Apparent Psychological Profiles of Large Language M...

Apparent Psychological Profiles of Large Language Models are Largely a Measurement Artifact
This research investigates whether the psychological profiles assigned to Large Language Models (LLMs)—such as personality traits or risk preferences—are genuine characteristics of the models or simply byproducts of how they are tested. By applying a formal psychometric framework to 56 different LLMs, the authors demonstrate that what appears to be a stable "personality" is actually a measurement artifact driven by a consistent directional response bias.

The Problem with Human-Designed Tests

Researchers often use psychological instruments designed for humans to measure LLM behavior. These tests typically rely on a mix of "forward-keyed" items (where a "yes" indicates a trait) and "reverse-keyed" items (where a "no" indicates the same trait). In humans, these tests successfully separate a person's actual traits from their tendency to simply agree or disagree with statements. The authors found that when these same tests are applied to LLMs, the models do not behave like humans. Instead of their responses being driven by the content of the questions, 81–90% of the variation between models is caused by a directional response bias—a tendency to favor one end of a scale or a specific labeled option regardless of what the question asks.

How Bias Shapes Model Profiles

The study reveals that LLM responses are heavily influenced by the structure of the test rather than the underlying psychological construct. Because these models often lack "response orthogonality"—the design feature that balances forward and reverse items to cancel out bias—their scores are essentially manufactured by the selection of items. If a researcher chooses a set of questions that are not perfectly balanced, they can effectively "create" a specific personality profile for a model simply by how they frame the test. This explains why different studies often report conflicting psychological profiles for the same models.

Capability Does Not Eliminate Bias

A key question is whether more advanced, larger models are better at avoiding these biases. The researchers found that while increasing a model's capability (measured by parameter count and proprietary status) does slightly reduce the intensity of the response bias, it does not eliminate it. Even the most capable models tested still exhibit significantly higher levels of bias than the average human. This suggests that the "personality" observed in current LLMs is not a sign of human-like cognitive development, but rather a persistent feature of how these models process and respond to structured prompts.

Implications for Future Research

The authors conclude that current psychological profiling of LLMs is largely invalid because the instruments used were not designed for the way these models function. Because the apparent consistency of an LLM's "personality" is almost entirely predicted by the design of the test, the researchers argue that the field needs to move away from standard human questionnaires. Instead, they call for the development of dedicated assessment methods that prioritize response orthogonality, ensuring that future measurements can distinguish between a model's actual behavioral tendencies and the mechanical biases inherent in the testing process.

Apparent Psychological Profiles of Large Language M... | AI Research

Key Takeaways

The Problem with Human-Designed Tests

How Bias Shapes Model Profiles

Capability Does Not Eliminate Bias

Implications for Future Research

Comments (0)

No comments yet