Why do two experts with identical knowledge produce different outcomes in the same situation?

This explores why expertise can't be reduced to knowledge stock — two people who know identical things still act differently because expertise is judgment, performance, and communication, not just retrieval.

This explores why two experts with the same knowledge diverge in practice — and the corpus has a sharp answer: knowledge is the smallest part of expertise. The bigger part is *situational judgment* — knowing when to speak, when to defer, which piece of knowledge applies right now, and how to deliver it to this particular audience Is expertise really just knowing more than others?. Two people can hold the same facts and still read the same moment differently, because reading the moment is its own skill.

Part of that skill is *selecting which differences matter.* Expert observation isn't pattern-matching across everything in view; it's a qualitative choice about which details are relevant to this situation and this audience's needs Can AI distinguish which differences actually matter?. Give two experts the same scene and they may foreground different signals — and that single upstream choice cascades into different actions downstream. The knowledge is identical; the judgment about what's salient is not.

The other half is that expert claims are *communicative acts that anticipate a response.* A claim succeeds when it's both factually right and socially acceptable to the community hearing it, and experts are constantly running that second calculation — what will land, what will be accepted, what their standing lets them assert Can AI anticipate whether expert claims will be socially valid? Can AI replicate the communicative work experts do?. The same true statement carries different force depending on who says it and the track record behind it Can language models distinguish expert arguments from common assumptions?. So two experts with equal knowledge but different reputations, framings, or audiences will produce genuinely different outcomes — the difference lives in the social situation, not the facts.

There's a deeper reason the divergence is unavoidable: even the words don't mean the same thing to both. Referential grounding is person-specific — how language connects to the world has to be actively negotiated, not assumed from shared vocabulary Why do speakers need to actively calibrate shared reference?. The same explanation is effective or useless depending on the source-framing-recipient triad it lands in, not on anything intrinsic to the explanation itself What if XAI is fundamentally a communication problem?. The thing worth taking away: 'identical knowledge' is a fiction once you realize that expertise is a performance calibrated to a moment and an audience — which is also precisely the layer the corpus argues AI can't reproduce, because it has the text but not the standing, the situation, or the read of the room.

Sources 7 notes

Is expertise really just knowing more than others?

Real expertise involves situational judgment—knowing when to speak, when to defer, which knowledge applies now, and how to communicate it to a specific audience. This role-performance dimension is at least as important as the underlying knowledge stock, and it is what AI cannot structurally perform.

Can AI distinguish which differences actually matter?

Experts observe by choosing which differences matter (qualitative judgment); AI finds patterns and probabilities (quantitative). AI generates text from prompts without observing context, audience needs, or knowledge states—producing fabrication that mimics observation's form without its epistemic process.

Can AI anticipate whether expert claims will be socially valid?

Expert claims are validity claims that succeed when both factually correct and socially acceptable within a community. AI can estimate statistical correctness but cannot anticipate contextual acceptability because it lacks embedded knowledge of expert communities' evolving standards.

Can AI replicate the communicative work experts do?

Expertise requires anticipating audience acceptability and social validity, not just retrieving information. AI lacks the mechanism to perform this communicative work, making its fluent output epistemically misleading despite its confident form.

Can language models distinguish expert arguments from common assumptions?

LLMs lose the social context that gives expert claims their force—reputation, track record, and standing—because they process only text, not the social world where expertise is built and evaluated.

Why do speakers need to actively calibrate shared reference?

The same words can mean different things to different speakers because referential grounding is person-specific. True communicative grounding demands collaborative negotiation of how language connects to the world, not mere surface-level word sharing.

What if XAI is fundamentally a communication problem?

Explanation quality is not intrinsic to the explanation itself but depends on the rhetorical situation: who presents it, how it is framed, and what role the recipient plays. Evaluations that ignore this triad measure only a narrow slice of real-world effectiveness.

Research prompt for your LLMexpand ↓

Copy into ChatGPT or Claude to take this line of inquiry further — it asks the model to find newer work and re-test which earlier constraints still hold.

You are a research analyst re-testing claims about expertise divergence in high-stakes judgment. The question remains open: why do two experts with identical knowledge produce different outcomes? A curated library found — and when (dated claims, not current truth):

• Expertise is not knowledge-holding but *situational judgment*—knowing when to apply which fact to which audience (2023–2026 range).
• Expert observation selects which differences matter; two experts reading the same scene may foreground different signals upstream, cascading into divergent actions (arXiv:2304.01481, ~2023).
• Expert claims are communicative acts; identical true statements carry different force depending on speaker reputation and audience standing (arXiv:2307.10928, ~2023; arXiv:2507.01936, ~2025).
• Referential grounding is person-specific; shared vocabulary does not guarantee shared reference—explanations succeed in source-framing-recipient triads, not in isolation (arXiv:2505.09862, ~2025).
• AI systems hold text but lack standing and situational read; they cannot reproduce the performance-calibration layer that distinguishes expert outcomes (synthesis consensus, 2023–2026).

Anchor papers (verify; mind their dates):
• arXiv:2304.01481 (2023) — The Vector Grounding Problem
• arXiv:2507.01936 (2025) — The Thin Line Between Comprehension and Persuasion in LLMs
• arXiv:2508.19004 (2025) — AI Models Exceed Individual Human Accuracy in Predicting Everyday Social Norms
• arXiv:2602.16488 (2026) — Learning to Learn from Language Feedback with Social Meta-Learning

Your task:
(1) RE-TEST THE STANDING + SALIENCE CONSTRAINT. The library claims AI cannot reproduce expert divergence because it lacks social standing and situational judgment. New models (GPT-4o, Claude 3.5+) and multi-agent scaffolds now operate in sustained dialogue, reputation logging, and role-play. Do they now *simulate* standing and salience-selection credibly enough that two instances can diverge on identical inputs—and if so, does that divergence match *human* expert divergence in direction and magnitude? Or does it remain decorative?
(2) Surface the strongest contradicting work from the last ~6 months: arXiv:2508.19004 claims AI *exceeds* humans in predicting social norms. Does this undercut the claim that AI lacks situational judgment, or does it show AI can pattern-match norms without inhabiting them? Flag what you find.
(3) Propose two research questions that assume the regime may have shifted: (a) If AI can now learn *whose* judgment to defer to in a given situation, does the divergence problem move upstream—to training data selection or prompt framing rather than inference-time judgment? (b) Does the emergence of AI-mediated expert teams (human + model in dialogue) actually *increase* or *decrease* the divergence problem, and through which mechanism?

Cite arXiv IDs; flag anything you cannot ground in a real paper.

Why do two experts with identical knowledge produce different outcomes in the same situation?

Sources 7 notes

Next inquiring lines