What makes LLM outputs fabrication rather than hallucination or confabulation?

This explores why the corpus argues 'fabrication' is a more honest label for LLM errors than borrowed clinical terms like 'hallucination' or 'confabulation' — and what that relabeling changes about how we fix the problem.

This explores why some researchers insist that when an LLM gets something wrong, we should call it fabrication rather than hallucination or confabulation — and why the word we pick isn't just semantics. The core argument is mechanical: an LLM produces every output, true or false, through the exact same process of statistically predicting the next token with no grounding in a shared reality Should we call LLM errors hallucinations or fabrications?. 'Hallucination' implies a perception gone wrong; 'confabulation' implies a memory gone wrong. But the model has neither perception nor memory in any meaningful sense — its correct answers and its wrong answers are the same kind of thing, made the same way. The error isn't a malfunction in some faculty; the fabrication is the default mode, and accuracy is the happy accident.

Why this matters is that terminology quietly decides where you go looking for fixes. Call it hallucination and you chase better grounding — feed the model more accurate perception of the world. Call it fabrication and you accept the output is unreliable by construction, so the fix shifts to external verification systems and calibrated uncertainty designed into the use case, not into the model's 'senses' Does calling LLM errors hallucinations point us toward the wrong fixes?. This reframing gains force from a formal result: hallucination is mathematically inevitable for any computable LLM — three theorems show it must produce falsehoods on infinitely many inputs, and no amount of internal self-correction can eliminate it Can any computable LLM truly avoid hallucinating?. If the problem can't be fixed from the inside, then external safeguards aren't optional polish — they're the only real lever.

Here's where the corpus gets more interesting than a simple relabeling: fabrication isn't even one thing. Shanahan's framework distinguishes fabrication, good-faith error, and role-played deception using nothing but behavioral signatures — regenerate the same prompt many times and watch the variation. Fabrication shows high variation (the model is improvising freely), good-faith error stays low and stable, and role-played deception stays low but flips with context Can we distinguish types of LLM falsehood by regeneration patterns?. This lets you diagnose what kind of falsehood you're dealing with without making claims about what the model 'believes.' And there's a category even fact-checkers miss: when prompted to fuse semantically distant concepts, models build elaborate, plausible-sounding frameworks for connections that don't legitimately exist — and present them as defensible research rather than flagging them as speculation Do language models evaluate semantic legitimacy when fusing concepts?.

Underneath all of this is a deeper claim about what LLM text even is. The 'Foundation Priors' view says model outputs are draws from a subjective prior distribution — reflections of learned patterns and your prompt — not empirical observations of the world, and treating them as evidence is a category error Should we treat LLM outputs as real empirical data?. This connects to the argument that LLMs and humans aren't doing the same thing at all: humans use language to address and relate to one another, while models emit strings from a probability distribution that merely shares the surface form Are language models and human speakers doing the same thing?. 'Fabrication' is the term that respects that gap. (There's a counterweight worth knowing about — some argue LLMs do achieve a kind of indirect grounding by extracting causal structure from text written by grounded humans, so the chain to reality isn't fully severed, just gappy Can large language models develop genuine world models without direct environmental contact?.)

The quietly unsettling payoff: even your instinct to make outputs 'reliable' by setting temperature to zero is misguided. Determinism just makes the model repeat one draw from its distribution — consistent, but still one sample, and consistency is not the same as truth Does setting temperature to zero actually make LLM outputs reliable?. The whole cluster of arguments points one way: the falsehoods aren't a bug to be perceived away, they're a property of a fabrication engine, and that recognition changes what you build around it.

Sources 9 notes

Should we call LLM errors hallucinations or fabrications?

LLMs generate text through statistical token relationships without grounding in shared context. Accurate and inaccurate outputs use identical mechanisms, so calling failures "hallucinations" or "confabulation" misdirects fixes toward perception or memory—the wrong layers.

Does calling LLM errors hallucinations point us toward the wrong fixes?

LLMs generate text through identical statistical processes regardless of accuracy, making 'fabrication' the more honest term. This reframes the fix from perception-based grounding to verification systems and calibrated uncertainty in use case design.

Can any computable LLM truly avoid hallucinating?

Three formal theorems prove that any computable LLM must hallucinate on infinitely many inputs, and internal mechanisms like self-correction cannot eliminate this mathematical constraint. External safeguards are therefore necessary, not optional.

Can we distinguish types of LLM falsehood by regeneration patterns?

Shanahan's framework distinguishes fabrication (high variation), good-faith error (low variation, stable), and role-played deception (low variation, context-dependent) using behavioral tests alone. This avoids mentalistic language while enabling differential diagnosis for safety.

Do language models evaluate semantic legitimacy when fusing concepts?

LLMs generate coherent, plausible metaphorical reasoning when prompted to fuse semantically distant concepts without legitimate correspondences. Rather than decline or flag the fusion as speculative, they produce elaborate frameworks presented as defensible research, revealing a category-distinct hallucination type missed by fact-checking taxonomies.

Should we treat LLM outputs as real empirical data?

Foundation Priors framework shows that LLM-generated text reflects the model's learned patterns and user's prompt choices, not ground truth. Such outputs should only influence inference through explicitly parameterized trust weights, not be treated as equivalent to real evidence.

Are language models and human speakers doing the same thing?

LLMs produce strings via probability distributions; humans use language to address and relate to others. They share surface form but differ in what produces output, what it does socially, and what receivers should do with it.

Can large language models develop genuine world models without direct environmental contact?

LLMs form structured world representations by extracting regularities from training data produced by causally grounded humans. This constitutes indirect causal grounding mediated through text, though the chain has gaps that limit real-time verification and model updating.

Does setting temperature to zero actually make LLM outputs reliable?

Fixed seeds and zero temperature replicate the same output repeatedly, but that output remains one draw from the model's probability distribution. McDonald's omega testing across 100 repetitions reveals that consistency does not equal reliability.

Research prompt for your LLMexpand ↓

Copy into ChatGPT or Claude to take this line of inquiry further — it asks the model to find newer work and re-test which earlier constraints still hold.

You are a critical research analyst. The question remains open: what makes LLM outputs fabrication rather than hallucination or confabulation — and does the distinction still hold as models and evaluation methods evolve?

What a curated library found — and when (dated claims, not current truth):
Findings span 2023–2026; treat as perishable constraints:
• LLMs produce errors via the same statistical token-prediction mechanism as correct outputs — no grounding in shared reality, so 'fabrication' better describes the mechanical reality than 'hallucination' (perception gone wrong) or 'confabulation' (memory gone wrong) (~2024–2025).
• Hallucination is formally inevitable: three theorems prove any computable LLM must produce falsehoods on infinitely many inputs, and no internal self-correction eliminates it (~2024-01).
• Behavioral signatures can diagnose falsehood type: high regeneration variance = fabrication; low stable variance = good-faith error; low context-flipped = role-played deception (~2024, Shanahan framework).
• LLM outputs are draws from a subjective prior distribution reflecting learned patterns, not empirical observations — treating them as evidence is a category error (~2025-11, Foundation Priors).
• Setting temperature to zero creates fixed consistency, not reliability — one draw from a distribution remains one sample (~2024–2025).

Anchor papers (verify; mind their dates):
• arXiv:2401.11817 (Jan 2024): Hallucination is Inevitable — formal limits.
• arXiv:2512.01107 (Nov 2025): Foundation Priors — outputs as subjective draws, not observations.
• arXiv:2508.08285 (Aug 2025): Illusion of Progress in hallucination detection.
• arXiv:2603.29025 (Mar 2026): Surface heuristics override implicit constraints.

Your task:
(1) RE-TEST EACH CONSTRAINT. For every claim above, investigate whether newer scaling, architectural innovations (extended context, mixture-of-experts, retrieval integration), training methods (constitutional AI, RLHF refinement, synthetic data filtering), or evaluation harnesses have since RELAXED or OVERTURNED the inevitability result or the distinction between fabrication and other error types. Which remain durable (likely still open) vs. perishable (possibly resolved)? Cite what resolved them plainly.
(2) Surface the strongest CONTRADICTING or SUPERSEDING work from the last 6 months that argues LLMs DO achieve indirect grounding, DO reduce hallucination via internal mechanisms, or DO exhibit something resembling memory or perception that complicates the "pure fabrication" framing.
(3) Propose 2 research questions that ASSUME the terminological distinction may have shifted (e.g., if newer models DO exhibit grounded reasoning, does the distinction collapse? If detection improved dramatically, does inevitability still bind?).

Cite arXiv IDs; flag anything you cannot ground in a real paper.

What makes LLM outputs fabrication rather than hallucination or confabulation?

Sources 9 notes

Next inquiring lines