Why do LLMs understand efficient language but fail to produce it?
This explores why models can recognize and describe efficient, well-formed language yet don't reliably generate it themselves — and the corpus suggests the gap isn't about knowledge but about how comprehension and production sit on separate tracks.
This reads the question as: a model can spot economical, well-structured language and even explain what makes it good, but its own output doesn't follow those principles. The corpus has a sharp answer for the first half. LLMs learn whatever is statistically present in text — sound symbolism, priming effects, surface patterns — but they don't pick up the *communicative logic* behind why language takes efficient forms. Things like word-length economy (frequent words get shorter) or pragmatic discourse inference require optimizing for a goal the training signal never explicitly contains, so the model absorbs the shape of efficient language without the principle that produces it Why do language models fail at communicative optimization?.
The deeper reason production lags comprehension shows up across several notes as a recurring split: models can state a principle correctly and still fail to act on it. One line of work calls this a 'computational split-brain' — explanation and execution run on dissociated pathways, with accuracy around 87% when articulating a rule but dropping to ~64% when applying it Can language models understand without actually executing correctly?. The 'Potemkin understanding' framing makes the same point from another angle: a model can explain a concept, fail to apply it, *and* recognize its own failure — a combination that doesn't look like a human knowledge gap at all, but like two functionally disconnected systems Can LLMs understand concepts they cannot apply?. The 'knowing-doing gap' note shows the same 87%-vs-64% signature persisting across model scales Why do language models fail to act on their own reasoning?.
What makes this lateral rather than a single finding: the understand-but-can't-produce pattern isn't specific to language efficiency — it's a general architecture-level dissociation. The same shape appears in planning, where models acquire planning knowledge fluently but only 12% of GPT-4's generated plans actually execute without error Can large language models actually create executable plans?. So 'efficient language' is one instance of a broader phenomenon: recognition is cheap because it's pattern-matching, but production requires assembling and committing to a structure that satisfies a goal, and that's where the seams show.
There's also a structural-complexity story worth knowing. Production degrades *predictably* as the thing being produced gets harder to hold together — grammatical competence falls off as syntactic depth and embedding increase, suggesting models learned surface heuristics rather than generative grammatical rules Does LLM grammatical performance decline with structural complexity?. The blind spots map to specific failures in discourse intentionality and forward-planning, not just surface form Where exactly do language models fail at structural language tasks? Why do large language models fail at complex linguistic tasks?. Efficient language is forward-planned — you have to anticipate the whole utterance to make it economical — which is exactly the kind of structure these notes say breaks down first.
If you want a frame for the whole pattern, the epistemic-failure-modes note catalogs these as structurally distinct gaps between statistical pattern-tracking and real competence, rather than as the model simply being 'wrong' How do LLMs fail to know what they seem to understand?. The thing you might not have expected: the reason a model can grade good writing better than it can write is the same reason it can describe a plan it can't execute — comprehension and generation aren't two ends of one ability, they're two different machines that happen to share a vocabulary.
Sources 9 notes
LLMs successfully replicate statistical regularities learnable from text distributions (sound symbolism, priming) but fail at principles requiring pragmatic optimization (word length economy, discourse inference). The gap reveals that communicative logic—why language has certain forms—isn't present as a trainable signal.
Large language models can articulate correct principles but systematically fail to apply them due to dissociated instruction and execution pathways. The 87% accuracy in explanations versus 64% in actions reveals this is not knowledge deficit but structural disconnect.
Models can explain concepts accurately, fail to apply them, and recognize the failure—a triple pattern incompatible with human cognition. This indicates functionally disconnected explanation and execution pathways rather than simple knowledge gaps.
LLMs generate correct reasoning 87% of the time but follow it only 64% of the time. Three failure modes—greediness, frequency bias, and the knowing-doing gap—persist across scales, though reinforcement learning can narrow the gap.
Only 12% of GPT-4 generated plans are actually executable without errors. LLMs excel at acquiring planning knowledge but fail at the reasoning assembly required to handle subgoal and resource interactions.
LLMs show systematic performance decline as syntactic depth and embedding increase. Simple sentences are handled well while complex structures with recursion and embedding fail consistently, suggesting LLMs learned surface heuristics rather than structural grammar rules.
Language models excel with explicit discourse markers and simple grammar but fail predictably on implicit relations, embedded structures, and forward-planning discourse. These breakdowns map to failures in discourse intentionality and attention layers, not just linguistic surface structure.
Top-tier LLMs like Llama3-70b consistently misidentify embedded clauses, verb phrases, and complex nominals. Performance degrades predictably as syntactic depth increases, revealing that statistical learning captures surface patterns but not deep grammatical rules.
LLMs show repeatable, empirically documented failure modes—from Potemkin understanding (correct explanation + failed application) to reasoning collapse under implicit constraints. These failures reveal gaps between statistical pattern-tracking and actual epistemic competence.