Where does mode collapse in language models really come from?
Researchers investigate whether mode collapse—when models narrow to repetitive outputs—stems from training algorithms or the preference data itself. Understanding the root cause is crucial for fixing diversity loss in creative and synthetic tasks.
Post-training alignment narrows LLM output to a few favored responses — mode collapse — which kneecaps creative writing, social simulation, pluralistic alignment, and synthetic-data generation. Prior work blames algorithmic causes (inadequate reward models, majority-favoring optimization). This paper relocates the cause to the data: a pervasive typicality bias in preference data, where annotators systematically prefer familiar, typical text (a well-established cognitive-psychology effect). Mode collapse is thus an inherent property of preference data itself, formalized theoretically and verified on real preference datasets.
The fix follows from the diagnosis and is strikingly cheap: Verbalized Sampling (VS), a training-free prompting strategy that asks the model to verbalize a distribution over responses with their probabilities ("generate 5 jokes and their probabilities") rather than a single answer. Across creative writing, dialogue simulation, open-ended QA, and synthetic data, VS lifts diversity 1.6-2.1× over direct prompting without sacrificing factual accuracy or safety — and, tellingly, more capable models benefit more, suggesting the diversity was latent and suppressed rather than absent.
This reframes the vault's diversity-collapse thread at the source. Since Does outcome-based RL diversity loss spread across unsolved problems? locates collapse in RL dynamics, VS adds the data-level origin and a decoding-time remedy; and it offers a practical lever for Why do LLMs generate novel ideas from narrow ranges? — verbalize the distribution to surface the suppressed tail.
Related concepts in this collection 3
This note in its neighbourhood — explore the map, then jump to a related concept in the list below.
Click a node to walk · click center to open · click Open in graph to see this note in the full knowledge graph
-
Does outcome-based RL diversity loss spread across unsolved problems?
When RL concentrates probability mass on correct answers for solved problems, does that narrowing propagate to problems the model cannot yet solve? And if so, what are the separate mechanisms for preserving diversity during training versus at test time?
RL-dynamics account of collapse; VS adds the data-level cause and a training-free fix
-
Why do LLMs generate novel ideas from narrow ranges?
LLM research agents produce individually novel ideas but cluster them in homogeneous sets. This explores why high average novelty coexists with poor diversity coverage and what it means for automated ideation.
VS is a candidate remedy for the suppressed-tail problem in ideation
-
Why aren't bigger models better for generating diverse outputs?
When generating many unique outputs within a fixed budget, does model size actually matter? Exploring whether the conventional wisdom of using larger models holds for diversity-focused tasks.
both about eliciting diversity; VS shows capable models hold latent diversity that elicitation unlocks
Related papers in this collection 8
Papers most semantically related to this note, ranked by cosine similarity in the embedding space.
- Verbalized Sampling: How to Mitigate Mode Collapse and Unlock LLM Diversity
- Thinkless: LLM Learns When to Think
- NoveltyBench: Evaluating Language Models for Humanlike Diversity
- Artificial Hivemind: The Open-Ended Homogeneity of Language Models (and Beyond)
- Evaluating the Diversity and Quality of LLM Generated Content
- The Curse Of Recursion: Training On Generated Data Makes Models Forget
- Temporal Self-Rewarding Language Models: Decoupling Chosen-Rejected via Past-Future
- Outcome-based Exploration for LLM Reasoning
Original note title
mode collapse is a data-level property of preference data driven by typicality bias not an algorithmic artifact — and verbalized sampling restores diversity training-free