Agentic and Multi-Agent Systems Language Understanding and Pragmatics LLM Reasoning and Architecture

Can structured debate roles help small models detect ambiguity?

Small language models struggle to recognize when problems are underspecified. Can assigning explicit leader-follower roles in multi-agent debates overcome this limitation and boost ambiguity detection accuracy?

Note · 2026-04-18 · sourced from Reasoning Architectures
What makes multi-agent teams actually perform better? Where exactly do reasoning models fail and break?

Small models (7-9B parameters) individually struggle with ambiguity detection — recognizing when a problem statement is underspecified or admits multiple interpretations. But a structured multi-agent debate protocol with explicit leader-follower roles and rotation significantly boosts performance: Mistral-7B-led debates achieve 76.7% success rate, well beyond single-model baselines.

The protocol matters more than the models. A leader agent proposes an interpretation, two follower agents challenge or extend it, and roles rotate across rounds. The two-follower configuration creates a stronger consensus mechanism than pairwise debate because disagreement must survive two independent challenges rather than one. This is a different mechanism from the general multi-agent debate finding that When does debate actually improve reasoning accuracy? — ambiguity detection is not a verifiability problem but a recognition problem, and the structured role protocol prevents the persuasive-framing failure mode by forcing role rotation.

The result is notable because ambiguity detection is a prerequisite for the information-seeking behavior that models systematically lack. Since Can models identify what information they actually need?, the ability to detect ambiguity is upstream of the ability to ask clarifying questions. Leader-follower debate offers a multi-agent route to a capability that single models achieve at only 40-50% accuracy (QuestBench).

This also connects to the broader finding that Does cognitive diversity alone improve multi-agent ideation quality?. The leader-follower protocol imposes structural diversity through role assignment rather than relying on emergent diversity — a design choice that may explain why it works with small models that individually lack the expertise threshold.


Source: Reasoning Architectures Paper: "Debate for Ambiguity Detection" (2507.12370)

Original note title

leader-follower multi-agent debate enhances ambiguity detection in small models through structured role rotation and consensus forcing