Do persona consistency metrics actually measure dialogue quality?

Personalized dialogue systems can achieve high persona consistency scores by simply restating character descriptions, ignoring conversational relevance. Does optimizing for persona fidelity necessarily harm the coherence readers actually care about?

Note · 2026-02-23 · sourced from Personalization

Personalized dialogue generation faces a persistent dual optimization problem: persona consistency and discourse coherence pull in different directions, and most methods sacrifice one for the other.

The measurement trap is revealing. Methods that achieve the highest personalization scores (e.g., PAA) do so by "frequently generating sentences that are exact restatements of the persona description, often ignoring the relevance to the query." High persona adherence metrics can be achieved trivially through description copying — which looks like success on the persona dimension while failing on the coherence dimension. This is not a training failure but a measurement artifact that rewards surface-level persona adherence.

The coherence side has two distinct components:

Local coherence — logical connections between adjacent sentences, ensuring they relate to each other and form a coherent sequence. This is sentence-to-sentence reasoning.

Global coherence — higher-level relationships across the entire dialogue, maintaining topic consistency and effectively conveying meaning throughout an interaction. Poor global coherence impairs understanding of the discourse as a cohesive whole.

MUDI addresses the trade-off by incorporating discourse relations directly into the generation architecture. Using 16 discourse relation types from the STAC annotation scheme plus a topic-shift relation, an LLM (LLaMA-3-70B) annotates coherence relations between utterance pairs. A graph encoder (DialogueGAT) captures these interactive relationships, with Sentence-BERT initializing node features for sentence-level semantics. The key architectural additions are order information and turn information integrated via attention mechanisms.

The broader principle: persona fidelity and contextual coherence must be jointly optimized, not separately measured. Since Why does supervised learning fail to enforce persona consistency?, the training method (RL for consistency) and the generation architecture (discourse-aware for coherence) address different dimensions of the same problem. Neither alone is sufficient.

This connects to the three-failure-modes analysis. Since Why do static persona descriptions produce repetitive dialogue?, the persona-restatement failure identified by MUDI is a fourth failure mode: not just repetitiveness, shallowness, and contradiction, but contextual irrelevance — generating persona-consistent but conversationally inappropriate responses.

Source: Personalization

Related concepts in this collection

Why does supervised learning fail to enforce persona consistency? Supervised learning trains models to generate good responses but never punishes contradictions. This note explores why explicit negative feedback is structurally necessary for dialogue agents to maintain consistent personas, and what training methods can provide it.
training mechanism for consistency; MUDI adds architectural mechanism for coherence
Why do static persona descriptions produce repetitive dialogue? Does relying on fixed attribute lists to define conversational personas limit dialogue depth and consistency? Research suggests static descriptions may cause repetition and self-contradiction in generated responses.
persona-restatement is a fourth failure mode alongside repetitiveness, shallowness, contradiction
How do readers track segments, purposes, and salience together? Can discourse processing actually happen in parallel rather than sequentially? This matters because understanding how readers coordinate multiple layers of meaning at once reveals where AI systems break down in comprehension.
Grosz & Sidner's framework aligns with MUDI's local/global coherence distinction
Why does ChatGPT fail at implicit discourse relations? ChatGPT excels when discourse connectives are present but drops to 24% accuracy without them. What does this gap reveal about how LLMs actually process meaning and logical relationships?
MUDI's explicit discourse relation annotation may compensate for LLMs' implicit relation weakness

Concept map

14 direct connections · 107 in 2-hop network ·medium cluster

Do persona consistency metrics actually measure … Why does supervised learning fail to enforce perso… Why do static persona descriptions produce repetit… How do readers track segments, purposes, and salie… Why does ChatGPT fail at implicit discourse relati…

Click a node to walk · click center to open · click Open full network for a force-directed map

your link semantically near linked from elsewhere

Original note title

persona consistency trades off against discourse coherence in personalized dialogue — models that prioritize persona restate descriptions at the expense of contextual relevance