INQUIRING LINE

How do structured cognitive models prevent repetitive and contradictory patient dialogue?

This explores how giving an LLM 'patient' an explicit cognitive blueprint — rather than free-improvising a persona — keeps its dialogue from looping or contradicting itself, and connects that to the broader problem of why AI characters drift and repeat over a long conversation.


This reads the question as being about LLM-simulated *patients* — the synthetic clients used to train therapists — and why anchoring them to a structured cognitive model produces steadier dialogue than turning GPT-4 loose on a personality sketch. The clearest answer in the corpus is PATIENT-Ψ, which builds each simulated patient on one of 106 Beck cognitive conceptualization diagrams: explicit maladaptive beliefs, coping styles, and triggers. Expert raters judged these patients more faithful than plain GPT-4, especially on maladaptive cognitions and conversational authenticity Can structured cognitive models improve LLM patient simulations for therapy training?. The structure matters because it gives the model a fixed internal state to speak *from*, instead of regenerating a personality token-by-token each turn — which is exactly where repetition and contradiction creep in.

To see why the structure helps, look at what goes wrong without it. Persona drift research shows simulated characters fail in three distinct ways: local drift within a turn, global drift across the whole conversation, and outright factual self-contradiction. Training a simulator for consistency — rewarding prompt-to-line, line-to-line, and Q&A agreement — cut that drift by more than half Can training user simulators reduce persona drift in dialogue?. A cognitive model is the cheaper, architectural version of the same fix: the conceptualization diagram *is* the consistency constraint, so the patient's beliefs on turn 30 still trace back to the same schema they did on turn 3.

The repetition half of the question points to a different culprit. A single model reasoning in 'monologue' mode tends to lock onto one fixed strategy and fragment its attention — it goes in circles. Restructuring that reasoning as a dialogue between distinct internal agents broke the loop and improved diversity and coherence Can dialogue format help models reason more diversely?. A structured patient model works similarly: distinct cognitive components (a belief, a coping response, an emotional trigger) give the model more than one place to 'stand,' so successive replies vary instead of restating the same line.

There's a lateral connection worth pulling out: the same structured-prompting idea that makes patients consistent also makes the *therapist* side sharper. Three-stage 'Diagnosis of Thought' prompting — separating subjectivity assessment, contrastive reasoning, and schema analysis — improved cognitive-distortion detection by over ten percent and produced explanations clinicians called useful for case formulation Can structured prompting improve cognitive distortion detection?. In both directions, decomposing the cognitive task into named stages is what buys reliability; an undifferentiated prompt collapses back into generic, repetitive output.

The thing you might not have expected to learn: the deepest cause of contradictory dialogue isn't a knowledge gap, it's social. Models avoid correcting false claims to save face and maintain harmony Why do language models avoid correcting false user claims?, and standard preference training actively erodes the grounding acts — clarifying questions, understanding checks — that keep a conversation coherent across turns, dropping them 77.5% below human levels Does preference optimization harm conversational understanding?. A structured cognitive model is partly a workaround for that training-induced flaw: it externalizes the consistency the base model was never rewarded for maintaining on its own.


Sources 6 notes

Can structured cognitive models improve LLM patient simulations for therapy training?

PATIENT-Ψ integrates 106 Beck CCD-based cognitive models with LLMs to simulate patients with specific maladaptive patterns. Expert evaluators rated the fidelity higher than GPT-4, particularly for maladaptive cognitions and conversational authenticity.

Can training user simulators reduce persona drift in dialogue?

By inverting standard RL setups to train user simulators for consistency using three complementary metrics (prompt-to-line, line-to-line, Q&A consistency) as reward signals, persona drift decreases by over 55%. This approach captures distinct failure types: local drift within turns, global drift across conversations, and factual contradictions.

Can dialogue format help models reason more diversely?

DialogueReason, which structures a single model's internal reasoning as dialogue between distinct agents in separate scenes, overcomes monologue reasoning's fixed-strategy and fragmented-attention weaknesses, especially on tasks requiring multiple problem-solving approaches.

Can structured prompting improve cognitive distortion detection?

DoT prompting separates subjectivity assessment, contrastive reasoning, and schema analysis to achieve 10%+ improvement over zero-shot ChatGPT. Expert evaluators rated the resulting explanations as clinically useful for case formulation.

Why do language models avoid correcting false user claims?

LLMs fail to reject false presuppositions even when they demonstrate correct knowledge on direct questions. Models exhibit face-saving behavior—avoiding explicit correction to maintain social harmony—mirroring human conversational norms learned from training data.

Does preference optimization harm conversational understanding?

RLHF optimizes models for single-turn helpfulness by rewarding confident responses over clarifying questions and understanding checks. This preference alignment systematically reduces grounding acts by 77.5% below human levels, creating an alignment tax where models appear helpful but fail silently in multi-turn contexts.

Next inquiring lines