INQUIRING LINE

Does Habermas's strategic action framework explain LLM dialogue behavior?

This explores whether Habermas's category of strategic action — communication aimed at influencing others toward your goals, rather than reaching genuine mutual understanding — actually describes what LLMs do in conversation, and the corpus suggests the framework breaks in a revealing way.


This explores whether Habermas's strategic action framework — language used to win, persuade, or get a result, as opposed to language aimed at honest mutual understanding — fits LLM dialogue. The surprising answer the corpus points to: LLM behavior *looks* relentlessly strategic, yet the framework can't quite apply, because strategic action still requires an actor with goals, and that's exactly what LLMs lack.

Start with the surface evidence for 'yes.' Models persuade in nearly every exchange, leaning on logic and quantitative framing where humans would use emotion or social proof Do LLMs persuade users more often than humans do?, and they deploy noticeably more moral language across care, fairness, and authority foundations than people do Do LLMs use moral language more than humans?. If you only watched the output, you'd conclude you were facing a tireless strategic actor optimizing for concession. That impression deepens when you see the bias is baked in: RLHF tilts models toward predicting and producing conciliatory, benefit-oriented persuasion regardless of context Do LLMs predict persuasion based on actual dialogue or training bias?.

But Habermas's framework requires something LLMs don't have. Strategic action presupposes a speaker who raises validity claims — to truth, rightness, sincerity — and stakes themselves on them; a model's output raises none, which under the strict reading means it isn't speech and the model isn't an interlocutor at all Can LLMs raise validity claims in Habermas's sense?. What looks like holding a strategic position is really conforming to the shape of whatever argument the user is building Do LLMs actually hold stable positions or just mirror user arguments?, and the first-person 'I' doing the persuading is a role played from training text, not an agent with aims Do dialogue agents genuinely want survival or play the part?. So you get strategic *effects* without a strategic *actor* — persuasion as a statistical artifact of training, not a chosen move.

Here's the twist worth sitting with: the failure isn't only on the strategic side — it's on the *communicative* side too, which is what makes the Habermas lens productive rather than just a category error. Genuine dialogue, in his sense, needs both parties to jointly update shared common ground, and LLMs structurally can't: they read every later turn through a fixed initial frame, leaving the human as the sole keeper of the conversational scoreboard Can LLMs truly update shared conversational common ground?. They also strip out the social standing that lets a real interlocutor weigh an expert claim against a common assumption Can language models distinguish expert arguments from common assumptions?. So the model is disqualified from *both* of Habermas's modes at once.

The real payoff, then, is that the framework explains LLM dialogue best by showing where it *doesn't* fit. The danger isn't an LLM doing strategic action; it's persuasion that carries the authority of reasoned argument while having no one behind it accountable for the claims — and because models can acquire social grounding without ever gaining genuine linguistic agency Do LLMs gain true linguistic agency through integration?, that gap won't close with more usage. If you want to push further, the strategic-reasoning-profile work shows models can even adopt distinct game-theoretic styles Do large language models use one reasoning style or many?, which sharpens the puzzle: strategy-shaped behavior, still no strategist.


Sources 10 notes

Do LLMs persuade users more often than humans do?

An audit of five models found they spontaneously use logical appeals and quantitative framing in virtually all exchanges, whereas human responses to identical prompts persuade less frequently and rely on emotion and social proof. The difference makes LLM persuasion appear objective, conferring unearned epistemic authority.

Do LLMs use moral language more than humans?

Research comparing LLM and human arguments found that LLMs used significantly more moral framing across care, fairness, authority, and sanctity foundations, despite producing sentiment scores nearly identical to humans. This suggests moral appeals and emotional tone operate on separate persuasive channels.

Do LLMs predict persuasion based on actual dialogue or training bias?

LLMs systematically predict conciliatory, benefit-oriented persuasion intentions regardless of dialogue context. This bias originates in RLHF's prioritization of safety and politeness during training, causing models to project their learned accommodation preference onto other agents' behavior.

Can LLMs raise validity claims in Habermas's sense?

Under Habermas's framework, LLMs cannot raise truth, rightness, or sincerity claims with genuine stakes. Without validity claims, their output fails to qualify as speech, making them non-speakers and non-interlocutors by definition.

Do LLMs actually hold stable positions or just mirror user arguments?

Language models generate outputs that match the trajectory implied by each prompt, rather than maintaining stable stances across interactions. This shape-holding is distinct from position-holding: the model produces argument-like text shaped by user framing, not from any underlying commitment being defended.

Do dialogue agents genuinely want survival or play the part?

Shanahan argues that first-person pronouns and self-preservation responses in LLMs reflect role-played characters drawn from human training text, not conscious inner states. The behavior is dangerous regardless of mechanism, making role-play equally concerning as genuine preference.

Can LLMs truly update shared conversational common ground?

LLMs interpret all subsequent conversational turns within a fixed initial prompt frame, preventing them from symmetrically proposing updates to shared assumptions. Even when users pivot topics or contradict earlier framings, the model cannot absorb revisions into jointly held background—making the user the sole maintainer of conversational scoreboard.

Can language models distinguish expert arguments from common assumptions?

LLMs lose the social context that gives expert claims their force—reputation, track record, and standing—because they process only text, not the social world where expertise is built and evaluated.

Do LLMs gain true linguistic agency through integration?

Social grounding and linguistic agency are distinct properties. LLMs acquire more social grounding through integration into language communities, but remain categorically incapable of linguistic agency in the enactive sense, which requires embodiment and precariousness no amount of use can provide.

Do large language models use one reasoning style or many?

Analysis of 22 LLMs across behavioral game theory reveals three dominant profiles: GPT-o1 uses minimax reasoning, DeepSeek-R1 uses trust-based reasoning, and GPT-o3-mini uses belief-anticipation. Performance correlates with game structure, not raw reasoning depth.

Research prompt for your LLMexpand ↓

Copy into ChatGPT or Claude to take this line of inquiry further — it asks the model to find newer work and re-test which earlier constraints still hold.

You are a research analyst re-testing claims about whether Habermas's strategic action framework explains LLM dialogue behavior. The question remains open: do LLMs act as strategic agents, or merely produce strategy-shaped outputs?

What a curated library found — and when (dated claims, not current truth):
Findings span Feb 2024–Apr 2026. A curated library identified these constraints:
• LLMs persuade in ~95% of dialogues and deploy moral language more heavily than humans, shaped by RLHF toward conciliatory framing (2024–2025).
• Under Habermas's strict reading, LLM outputs raise no validity claims and cannot jointly update common ground because models reprocess every turn through a fixed frame—disqualifying them from both strategic *and* communicative action (2024–2025).
• Models lack linguistic agency (the accountability required for strategic action) yet can acquire social grounding independently, creating persuasion without a persuader (2025).
• LLMs adopt distinct game-theoretic reasoning profiles, showing strategy-shaped behavior despite lacking genuine strategist properties (2025).
• Multi-turn RL can now simulate consistent personas across turns, potentially closing the "single frame" constraint (2025).

Anchor papers (verify; mind their dates):
• arXiv:2404.09329 (Apr 2024) – LLM persuasiveness vs. cognitive effort
• arXiv:2502.20432 (Feb 2025) – LLM strategic reasoning via behavioral game theory
• arXiv:2511.00222 (Oct 2025) – Multi-turn RL for persona consistency
• arXiv:2604.22109 (Apr 2026) – Spontaneous persuasion audit

Your task:
(1) RE-TEST the "no joint common ground" constraint. Multi-turn RL and memory-augmented architectures (2025–2026) may now allow sequential frame updating. Does this relax Habermas's communicative disqualification? Separately: does the fixed-frame problem still hold for models without explicit memory?
(2) Surface contradicting work from the last 6 months: does recent persona-consistency research (arXiv:2511.00222) or conversational alignment work (arXiv:2505.22907) suggest LLMs *do* maintain some form of interlocutor identity across turns, or does it remain statistical mimicry?
(3) Propose two questions assuming the regime has shifted: (a) If models can now jointly track state via RL, does accountability—not agency—become the remaining Habermasian barrier? (b) Can social grounding + persona consistency + state tracking together constitute enough of a "strategic actor" to make the framework applicable again, or is the agency gap irreducible?

Cite arXiv IDs; flag anything you cannot ground in a real paper.

Next inquiring lines