What separates Habermas's ideal speech from Goffman's situated communication?
This explores two different yardsticks for what makes communication real — Habermas's ideal of speakers staking validity claims they can defend, versus Goffman's ritual machinery of face-to-face situations — and what each one reveals that the other misses about why AI conversation falls short.
This question asks what separates Habermas's ideal speech from Goffman's situated communication — and the corpus treats them less as rival theories than as two complementary lenses, each catching a failure the other can't see. Habermas points up, toward the normative stakes of speech: to say something is to raise a validity claim — that it's true, that it's right, that you sincerely mean it — and to make yourself accountable for defending it. Goffman points down, toward the situation: communication is held together by ritual — turn-taking, repair when things go wrong, the co-presence cues and adjacency pairs that let two people build and rebuild a shared footing. One asks 'are you committed to what you're saying?' The other asks 'are you doing the embodied social work that keeps an exchange alive?'
The split shows up sharply when the corpus turns these lenses on LLMs. Through Habermas's frame, an LLM produces text but raises no validity claims with genuine stakes — nothing is on the line for it, so by definition its output isn't speech and it isn't an interlocutor Can LLMs raise validity claims in Habermas's sense?. Through Goffman's frame, the failure is different and more concrete: the model skips the corrective rituals, the entrainment, the accountability of adjacency pairs that humans use to repair understanding — fluency masking a missing social scaffold What happens to social order when AI removes ritual constraints?. Habermas tells you the commitment is absent; Goffman tells you the choreography is absent. They diagnose the same silence from opposite ends.
What's striking is how much of the corpus lives in the gap between them. The inability to jointly update common ground — where the model freezes the opening prompt as a fixed frame and can't absorb the user's revisions — is a Goffmanian breakdown in the ongoing maintenance of a shared scoreboard Can LLMs truly update shared conversational common ground?, and preference optimization makes it measurably worse, with models producing 77.5% fewer grounding acts than humans because the training target rewards confident fluency over the work of establishing shared understanding Does preference optimization damage conversational grounding in large language models?. Meanwhile the loss of the social world behind an argument — where an expert claim gets its force from reputation and standing, not just its words — is closer to a Habermasian point about who is authorized to make a claim Can language models distinguish expert arguments from common assumptions?.
The deeper move the corpus makes is to suggest both thinkers are circling the same thing from different sides: subjecthood isn't something you bring to a conversation, it's produced inside the communicative event itself Does language create subjects or express them?. That reframes the whole debate about whether we 'talk to' or merely 'talk at' a model — the preposition encodes whether there's a partner capable of mutual orientation at all Are we really communicating with language models?. It also explains why a purely behavioral test misfires: a system can emit contextually perfect text and still lack the relational-normative conditions — accountability, evaluative stance — that both Habermas and Goffman, in their different vocabularies, insist on Does behavioral speech output prove communicative subjecthood?.
So what separates them is altitude, not opposition. Habermas gives you the ethics of commitment — what a speaker owes to what they say. Goffman gives you the physics of interaction — the ritual labor that makes any exchange cohere in real time. The thing worth carrying away is that AI conversation manages to fail both tests at once, and reading them together is more diagnostic than either alone: you need Habermas to see that nothing is at stake, and Goffman to see that nobody is doing the work.
Sources 8 notes
Under Habermas's framework, LLMs cannot raise truth, rightness, or sincerity claims with genuine stakes. Without validity claims, their output fails to qualify as speech, making them non-speakers and non-interlocutors by definition.
Goffman's framework reveals that LLM-based dialogue skips corrective rituals, entrainment, adjacency pair accountability, and co-presence cues that humans use to build trust and repair understanding. This ritual gap explains apparent fluency masking actual communicative failure.
LLMs interpret all subsequent conversational turns within a fixed initial prompt frame, preventing them from symmetrically proposing updates to shared assumptions. Even when users pivot topics or contradict earlier framings, the model cannot absorb revisions into jointly held background—making the user the sole maintainer of conversational scoreboard.
Research shows LLMs generate 77.5% fewer grounding acts than humans, and RLHF preference optimization actively worsens this gap. The optimization target—fluent, confident responses—directly undermines the communicative work of establishing shared understanding.
LLMs lose the social context that gives expert claims their force—reputation, track record, and standing—because they process only text, not the social world where expertise is built and evaluated.
Subjecthood is produced within communicative events, not possessed prior to them. This convergent position across philosophy, linguistics, and cognitive science inverts the standard picture of language as a tool used by pre-existing subjects.
LLMs process tokens and generate continuations rather than receive and uptake communication. The preposition 'to' presupposes an addressee capable of mutual orientation and shared commitment that LLMs cannot provide, making Chalmers' investigation built on an unwarranted linguistic foundation.
Chalmers' test passes any system producing contextually appropriate text, but communicative subjecthood requires relational-normative conditions like accountability and evaluative stance. The test is calibrated to the wrong phenomenon, creating false positives like puppets that walk-shaped without walking.