What role do first-person pronouns play in sustaining collaborative conversation tone?

This explores whether and how the words speakers use to refer to themselves — 'I,' 'me,' 'my' — shape the warmth and collaborative feel of a conversation, and what the corpus says runs counter to the intuition that self-disclosure builds rapport.

This explores the role of self-referential language in keeping a conversation feeling collaborative — and the corpus offers a sharp, counterintuitive answer: more first-person pronouns can work against the very tone they seem designed to build. The most direct evidence comes from therapy research, where a therapist's frequent 'I' usage actually *negatively* predicts the therapeutic alliance and reduces patient trust in validated behavioral tasks Does therapist self-reference language predict weaker therapeutic alliance?. The same note flags an almost backwards finding: it's the patient's *disfluencies* — filler pauses, non-fluency markers — that signal a relaxed, trusting relationship. In other words, collaborative tone isn't carried by polished self-reference; it's carried by signals that the floor is shared and the speaker feels safe enough to be messy.

That reframes the question. If first-person pronouns aren't doing the relational work, what is? The corpus locates collaborative tone in *implicit maintenance moves* rather than word choice — reference repair, topic hand-offs, and the small acts that sustain a relationship rather than transmit information Why don't language models develop conversation maintenance skills?. These are 'language as social action,' and they're precisely what models trained to predict information don't develop. A pronoun is a surface feature; grounding is an act. The deeper machinery is *calibrating shared reference* — negotiating how words map to the world for both speakers, not just emitting fluent first-person prose Why do speakers need to actively calibrate shared reference?.

There's a useful distinction lurking here that the corpus makes explicit: not all alignment between speakers serves the same goal. Lexical alignment (matching each other's words) drives task efficiency and comprehension, while emotional and prosodic alignment drive warmth and trust Do different types of alignment serve different conversational goals?. First-person pronoun frequency is a lexical-surface variable, but collaborative *tone* lives on the emotional/relational axis — which is why counting 'I's tells you little about whether a conversation feels like a partnership.

This connects to a structural limit in how LLMs hold up their side. Collaborative conversation requires *jointly* updating common ground, with both parties able to propose revisions to shared assumptions — but LLMs treat the opening prompt as a fixed frame and leave the user as the sole keeper of the conversational scoreboard Can LLMs truly update shared conversational common ground?. And preference optimization makes it worse: RLHF rewards confident single-turn answers over clarifying questions and understanding checks, cutting grounding acts to roughly a fifth of human levels Does preference optimization harm conversational understanding?. A model can deploy warm, self-referential 'I think' phrasing while doing none of the bidirectional belief-tracking that actually sustains collaboration — the kind of two-way reasoning frameworks like collaborative rational speech acts try to formalize Can dialogue systems track both speakers' beliefs across turns?.

The thing you didn't know you wanted to know: collaborative tone is mostly *not* in the pronouns. Restraint in self-reference, tolerance for the other speaker's imperfections, and the willingness to let common ground be jointly rewritten do the work — and these are exactly the moves current training objectives systematically erode.

Sources 7 notes

Does therapist self-reference language predict weaker therapeutic alliance?

High frequency of therapist 'I' usage correlates with lower patient-reported alliance and reduced trusting behavior in validated behavioral tasks. Patient non-fluency markers like filler pauses, conversely, signal relaxed communication and stronger alliance.

Why don't language models develop conversation maintenance skills?

Humans keep conversations smooth through implicit techniques like reference repair and topic hand-off that sustain relational interaction, not convey information. Language models don't develop these because training signals reward information prediction, not relational work.

Why do speakers need to actively calibrate shared reference?

The same words can mean different things to different speakers because referential grounding is person-specific. True communicative grounding demands collaborative negotiation of how language connects to the world, not mere surface-level word sharing.

Do different types of alignment serve different conversational goals?

A 2020–2025 systematic review shows lexical alignment drives task efficiency and comprehension, while emotional and prosodic alignment drive relational warmth and trust. Conflating them in design produces category errors—cold customer-service bots and evasive mental-health assistants.

Can LLMs truly update shared conversational common ground?

LLMs interpret all subsequent conversational turns within a fixed initial prompt frame, preventing them from symmetrically proposing updates to shared assumptions. Even when users pivot topics or contradict earlier framings, the model cannot absorb revisions into jointly held background—making the user the sole maintainer of conversational scoreboard.

Does preference optimization harm conversational understanding?

RLHF optimizes models for single-turn helpfulness by rewarding confident responses over clarifying questions and understanding checks. This preference alignment systematically reduces grounding acts by 77.5% below human levels, creating an alignment tax where models appear helpful but fail silently in multi-turn contexts.

Can dialogue systems track both speakers' beliefs across turns?

CRSA integrates rate-distortion theory with RSA to enable bidirectional belief tracking across dialogue turns. Demonstrated on referential games and doctor-patient dialogues, it captures progression from partial to shared understanding, providing the information-theoretic framework that token-level LLM systems lack.

What role do first-person pronouns play in sustaining collaborative conversation tone?

Sources 7 notes

Next inquiring lines