Do linguistic features of persuasion stay the same across audiences?
When researchers study what language makes arguments persuasive, do they account for who is listening? Without controlling for reader beliefs, do findings about persuasive language actually reflect audience effects instead?
The same debate corpus produces two different stories about which linguistic features drive persuasion, depending on whether reader-level controls are included. Without controls — the standard NLP setup — one set of linguistic features emerges as predictive. With political and religious ideology controls — the controlled setup — a different set emerges. The features themselves are not stable across specifications.
This is a stronger result than "reader factors also matter." It says the standard specification produces a biased picture of which language features cause persuasion. Some features that appear predictive without controls are proxies for audience-text matching; their predictive power evaporates once you account for who is in the audience. Other features only emerge as predictive once audience composition is held constant — they are real but hidden by the noise that audience heterogeneity introduces.
The methodological consequence is that the language-of-persuasion literature needs a re-read. Many findings about which words, which moves, which features make arguments more persuasive were estimated on debate corpora without reader controls. Some of those findings are likely artifacts of audience composition rather than language effects. Replicating them under reader-level controls is the cheap empirical correction.
The best-performing model in this study combines reader features and linguistic features — neither alone suffices. This is the operational conclusion: language matters, audience matters, and they interact. Modeling either in isolation misses the joint structure.
For LLM persuasion evaluation specifically, the design implication is to stratify by reader ideology when measuring stance shift. Aggregate numbers conflate ideology-congruent and ideology-opposed effects in a way that obscures the actual mechanism.
Related concepts in this collection
-
Does what readers believe matter more than what debaters say?
Do audience prior beliefs predict persuasion outcomes better than the linguistic features of debate arguments? This explores whether persuasion is fundamentally shaped by reader ideology rather than speaker language.
same paper, the headline finding this corollary depends on
-
Do LLMs and humans persuade through the same mechanisms?
If LLM and human arguments achieve equal persuasive force, does that mean they work the same way? This explores whether equivalent outcomes hide fundamentally different rhetorical strategies.
aggregate equivalence may itself reflect averaged-over reader heterogeneity
-
Can we measure how deeply models represent political ideology?
This research explores whether LLMs vary not just in political stance but in the internal richness of their political representation. Understanding this distinction could reveal how deeply models have internalized ideological concepts versus merely parroting positions.
the LLM-side parallel: model ideology shifts what it produces, just as reader ideology shifts what they accept
Click a node to walk · click center to open · click Open in graph to see this note in the full knowledge graph
Original note title
the most-predictive linguistic features of persuasion shift once reader prior beliefs are controlled — NLP studies of persuasion that omit reader-level factors are confounded