Language Understanding and Pragmatics Psychology and Social Cognition

Can humans detect AI by passively reading its text?

When people read AI-generated transcripts without the ability to ask follow-up questions, can they tell it apart from human writing? This matters because most real-world AI encounters are passive.

Note · 2026-02-23 · sourced from Social Theory Society
What kind of thing is an LLM really? Where exactly does language competence break down in LLMs?

The displaced Turing test introduces a human judge who reads a transcript of a Turing test conducted by a different interrogator. The inverted test places an AI in the interrogator role. Both variations reveal that removing interactivity collapses detection capability.

Key finding: both AI and displaced human judges were less accurate than interactive interrogators, with below-chance accuracy overall. The interactive interrogator's advantage is the ability to adapt questions adversarially in real time — pursuing fruitful lines of questioning and probing inconsistencies as they emerge. Passive consumption of the same conversation loses this adaptive capacity entirely.

This has direct ecological validity implications. Most real-world AI content consumption is passive — reading chatbot outputs, encountering AI text in search results, seeing AI-generated social media posts. If interactive interrogation only marginally succeeds at detection (GPT-4 fools interrogators 54% of the time), passive consumption is far worse.

The inverted test (AI as interrogator) tests what Watt (1996) called "naïve psychology" — whether an AI system has the innate tendency to recognize intelligence similar to its own. The results suggest AI judges lack this capacity: they cannot reliably discriminate between humans and machines that fooled human interrogators.

Since Can humans detect AI writing if it looks natural?, there is a paradox: statistical analysis reveals clear AI signatures, but neither interactive nor passive human judgment reliably detects them. The detection problem is not that AI text has no distinguishing features — it's that human perception cannot access them.

Since How can AI text disrupt structure yet feel normal to readers?, the displaced Turing test confirms the mechanism: when you remove provenance information (who generated this?), the text enters the same interpretive circuits regardless of origin.

Since Does AI text affect readers the same way human text does?, passive consumption is where this equivalence matters most. If observers cannot tell, then AI text is functionally indistinguishable from human text in the settings where most content consumption actually occurs.


Source: Social Theory Society

Related concepts in this collection

Concept map
13 direct connections · 85 in 2-hop network ·medium cluster

Click a node to walk · click center to open · click Open full network for a force-directed map

your link semantically near linked from elsewhere
Original note title

passive observers cannot distinguish AI from humans even when interactive interrogators partially can