Can rarity in feature space distinguish human authorship from AI output reliably?
This explores whether you can tell human writing from AI by measuring how *unusual* a text is — the idea that humans land in rarer, more scattered regions of some feature space while AI clusters in the predictable middle.
This explores whether "rarity" — being statistically unusual in some measured feature space — is a reliable tell for human authorship. The corpus says the signal is real and, in places, surprisingly strong, but "reliably" hides two very different questions: reliable for a machine, or reliable for a person?
The rarity idea is most explicitly developed in narrative work. StoryScope operationalizes originality as statistical rarity in *discourse-level* decisions — character agency, how time is sequenced — and finds human stories genuinely occupy rarer regions of that space while AI outputs cluster tightly together Can statistical rarity measure whether stories are truly original?. Crucially, the discriminating features aren't surface style; the same approach hits ~93% accuracy using structure alone, and that structure resists "humanization" because faking it requires a rewrite, not a word swap Can AI stories be detected without analyzing writing style?. So rarity isn't just a statistical artifact — it points at something AI structurally does differently.
Why would AI cluster? Because models converge. Analysis of 70+ models across 26K open-ended prompts found an "Artificial Hivemind": different systems independently produce strikingly similar outputs, since they share training data and alignment procedures Do different AI models actually produce diverse outputs?. That convergence is the flip side of rarity — if all the models pile into the same region, anything *outside* that region reads as human. And the gap is measurable along concrete axes: AI text diverges from human writing across six dimensions of lexical diversity, robustly enough to show up under statistical testing Can human judges detect measurable differences in AI text?.
Here's the catch that should reframe the whole question. "Reliably" for a *detector* and "reliably" for a *human reader* are not the same thing. The exact same measurable lexical differences that machines catch are invisible to people — even trained linguists and NLP researchers can't reliably tell the two apart, and newer models diverge *further* statistically while getting *harder* for humans to spot Can humans detect AI text if machines can measure it?. So rarity-in-feature-space can be a reliable machine signal precisely *because* it's imperceptible to us. For arguments rather than fiction, cheap interpretable linguistic features hit 99% accuracy spotting LLM counter-arguments — AI leaves signatures like over-accommodation to the prompt and textbook-clean argument markers humans don't bother to produce Can simple linguistic features detect AI-written arguments?.
The honest answer: rarity is a strong, reliable signal *today*, but it's a moving target, not a permanent fingerprint. It works because current models converge into a measurable common region; it's defined relative to that region, so it drifts as models change. And it tells you a text is statistically machine-typical — not that the idea behind it is hollow. The corpus's deeper claim is that AI lacks the *event structure* of genuine utterance and we supply the missing intent ourselves Does AI generate genuine utterances or just text patterns?. Rarity may be the closest measurable shadow of that absence — but it's a shadow, not the thing itself.
Sources 7 notes
StoryScope operationalizes originality as statistical rarity in discourse-level narrative decisions. Human stories are measurably rarer in this space than AI outputs, which cluster tightly, offering a quantifiable proxy for the human conception copyright law requires.
StoryScope achieved 93.2% accuracy separating AI from human fiction using only discourse-level features like character agency and chronological structure, retaining 97% of performance while eliminating stylistic cues. These structural choices resist humanization because they require rewrites, not surface edits.
INFINITY-CHAT analyzed 70+ models across 26K open-ended queries and found an "Artificial Hivemind" effect: models independently generate strikingly similar or identical responses due to overlapping training data and alignment procedures, undermining the diversity benefits of model ensembles.
Six-dimension MANOVA analysis confirms significant differences between ChatGPT and human writing across vocabulary volume, abundance, variety, evenness, disparity, and dispersion. Despite these robust statistical differences, human judges including linguists and NLP researchers fail to reliably distinguish AI from human text.
LLM-generated text differs significantly on six lexical diversity dimensions, confirmed through statistical analysis across multiple models. Yet human judges, including trained linguists, cannot reliably detect these differences—and newer models diverge further while becoming harder to spot.
General linguistic features combined with argument-quality measures achieved 99% accuracy detecting LLM-generated counter-arguments on r/ChangeMyView, matching heavyweight neural detectors while remaining computationally cheap and transparent. LLMs produce detectable stylistic signatures: accommodation to prompts and textbook-quality argument markers that humans don't replicate.
AI output carries communicative markers inherited from training data but lacks the event structure that produces actual utterances. Users supply the missing orientation through interpretive labor, creating a pseudo-event with structure only on the human side.