SYNTHESIS NOTE
Psychology, Society, and Alignment Reasoning, Retrieval, and Evaluation Language, Text, and Discourse

How often do legal AI tools actually hallucinate citations?

Legal vendors claim their AI research tools eliminate hallucinations, but do they? This preregistered study measures hallucination rates in leading commercial legal-research systems to test those marketing claims.

Synthesis note · 2026-06-03 · sourced from Domain Specialization

Legal-research vendors have marketed RAG-based tools as "eliminating," "avoiding," or guaranteeing "hallucination-free" citations. This preregistered empirical evaluation — the first of its kind — tests those claims and finds them overstated: while hallucinations are reduced relative to general-purpose GPT-4, LexisNexis (Lexis+ AI), Thomson Reuters (Westlaw AI-Assisted Research), and Ask Practical Law AI each hallucinate between 17% and 33% of the time, with substantial differences in responsiveness and accuracy across systems. The high-stakes context is real — lawyers have already been sanctioned for citing AI-invented cases.

The keeper is twofold: RAG reduces but does not eliminate hallucination even in a citation-grounded, high-stakes domain, so users must still verify; and the closed nature of these tools (no systematic access, no published benchmarks) makes the vendor claims unfalsifiable and responsible oversight "acutely difficult" — a marked contrast with the benchmarked open-AI field.

This is a domain-deployment anchor with a strong post angle (AI marketing vs measured reliability). It instantiates Why does retrieval-augmented generation fail in production? in law, complements Do LLMs overgeneralize when summarizing scientific research? (another measured fidelity-claim gap), and connects to Why do language models struggle with historical legal cases? on legal-AI reliability specifically.

Inquiring lines that use this note as a source 3

This note is a source for these synthesized inquiries. Follow a line forward into its question, or open it to trace back to all of its sources.

Related concepts in this collection 3

This note in its neighbourhood — explore the map, then jump to a related concept in the list below.

Concept map
14 direct connections · 144 in 2-hop network ·dense cluster Open in graph ↗

Click a node to walk · click center to open · click Open in graph to see this note in the full knowledge graph

your link semantically near linked from elsewhere

Related papers in this collection 8

Papers most semantically related to this note, ranked by cosine similarity in the embedding space.

Original note title

legal AI research tools marketed as hallucination-free still hallucinate 17 to 33 percent of the time