Knowledge Retrieval and RAG

How do logic units preserve procedural coherence better than chunks?

Can structured retrieval units with prerequisites, headers, bodies, and linkers maintain step-by-step coherence in how-to answers where fixed-size chunks fail? This matters because procedural questions require sequential logic and conditional branching that chunk-based RAG cannot support.

Note · 2026-02-22 · sourced from Question Answer Search
RAG How should researchers navigate LLM reasoning research?

RAG systems overwhelmingly use fixed-size chunks as their retrieval granularity. This works acceptably for factoid "5W" questions (who, what, where, when, why) where the answer is localized. It fails systematically for "1H" questions — how-to questions — which require sequential, procedurally coherent answers where step ordering, prerequisites, and conditional branching matter.

THREAD proposes logic units (LUs) as an alternative retrieval granularity with four components:

The linker is what makes THREAD fundamentally different from chunk-based RAG. Chunks have no mechanism for specifying what should come next — retrieval of subsequent chunks relies on the same query or the generated partial answer, both of which degrade as the procedure progresses. Linkers provide explicit navigation between steps, enabling branching paths (if server load is high → do X; if normal → do Y).

This connects to the broader RAG failure mode. Since Do vector embeddings actually measure task relevance?, the chunk+embedding approach fails for procedural questions doubly: embeddings can't capture sequential dependency, and chunks can't preserve it. Logic units address both by structuring retrieval around intent (header) and navigation (linker) rather than semantic similarity.


Source: Question Answer Search

Related concepts in this collection

Concept map
14 direct connections · 87 in 2-hop network ·medium cluster

Click a node to walk · click center to open · click Open full network for a force-directed map

your link semantically near linked from elsewhere
Original note title

logic units with prerequisite-header-body-linker structure preserve document coherence that fixed-size chunking destroys for procedural how-to questions