Can knowledge graphs enable multi-hop reasoning in one retrieval step?
Standard RAG retrieves once but misses chains; iterative RAG follows chains but costs more. Can we encode multi-hop paths in a knowledge graph so one retrieval pass discovers them all?
Multi-hop question answering requires finding information that depends on other information: "Who was the mentor of the person who founded X?" requires finding X's founder, then finding that person's mentor. Standard RAG retrieves once per question. Iterative RAG retrieves multiple times, issuing new queries based on intermediate answers. Both approaches have costs: single-step RAG misses the chain; iterative RAG is slow and expensive.
HippoRAG takes a third approach, inspired by hippocampal indexing theory of human memory. The corpus is converted into a schemaless knowledge graph: LLMs extract entities and relationships from passages, forming a graph where nodes are concepts and edges are relationships. This happens offline, at indexing time.
At query time: extract key concepts from the query; seed Personalized PageRank (PPR) on the knowledge graph using those concepts. PPR propagates importance scores through the graph from the seed nodes, naturally following edges (relationships) and discovering connected subgraphs. Passages associated with highly-ranked graph regions are retrieved.
The multi-hop capability emerges naturally from graph traversal: PPR follows relationship edges, which means it discovers concepts multiple hops away from the query seeds. The traversal is complete in a single retrieval step because the graph encodes all multi-hop paths.
Results: 20% better than state-of-the-art on multi-hop QA; single-step HippoRAG matches or beats iterative retrieval (IRCoT) while being 10-20x cheaper and 6-13x faster. Integrating HippoRAG into IRCoT brings further gains.
The memory analogy holds architecturally: the knowledge graph functions as an associative index (analogous to hippocampus) over perceptual content (analogous to neocortex), enabling pattern-completion retrieval from partial query cues.
Complementary KG-based approaches: GraphRAG (Microsoft) uses knowledge graphs for a fundamentally different purpose than HippoRAG. Where HippoRAG uses the KG for traversal-based retrieval (find specific facts via graph navigation), GraphRAG uses KG + Leiden community detection for modular summarization — answering global corpus questions ("What are the main themes?") that no single retrieval step can address. The distinction: HippoRAG exploits graph connectivity; GraphRAG exploits graph modularity. See Can community detection enable RAG systems to answer global corpus questions?. SymAgent takes yet a third approach: deriving symbolic rules from KG structure to create navigational plans that align natural language with graph topology — using the graph not for retrieval or summarization but for structural reasoning guidance. See Can symbolic rules from knowledge graphs guide complex reasoning?.
Source: RAG; enriched from Knowledge Graphs
Related concepts in this collection
-
Can query-time graph construction replace pre-built knowledge graphs?
Does building dependency graphs from individual queries at inference time offer a more flexible and cost-effective alternative to constructing knowledge graphs over entire document collections upfront?
LogicRAG inverts this: query-time graph vs corpus-time graph; HippoRAG pre-builds; both achieve relational reasoning via graph traversal with different cost/flexibility trade-offs
-
When do graph databases outperform vector embeddings for retrieval?
Vector similarity struggles with aggregate and relational queries that require traversing multiple entity connections. Can graph-oriented databases with deterministic queries solve this failure mode in enterprise domain applications?
same architectural principle; HippoRAG implements it with PPR over a schemaless KG rather than a structured graph database
-
Can community detection enable RAG systems to answer global corpus questions?
Standard RAG struggles with corpus-wide questions that require understanding overall themes rather than retrieving specific passages. Can graph community detection overcome this limitation at scale?
complementary KG approach: community-based summarization vs traversal-based retrieval
-
Can symbolic rules from knowledge graphs guide complex reasoning?
Can deriving symbolic rules directly from knowledge graph structure help align natural language questions with structured reasoning paths? This explores whether explicit structural patterns outperform semantic similarity for multi-hop inference.
third approach: KG structure for navigational reasoning guidance
Click a node to walk · click center to open · click Open full network for a force-directed map
Original note title
knowledge graph plus Personalized PageRank achieves multi-hop reasoning in a single retrieval step