Can a model's partial response guide what to retrieve next?
Can generation reveal implicit information needs that the original query cannot express? This explores whether using in-progress responses as retrieval signals outperforms upfront query formulation.
Standard RAG asks: "what documents are relevant to this query?" before any generation has occurred. The query is the only signal available. For complex tasks, the query is often an inadequate signal — it expresses what was asked but not what is needed to answer it fully.
ITER-RETGEN (Iterative Retrieval-Generation Synergy) demonstrates an alternative: use the model's current response to the task as the retrieval query. The model's response "shows what might be needed to finish the task" — it contains implicit signals about the gaps between what has been answered and what remains unaddressed.
The synergy is iterative: generate a response → use response as retrieval query → retrieve more relevant documents → regenerate with new context → repeat. Each generation round surfaces new implicit information needs that the original query did not express. Performance on multi-hop question answering, fact verification, and commonsense reasoning improves substantially over single-pass RAG.
This reframes what generation is for in RAG pipelines. Generation is not only the terminal output step — it is also a diagnostic step that identifies what retrieval should target next. The generator functions as both an answer producer and an information-need clarifier.
The connection to human information seeking: humans working on complex research do not submit all their queries upfront. They read, understand what they know and don't know, then query for the specific gaps that reading revealed. ITER-RETGEN operationalizes this workflow.
Source: RAG
Related concepts in this collection
-
When should retrieval happen during model generation?
Explores whether retrieval should occur continuously, at fixed intervals, or only when the model signals uncertainty. Standard RAG retrieves once; long-form generation requires dynamic triggering based on confidence signals.
complementary trigger mechanism: FLARE uses confidence (token probability) as the signal; ITER-RETGEN uses response content (what was generated so far) as the signal
-
What makes deep research fundamentally different from RAG?
Explores whether current systems using the label 'deep research' actually meet a rigorous three-component definition involving multi-step gathering, cross-source synthesis, and iterative refinement, or if they're performing something narrower.
ITER-RETGEN is an implementation of iterative query refinement as the third component of deep research
-
Do iterative refinement methods suffer from overthinking?
Iterative refinement approaches like Self-Refine structurally resemble token-level overthinking in o1-like models. Does revision across multiple inference calls reproduce the same accuracy degradation seen within single inferences?
ITER-RETGEN is iterative refinement with an external information escape: each iteration retrieves new evidence rather than re-processing the same context, avoiding the variance inflation that pure self-revision causes; the contrast shows that iterative refinement fails when information is held constant but may succeed when each iteration adds genuinely new knowledge
-
Does revising your own reasoning actually help or hurt?
Self-revision in reasoning models often degrades accuracy, while external critique improves it. Understanding what makes revision helpful or harmful could reshape how we design systems that need to correct themselves.
ITER-RETGEN is a concrete implementation of external-signal-driven revision: the retrieval signal is external (new documents), not internal (model's assessment of its own output); this positions ITER-RETGEN on the side of revisions that help rather than hurt
Click a node to walk · click center to open · click Open full network for a force-directed map
Original note title
model response quality is a retrieval signal — the partial answer reveals what information is still needed