LLM Reasoning and Architecture

Do reasoning cycles in hidden states reveal aha moments?

What if the internal loops in model reasoning—visible in hidden-state topology—correspond to the reconsidering moments that happen during reasoning? This note explores whether graph cyclicity captures a mechanistic signature of insight.

Note · 2026-02-22 · sourced from Reasoning Architectures

The Topology of Reasoning paper introduces an internal mechanistic lens for reasoning model performance that is distinct from the external graph taxonomy (CoT/ToT/GoT as formal graph types). By extracting reasoning graphs from hidden-state representations at each step — clustering hidden states to identify repeated states as cycles — it quantifies three graph-theoretic properties and shows they predict accuracy.

The three properties:

The aha moment connection: RL-trained models are reported to exhibit "aha moments" — reconsidering intermediate answers during reasoning. From the hidden-state topology perspective, aha moments correspond exactly to cyclic structures in the reasoning graph. The paper quantifies a phenomenon previously identified at the generated-token level as a property of internal representation dynamics.

Overthinking and underthinking reinterpreted: Overthinking corresponds to redundant cyclic structures (excessive cycling). Underthinking — observed in o1-family models — corresponds to overly large exploration diameter without adequate cycling back to check.

Design implication: Supervised fine-tuning on an improved dataset systematically expands reasoning graph diameters in tandem with performance gains, providing concrete guidelines for dataset construction aimed at boosting reasoning.

This adds a mechanistic dimension to Can reasoning topologies be formally classified as graph types?, which covers external topology. Together they provide a two-layer analysis: what reasoning structure looks like from outside (CoT = chain, ToT = tree, GoT = graph) and what reasoning dynamics look like from inside (cycles, diameter, small-world).


Source: Reasoning Architectures

Related concepts in this collection

Concept map
18 direct connections · 175 in 2-hop network ·dense cluster

Click a node to walk · click center to open · click Open full network for a force-directed map

your link semantically near linked from elsewhere
Original note title

reasoning graph topology — cyclicity diameter and small-world structure — correlates with reasoning performance and reveals the aha moment mechanism