Reasoning and Learning Architectures Language Understanding and Reasoning Reasoning and Knowledge

Can cognitive science methods unlock how LLMs actually work?

Does Marr's three-level framework—developed to understand biological minds—offer interpretability researchers the structured methodology they need to decode opaque language models?

Note · 2026-05-18 · sourced from Philosophy Subjectivity

David Marr's framework — the computational level (what abstract problem is the system solving), the algorithmic level (what representations and processes does it use), and the implementation level (what physical mechanisms realize the computations) — has been the backbone of cognitive science for decades. The argument in Levels of Analysis for Large Language Models is that this framework now imports usefully into LLM interpretability, because the field's problem is structurally the same problem cognitive science has had for 70 years: opaque systems whose behavior is interesting and whose internals resist direct inspection.

The historical asymmetry was that cognitive science had a methodology and few systems to study, while AI had many systems and no methodology for understanding them. The asymmetry inverts now. Cognitive science's accumulated toolkit — behavioral probes, implicit association tests, double-dissociation paradigms, representational similarity analysis, causal interventions — was developed for one kind of mind and can be redeployed for another. The methodology was always more general than its initial object.

The Marr framework does specific work in this redeployment. The computational level reframes interpretability questions around the abstract problem the LLM is solving (next-token prediction with learned objectives), independent of how. The algorithmic level surfaces the representations and processes — circuits, features, attention patterns — and the cognitive-architecture question (Newell, Anderson) of which level the algorithms run on. The implementation level connects representations to the artificial neurons that realize them.

Beyond the framework, the deeper claim is that interpretability needs layered analysis rather than monolithic explanation. A complete account of why an LLM does what it does requires all three levels, and the disciplines that have learned to do this work for biological minds are the natural source of the methods.

Related concepts in this collection

Concept map
15 direct connections · 120 in 2-hop network ·medium cluster Open in graph ↗

Click a node to walk · click center to open · click Open in graph to see this note in the full knowledge graph

your link semantically near linked from elsewhere
Original note title

Marr's three levels of analysis provide a structured toolkit for making LLMs interpretable