Agentic and Multi-Agent Systems Psychology and Social Cognition LLM Reasoning and Architecture

What should a world model actually be designed to do?

Current AI research treats world models as either video predictors or RL dynamics learners, but what if their real purpose is simulating actionable possibilities for decision-making rather than predicting next observations?

Note · 2026-05-03 · sourced from World Models

What a world model is supposed to do has been contested in recent AI research. Some schools treat it as a video predictor — a system that generates the next frame given current observations. Others treat it as a learned latent dynamics model used for model-based RL. The Critiques of World Models essay argues both formulations miss what makes a world model useful: the capacity for hypothetical thinking, the capacity to simulate alternatives that did not happen and may not happen.

The argument draws on hypothetical thinking in psychology and on the Sci-Fi imagination from Dune (where the Bene Gesserit and the Mentat exemplify different modes of inner simulation). The proposed primary goal: a world model should simulate all actionable possibilities of the real world for purposeful reasoning and acting. The keyword is actionable — possibilities that an agent could choose between, not all metaphysical possibilities. This grounds the WM in decision-making rather than in passive prediction.

The simulation scope is broader than typical proposals. A general-purpose WM must simulate physical dynamics (how water pours, how machines operate), embodied experiences (balance, posture, motor sequences), emotional states (affective responses for therapy or social interactions), social situations (other agents' internal states and intentions), the mental world (logistics, tactics, strategies in adversarial settings), the counterfactual world (what-if scenarios for decisions under uncertainty), and the evolutionary world (generational dynamics like adaptation and inheritance). Each of these is a domain of actionable possibility; a WM that cannot simulate any of them cannot support reasoning in that domain.

This breadth implies that a single uniform representation will not suffice — the architectural answer the essay proposes is hierarchical, multi-level, mixed continuous/discrete representations within a generative self-supervised framework. The Physical, Agentic, and Nested (PAN) AGI concept is the architectural commitment: world models should be nested (worlds within worlds, simulations of agents that themselves simulate), agentic (a WM is for an agent, not a passive predictor), and physical (grounded in dynamics rather than only in symbol manipulation) — a commitment that aligns with the five-aspect WM decomposition by privileging aspect 5 (decision-integration) and aspect 3 (architecture for compositional/counterfactual operations).


Source: World Models

Related concepts in this collection

Concept map
16 direct connections · 141 in 2-hop network ·medium cluster

Click a node to walk · click center to open · click Open full network for a force-directed map

your link semantically near linked from elsewhere
Original note title

the primary goal of a world model is to simulate all actionable possibilities of the real world for purposeful reasoning — not to predict the next observation