Conversational AI Systems Psychology and Social Cognition

Do simulated training interactions transfer to real conversations?

Most conversational recommender systems train on simulated entity-level exchanges, not natural dialogue. The question is whether models built this way actually work when deployed with real users who speak naturally and deviate from expected patterns.

Note · 2026-05-03 · sourced from Recommenders Conversational
What breaks when specialized AI models reach real users?

The conversational recommender literature splits into two practically incompatible strands. Standard CRS research uses simulated interactions — turns exchange entity-level information (item names, attribute values), not natural-language utterances. The user model is a programmatic simulator that emits "I like attribute X" rather than "you know, I'm in the mood for something fun but not silly." This simulation makes training tractable and benchmarks reproducible, but it short-circuits the actual problems of language understanding, response generation, topic planning, and knowledge engagement.

Holistic CRS, in contrast, trains on conversational data collected from real-world scenarios. The system must handle imperfect intent understanding, unexpected dialogue turns, and the social dynamics of recommendation conversation (encouragement, hedging, explanation). Holistic CRS approaches structurally combine three components: a backbone language model, optional external knowledge, and optional external guidance.

The dichotomy matters because conclusions from standard CRS evaluation do not transfer to deployed systems. Models that win on simulated benchmarks may collapse on real conversation, where users say "Whatever, I'm open to any suggestion" because they don't have a specific preference yet, or where the conversation goes off-topic and back. Real human conversation includes content that simulators don't generate — and the systems trained on simulated data have no exposure to this distribution.

The practical consequence is that CRS research has accumulated a decade of methodology against a problem (entity-level dialogue) that no production system actually faces. Holistic CRS is under-explored because data is harder to collect, but it is the only setting that maps to real applications.


Source: Recommenders Conversational

Related concepts in this collection

Concept map
15 direct connections · 126 in 2-hop network ·medium cluster

Click a node to walk · click center to open · click Open full network for a force-directed map

your link semantically near linked from elsewhere
Original note title

CRS models trained on simulated entity-level interactions do not generalize to real human conversation — the holistic CRS gap