SYNTHESIS NOTE
Reasoning, Retrieval, and Evaluation Model Architecture and Internals Training, RL, and Test-Time Scaling

Does learning from mistakes improve in-context learning?

Explores whether inducing models to make errors on few-shot examples, then having them articulate principles from those mistakes, leads to better performance than learning from correct examples alone.

Synthesis note · 2026-06-03 · sourced from Prompts Prompting

In-context learning has always learned from correct input-output pairs only. LEAP (Learning Principles) revisits that: given the same few examples, it (1) intentionally induces the model to make mistakes on them, (2) has the model reflect on those mistakes and articulate explicit, task-specific principles — with no human supervision — that help avoid common errors, then (3) prompts the test question with the original few-shot examples plus the learned principles. It uses exactly the same number of labeled examples as standard few-shot, yet improves strong models (GPT-3.5/4/4-turbo, Claude-2.1, Gemini Pro) across DROP, HotpotQA, GSM8K, MATH, and Big-Bench Hard (e.g., +7.5% on DROP with GPT-4).

The keeper is a generative-learning principle at the prompt level: the model extracts more usable structure from examples by erring and explaining the error than by imitating correct answers. Negative experience, articulated, transfers better than positive examples alone — within a single inference-time prompt, no fine-tuning.

This is the in-context, self-supervised cousin of learning-from-mistakes at training time. It rhymes with Can reconstructing expert thinking improve reasoning transfer? (articulating the latent process behind surface examples) and with Can confidence trajectories reveal when reasoning goes wrong? in deriving a usable training/prompting signal from the model's own errors rather than external labels.

Inquiring lines that use this note as a source 4

This note is a source for these synthesized inquiries. Follow a line forward into its question, or open it to trace back to all of its sources.

Related concepts in this collection 3

This note in its neighbourhood — explore the map, then jump to a related concept in the list below.

Concept map
15 direct connections · 173 in 2-hop network ·dense cluster Open in graph ↗

Click a node to walk · click center to open · click Open in graph to see this note in the full knowledge graph

your link semantically near linked from elsewhere

Related papers in this collection 8

Papers most semantically related to this note, ranked by cosine similarity in the embedding space.

Original note title

in-context learning improves when you induce the model to err on the few-shot examples then have it articulate explicit principles from those mistakes