Recommender Systems

Do accuracy-optimized recommendations preserve user interest diversity?

Standard recommender systems rank by predicted relevance, which tends to saturate lists with the highest-confidence items. Does this approach naturally preserve the proportions of a user's multiple interests, or does it systematically crowd out smaller ones?

Note · 2026-05-03 · sourced from Recommenders Architectures
Does personalization in AI increase trust or manipulation risk? What breaks when specialized AI models reach real users?

Steck's calibration result identifies a failure mode that standard accuracy metrics make invisible. A user has watched 70 romance and 30 action movies. The accuracy-optimized recommender, ranking by predicted relevance, will tend to fill the recommendation list with romance. Each romance item has slightly higher predicted relevance than each action item, so a list ranked by relevance produces 100% romance — and the user's 30% action interest is crowded out entirely.

Calibration is the property that the recommended list reflects the user's interest distribution proportionally: 70% romance, 30% action. This is empirically not what optimization-for-accuracy produces, even though it sounds like what users want. The mismatch comes from how ranking metrics aggregate per-item predictions: top-K lists are determined by per-item ranking, not by distributional match between the recommendation set and the user's history.

Steck's proposal is post-processing. Define metrics that measure the divergence between the user's category distribution and the recommended list's category distribution, then use a re-ranking algorithm to enforce calibration on top of the base recommender output. The technique is simple and works.

The conceptual contribution is identifying the gap. Accuracy-as-defined-by-ranking-metrics does not entail proportional representation of interests. These are two different things, and they pull apart whenever a user has multiple interests of unequal strength — which is most users. Calibration is a separate optimization target that has to be added explicitly because the standard objective does not produce it.


Source: Recommenders Architectures

Related concepts in this collection

Concept map
12 direct connections · 73 in 2-hop network ·medium cluster

Click a node to walk · click center to open · click Open full network for a force-directed map

your link semantically near linked from elsewhere
Original note title

calibrated recommendations preserve interest proportions — accuracy-optimized lists otherwise crowd out lesser interests