Recommender Systems

How do ranking systems handle conflicting objectives without feedback loops?

Industrial rankers must balance incompatible goals like engagement versus satisfaction while avoiding training on biased feedback from their own prior decisions. What architectural patterns prevent these systems from converging on degenerate solutions?

Note · 2026-05-03 · sourced from Recommenders Architectures
What breaks when specialized AI models reach real users?

Industrial ranking systems face two distinct problems that interact. First, objectives conflict: engagement (clicks, watch time) and satisfaction (ratings, likes, shares) are not the same thing, and naive aggregation collapses them. YouTube's solution uses Multi-gate Mixture-of-Experts so each objective can choose which input experts it shares with others — soft parameter sharing rather than full-shared or fully-separate models.

Second, and more insidious: training data comes from logs of the current ranker. A user clicked a video because it was placed at position 1, not because they preferred it. Train on that data and you reinforce whatever the ranker did before — a positive feedback loop where the model keeps learning what it has already taught itself. The Wide & Deep extension here adds a shallow tower whose only job is to model position bias, factoring out the rank-induced effect from the engagement signal.

Two mechanisms because two failure modes: MMoE for objective conflict, shallow position tower for selection bias. Without explicit treatment of either, the model converges on a degenerate equilibrium.


Source: Recommenders Architectures

Related concepts in this collection

Concept map
15 direct connections · 101 in 2-hop network ·medium cluster

Click a node to walk · click center to open · click Open full network for a force-directed map

your link semantically near linked from elsewhere
Original note title

multi-objective ranking systems must explicitly model selection bias because data generated by the current ranker produces feedback loops