How do different feed-weighting schemes construct distinct network topologies at population scale?

This explores how the scoring rules a recommendation feed uses to rank content (its 'weights') don't just change what individuals see — they reshape the connective structure of who-is-exposed-to-what across an entire population.

This explores how the scoring rules a recommendation feed uses to rank content don't just change individual experiences but actively wire up the population into different network shapes. The clearest statement of this in the corpus is the argument that recommendation feeds are persuasion infrastructure, not neutral pipes: feed weights influence what producers make, the resulting network topology pushes opinions to converge, and these effects compound at population scale through rating contamination and selection biases How do recommendation feeds shape what people see and believe?. The key move is recognizing that a weighting scheme is a topology-construction policy — change what the feed rewards and you change the graph of exposure.

The mechanism by which weights harden into topology is feedback. A ranker trained on its own past outputs converges on degenerate equilibria that amplify earlier decisions unless selection bias is explicitly modeled out — YouTube's system needs both a multi-gate objective mixer and a separate position tower precisely to break this self-reinforcing loop Why do ranking systems need to model selection bias explicitly?. Without that correction, the weighting scheme doesn't describe the network; it sculpts it over time, each round of exposure narrowing the next.

A subtler lever is the representational machinery underneath the weights. When user and item embeddings are too low-dimensional, the system overfits toward popular items to maximize ranking quality, and niche content gets starved of exposure — a structural skew that compounds over time and can't be patched after the fact Does embedding dimensionality secretly drive popularity bias in recommenders?. So 'feed-weighting' includes not just the visible objective weights but the dimensionality of the latent space, which silently decides whether the population graph collapses toward a popular core or stays diverse.

The corpus also shows that the *structure* of the signal you weight changes how stable the resulting topology is. Taobao's Swing algorithm builds product graphs from quasi-local bipartite patterns rather than single edges, and those structural signals resist noise because multiple independent noisy edges rarely align by chance Can graph structure patterns outperform direct edge signals in noisy data?. Weighting on structure versus weighting on raw edges yields measurably different — and differently robust — networks. There's even a hint that embedding geometry itself imposes a coarse-to-fine organization, where leading eigenvectors split broad categories before fine ones Do embedding eigenvectors organize taxonomy from coarse to fine?, suggesting the topology a feed produces is partly inherited from how its representation space is shaped before any explicit weight is tuned.

What you might not expect: the corpus frames the strongest topology lever as *selection* rather than *scale*. Routing queries to specialized models per semantic cluster outperforms one larger model Can routing beat building one better model? — and the same logic applies to feeds. How you partition and route a population matters more than how big the model is, which means feed-weighting schemes are best understood as routing topologies, where the choice of clustering quietly determines which communities ever see each other.

Sources 6 notes

How do recommendation feeds shape what people see and believe?

Research shows recommendation systems operate as political actors: feed weights influence producer behavior, network topology drives opinion convergence, and automation enables targeted persuasion at population scale. These effects compound through rating contamination and selection biases.

Why do ranking systems need to model selection bias explicitly?

YouTube's multi-objective ranker uses MMoE for conflicting objectives and a shallow position tower to remove selection bias from training data. Without both mechanisms, models converge on degenerate equilibria that amplify their own past decisions.

Does embedding dimensionality secretly drive popularity bias in recommenders?

Research shows that when user/item embedding dimensions are too small, recommender systems overfit toward popular items to maximize ranking quality. This compounds over time as niche items receive insufficient exposure, and cannot be fixed post-hoc without treating dimensionality as a fairness hyperparameter.

Can graph structure patterns outperform direct edge signals in noisy data?

Taobao's Swing algorithm constructs more robust product substitute graphs by exploiting quasi-local bipartite patterns rather than single edges. Structural signals are inherently noise-resistant because they require multiple independent noisy edges to coincidentally align, which rarely happens by chance.

Do embedding eigenvectors organize taxonomy from coarse to fine?

Leading eigenvectors of embedding Gram matrices separate broad taxonomic branches first, then progressively finer sub-branches—a coarse-to-fine spectral order that tracks the WordNet hypernym tree level by level, confirming predictions from co-occurrence statistics.

Can routing beat building one better model?

Avengers-Pro achieves 7% higher accuracy than GPT-5-medium by routing queries to optimal models per semantic cluster, or matches its performance at 27% lower cost. Ten 7B models with routing previously surpassed GPT-4.1 and 4.5, suggesting selection is a stronger lever than scaling.

Research prompt for your LLMexpand ↓

Copy into ChatGPT or Claude to take this line of inquiry further — it asks the model to find newer work and re-test which earlier constraints still hold.

You are a recommendation systems researcher testing whether feed-weighting topology claims from 2020–2026 still hold under current models, methods, and evaluation. The question remains open: How do different feed-weighting schemes construct distinct network topologies at population scale?

What a curated library found — and when (dated claims, not current truth):
• Recommendation feeds are persuasion infrastructure: weighting schemes influence producer behavior, compress opinion diversity, and harden topology through rating contamination and selection bias (2023).
• Rankers trained on their own outputs converge on degenerate equilibria unless selection bias is explicitly modeled; YouTube's multi-gate objective mixer + position tower are necessary to break feedback loops (2023).
• Low-dimensional embeddings structurally overfit toward popular items, starving niche content and collapsing diversity irreversibly (2023).
• Weighting on quasi-local bipartite structure (Swing algorithm) yields more robust, noise-resistant networks than edge-level weighting (2020).
• Embedding geometry imposes coarse-to-fine taxonomy organization via leading eigenvectors before explicit weights are tuned (2026).
• Routing queries to specialized models per semantic cluster outperforms single large models; routing topology matters more than scale (2025).

Anchor papers (verify; mind their dates):
• arXiv:2010.05525 — Large Scale Product Graph Construction for Recommendation in E-commerce (2020)
• arXiv:2305.13597 — Curse of "Low" Dimensionality in Recommender Systems (2023)
• arXiv:2508.12631 — Beyond GPT-5: Making LLMs Cheaper and Better via Performance-Efficiency Optimized Routing (2025)
• arXiv:2605.23821 — Hierarchical Concept Geometry in Language Models Emerges from Word Co-occurrence (2026)

Your task:
(1) RE-TEST EACH CONSTRAINT. For every claim above, determine whether newer architectures (mixture-of-experts, sparse routing, adaptive compute), training methods (DPO, preference learning, multi-objective RL), or online evaluation infrastructure have since relaxed, inverted, or overturned it. Separate the durable question (feedback loops and representational collapse likely still core) from perishable limitations (e.g., can modern high-dimensional embeddings + explicit diversity objectives fix the popularity bias?). Ground resolution claims in real papers.
(2) Surface the strongest contradicting or superseding work from the last ~6 months that challenges the "weighting schemes sculpt topology" frame — e.g., work claiming user agency, algorithmic transparency, or emergent degeneracy-resistance outweigh feed design.
(3) Propose 2 research questions that assume the regime has shifted: (a) Do hierarchical routing topologies with per-cluster weighting schemes reduce opinion convergence compared to monolithic rankers? (b) Can steering embedding geometry (e.g., via contrastive objectives) prevent popularity collapse without tuning visible feed weights?

Cite arXiv IDs; flag anything you cannot ground in a real paper.

How do different feed-weighting schemes construct distinct network topologies at population scale?

Sources 6 notes

Next inquiring lines