Why would compute-replacement cost determine wages instead of productivity?

This explores the claim that once AI can do a job, what an employer pays a human stops tracking the value that human creates and starts tracking what it would cost to run a machine doing the same thing.

This explores the argument that in an AI-saturated economy, your wage is set not by how much value your work produces but by the price of the cheapest thing that could replace you — and that cheapest thing is compute. The core of this idea comes from What happens to human wages in an AGI economy?, which argues that as AI automates the highest-value bottleneck tasks first, human pay drifts away from reflecting economic contribution and toward reflecting the cost of buying enough GPU time to replicate that contribution. Labor's share of the economy shrinks toward zero not because humans become worthless, but because their wage gets anchored to a falling external benchmark.

The reason this substitution holds is that in a competitive market, a price floats down to the marginal cost of the next-cheapest supplier. If a task can be done by either a person or a model, an employer won't pay the person more than it costs to run the model — regardless of how productive the person is. Productivity used to set the price because there was no substitute; once compute becomes a substitute, productivity sets the ceiling on value created, but compute cost sets the wage. The two decouple.

What makes compute a genuine substitute (rather than a fixed alternative) is that its cost is fluid and falling. Several notes show compute is not one fixed resource but a budget that can be reshuffled to get more for less: smaller models with extra inference time match bigger ones on hard problems Can inference compute replace scaling up model size?, and spending that inference budget adaptively — little on easy tasks, lots on hard ones — beats spending it uniformly Can we allocate inference compute based on prompt difficulty?. There are even cheaper tricks lurking, like recomputing weights instead of moving them on constrained hardware Does recomputing weights cost less than moving them on mobile?, and persistent agents that amortize cost across reused context so the real denominator becomes finished work, not tokens Do persistent agents really cost less per token?. Each of these pushes the replacement price down — and the wage benchmark down with it.

There's a deeper framing worth pulling in: this is what it looks like when intelligence stops behaving like a commodity. Two notes argue AI output isn't a fixed, possessable object but a contextual flow — a token valued by what it does for the receiver, not what it is Does AI actually commodify expertise or tokenize it?, Is AI fundamentally changing how value gets produced?. When the thing you're selling is a flow generated on demand, its price collapses toward the marginal cost of generating the next unit — which is compute. The wage-equals-compute story is the labor-market version of that same collapse.

The corpus also pushes back on treating this as inevitable. Whether AI displaces or merely reshuffles labor depends on how exposure is distributed: concentrated exposure lets workers shift to non-displaced tasks and offsets job losses Does concentrated AI exposure enable workers to adapt and reallocate?, and the inequality outcomes hinge on access and incentives, not the technology itself Does generative AI inevitably worsen or reduce inequality?. There's also a productivity wrinkle: AI's gains show up when people apply skills they already have, and vanish when they try to learn new ones When does AI actually boost worker productivity? — which hints that the human work resisting compute-replacement is exactly the work where value and substitutability haven't yet decoupled.

Sources 10 notes

What happens to human wages in an AGI economy?

As AGI automates bottleneck work first, human wages shift from reflecting economic value to reflecting compute costs. Labor's share of GDP approaches zero even as some accessory work remains human, driven by compute-allocation efficiency rather than irreplaceability.

Can inference compute replace scaling up model size?

Snell et al. (2024) showed that inference-time compute trades off against model parameter scaling, especially on difficult prompts. This reveals pretraining and inference compute are not independent resources.

Can we allocate inference compute based on prompt difficulty?

Research shows inference effectiveness varies dramatically by prompt difficulty. Reallocating the same total compute adaptively—giving easy prompts less and hard ones more—substantially outperforms larger models under uniform budgets.

Does recomputing weights cost less than moving them on mobile?

MobileLLM shows that on memory-bound mobile hardware, sharing weights between adjacent transformer blocks by recomputing one block twice uses less latency than fetching separate weights, gaining accuracy with no parameter increase.

Do persistent agents really cost less per token?

A 115-day case study found 82.9% of tokens were cache reads. When context persists and reuses, the meaningful cost denominator becomes completed artifacts, not individual tokens.

Does AI actually commodify expertise or tokenize it?

AI output lacks the fixed, identical, possessable properties of commodities. Instead it functions like tokens—mutable mediums of exchange valued by what they do for receivers, not what they are.

Is AI fundamentally changing how value gets produced?

AI production is organized around contextual token-flows generated at point of use, not identical mass-produced objects. This creates different effects than commodification: inflationary devaluation, contextual variation, and skill transformation from production to validation.

Does concentrated AI exposure enable workers to adapt and reallocate?

Analysis of task-level AI exposure across firms 2010-2023 shows that while higher mean exposure reduces labor demand, more concentrated exposure (affecting few tasks) enables workers to reallocate to non-displaced tasks, producing modest net employment effects.

Does generative AI inevitably worsen or reduce inequality?

An interdisciplinary review found that across information, work, education, and healthcare, generative AI can both exacerbate and reduce inequality. The direction is determined by access, integration, and incentive structures, not the capability itself.

When does AI actually boost worker productivity?

Studies showing AI productivity gains measured tasks within workers' existing domains. When workers used AI to learn new skills, productivity gains disappeared and learning suffered, suggesting prior findings do not generalize to skill acquisition.

Research prompt for your LLMexpand ↓

Copy into ChatGPT or Claude to take this line of inquiry further — it asks the model to find newer work and re-test which earlier constraints still hold.

You are a labor economist and AI systems analyst re-testing claims about whether compute cost—not productivity—anchors wages in an AI-saturated economy. The question remains open: what actually sets the floor on human compensation as AI capability grows?

What a curated library found — and when (dated claims, not current truth):
Findings span 2023–2026; treat these as perishable benchmarks:
• Compute cost acts as a wage floor when AI can substitute for human tasks, decoupling pay from productivity (2026, arXiv:2602.00139).
• Inference-time compute can be squeezed: smaller models + adaptive compute per task-difficulty match larger models, pushing replacement cost down (2025–2026).
• Worker reallocation offsets job losses only when task exposure is concentrated, not dispersed (2024).
• AI productivity gains appear only when workers apply *existing* skills, not during learning; this hints human work resisting substitution is skill-dependent (2025).
• Persistent agentic systems amortize cost across reused context, shifting the economic unit from cost-per-token to cost-per-outcome (2026, arXiv:2605.26870).

Anchor papers (verify; mind their dates):
• arXiv:2602.00139 (2026) — direct firm-level evidence on labor–AI substitution.
• arXiv:2605.26870 (2026) — persistent agents and amortized cost economics.
• arXiv:2601.20245 (2026) — how AI impacts skill formation (counterweight to substitution thesis).
• arXiv:2501.17161 (2025) — RL generalizes where SFT memorizes; implications for wage-floor stability.

Your task:
(1) RE-TEST EACH CONSTRAINT. For compute-cost wage anchoring: does it still hold given (a) recent progress on reasoning models that may *increase* task specificity/irreplaceability, (b) tooling shifts (multi-agent orchestration, long-context amortization, reasoning caching), and (c) labour-market tightness in domains where AI is weak? Distinguish the durable question (does substitutability drive wage floors?) from perishable findings (e.g., specific compute-cost figures, inference-budget tricks). Flag where the constraint *still* appears to hold and where evidence suggests regime shift.
(2) Surface the strongest CONTRADICTING or SUPERSEDING work from the last 6 months. Look for papers showing: skill recombination *raises* human value; long-context or reasoning compute costs *aren't* falling; or labour markets segment in ways that decouple wages from compute benchmarks.
(3) Propose 2 research questions that assume the regime may have moved: (a) Does reasoning-time compute substitution follow the same marginal-cost logic as inference-time compute? (b) In domains where AI amplifies rather than replaces (e.g., research, creative direction), does the wage floor detach from compute cost entirely?

Cite arXiv IDs; flag anything you cannot ground in a real paper.

Why would compute-replacement cost determine wages instead of productivity?

Sources 10 notes

Next inquiring lines