AI Social Psychology Language Understanding and Reasoning

When should human values enter the LLM development pipeline?

Explores whether human-centered concerns like safety and fairness work better as early design principles throughout development, or as post-training alignment patches. Matters because pipeline placement determines whether human priorities shape the foundation or fight against it.

Note · 2026-05-28 · sourced from Human Centered Design

The dominant industry pattern treats human-centered concerns — safety, fairness, steerability, user values — as alignment problems handled in a "cursory post-training stage," downstream of the real work of capability scaling. The Human-Centered Large Language Models framework rejects this sequencing. It argues that human priorities must be embedded with rigor at every stage of the pipeline: data sourcing and filtering, model training, evaluation, deployment, and long-term maintenance. The distinction it draws is between post-hoc human factors design, which accounts for user needs in only a thin slice of the process, and genuine human-centered design, where stakeholders are central to ideating, building, evaluating, and deploying the system.

Why the placement matters: a value introduced only at post-training inherits whatever the pretraining data and objective already baked in, so the patch is forever fighting the foundation. If the data sourcing stage ignored privacy or representational harm, alignment cannot fully recover it; if evaluation optimizes leaderboard metrics, human flourishing is invisible to the gradient. Embedding objectives upstream means treating the LLM as a sociotechnical system with global influence rather than an isolated tool measured by static benchmarks. The counterpoint — that pipeline-wide human-centering is slower and harder to operationalize than a final-stage fix — is real, and the framework concedes that the optimal path resists universal solutions. But the analysis is that treating alignment as a patch is precisely what subordinates human concerns to the capability race. The architecture of the pipeline encodes the priority.


— "Reflections and New Directions for Human-Centered Large Language Models", https://arxiv.org/abs/2605.06901

Related concepts in this collection

Concept map
14 direct connections · 144 in 2-hop network ·dense cluster Open in graph ↗

Click a node to walk · click center to open · click Open in graph to see this note in the full knowledge graph

your link semantically near linked from elsewhere
Original note title

human-centered objectives must be embedded across the entire llm pipeline not bolted on as a post-training patch