Design & LLM Interaction Language Understanding and Pragmatics LLM Reasoning and Architecture

Why do LLMs excel at feasible design but struggle with novelty?

When LLMs generate conceptual product designs, they produce more implementable and useful solutions than humans but fewer novel ones. This explores why domain constraints flip the novelty advantage seen in research ideation.

Note · 2026-02-23 · sourced from Design Frameworks
What kind of thing is an LLM really? How should researchers navigate LLM reasoning research?

Expert evaluation of LLM-generated conceptual design solutions compared to crowdsourced ones reveals a profile that INVERTS the research ideation finding:

Few-shot learning further constrains: it makes LLM solutions more similar to crowdsourced examples (improving quality alignment) but reduces the diversity of solutions the LLM can generate.

This inverts Why do LLMs generate more novel research ideas than experts?, where LLM research ideas were rated MORE novel but LESS feasible than human expert ideas. The critical variable is domain structure:

The pattern suggests that Can LLMs generate more novel ideas than human experts? — in design, the evaluation criteria are embedded in the prompt (feasibility, usefulness ratings), channeling generation toward conservative solutions. In research, evaluation criteria are absent from the prompt, allowing unconstrained generation.

The few-shot finding connects to How much does demo position alone affect in-context learning accuracy? — examples constrain not just accuracy but creative scope. Each example narrows the generative space.

The Pron vs Prompt contest (2024) provides complementary evidence from creative writing specifically. In a direct contest between Patricio Pron (an award-winning novelist) and GPT-4, evaluated by literature critics and scholars using a Boden-inspired creativity rubric across 5,400 manual assessments, "LLMs are still far from challenging a top human creative writer." The authors conclude that "reaching such level of autonomous creative writing skills probably cannot be reached simply with larger language models." This extends the feasible-not-novel pattern beyond design: LLMs generate competent but uncreative output across both design and literary domains. Source: Arxiv/Prompts Prompting.


Source: Design Frameworks

Related concepts in this collection

Concept map
15 direct connections · 109 in 2-hop network ·medium cluster

Click a node to walk · click center to open · click Open full network for a force-directed map

your link semantically near linked from elsewhere
Original note title

LLMs generate more feasible and useful but less novel conceptual design solutions than humans — few-shot learning decreases diversity further