The Model Is Only 10%: The Real Lesson of the New SDLC

📊 Full opportunity report: The Model Is Only 10%: The Real Lesson of the New SDLC on ThorstenMeyerAI.com — validation score, market gap, and execution plan.

TL;DR

A recent whitepaper from Google reveals that in AI-assisted software development, the model itself is only about 10% of system behavior. The majority depends on how developers design the harness and context, shifting focus from models to configuration and verification.

A new whitepaper from Google, authored by Addy Osmani, Shubham Saboo, and Sokratis Kartakis, states that the AI model accounts for only about 10% of the behavior in AI-driven software systems. The more significant factors are the harness and context engineering, which determine how the model is used and how its outputs are verified. This shifts the traditional focus on models to the importance of configuration, scaffolding, and verification processes.

The whitepaper emphasizes that the dominant part of an AI agent’s behavior lies outside the core model, which is often mistakenly viewed as the primary driver. Instead, the harness — including prompts, tools, rules, and observability — makes up roughly 90% of the effective system performance. Evidence from benchmarks shows that changing the harness can dramatically improve AI agent outcomes without switching models, highlighting the importance of configuration over raw model power.

The authors introduce the concept of context engineering as a core discipline, involving the strategic loading of instructions, knowledge, memory, and tools. They argue that the key to scalable and cost-effective AI development is designing dynamic, on-demand context loading rather than static, always-loaded information, which impacts both performance and costs.

Economically, the paper notes that while vibe coding appears cheap initially, it incurs high long-term costs due to token wastage, maintenance, and security vulnerabilities. Conversely, disciplined engineering with well-structured schemas and verification can reduce total costs, despite higher upfront investments.

At a glance
reportWhen: published early 2026
The developmentGoogle’s new whitepaper on the SDLC highlights that the core of effective AI development is not the model but the harness and context engineering, with the model representing only 10% of the system’s behavior.
The Model Is Only 10% — The New SDLC With Vibe Coding
AI Dispatch · Field Notes
Google · Osmani, Saboo & Kartakis · May 2026

The model is only 10%

A Google whitepaper argues software’s biggest shift is from writing code to expressing intent. Its sharpest claim: the model you obsess over is the smallest part of the system — the scaffolding around it does the real work.

A spectrum, not a binary — the differentiator is how outputs get verified
Vibe Coding
Casual prompts · “does it seem to work?” · disposable code · high risk
Structured AI-Assisted
Detailed prompts + constraints · manual testing · features in real codebases
Agentic Engineering
Formal specs · automated tests + evals + CI gates · production scale · low risk
Tests verify the deterministic; evals verify the rest. Without both, it’s vibe coding — however clever the prompt.
The idea worth building your strategy around
Agent = Model + Harness
~10%
HARNESS — prompts · tools · context · hooks · sandboxes · observability
MODEL~90% IS YOUR SURFACE AREA, NOT THE PROVIDER’S
Outside Top 30 → Top 5 on Terminal Bench 2.0 by changing only the harness — same model.
“Most agent failures, examined honestly, are configuration failures” — a missing tool, a vague rule, a noisy context.
The economics: it’s a token-cost problem (CapEx vs OpEx)
Vibe Coding
Low CapEx · High OpEx
Looks free, hides debt: token burn (fix-it loops), maintenance tax (AI spaghetti), security remediation. Crosses over to 3–10× more per feature.
Agentic Engineering
High CapEx · Low OpEx
Pay upfront (specs, evals, context), then ship cheaply. Levers: context engineering for first-pass success + intelligent model routing — cheap models for the easy work.
85%
of devs use AI coding agents (51% daily)
41%
of all new code is AI-generated
~90%
of agent behavior is the harness, not the model
+19%
longer on some tasks (METR) — verification is the cost
The read

The clearest map yet of how serious AI development works — and mostly tool-agnostic. But it’s a Google funnel: the concepts are neutral, the on-ramps point to Gemini, Jules & the ADK. If the harness is 90% and it’s yours, your moat and your costs both live there — so own your scaffolding, route across models, and remember: AI amplifies whatever engineering culture it lands in.

Source: Osmani, Saboo & Kartakis, “The New SDLC With Vibe Coding,” Google (May 2026). Figures are the paper’s own, incl. METR & LangChain. Analysis is the author’s.
thorstenmeyerai.com

Why the 10% Model Is Not the Main Focus

This shift in understanding has significant implications for AI development strategies. It suggests that investing in better harness design, configuration, and verification processes can yield more substantial and cost-effective improvements than simply adopting newer or larger models. For organizations, this means rethinking resource allocation, emphasizing configuration and testing to maximize AI system reliability and efficiency.

Moreover, it challenges the common narrative that the latest model innovations are the primary source of AI progress, redirecting attention to the surrounding infrastructure and engineering practices. This could influence how companies plan their AI roadmaps and allocate budgets for model development versus system engineering.

Amazon

AI prompt engineering tools

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Evolution of AI Development Practices and the SDLC Shift

The new whitepaper builds on the ongoing evolution of AI development, where the focus has shifted from solely improving models to optimizing how they are integrated into systems. As of early 2026, AI adoption is widespread, with 85% of professional developers using AI coding agents regularly, and 41% generating code via AI daily, according to industry reports.

Previous trends emphasized model improvements, but recent experiments and benchmarks demonstrate that configuration and scaffolding can dramatically enhance outcomes. This aligns with broader industry observations that system design and verification are now central to AI success, especially as models become more commoditized.

The paper also contextualizes this within the broader SDLC (Software Development Life Cycle), advocating for a disciplined, structured approach to AI system engineering that prioritizes verification, guardrails, and context management over raw model capabilities.

“The model accounts for only about 10% of the behavior; the rest is the harness and context engineering.”

— Addy Osmani

Amazon

AI system verification software

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

What Aspects of the Harness and Context Are Still Unclear

While the whitepaper provides strong evidence that harness and context engineering dominate system behavior, it does not specify precise best practices or standardized frameworks for implementation. The exact methods for optimal context loading, the impact across different AI tasks, and how to measure success in harness design remain areas for further research and experimentation.

Additionally, the long-term effects of this shift on AI model development, industry standards, and organizational workflows are still evolving and not yet fully understood.

Amazon

AI harness configuration tools

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Next Steps for AI Engineering and Industry Adoption

Organizations are likely to prioritize investing in harness and context engineering, developing better tools, and establishing standards for verification and configuration. Further research and case studies will clarify best practices and quantify cost savings. Industry leaders may also explore new training programs focused on system configuration and verification skills, shifting the AI development paradigm further toward disciplined engineering.

Monitoring how these insights influence AI product development and organizational workflows in the coming months will be critical to understanding their full impact.

Amazon

context engineering software for AI

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Key Questions

Why is the model only 10% of the system behavior?

The whitepaper argues that most of an AI agent’s behavior depends on how the system is configured, including prompts, tools, rules, and verification processes, which collectively shape the output more than the model itself.

How does this shift affect AI development costs?

While initial investments in harness design and context engineering may be higher, they lead to lower long-term costs through improved efficiency, reduced token waste, and better security, making AI development more cost-effective overall.

What practical steps can organizations take now?

Organizations should focus on designing flexible, dynamic context loading, establishing verification protocols, and building scaffolding that can be owned and improved over time, rather than relying solely on model upgrades.

Does this mean models are becoming less important?

Models remain essential as the core engine, but their role is now understood as a component within a larger system where configuration and verification play a more significant role in determining outcomes.

Will this change the future of AI research?

Yes, future research is likely to focus more on system engineering, verification, and context management, rather than solely on developing larger or more sophisticated models.

Source: ThorstenMeyerAI.com

This content is for general information only and is not financial, tax or legal advice. Consult a qualified professional for decisions about your money.
You May Also Like

2 Best Home Night Lights In 2026

Discover the best home night lights of 2026, featuring top models with adjustable brightness and automatic sensors for safe, energy-efficient lighting.

Aleph Alpha. The retrospective case.

Analyzing Aleph Alpha’s strategic pivot, founder departure, and recent merger with Cohere to understand the costs of late structural adaptation in European AI.

2 Best Home Night Lights in 2026

Discover the best home night lights of 2026, featuring adjustable brightness and low-power options. Find out which fits your needs best.

The Power Bottleneck: AI Data Centers and the Grid Cliff Approaching 2027-2028

AI data center growth faces a power supply crunch as grid expansion delays clash with hyperscaler capex commitments, risking deployment delays by 2028.