📊 Full opportunity report: The Model Is Only 10%: The Real Lesson of the New SDLC on ThorstenMeyerAI.com — validation score, market gap, and execution plan.

TL;DR

A recent Google whitepaper reveals that in AI-assisted software development, the model accounts for only 10% of system behavior. The focus should be on harness design and context engineering, which drive performance and cost efficiency.

A new Google whitepaper released in early 2026 states that in AI-assisted software development, the model accounts for only about 10% of the system’s behavior. The paper emphasizes that the harness and context engineering are far more influential in determining performance, cost, and reliability. This challenges the common perception that advances hinge mainly on new AI models, shifting the strategic focus toward system design and configuration.

The whitepaper, authored by Addy Osmani, Shubham Saboo, and Sokratis Kartakis, argues that the biggest shift in software engineering is moving from writing code to expressing intent and trusting machines to interpret that intent. It highlights that as of early 2026, 85% of professional developers use AI coding agents regularly, with 41% generating roughly half of their new code via AI. Despite the hype around new models, the paper stresses that model improvements alone are insufficient for system performance.

Instead, the authors focus on the harness—the prompts, tools, rules, and observability layers surrounding the model—and the context engineering—the quality and structure of information fed into the AI. Concrete experiments show that changing only the harness or the prompt can dramatically improve results, even with the same model, indicating that system configuration is the primary driver of AI behavior.

At a glance

reportWhen: published early 2026

The developmentThe whitepaper by Addy Osmani and colleagues shifts focus from AI models to the surrounding system components in software development.

The Model Is Only 10% — The New SDLC With Vibe Coding

AI Dispatch · Field Notes

Google · Osmani, Saboo & Kartakis · May 2026

The model is only 10%

A Google whitepaper argues software’s biggest shift is from writing code to expressing intent. Its sharpest claim: the model you obsess over is the smallest part of the system — the scaffolding around it does the real work.

A spectrum, not a binary — the differentiator is how outputs get verified

Vibe Coding

Casual prompts · “does it seem to work?” · disposable code · high risk

Structured AI-Assisted

Detailed prompts + constraints · manual testing · features in real codebases

Agentic Engineering

Formal specs · automated tests + evals + CI gates · production scale · low risk

Tests verify the deterministic; evals verify the rest. Without both, it’s vibe coding — however clever the prompt.

The idea worth building your strategy around

Agent = Model + Harness

~10%

HARNESS — prompts · tools · context · hooks · sandboxes · observability

MODEL~90% IS YOUR SURFACE AREA, NOT THE PROVIDER’S

Outside Top 30 → Top 5 on Terminal Bench 2.0 by changing only the harness — same model.

“Most agent failures, examined honestly, are configuration failures” — a missing tool, a vague rule, a noisy context.

The economics: it’s a token-cost problem (CapEx vs OpEx)

Vibe Coding

Low CapEx · High OpEx

Looks free, hides debt: token burn (fix-it loops), maintenance tax (AI spaghetti), security remediation. Crosses over to 3–10× more per feature.

Agentic Engineering

High CapEx · Low OpEx

Pay upfront (specs, evals, context), then ship cheaply. Levers: context engineering for first-pass success + intelligent model routing — cheap models for the easy work.

85%

of devs use AI coding agents (51% daily)

41%

of all new code is AI-generated

~90%

of agent behavior is the harness, not the model

+19%

longer on some tasks (METR) — verification is the cost

The read

The clearest map yet of how serious AI development works — and mostly tool-agnostic. But it’s a Google funnel: the concepts are neutral, the on-ramps point to Gemini, Jules & the ADK. If the harness is 90% and it’s yours, your moat and your costs both live there — so own your scaffolding, route across models, and remember: AI amplifies whatever engineering culture it lands in.

Source: Osmani, Saboo & Kartakis, “The New SDLC With Vibe Coding,” Google (May 2026). Figures are the paper’s own, incl. METR & LangChain. Analysis is the author’s.

thorstenmeyerai.com

Why System Design Outweighs Model Advances in AI Development

This shift has major implications for organizations investing heavily in AI. It suggests that cost efficiency, reliability, and performance depend more on how systems are built and configured than on acquiring the latest AI models. Companies can gain a competitive edge by focusing on harness design and context management, rather than solely chasing model improvements, which are becoming commoditized.

Moreover, understanding that model performance is only 10% underscores the importance of developing expertise in system engineering, prompt engineering, and context structuring. This paradigm change could influence how organizations allocate resources, prioritize skill development, and approach AI deployment strategies.

AI Engineering: Building Applications with Foundation Models

As an affiliate, we earn on qualifying purchases.

Systemic Shift from Model-Centric to System-Centric AI Strategies

The whitepaper builds on the ongoing evolution of AI development, where early focus was on creating larger, more capable models. However, recent experiments and benchmarks demonstrate that tuning and configuring existing models can yield more significant gains than developing new models. For example, a team improved an agent’s performance by moving from outside the top 30 to the top 5 on a benchmark solely through harness modifications, with no change in the model itself.

This reflects a broader trend: AI system success increasingly depends on the surrounding infrastructure—prompt design, rule sets, tools integration, and observability—rather than the core model. The whitepaper advocates for a disciplined approach, termed agentic engineering, where system components are meticulously engineered for correctness and efficiency.

“The model is only about 10% of what determines behavior; the harness and context are the other 90%.”
— Addy Osmani

AI Prompt Engineering: Foundations of Communication with LLMs – Building Generative AI and Agentic AI Prompt Systems Across Development, Testing, and Deployment (AI Engineering)

As an affiliate, we earn on qualifying purchases.

Unclear Impact of System Engineering on Long-Term AI Progress

While the whitepaper makes a compelling case for system-centric AI development, it remains unclear how quickly organizations will adopt these practices at scale. The long-term impact of focusing on harness and context engineering versus ongoing model innovation is still being evaluated, and some industry observers question whether this approach can sustain breakthroughs in AI capabilities.

Designing Instruction with Generative AI: 24/7 Support for Optimizing Teaching and Learning

As an affiliate, we earn on qualifying purchases.

Next Steps for AI System Optimization and Industry Adoption

Organizations are likely to begin prioritizing system design, prompt engineering, and context management in their AI workflows. Future research and development may focus on creating standardized tools, frameworks, and best practices for harness and context engineering. Additionally, industry benchmarks and case studies will help quantify the benefits of this approach, encouraging broader adoption across sectors.

Mold Test Kit for Home – 12 Simple Detection Tests,Test HVAC System,Home Surfaces,& Indoor Air Quality Testing Kits,DIY Mold Detector at Home,Includes Detailed Mold Identification Guide,air Tester

The Kayenqo Mold Test Kit contains 12 test plates, detailed mold guide and mold removal instructions, 12 swabs,…

As an affiliate, we earn on qualifying purchases.

Key Questions

Why is the model only 10% of the system’s behavior?

The whitepaper shows that the surrounding components—prompts, tools, rules, and observability—are responsible for most of how an AI system performs, with the model itself being only a minor influence.

How can organizations improve AI performance without changing models?

By focusing on harness design, prompt engineering, and context structuring, organizations can significantly enhance AI behavior and efficiency without needing to upgrade to new models.

Does this mean AI model development is no longer important?

Model development remains valuable, but the whitepaper suggests that system configuration and engineering are now more critical for practical performance and cost management.

What skills should AI teams focus on now?

Teams should develop expertise in system engineering, prompt design, context management, and observability to optimize AI deployment effectively.

Will this approach reduce AI development costs?

Yes, focusing on system configuration can lower marginal costs, improve reliability, and reduce vulnerabilities, making AI projects more economically sustainable.

Source: ThorstenMeyerAI.com

This content is for general information only and is not financial, tax or legal advice. Consult a qualified professional for decisions about your money.

The Model Is Only 10%: The Real Lesson of the New SDLC

Up next

Cutrova: Edit the Words, Not the Timeline

Author

Lifevest Advisors Team

The model is only 10%

Why System Design Outweighs Model Advances in AI Development

AI Engineering: Building Applications with Foundation Models

Systemic Shift from Model-Centric to System-Centric AI Strategies

AI Prompt Engineering: Foundations of Communication with LLMs – Building Generative AI and Agentic AI Prompt Systems Across Development, Testing, and Deployment (AI Engineering)

Unclear Impact of System Engineering on Long-Term AI Progress

Designing Instruction with Generative AI: 24/7 Support for Optimizing Teaching and Learning

Next Steps for AI System Optimization and Industry Adoption

Mold Test Kit for Home – 12 Simple Detection Tests,Test HVAC System,Home Surfaces,& Indoor Air Quality Testing Kits,DIY Mold Detector at Home,Includes Detailed Mold Identification Guide,air Tester

Key Questions

Why is the model only 10% of the system’s behavior?

How can organizations improve AI performance without changing models?

Does this mean AI model development is no longer important?

What skills should AI teams focus on now?

Will this approach reduce AI development costs?

10 Best Computers, Tablets & Components For Flexible Work In 2026

Private Cloud Storage Gets Smarter With These AI NAS Devices In 2026

Software-Defined Warfare: How Ukraine’s Delta Turned The Battlefield Into A Shared, Real-Time Map

The Best AI Student Planners For Smarter Study Scheduling In 2026

FRANKLIN RESOURCES INC Files 8-K: Executive Change

Inside Room 23: The AI Techniques Powering ‘Kanton Alpin Verkehrsbetriebe’

How to Prepare a Home Office for Power Outages

How to Build a Paycheck System From Investment Accounts

The Model Is Only 10%: The Real Lesson of the New SDLC

Up next

Author

Lifevest Advisors Team

The model is only 10%

Why System Design Outweighs Model Advances in AI Development

AI Engineering: Building Applications with Foundation Models

Systemic Shift from Model-Centric to System-Centric AI Strategies

AI Prompt Engineering: Foundations of Communication with LLMs – Building Generative AI and Agentic AI Prompt Systems Across Development, Testing, and Deployment (AI Engineering)

Unclear Impact of System Engineering on Long-Term AI Progress

Designing Instruction with Generative AI: 24/7 Support for Optimizing Teaching and Learning

Next Steps for AI System Optimization and Industry Adoption

Mold Test Kit for Home – 12 Simple Detection Tests,Test HVAC System,Home Surfaces,& Indoor Air Quality Testing Kits,DIY Mold Detector at Home,Includes Detailed Mold Identification Guide,air Tester

Key Questions

Why is the model only 10% of the system’s behavior?

How can organizations improve AI performance without changing models?

Does this mean AI model development is no longer important?

What skills should AI teams focus on now?

Will this approach reduce AI development costs?

You May Also Like