AI Engineering Insight

Why Anthropic Says "Don't Build Agents, Build Skills Instead"

After building Claude Code (their production coding agent), Anthropic realized something fundamental: we've been building AI agents wrong. The future isn't domain-specific agents—it's composable skills that any general-purpose agent can pull in at runtime.

Put simply, we think code is all we need.

— Barry Zhang, Anthropic (00:01:20)

16 min

Visionary talk from AI Engineer 2024

Anthropic logoSkills

Modular knowledge folders

5 weeks

To enterprise adoption

The Core Problem: Intelligence ≠ Expertise

Who do you want doing your taxes—Mahesh, a 300 IQ mathematical genius, or Barry, an experienced tax professional? Most people would pick Barry every time.

Who do you want doing your taxes? Is it going to be Mahesh, the 300 IQ mathematical genius, or is it Barry, an experienced tax professional, right? I would pick Barry every time. I don't want Mahesh to figure out the 2025 tax code from first principles.

— Barry Zhang, Anthropic

Watch the tax analogy (00:02:18)

Current Agents Are Like Mahesh

They're brilliant (high intelligence) but lack domain expertise. They can't absorb expertise well and don't learn over time.

The Missing Piece

We've been trying to make agents smarter. The real solution is giving them domain expertise in a way they can actually use.

The Solution: Skills as Modular Knowledge Units

Instead of building a "tax agent," "coding agent," or "research agent," Anthropic proposes building one general-purpose agent and giving it skills—modular, composable folders of expertise.

Skills are organized collections of files that package composable procedural knowledge for agents. In other words, they're folders. This simplicity is deliberate.

— Barry Zhang, Anthropic

Watch (00:03:02)

skill.md

Metadata and core instructions for what the skill does

Scripts/Executables

Tools and utilities the agent can use

Code/Assets

Supporting materials and context

Why This Works

Code is self-documenting, modifiable, and can live in the file system until needed. When Claude kept writing the same Python script to apply styling to slides, they saved it as a skill. Now Claude just runs the script instead of regenerating it every time.

Technical Breakthrough: Progressive Disclosure

If you have thousands of skills, you can't load them all into the context window at once. You'd run out of tokens immediately. The solution: progressive disclosure.

Step 1: Metadata Only

At runtime, only skill metadata is shown to the model to indicate what's available.

Lightweight

Step 2: Load on Demand

When an agent needs to use a skill, it reads the full skill.md with core instructions.

Full Content
That's why skills are progressively disclosed. At runtime, only this metadata is shown to the model just to indicate that he has the skill. When an agent needs to use a skill, it can read in the rest of the skill.md, which contains the core instruction and directory for the rest of the folder.

— Barry Zhang, Anthropic

Watch (00:04:27)

Real-World Adoption: Enterprise in 5 Weeks

Since launching just 5 weeks before this talk, the skills ecosystem has grown rapidly—with thousands of skills and strong enterprise adoption.

Since our launch five weeks ago, this very simple design has translated into a very quickly growing ecosystem of thousands of skills.

— Barry Zhang, Anthropic (00:05:04)

Fortune 100 Adoption

Using skills to teach agents about organizational best practices and bespoke internal software.

"We've been talking to Fortune 100s that are using skills as a way to teach agents about their organizational best practices and the weird and unique ways that they use this bespoke internal software."

— Mahesh Murag (00:06:43)

Developer Productivity

Teams serving thousands of developers are using skills to teach agents about code style best practices.

"These are teams serving thousands or even tens of thousands of developers in an organization that are using skills as a way to deploy agents like Claude Code and teach them about code style best practices."

— Mahesh Murag (00:07:03)

Why This Works

Skills solve the "bespoke software" problem. Every company has weird internal tools and processes. Skills let you teach agents about your unique environment without rebuilding the agent itself.

Third-Party Skills: The Partner Ecosystem

Partners aren't building agents—they're building skills that extend what Claude can do. This is the composable future Anthropic is betting on.

Notion

Workspace research and analysis skills that help Claude better understand your Notion workspace and do deep research.

"Notion launched a bunch of skills that help Claude better understand your Notion workspace and do deep research over your entire workspace."

— Barry Zhang (00:06:21)

Browserbase

Web Automation

Built a skill for their open-source browser automation tooling, Stage Hand.

"Browserbase is a pretty good example of this. They built a skill for their open-source browser automation tooling, stage hand."

— Barry Zhang (00:06:04)

Cadence

Scientific Research

Scientific research skills for EHR data analysis and Python bioinformatics libraries.

"We're also really excited to see people like Cadence build scientific research skills that give Claude new capabilities like EHR data analysis and using common Python bioinformatics libraries better than it could before."

— Barry Zhang (00:05:45)

Three Emerging Trends in Skills Development

1. Skills Getting More Complex

The most basic skill is a markdown file with prompts. But we're seeing skills that package software, executables, binaries, files, code, scripts, and assets—much like traditional software applications.

"But we're starting to see skills that package software, executables, binaries, files, code, scripts, assets, and a lot more. But we think that increasingly much like a lot of the software we use today, these skills might take weeks or months to build and be maintained."

— Mahesh Murag (00:07:46)

2. Skills Complementing MCP

MCP (Model Context Protocol) provides connectivity to the outside world. Skills provide expertise and orchestration. Developers are using skills to orchestrate workflows of multiple MCP tools.

"Developers are using and building skills that orchestrate workflows of multiple MCP tools stitched together to do more complex things with external data and connectivity. In these cases, MCP is providing the connection to the outside world while skills are providing the expertise."

— Mahesh Murag (00:08:12)

3. Non-Technical Builders (Most Exciting)

People in finance, recruiting, accounting, and legal are building skills—without coding. This democratizes AI development by letting domain experts encode their knowledge directly.

"And finally, and I think most excitingly for me personally, is we're seeing skills that are being built by people that aren't technical. These are people in functions like finance, recruiting, accounting, legal, and a lot more. I think this is pretty early validation of our initial idea that skills help people that aren't doing coding work extend these general agents."

— Mahesh Murag (00:08:40)

MCP + Skills: The Perfect Pair

The division of labor is clear: MCP provides connectivity (APIs, databases, services) while skills provide expertise (how to use those connections effectively).

MCP = Connection

Connects agents to the outside world—APIs, databases, services, data sources.

Connectivity Layer

Skills = Expertise

Provides orchestration and domain logic—how to use those connections effectively.

Intelligence Layer

Example: Financial Reporting Skill

A "financial reporting" skill might orchestrate multiple MCP servers:

  • MCP Server for Bloomberg API (market data)
  • MCP Server for Salesforce (CRM data)
  • MCP Server for PostgreSQL (historical data)
  • Skill contains the logic for combining these sources

Clean separation: MCP servers stay simple (just connectivity). Skills get smart (orchestration and domain logic).

Future Vision: Continuous Learning

Current agents don't learn across sessions. Every conversation starts fresh. Skills make learning transferable.

When you first start using Claude, this standardized format gives a very important guarantee. Anything that Claude writes down can be used efficiently by a future version of itself. This makes the learning actually transferable.

— Barry Zhang (00:13:35)

The Vision

"Our goal is that Claude on day 30 of working with you is going to be a lot better than Claude on day one."

Day 30 > Day 1

Already Real

"Claude can already create skills for you today using our skill creator skill and we're going to continue pushing in that direction."

Available Now

How It Works

  1. 1. Claude creates a skill to remember something important
  2. 2. Future versions of Claude can read that skill
  3. 3. Knowledge persists across sessions
  4. 4. The agent gets better over time

The Collective Knowledge Base

The vision that excites Anthropic most is a collective and evolving knowledge base of capabilities curated by people and agents inside organizations.

Network Effects

As you interact with an agent and give it feedback, all agents in your team and org get better too.

Instant Onboarding

When someone joins your team, Claude already knows what your team cares about.

Community Benefits

Skills built by others make your agents more capable, reliable, and useful.

"So just like when someone else across the world builds an MCP server that makes your agent more useful, a skill built by someone else in the community will help make your own agents more capable, reliable, and useful as well."

— Barry Zhang (00:13:00)

The Computing Analogy: Why Skills Are the Application Layer

In a rough analogy, models are like processors. Both require massive investment and contain immense potential, but are only so useful by themselves.

The OS made processors far more valuable by orchestrating the processes, resources, and data around the processor. A few companies build processors and operating systems, but millions of developers like us have built software that encoded domain expertise and our unique points of view.

— Barry Zhang (00:14:52)

Watch the computing analogy (00:14:41)
LayerComputingAI Agents
HardwareProcessorsModels (Claude)
OSOperating SystemAgent Runtime
ApplicationsSoftwareSkills

The Opportunity

The real value in computing wasn't in building processors or operating systems—it was in the applications layer. Skills open up the "application layer" for AI development, where domain expertise gets encoded and shared.

Key Takeaways for AI Engineers

1. Intelligence ≠ Expertise

Agents are brilliant but lack domain knowledge. Skills bridge that gap.

2. Code is Universal Interface

You don't need domain-specific scaffolding. Code works for everything.

3. Progressive Disclosure Enables Scale

Load metadata first, full content on-demand. Thousands of skills possible.

4. MCP + Skills = Complete Stack

MCP provides connectivity, skills provide expertise.

5. Non-Technical Users Are Building Skills

Finance, legal, accounting teams are already doing this.

6. Enterprise Adoption is Real

Fortune 100s using skills for organizational knowledge and internal tools.

7. Continuous Learning is Possible

Agents can create skills for their future selves. Day 30 Claude > Day 1 Claude.

8. Skills Enable Collective Intelligence

One person's skill makes everyone's agent better.

9. Stop Building Agents, Start Building Skills

The future is composable, not monolithic. As Barry Zhang says: "So we think it's time to stop rebuilding agents and start building skills instead." (00:15:55)

Source Video

Anthropic logo

Don't Build Agents, Build Skills Instead

Barry Zhang & Mahesh Murag, Anthropic • AI Engineer 2024

Video ID: CEvIs9y1uogDuration: ~16 minutes
Watch on YouTube

Research Note: All quotes in this report are timestamped and link to exact moments in the video for validation. This analysis was conducted using multi-agent transcript analysis.

Technologies Mentioned: MCP (Model Context Protocol), Claude Code, Agent Skills, Python, Git, File Systems, Bash, Notion, Browserbase, Cadence

Research sourced from AI Engineer 2024 conference transcript. Analysis conducted using dedicated agents for transcript analysis, highlight extraction, fact-checking, and content strategy. All quotes verified against original VTT file.