AI Transformation Research

What Data from 20m Pull Requests Reveal About AI Transformation

Nicholas Arcolano from Jellyfish shares groundbreaking insights from 20 million pull requests. Discover the real productivity gains, quality impacts, and adoption patterns that define AI's impact on software development.

"We're seeing 2x throughput and 24% faster cycle times across teams adopting AI tools."

Nicholas Arcolano, Head of Research at Jellyfish • 6:10

Dataset

20M+

Pull requests analyzed

Productivity

Throughput increase

Cycle Time

-24%

Faster delivery

The Dataset: Unprecedented Scale

Jellyfish analyzed 20 million pull requests from 200,000 developers across ~1,000 companies. This represents one of the most comprehensive studies of AI's impact on software development ever conducted.

20M+

Pull Requests

Code changes analyzed from June-mid 2024

200K

Developers

Individual contributors tracked across companies

~1K

Companies

From startups to enterprise organizations

Why This Matters

Most AI productivity claims are based on small surveys or hypothetical scenarios. This dataset captures real-world behavior at scale—actual pull requests, actual cycle times, actual outcomes. The findings represent what's really happening, not what vendors claim should happen.

The AI Adoption Explosion

In just a few months (June to mid-2024), AI adoption transformed from experimental to essential. The data shows exponential growth in both company-level adoption and developer-level usage.

Companies with 50%+ AI-Generated Code

June 2024

2% → 50% (mid-2024)

25x growth in just a few months

Median Developer Adoption Rate

Summer 2024

22% → ~90% (present)

4x growth as tools became mainstream

"We went from 2% to 50% of companies generating half their code with AI in a matter of months. This isn't gradual adoption—it's a transformation."

— Nicholas Arcolano, Head of Research at Jellyfish

AI adoption growth from June to mid-2024

3:00

Productivity Impact: The Real Gains

The data reveals clear, measurable productivity improvements as AI adoption increases. Teams using AI tools are shipping faster and delivering more.

PR Throughput

Teams at 100% AI adoption produce twice as many pull requests as teams at 0% adoption.

From 0% to 100% AI adoption

-24%

Cycle Time

Full-cycle PR delivery is nearly a quarter faster with complete AI adoption.

From 0% to 100% AI adoption

Increased PR Volume

Teams push more pull requests when using AI tools. Higher throughput without sacrificing quality.

Faster Processing

Both writing and merging cycles accelerated. From commit to merge, AI compresses timelines.

Larger PR Sizes

Pull requests are 18% larger (net lines added). AI generates more thorough code changes.

More Verbose Changes

AI doesn't expand scope—changes are more thorough. Rewriting patterns replace minimal edits.

The Bottom Line

AI tools aren't just making developers feel faster—they're objectively accelerating software delivery. 2x throughput and 24% faster cycle times represent multi-million dollar efficiency gains for organizations. This is the AI productivity promise delivered in practice, not theory.

The Critical Finding: Architecture Determines AI Success

Not all organizations see the same productivity gains from AI. The data reveals a stark correlation between software architecture and AI effectiveness. This is the most important insight for leaders considering AI transformation.

CRITICAL FINDING

The architecture correlation

4x Gains

Organizations with modular, well-architected codebases see 4x productivity improvements from AI adoption.

Microservices, clean separation of concerns, testable code

0x Gains

Organizations with tightly coupled, distributed architectures see virtually no productivity benefit from AI tools.

Ball of mud, tangled dependencies, untestable code

The implication: AI isn't a magic bullet. It amplifies existing codebase characteristics. Well-architected software becomes dramatically more productive with AI. Poorly architected codebases see minimal benefit. Architecture modernization must precede or accompany AI adoption.

"Architecture is the multiplier. AI makes good architectures great and bad architectures worse. The 4x vs 0x gap tells us that AI transformation is actually an architecture transformation."

— Analysis of Jellyfish dataset findings

Correlation between software architecture and AI productivity gains

14:00

Quality Impact: No Bugs, No Regrets

The most common fear about AI-generated code is quality degradation. The data shows no statistically significant relationship between AI adoption and bug creation or PR reverts.

Bug Creation

Stable → No significant change

PR Reverts

Stable → No significant change

Bug Resolution

Baseline → Increased rates

Quality Remains Stable

Despite dramatic increases in PR volume (2x throughput), there's no corresponding spike in bugs. AI- generated code is not measurably buggier than human-written code.

AI Tackles Backlog

Bug resolution rates have actually increased with AI adoption. Teams use AI tools to address technical debt and resolve long-standing issues.

"We worried AI would flood teams with low-quality code. That's not happening. Quality is stable, and teams are using AI to fix existing bugs faster than ever."

— Jellyfish research summary

Quality impact assessment from 20M pull request dataset

10:15

Reality Check: Autonomous Agents (Mid-2024 Data)

This data reflects the AI landscape in mid-2024. Autonomous agents have evolved dramatically since then. Treat these findings as a historical snapshot, not current state.

Data Timestamp Warning

Critical caveat: The autonomous agent statistics below are from mid-2024. The agent landscape has changed significantly in late 2025.

What's changed: Devin, AutoGPT, OpenAI's Swarm, Claude's Computer Use, and numerous other agents have matured. Current adoption is likely much higher than the <2% reported here.

Our recommendation: Use this data as a baseline for understanding the early agent landscape, not as a reflection of current capabilities or adoption.

<2%

PRs from Autonomous Agents

As of mid-2024, less than 2% of pull requests came from fully autonomous agents like Devon or Codeex.

⚠️ Mid-2024 data - likely much higher now

Mostly Experimentation

Autonomous agent usage was primarily in trial and experimentation phases. Very few companies had agents running at full production scale.

Interactive tools: High adoption

Autonomous agents: Early stage

Interactive vs Autonomous Tools (Mid-2024)

Interactive Tools (High Adoption)

GitHub Copilot

Cursor

Claude Code

Autonomous Agents (Experimental)

Devin (Cognition AI)

⚠️

"Codeex" (unverified)

Top Quotes from the Talk

Direct quotes with timestamped YouTube links for verification

Productivity

"On average, a company should expect to double their PR throughput if they go from not using AI at all to 100% adoption of AI coding tools."

Nicholas Arcolano

Setting expectations for AI adoption