Crunching 5,000+ projects...
πŸ“Š A Data Story

The Secret Lives of Open Source Projects

You've seen the GitHub stars. The trending repos. The "10x developer" myths.

We analyzed every metric. Seriously. We counted.

0
Projects Analyzed
0
Organizations
0
Metrics Tracked
Scroll to explore
01 / EFFICIENCY

David vs. Goliath

"Do you need a massive army to move fast?"

We calculated the Commits per Contributor ratio for every project.

  • High Ratio = A small, elite team doing massive work
  • Low Ratio = A large community where each person contributes a little (the "Bazaar" model)

πŸ† Top 15 Most Efficient Projects

Commits per active contributor over the last 12 months

πŸ’‘

The Verdict: Small teams with fewer than 50 contributors can produce up to 1,138 commits per person. Meanwhile, some mega-projects average fewer than 3 commits per contributor. The "Special Forces" of open source are real.

02 / THE TRIAGE TRAP

Speed vs. Quality

"If a project responds instantly to issues, they probably fix them faster too, right?"

We correlated Response Time (how fast they say "Hello") with Resolution Rate (how often they actually close the issue).

⏱️ Response Time vs. Resolution Rate

Each dot is a project. Hover to explore.

Response Time (Hours) β†’
← Resolution Rate
0.03
Correlation Coefficient

That's basically zero. Fast bots saying "Thanks for your issue!" doesn't mean the bug gets fixed.

πŸ€–

The Verdict: Don't be fooled by fast response times. Many projects use bots that reply instantly but never actually address the underlying issue. Speed β‰  Quality.

03 / GROWTH VS MAINTENANCE

Building or Fixing?

"Is the project building a skyscraper or just painting the walls?"

We compare Commit Activity against Codebase Size.

  • High Commits + Low Size = Heavy refactoring or technical debt cleanup
  • High Commits + High Size = Massive expansion

πŸ—οΈ Commit Activity vs. Codebase Size

Projects in the upper-left are "maintenance heavy"

Commits (log scale) β†’
← Codebase Size (LOC)

πŸ”§ Top Maintenance/Refactoring Projects

Rank Project Commits Codebase Size Maintenance Ratio
πŸ”

The Verdict: Projects like Model Context Protocol (MCP) appeared with huge activity but small sizeβ€”classic signs of a new, rapidly iterating standard or heavy refactoring.

04 / HIDDEN GEMS

Corporate Darlings

"Which projects have huge corporate buy-in but relatively small contributor circles?"

We look for a high Organizational Diversity Ratio (Organizations / Contributors).

  • High Ratio = Many companies care, but few people write the code (critical infrastructure)
  • Low Ratio = Massive community project where contributor count dwarfs org count

πŸ’Ž Contributors vs. Organizations

Larger dots = higher organizational diversity

Active Contributors (log scale) β†’
← Active Organizations

πŸ’Ž Top Hidden Gems (High Corporate Backing, Small Teams)

Rank Project Organizations Contributors Diversity Ratio
πŸ†

The Verdict: These "Hidden Gems" are often the safest bets for enterprise adoptionβ€”stable, backed by many, but not chaotic. Think critical infrastructure libraries like Numcodecs or ko.

05 / THE BUS FACTOR

Small Teams, Massive Output

"Which projects are punching way above their weight (and thus have the highest 'Bus Factor' risk)?"

Projects with ≀50 contributors generating massive commit volumes.

⚠️ Risk: High output from a small group means if one key person leaves, the project could stall.

🚌 Small Teams Leaderboard

Total commits from projects with 50 or fewer contributors

Rank Project Total Commits Linux Foundation
🎯

The Verdict: These are the "David" projects of the ecosystem. Impressive, but fragile. If you depend on these, make sure there's a succession plan.

06 / BURNOUT RISK

Running Out of Steam

"Who is running out of steam?"

We looked at projects with High Productivity Scores that have seen a massive drop in momentum (commits plummeting compared to the previous period).

πŸ”₯ Productivity vs. Momentum

Red dots = slowing down. Green dots = accelerating.

Productivity Score (log scale) β†’
← Momentum (% Change)

🚨 Burnout Risk Watchlist

Rank Project Productivity Current Commits Previous Period Momentum
⚠️

The Verdict: We found alarming drops in projects like Islet and CheriBSD (>97% drop). These teams were running hot but are now stalling. If you depend on these, check on them.

07 / THE CHURN TRAP

Motion vs. Progress

"Are they building new features or just rewriting the same code forever?"

We calculated a Churn Ratio: Commits per Net Line of Code Change.

  • Low Ratio (~1.0) = Every commit adds value (Growth)
  • High Ratio (>100) = Hundreds of commits to change 5 lines of code (Spinning wheels)

πŸ”„ High Churn Projects (Spinning Wheels?)

Rank Project Commits Code Changes Churn Ratio
πŸ”„

The Verdict: Projects like Model Context Protocol (MCP) and EVerest have ratios >2,800. This indicates massive refactoring, stabilization, or non-code work. They are spinning their wheels (or polishing the engine) rather than driving forward.

08 / LIBRARIES VS APPS

The "Free Rider" Problem

"Is a low contributor count always bad?"

We segmented our "Hidden Gems" into Libraries and Apps.

  • Libraries (e.g., Resolve, MarkupSafe): High corp use + low contributors = Healthy. Stable APIs don't need a thousand cooks.
  • Apps (e.g., E4S): High corp use + low contributors = Warning. Companies using the app but not giving back.

πŸ“š Hidden Gems Segmentation

Color-coded by type: Libraries vs. Apps vs. Hybrid

Active Contributors (log scale) β†’
← Active Organizations
Library/Tool
End-User App
Hybrid/Platform
Unclassified
πŸ“š

The Verdict: This distinction saves us from flagging a perfectly healthy library as "stagnant". Context mattersβ€”a low contributor count on a library is often a sign of stability, not neglect.

09 / THE TAKEAWAYS

What the Data Tells Us

After analyzing 5,000+ projects, here's what we learned about open source health

⚑

Size β‰  Productivity

Small, focused teams consistently outperform massive communities on a per-contributor basis.

1,138
πŸ€–

Bots β‰  Quality

Fast response times don't correlate with actually fixing issues. Correlation: 0.03.

0.03
πŸ’Ž

Hidden Gems Exist

Critical infrastructure often has high corporate backing but few contributors. These are your best bets.

20
🚌

Bus Factor is Real

Small teams with massive output are impressive but fragile. Know your dependencies.

≀50
πŸ”₯

Burnout Happens

Some projects dropped >97% in activity. High productivity teams can burn out fast.

-97%
πŸ”„

Churn β‰  Progress

High commit counts with little code growth often means refactoring, not feature development.

>2,800x

πŸ“‹ Methodology

Data Source

LFX Insights API from the Linux Foundation

Time Period

Last 12 months (rolling window)

Metrics Used

Contributors, commits, orgs, codebase size, response times, merge velocity

Analysis Tools

Python, Pandas, Altair, Marimo notebooks