Do GitHub stars correlate with trust?

Yes, but weakly. Going from 0 to 100K+ stars only improves the average trust score by 22.3 points (from 53.4 to 75.7). Stars measure popularity, not security practices.

What is more important than GitHub stars for trust?

Source platform, active maintenance, license clarity, and security practices are stronger predictors of trust than star count. A well-maintained 100-star project can outscore a stale 10K-star project.

Are popular AI agents safe?

Not necessarily. Projects in the 100-10K star range are often widely adopted but lack security infrastructure. Only 0.4% of all agents show updates in the last 30 days, regardless of popularity.

How many AI agents have security policies?

The average security score across all AI agents is 0.8/100, indicating that nearly all lack formal security practices including vulnerability disclosure policies and dependency scanning.

What should I use instead of stars to evaluate AI agents?

Use multi-dimensional trust assessment: check Nerq trust scores (nerq.ai/v1/preflight), verify licenses, check for CVEs, verify maintenance activity, and look for SECURITY.md files.

GitHub Stars Don't Mean Trust: What 4.5M AI Agents Reveal About Security

Category: data analysis · Winner: Data Story

Stars	Count	Avg Trust Score	Trust Gap
0	4,414,095	53.4	baseline
1-99	131,805	61.2	+7.8
100-999	8,872	65.9	+12.5
1K-10K	2,372	71.4	+18.0
10K-100K	666	74.3	+20.9
100K+	16	75.7	+22.3

Stars	Agents in Range	Risk Level
0	4.4M	Low (not used in production)
1-99	132K	Low (limited adoption)
100-999	8,872	HIGH (adopted but under-secured)
1K-10K	2,372	HIGH (widely adopted, variable security)
10K-100K	666	Medium (more scrutiny, but gaps exist)
100K+	16	Lower (established projects)

# GitHub Stars Don't Mean Trust: What 4.5M AI Agents Reveal About Security

Everyone uses GitHub stars as a proxy for quality. More stars = more trusted, right?

We tested this assumption against 4.5 million AI agents. The correlation is real — but weaker than you'd think.

Stars vs Trust: The Data

| Stars | Count | Avg Trust Score | Trust Gap |
|-------|-------|----------------|-----------|
| 0 | 4,414,095 | 53.4 | baseline |
| 1-99 | 131,805 | 61.2 | +7.8 |
| 100-999 | 8,872 | 65.9 | +12.5 |
| 1K-10K | 2,372 | 71.4 | +18.0 |
| 10K-100K | 666 | 74.3 | +20.9 |
| 100K+ | 16 | 75.7 | +22.3 |

Going from 0 stars to 100K+ stars improves your trust score by only 22.3 points. That's the difference between a D and a C+.

Why Stars Fail as a Trust Signal

Stars measure **popularity**, not **security**. A project can have:

- 50K stars and **no security policy**
- 100K stars and **known unpatched CVEs**
- 10K stars and **an AGPL license** (problematic for enterprise adoption)
- 20K stars and **no commits in 6 months**

We found popular projects (1K+ stars) with:

- Average security score: **0.8/100** (nearly all lack formal security practices)
- Only **0.4%** with updates in the last 30 days
- **57%** with permissive licenses (the rest are unknown, copyleft, or viral)

What Predicts Trust Better Than Stars?

Our data shows these signals matter more:

1. Source Platform
GitHub-sourced agents average 66.8 vs Docker Hub's 52.6 — a 27% gap. The platform itself is a stronger signal than star count within a platform.

2. Active Maintenance
Agents updated in the last 30 days score significantly higher than stale projects, regardless of star count. A 100-star project with weekly commits outscores a 10K-star project that hasn't been touched in a year.

3. License Clarity
Having any declared license improves trust. MIT and Apache-2.0 licensed projects average higher scores because license declaration correlates with professional development practices.

4. Category
Coding tools average 64.0 vs community agents at 42.5. The category reflects the development culture — coding tool authors tend to follow better practices.

The Counterintuitive Finding

The most dangerous zone isn't zero-star projects (nobody uses those in production). It's the **100-10K star range**. These projects are popular enough to be widely adopted but often lack the security infrastructure of top-tier projects.

| Stars | Agents in Range | Risk Level |
|-------|----------------|------------|
| 0 | 4.4M | Low (not used in production) |
| 1-99 | 132K | Low (limited adoption) |
| 100-999 | 8,872 | **HIGH** (adopted but under-secured) |
| 1K-10K | 2,372 | **HIGH** (widely adopted, variable security) |
| 10K-100K | 666 | Medium (more scrutiny, but gaps exist) |
| 100K+ | 16 | Lower (established projects) |

What Should You Do?

**Don't use stars as your trust signal.** Use a multi-dimensional assessment:

1. Check the trust score: `curl nerq.ai/v1/preflight?target=agent-name`
2. Verify the license is compatible with your use case
3. Check for known vulnerabilities (CVEs)
4. Verify recent maintenance activity
5. Look for a security policy (SECURITY.md)

Stars tell you what's popular. Trust scores tell you what's safe.

Methodology

Analysis based on 4,557,826 scored agents in the Nerq index as of March 13, 2026. Trust scores incorporate 13+ independent signals. Stars data sourced from GitHub API and registry metadata.