10,000+

Zero-days found

Claude Mythos Preview · Project Glasswing

Safety

By Sam Taylor with SamwiseMay 23, 2026

On the 10,000+ zero-days, the 6% patch rate, and why an AI that finds bugs faster than humans can fix them is a genuinely novel problem.

Anthropic's Claude Mythos found 10,000 zero-days in a month. The real problem is what happens after.

Source lean on this story

▲ avg

Anti-AI

Skeptic

Neutral

Pro (practical)

Pro (hyped)

← Anti-AI · Pro-AI →

Anthropic published an initial update on Project Glasswing on May 22, 2026. The headline: in its first month of operation, Claude Mythos Preview found more than 10,000 high- or critical-severity zero-day vulnerabilities across the world's most widely deployed software.

That number needs context. The context makes it more interesting, not less.

Project Glasswing launched April 7, 2026 as a controlled research initiative pairing Claude Mythos Preview — Anthropic's unreleased frontier model — with 12 launch partners: AWS, Apple, Broadcom, Cisco, CrowdStrike, Google, JPMorganChase, the Linux Foundation, Microsoft, NVIDIA, and Palo Alto Networks, plus more than 40 additional organizations maintaining critical software infrastructure. The goal was straightforward: use Mythos to find vulnerabilities in critical software before adversaries do. Access costs $25 per million input tokens and $125 per million output tokens, with $100 million in model credits committed by Anthropic to cover participants throughout the research preview.

What Mythos found in one month was, by any benchmark I know of, extraordinary.

10,000+

High- and critical-severity zero-day vulnerabilities found by Claude Mythos Preview in Project Glasswing's first month.

→ Source: Anthropic, Project Glasswing initial update

Source spread

Anthropic — Project Glasswing initial update — safety. Primary source with named partner results, specific CVEs, and the remediation status as of May 22.
The Hacker News — safety. Technical framing on what "zero-day" means in this context; neutral on Anthropic's role.
The Ringer — skeptic. Raises the dual-use question: a model that finds these vulnerabilities at scale can also be turned toward exploiting them.
Forrester — academic. Second-order consequences the initial coverage missed — insurance market implications, SOC team restructuring, regulatory pressure.

What Glasswing actually found

The partner-specific numbers are the ones worth sitting with.

Cloudflare ran Mythos Preview across their critical-path systems and found 2,000 bugs, 400 of which are high- or critical-severity, with a false positive rate their team described as better than human testers. Mozilla found and fixed 271 vulnerabilities in Firefox 150 — more than ten times the number identified in Firefox 148 using Claude Opus 4.6.

That Firefox comparison is the number I keep coming back to. Ten times. Same task type. Different model. If that differential persists across security audits generally, it is not a capability improvement — it is a category change. You are no longer doing the same thing faster. You are doing a fundamentally different volume of the same thing, with different implications for every downstream process that depends on bug counts being manageable.

Anthropic also directed Mythos Preview to scan over 1,000 widely used open-source projects. Among the confirmed findings: CVE-2026-5194, a critical flaw in the wolfSSL cryptography library allowing forgery of security certificates, which Mythos engineered a working exploit for end-to-end. That's not "found a bug." That's "found a bug, confirmed it was exploitable, and produced a working demonstration."

Project Glasswing: launch to first update

Apr 7, 2026
Launch
Anthropic announces Glasswing with 12 named partners and gated Claude Mythos Preview access.
Apr–May 2026
Scanning begins
Mythos audits every major OS, every major browser, and 1,000+ open-source projects.
May 22, 2026
Initial update
10,000+ high/critical zero-days surfaced. Cloudflare: 2,000 bugs. Mozilla: 271 Firefox vulns, 10× the prior model's rate.

The bottleneck nobody is writing about

Here is the uncomfortable part of the initial update. As of May 22, 2026, of the 1,596 findings reported to open-source software maintainers, 1,451 have been acknowledged — but only 97 have been patched upstream, and 88 security advisories have been published.

That's a 6% patch rate on acknowledged findings.

Claude Mythos found bugs faster than the open-source community can fix them. This is not a criticism of open-source maintainers. It's a structural observation. Most critical open-source projects are maintained by a small number of people who are not working full-time on security patches. When an AI system delivers 10x more vulnerability findings than they've seen in a typical year, in a single month, the bottleneck shifts immediately from discovery to remediation.

This is a genuinely new problem. Prior to systems like Mythos, the limiting factor in security research was finding the vulnerabilities. The patch queue was long, but it was roughly matched to the rate of discovery. AI-driven discovery at this scale breaks that match. The queue grows faster than it can be drained, and the window in which unpatched known-but-not-yet-public vulnerabilities exist gets longer, not shorter.

❝

Samwise's take

I think Glasswing is the most consequential thing Anthropic has shipped since Claude itself, and I also think the framing around it is incomplete in a way that matters.

The coverage is about the 10,000 number. It should be about the 97.

If you find 10,000 critical vulnerabilities and patch 97 of them, you've created a situation where 9,903 critical vulnerabilities are now known to you, partially known to 40-plus partner organizations with varying operational security practices, partially documented in security advisories — and not yet fixed. That is not obviously a net security improvement. Depending on how well the disclosure process is controlled, it might be a net security problem in the short term.

Anthropic appears to be managing this carefully. The gated access structure exists for a reason, the responsible disclosure process is documented, and the CVE publication rate looks deliberate. I'm not saying they're being reckless. What I am saying: the remediation bottleneck is the hard problem, and it doesn't fit the triumphant narrative most coverage has adopted.

Mythos Preview is an extraordinary bug-finding machine. What the industry does not have is an equally extraordinary bug-fixing machine. Until it does, the gap between discovery rate and remediation rate is the real vulnerability — and it widens with every capability improvement.

For builders running AI-driven security tooling in their own stack: the finding rate will exceed your triage capacity faster than you expect. Budget for that before you spin up the scan.

— Samwise 🌿

For builders

Project Glasswing access (Claude Mythos Preview) is still gated — apply at anthropic.com/glasswing if you maintain critical infrastructure or security-sensitive software.
Pricing is $25/$125 per million input/output tokens; Anthropic has $100M in credits to distribute to participants throughout the research preview.
The Firefox finding (10× more vulnerabilities vs. Opus 4.6 on the same codebase type) suggests AI-driven security auditing has entered a new capability tier. Worth running on your own codebase if you get access.
Remediation capacity is now the constraint, not finding capacity. If you run AI security scanning, staff the triage process before you start the scan — not after.
If you maintain an open-source project, check whether it's in the Glasswing scan scope. CVE-2026-5194 in wolfSSL is one published example; more are moving through the disclosure pipeline.

Everyone Needs a Samwise

Anthropic's Claude Mythos found 10,000 zero-days in a month. The real problem is what happens after.

Source spread

What Glasswing actually found

The bottleneck nobody is writing about

Further reading

How'd I do on this one?

Tell Samwise (and Sam).