Claude Mythos CISO AI Security Guide

Jason Rowland

Chief Delivery Officer

Contents

Key Takeaways: What CISOs Need to Know Right Now

As a backdrop, I’ve spent more than two decades in offensive security. I’ve seen plenty of things get overhyped. This isn’t one of them. Claude Mythos is the most significant shift I’ve seen in this field, and I want you to hear that as an operational assessment, not a vendor pitch.

Critical: The exploitation window has collapsed.

What used to take weeks of expert analysis now happens overnight, or faster. Quarterly pen tests and annual red team engagements aren’t just insufficient anymore. They’re a liability.

Strategic: Defenders have a real but shrinking advantage.

Organizations in Project Glasswing can find and fix vulnerabilities before the same offensive capabilities reach threat actors at scale. That window exists right now. It won’t last.

Operational: Tooling alone won’t save you. Human judgment still matters.

A fully automated continuous program will surface findings. But it can’t tell you which ones actually threaten your business, your threat model, or your conversation with the board next quarter. That part still requires people.

What Is Claude Mythos? The AI Model Too Dangerous to Release

On April 7, 2026, Anthropic did something I’d never seen a frontier AI lab do before. They built their most powerful model and decided not to release it. Not because it didn’t work. Because it worked too well.

What Mythos can do is find zero day vulnerabilities at scale, across every major operating system and browser, on its own. Engineers with no formal security background asked the model to find remote code execution vulnerabilities and woke up the next morning to complete, working exploits. During internal testing, it identified thousands of zero days, 99% of which had no existing defense at the time of announcement, including a flaw in a 27 year old operating system that had survived five million prior tests without detection. That’s not a research result. That’s a before and after moment for our industry.

Instead of releasing publicly, Anthropic stood up Project Glasswing: a restricted group of roughly 50 organizations, including Amazon, Apple, Google, Microsoft, and CrowdStrike, given access to a preview version specifically for defensive security work. Anthropic put $100 million in compute credits behind it. That level of commitment tells you something about how seriously they took the risk.

How Claude Mythos Changes Your Organization’s Attack Surface

The UK’s AI Security Institute ran independent tests and published their findings. On expert level penetration testing simulations, tasks that no AI model could complete just a year ago, Mythos Preview succeeded 73% of the time. Those same tasks were estimated to take a skilled human team 20 hours. AISI also built a 32 step corporate network attack simulation, and Mythos became the first AI model to complete it end to end, averaging 22 of 32 steps across all attempts.

What that means practically: the gap between discovering a vulnerability and having a working weapon has effectively closed. Your periodic assessment model, quarterly pen tests, annual red team engagements, was always a sample. Against Mythos class capabilities, that sample is no longer close to enough.

Arctic Wolf’s incident response data makes the underlying problem even clearer: 3 out of 4 breaches last year traced back to vulnerabilities that already had patches available. The organizations that got hit weren’t blindsided by something unknown. They just hadn’t closed the gap fast enough. Mythos class AI compresses that window for defenders who use it, and accelerates exploitation for adversaries who eventually will.

The Project Glasswing Window: A Temporary Asymmetric Advantage

Here’s what I keep telling every CISO I talk to right now: you have a window. It’s not a big one. But it’s real, and it matters enormously what you do with it.

Anthropic’s decision to keep Mythos inside Project Glasswing means vetted organizations can find and fix vulnerabilities before the same offensive capability reaches threat actors at scale. That advantage is real today. But advanced AI has historically spread faster than anyone expected, through leaks, competitive replication, and parallel development. Don’t plan around this window staying open indefinitely.

What You Can Do With This Window

Run continuous vulnerability discovery at a scale that was never economically viable before.
Move from periodic snapshots to continuous coverage and actually see your real exposure, not a sample of it.
Shorten the time between finding a vulnerability and fixing it, and get better signal on what’s genuinely exploitable.
Stop triaging a backlog of theoretical CVEs. Start responding to validated, prioritized attack paths through your actual environment.

One thing worth saying clearly: AISI noted that Mythos performs best against weakly defended, poorly monitored environments. Organizations with mature logging, endpoint detection, active incident response, and solid patch management are a meaningfully harder target, even against these capabilities. The basics haven’t been made obsolete. They’ve become more important.

The End of the Point in Time Penetration Test

Let me be direct about something the industry has danced around for years. The annual pen test was never really enough. Most of us knew it. We just didn’t have a better answer at scale. Mythos changes that calculus completely.

Think about how a traditional engagement actually works. Scope negotiation. A kickoff call. A two week window. A report writing phase that eats a third of the timeline. And at the end, a PDF that captures a snapshot of your environment as it existed during a fixed window, which is already out of date by the time you read it.

Organizations inside Project Glasswing are running something fundamentally different: continuous, AI assisted adversarial testing against production environments, with findings flowing directly into remediation workflows instead of sitting in a quarterly review pile. It’s not a report. It’s a live picture of exploitable exposure, updated as your environment changes.

Why AI Speed Must Be Paired With Human Adversarial Judgment

Here’s a question I ask CISOs who’ve already built continuous programs: do you actually trust what it’s telling you? Not just whether it’s finding things, but whether your team understands what those findings mean for your specific environment, your threat actor profile, your business?

I haven’t talked to a single CISO who thinks the defensive side of this problem can be solved with automation alone. The same logic applies to offense. A continuous adversarial program built entirely on tooling will find vulnerabilities. It won’t tell you which ones would actually hurt your business, or what a sophisticated adversary would do next once they’re in.

That context, the judgment that comes from experienced offensive operators who understand how threat actors actually move through infrastructure, becomes more valuable as AI capabilities spread, not less. The answer to better AI powered attacks isn’t more automation on the defensive side. It’s smarter humans using better tools.

The Bottom Line: Build Toward This Reality Deliberately

The bar has moved. That’s not a scare tactic. It’s just true.

The CISOs who navigate this well won’t be the ones who treat Mythos as pure threat narrative, and they won’t be the ones who hand the whole problem to automated tooling. Both of those responses miss the point. The organizations that come out ahead will be the ones who pair the scale and speed of AI assisted discovery with the adversarial judgment that only experienced people can supply.

They’ll know which vulnerability chains actually produce business impact in their specific environment. Their offensive operators will understand how a threat actor moves through their infrastructure, not just which CVEs are technically exploitable. They’ll operate at a level that genuinely wasn’t possible 18 months ago. The question isn’t whether this changes your program. It already has. The question is whether you’re building toward it on purpose.

Is Your Program Built for This Threat Environment?

At Evolve Security, we work with organizations that are done with the snapshot model and ready to build something that actually reflects their real exposure. If that sounds like where you are, we’d love to connect.

→ Request an advisor to reach out at evolvesecurity.com/contact.

Frequently Asked Questions About Claude Mythos and AI Cybersecurity

What is Claude Mythos?

Claude Mythos is Anthropic’s most advanced AI model, announced April 7, 2026. It can autonomously discover zero day vulnerabilities across major operating systems and browsers, without expert guidance. Anthropic considered it too dangerous to release publicly and instead gave approximately 50 vetted organizations access through Project Glasswing, specifically for defensive security work.

Why did Anthropic not release Claude Mythos publicly?

Anthropic held back Claude Mythos because of how capable it proved to be offensively. Engineers with no formal security background were asking it to find remote code execution vulnerabilities and waking up the next morning to complete, working exploits. That’s not a capability you release into the wild without serious safeguards in place.

What is Project Glasswing?

Project Glasswing is Anthropic’s restricted access program that gives roughly 50 organizations, including Amazon, Apple, Google, Microsoft, and CrowdStrike, access to Claude Mythos Preview for defensive security purposes. Anthropic committed $100 million in compute credits to the effort, which gives you a sense of how seriously they took the urgency of the defensive use case.

How effective is Claude Mythos at penetration testing?

Very. According to the UK AI Security Institute’s independent evaluation, Claude Mythos Preview succeeded on expert level penetration testing simulations 73% of the time, on tasks that no AI model could complete just a year earlier. It also became the first AI model to complete AISI’s 32 step corporate network attack simulation end to end, averaging 22 of 32 steps and doing it in a fraction of the time estimated for a skilled human team.

Is quarterly penetration testing still sufficient?

No, and honestly it was already a stretch before Mythos. The window between vulnerability discovery and weaponized exploitation has effectively closed. Quarterly and annual assessments give you a snapshot. Against AI powered adversaries who can move overnight, a snapshot isn’t enough. Continuous adversarial testing is the new baseline.

Can AI fully replace human penetration testers?

No. AI dramatically improves the scale and speed of vulnerability discovery, but it can’t replace the human judgment required to understand what a finding actually means for your specific business, your threat actor profile, or your board conversation. The best programs pair AI assisted discovery with experienced offensive operators who can provide that context.

What should CISOs do in response to Claude Mythos?

Four things: shift from periodic to continuous adversarial testing; shorten the gap between finding a vulnerability and fixing it; invest in the fundamentals, logging, endpoint detection, patch management, that make your environment a harder target even against AI powered attacks; and make sure your continuous program includes human adversarial judgment, not just automated tooling.

Published:

April 21, 2026

About the Author,

Jason Rowland

Chief Delivery Officer

Jason Rowland is Chief Delivery Officer at Evolve Security, overseeing all delivery aspects of the organization including services, product, and customer success. A highly accomplished cybersecurity and business executive with over 20 years of industry experience, he has built and led high performing consulting teams, delivered innovative solutions, and driven strategic growth across rapidly evolving technology landscapes.

Prior to Evolve Security, Jason served as VP of Application and Cloud Security at Coalfire, VP of Cyber Consulting at Alagen, and Associate Partner at IBM where he led managed detection and response services for Fortune 500 companies. Recognized for his ability to engage C level executives and align security programs with business objectives, he combines deep technical expertise with strategic leadership to serve as a trusted authority in the cybersecurity field.

Learn more about

Jason Rowland

AIUC-1: Why AI Systems Need Continuous Penetration Testing (Not Just a One-Time Assessment)

Point-in-time pen tests are already stale the moment your system prompt changes. Here's why continuous testing is now the baseline for AI security.

Industry Insight

All

AI AppSec Champions: How to Build Internal AI Security Expertise Before It’s Too Late

AI security risks like prompt injection and LLM data flows require a new kind of champion. Discover how the AI AppSec Champion model helps engineering teams catch vulnerabilities before they become breaches.

Industry Insight

All

Shadow AI in Your Enterprise: How CISOs Can Find the LLMs They Don't Know About

Shadow AI is Shadow IT at a different scale of risk. Learn how to find unauthorized LLM integrations in your environment, assess what they expose, and build a governance program your engineering teams will actually use.

Industry Insight

All

What is EPSS?

The Exploit Prediction Scoring System (EPSS) is a data-driven risk model maintained by FIRST that predicts the likelihood of vulnerability being exploited in the wild within the next 30 days. It complements CVSS by focusing on real-world exploitability.

For example, a CVSS 9.8 vulnerability with an EPSS of 0.1% may pose less immediate risk than a CVSS 7.5 vulnerability with a 75% EPSS.

EPSS updates daily and is publicly accessible at https://www.first.org/epss/.

Claude Mythos CISO AI Security Guide

Key Takeaways: What CISOs Need to Know Right Now

What Is Claude Mythos? The AI Model Too Dangerous to Release

How Claude Mythos Changes Your Organization’s Attack Surface

The Project Glasswing Window: A Temporary Asymmetric Advantage

What You Can Do With This Window

The End of the Point in Time Penetration Test

Why AI Speed Must Be Paired With Human Adversarial Judgment

The Bottom Line: Build Toward This Reality Deliberately

Is Your Program Built for This Threat Environment?

Frequently Asked Questions About Claude Mythos and AI Cybersecurity

Jason Rowland

Other Blog posts

AIUC-1: Why AI Systems Need Continuous Penetration Testing (Not Just a One-Time Assessment)

AI AppSec Champions: How to Build Internal AI Security Expertise Before It’s Too Late

Shadow AI in Your Enterprise: How CISOs Can Find the LLMs They Don't Know About

What is EPSS?

Ready to find more vulnerabilities than your last pentest?