Logo
FrontierNews.ai

How a Jailbreak Triggered the First Government Takedown of a Live AI Model

On June 12, 2026, the US government issued an export control directive ordering Anthropic to immediately disable two of its newest AI models, Claude Fable 5 and Claude Mythos 5, worldwide. This marks the first time a leading AI company has taken a publicly deployed model offline due to federal government intervention, setting a precedent that could reshape how the industry handles model releases and government oversight.

What Triggered the Government's Action?

Commerce Secretary Howard Lutnick sent a letter to Anthropic CEO Dario Amodei citing national security concerns. According to an administration official, the Commerce Department decided to act after another company claimed it could jailbreak Mythos 5, a model designed to find software vulnerabilities. The administration had previously asked Anthropic to pause the release, but the company proceeded anyway, prompting the export control letter.

The jailbreak in question involved a narrow technique that Anthropic later argued was widely available from other publicly available models, including OpenAI's GPT-5.5. Despite this, the government maintained that the model needed to remain locked down until the US national security apparatus could be hardened, an official said could happen within weeks.

Why Are Fable 5 and Mythos 5 So Significant?

Both models launched just days before the ban, making this one of the most abrupt model pullbacks in AI history. Fable 5 represented Anthropic's first release of such an advanced offering to the general public, thanks to new safeguards designed to block responses in high-risk areas like cybersecurity. Mythos 5, available to a separately vetted set of organizations, operates with some of those constraints removed.

Before launch, Anthropic subjected both models to thousands of hours of red-teaming by the US government and the UK, suggesting the company had taken extensive precautions. Yet the government's concern over a single jailbreak technique was enough to trigger a complete worldwide shutdown affecting hundreds of millions of potential users.

How Does This Ban Affect Anthropic and the Broader AI Industry?

The practical effect of the ban is severe. Because Anthropic cannot reliably separate foreign nationals from its user base in real time, the company had to implement a hard shutoff of both models worldwide. All other Anthropic models, including Claude Opus, Claude Sonnet, and Claude Haiku, remain available.

Anthropic is complying with the legal directive but has publicly disagreed with it. The company stated that it believes the government should have the ability to block unsafe deployments through a transparent, fair, and technically grounded statutory process, but argued this action does not meet those standards. Anthropic emphasized that the safeguards in Fable are so strong that many users have complained they are overly broad.

Steps to Understand the Broader Context of This Decision

  • The February 2026 Conflict: The Trump administration ordered federal agencies to stop using Anthropic technology after the company refused to grant unrestricted access, citing concerns about mass domestic surveillance and lethal autonomous weapons development.
  • The Pentagon's Supply Chain Label: The Department of Defense declared Anthropic a supply chain risk, a designation historically reserved for foreign adversaries, requiring defense contractors to certify they would not use Claude models in military work.
  • The March Court Injunction: A US judge ruled the Pentagon had no legal ground to label Anthropic a supply chain threat, temporarily slowing the administration's efforts but not stopping them entirely.

The latest ban represents a shift in legal mechanism. Rather than relying on supply chain designations or contract terminations, the government used export controls, a tool traditionally applied to sensitive technologies and foreign adversaries.

What Are Industry Experts Saying About This Precedent?

Reactions across the tech community have been sharp and divided. Dean Ball, an AI policy expert who briefly served in the Trump administration but is critical of its recent Anthropic decisions, expressed confusion about whether this represents targeted legal action against Anthropic specifically or broader national security concerns. He noted the apparent contradiction in an administration that supports exporting advanced AI chips to China while also seeking to restrict the best US AI models from non-Americans.

"We disagree that the finding of a narrow potential jailbreak should be a cause for recalling a commercial model deployed to hundreds of millions of people," Anthropic stated in response to the ban.

Anthropic, in official statement regarding the export control directive

The broader implication is significant for every AI company building frontier models. If the government can pull one model citing a narrow, non-universal jailbreak, it establishes a precedent that could theoretically apply to any model in the industry. This raises questions about the balance between national security oversight and commercial innovation in the rapidly evolving AI landscape.

The Anthropic AI models ban also reflects a larger shift happening across the cybersecurity landscape in 2026, where governments are increasingly treating AI tools as strategic security assets rather than simply commercial software. How this precedent shapes future government-industry relations in AI development remains to be seen.