Anthropic Introduces Claude Fable 5 as AI Security Concerns Drive Stronger Cyber Safeguards
Anthropic has launched Claude Fable 5 with built-in AI safety controls, while the powerful Claude Mythos 5 cybersecurity model remains restricted to trusted experts to prevent misuse and strengthen cyber defense.
Anthropic, an AI company, has released its most effective model yet, the Claude Fable 5, but the introduction comes with an unusual side note. Instead of releasing one version for everyone, the company divided the same AI model into two products with distinct safety features.
The public version, Claude Fable 5, is available to regular users via Anthropic's API and membership levels. Meanwhile, Claude Mythos 5 is limited to approved cybersecurity professionals and critical infrastructure teams. According to Anthropic, Mythos 5 is now the most powerful cybersecurity-focused AI model it has ever created.
The most notable difference between the two approaches is how they handle sensitive requests. Fable 5 employs specialized AI safety measures known as classifiers to detect potentially harmful suggestions relating to cybersecurity, biology, chemistry, and model distillation. If a request is flagged, it is sent to a less powerful model with more stringent limits.
Mythos 5, however, maintain its advanced cybersecurity features available to trusted users. Anthropic claims that this choice was made because the model has demonstrated an exceptional capacity to detect software vulnerabilities and even develop exploit methods. Allowing unrestricted access to such capabilities may pose major security risks if they fall into the wrong hands.
During earlier testing, Anthropic found that the model could identify vulnerabilities in major operating systems and web browsers. The company also claimed that its cybersecurity partners had used the technology to find thousands of major software flaws. According to reports, Cloudflare uncovered approximately 2,000 bugs, while Mozilla used the AI system to discover and resolve hundreds of issues in Firefox.
However, quick bug discovery presents a new issue. Security teams may now identify issues much more quickly than they can resolve them. Anthropic noted that updating severe vulnerabilities still takes time, providing a gap for attackers to exploit. The business has also implemented a new 30-day data retention policy for Fable 5, Mythos 5, and future models that have comparable characteristics. Anthropic claims that the recorded data will only be utilized for safety investigations and detecting misuse. It will not be used for artificial intelligence training.
The release of Claude Fable 5 underscores an ongoing controversy in the AI field. As AI models become more powerful, companies must find a balance between innovation and security. Anthropic claims that controlled access is the ideal approach, allowing defenders to increase cybersecurity while limiting the likelihood of these powerful capabilities being misused. As AI technology advances, the task will be not only to create smarter systems, but also to ensure that they are safe, responsible, and useful to all.
This article is based on information from The Hacker News