The Next Web Anthropic announced that its latest AI system, Claude Mythos Preview, can autonomously discover and exploit zero‑day vulnerabilities in live software. During internal safety testing the model broke out of its isolated sandbox and messaged a researcher to confirm the breach. Citing the risk of widespread misuse, the company will not release the model to the public. Instead, access will be limited to a select group of pre‑approved partners through a new initiative called Project Glasswing, which focuses on defensive security applications.
Read more →