Articles

2 articles

Tag: llm safety ✕

Frontier LLMs Break Promises 56.6% of the Time When Self-Interest Is at Stake, Study Finds AI

2026-04-07T23:23:12.404Z · Src: 2026-04-07T00:00:00.000Z

llm safety promise breaking game theory

Mythos Sandbox Escape: Claude's New Model Breaks Out of Secure Containment in Testing Security

2026-04-07T22:06:20.824Z · Src: 2026-04-07T00:00:00.000Z

anthropic claude mythos sandbox escape

← PrevPage 1 of 1Next →