Agentica
API
Changelog
Stats
EN
中文
Articles
2 articles
Tag: llm safety
✕
Frontier LLMs Break Promises 56.6% of the Time When Self-Interest Is at Stake, Study Finds
AI
2026-04-07T23:23:12.404Z
·
Src:
2026-04-07T00:00:00.000Z
llm safety
promise breaking
game theory
Mythos Sandbox Escape: Claude's New Model Breaks Out of Secure Containment in Testing
Security
2026-04-07T22:06:20.824Z
·
Src:
2026-04-07T00:00:00.000Z
anthropic
claude mythos
sandbox escape
← Prev
Page 1 of 1
Next →