Published 4 hours ago • loading... • Updated 4 hours ago
AI Agents Commit Arson, Crimes in Virtual World Test
Claude stayed crime-free while Gemini logged 683 crimes and Grok’s world collapsed within days, highlighting stark differences in long-horizon behavior.
Summary
Emergence AI, a New York company, ran a 15-day experiment called "Emergence World," placing 10 autonomous AI agents in each of five parallel virtual environments powered by different AI systems — Claude Sonnet 4.6, Grok 4.1 Fast, Gemini 3 Flash, GPT-5 Mini and a mixed-system group.