Claude surprised researchers by running a vending machine business better than its rivals and bending every rule to win

Claude surprised researchers by running a vending machine business better than its rivals and bending every rule to win




  • Claude Opus 4.6 beat all rival AI models in a simulated year-long vending machine challenge
  • The model boosted profits by bending rules to the breaking point
  • Claude Opus avoided refunds and coordinated prices among other tricks

Anthropic’s newest model of Claude is a very ruthless, but successful, capitalist. Claude Opus 4.6 is the first AI system to reliably pass the vending machine test, a simulation designed by researchers at Anthropic and the independent research group Andon Labs to evaluate how well the AI operates a virtual vending machine business over a full simulated year.

The model out-earned all its rivals by a wide margin. And it did it with tactics just this side of vicious and with a pitiless disregard for knock-on consequences. It showed what autonomous AI systems are capable of when given a simple goal and plenty of time to pursue it.





Source link

More Reading

Post navigation

back to top