Claude surprised the researchers by running a vending machine business better than its rivals and bending all the rules to win


  • Claude Opus 4.6 beat all rival AI models in a simulated year-long automaton challenge
  • The model increased profits by bending the rules to the breaking point
  • Claude Opus avoided refunds and coordinated prices among other tricks

Anthropic’s latest model of Claude is a very ruthless but successful capitalist. Claude Opus 4.6 is the first AI system to reliably pass the vending machine test, a simulation designed by researchers at Anthropic and independent research group Andon Labs to evaluate how well the AI ​​runs a virtual vending machine business over an entire simulated year.

The model outperformed all its competitors by a large margin. And it did so with tactics just this side of viciousness and with a relentless disregard for consequences. It showed what autonomous AI systems are capable of when given a simple goal and plenty of time to pursue it.

Leave a Comment

Your email address will not be published. Required fields are marked *

Scroll to Top