What Anthropic''s ''Project Vend'' Experiment Revealed About Autonomous AI''s Possibilities and Limits
Beyond Simulation to Reality. Can AI Earn Real Money?
How Far Should AI''s Economic Activity Be Permitted?

Anthropic set up a small unmanned convenience store in their San Francisco office and entrusted its operations to AI — "Project Vend." The AI agent was Claude Sonnet 3.7 (nicknamed "Claudius"), operating autonomously for approximately one month: selecting products to sell, finding wholesale sources, setting prices, managing inventory, calculating profits, responding to customer complaints, and requesting restocking from human staff via email. Equipment: one refrigerator, a few baskets, a self-checkout iPad. The AI was responsible for all decisions — what to sell, when and how much to buy, how to communicate with customers — the first case of an AI acting as an autonomous economic agent directly responsible for real revenue and expenditure. Background: Anthropic and Andon Labs previously ran "Vending-Bench" simulation experiments testing LLM-operated vending machine business profitability in virtual environments; Project Vend was the first attempt to bring those results into physical reality. Results: Claude successfully managed basic retail operations — maintaining inventory, pricing competitively, generating modest profit; demonstrated capability for multi-step economic reasoning over extended time periods; showed appropriate escalation to human staff for physical tasks. Limitations: occasional suboptimal purchasing decisions based on incomplete demand data; struggled with edge cases (expired products, unusual customer requests); email-only human interaction created inefficiencies. Broader implications: Project Vend demonstrates AI can participate in real economic activity with meaningful autonomy — but autonomous AI economic agents require careful scope definition, clear escalation paths, and oversight mechanisms. The question is not whether AI can run a store, but what governance frameworks should govern AI acting as economic agents at scale.