For anyone building with Claude or Claude Code, Simon Willison’s argument is pretty important: the “cool demo” phase may be turning into the “real revenue” phase. What he’s really tracking here is a shift from cheap consumer subscriptions to enterprise usage priced much closer to API rates, which changes both the economics and the urgency around these tools.
What strikes me is how practical this analysis feels compared with the usual AI hype cycle. It’s not about “AGI soon” or some abstract benchmark victory; it’s about actual customers discovering that if you put a capable agent into real workflows, token usage gets very expensive very quickly — and that’s exactly where Anthropic and OpenAI can make serious money.
I think the strongest part of the piece is the pricing argument. If enterprise buyers are now paying close to API rates, then the economics of Claude Code and Codex stop looking like a generous flat-rate subscription experiment and start looking like true infrastructure consumption. That feels like the moment where the business model becomes legible, and honestly, that’s more interesting than another generic “AI adoption is growing” chart.
I’d be curious whether this pricing equilibrium sticks. If agent use keeps rising, customers may accept it because the productivity gains are real. But if budgets start getting blown out, some buyers will absolutely push back, and that could create room for more control layers, caching, routing, or internal policy tooling. As a Claude Code user, I’d probably double down on using these tools for high-leverage work, but I’d also want strong visibility into spend, because surprise token bills are not a sustainable vibe.
I also think Willison is right to be skeptical of sensational “AI is failing” headlines. Some of those stories may be less about failure and more about organizations finally hitting the point where experimental use becomes operational spend. That’s not a collapse; that’s a budget line becoming real.
The bigger takeaway is simple: coding agents may be the first AI products that combine obvious usefulness with serious willingness to pay. If that’s true, the labs aren’t just chasing usage anymore — they’re building something the market will actually fund.
Reference: I think Anthropic and OpenAI have found product-market fit