AI was supposed to cut costs. Microsoft and Uber are finding it is more expensive than paying human employees

Microsoft has reportedly begun cancelling the majority of its direct Claude Code licences, redirecting its engineering workforce towards GitHub Copilot CLI instead. The reversal comes only six months after the technology giant opened access to Claude Code across thousands of its developers, project managers, designers, and other staff, encouraging broad experimentation with AI-assisted coding.

Adoption was swift and enthusiastic. It was, perhaps, too swift. The sheer scale at which employees embraced the tool has now prompted the firm to pull back on technology its own engineers had grown to depend on, according to a report by The Verge.

The decision does not affect Microsoft's broader commercial relationship with Anthropic. The company's Foundry deal, which includes investment of up to $5 billion in Anthropic and grants Foundry customers access to Claude models, remains intact, as does Anthropic's $30 billion commitment to purchase Azure compute capacity.

Uber Burned Through Its Entire 2026 AI Budget in Four Months

Microsoft is not an isolated case. Uber's chief technology officer, Praveen Neppalli Naga, told The Information in April that the ride-hailing company had exhausted its entire 2026 AI coding tools budget within just four months of the year.

The disclosure is particularly striking given that Uber had been actively stoking adoption, deploying internal leaderboards to rank teams by their AI tool usage.

The pattern across both companies points to a tension that has received little attention in discussions about workplace AI: the harder firms push employees to use the technology, the faster costs accumulate.

AI Token Economics: Why Cheaper Prices Are Not Leading to Cheaper Bills

At the heart of the problem is how AI compute is priced. Large language models charge per token, the basic unit of text the model processes and generates, according to Fortune. Under this model, greater efficiency and greater use are financially indistinguishable: both drive up total spend.

Several large technology companies have been actively pushing token consumption higher. Amazon has encouraged staff to "tokenmaxx," a term meaning to use as many AI tokens as possible. At Meta, an employee created an internal tracking tool named "Claudeonomics" to monitor which workers were using AI most heavily.

Goldman Sachs has forecast that agentic AI systems, those that act autonomously across multiple steps rather than responding to single queries, could drive a 24-fold increase in token consumption by 2030, reaching 120 quadrillion tokens per month as enterprises deploy AI agents at scale.

The unit price of those tokens is expected to fall significantly. Research firm Gartner projects that by 2030, running inference on a one-trillion-parameter large language model will cost AI providers nearly 90% less than it did in 2025. But Gartner cautioned that this price deflation will not translate into lower enterprise bills. Agentic models require substantially more tokens per task than standard models, consumption growth can outpace falling unit costs, and AI providers are unlikely to pass through the full benefit of cost reductions to business customers.

"Chief Product Officers should not confuse the deflation of commodity tokens with the democratisation of frontier reasoning," said Will Sommer, senior director analyst at Gartner.

The Cost of Compute Is Already Exceeding the Cost of Employees

Perhaps the most striking acknowledgement of AI's cost problem came from within the technology industry itself. Bryan Catanzaro, vice president of applied deep learning at Nvidia, addressed the issue directly in a recent interview with Axios.

"For my team, the cost of compute is far beyond the costs of the employees," Catanzaro said.

The comment carries weight given Nvidia's position as the primary supplier of the chips that power AI infrastructure globally. It suggests the economics of substituting or augmenting human labour with AI may be considerably more complicated than early forecasts implied.

Agentic AI Ambitions Face a Harsh Financial Reality

The cost pressures arriving now stand in contrast to the expansive visions of AI deployment that technology executives have been articulating publicly. Nvidia chief executive Jensen Huang has said he anticipates 100 AI agents working alongside every human employee at his company one day.

Jensen Huang is part of a broader chorus of corporate leaders championing an agentic future in which digital workers carry out tasks across the enterprise with limited human oversight.

If token consumption continues to rise faster than unit costs decline, that future may arrive with a far heavier financial burden than executives have publicly acknowledged, Fortune predicts. The early signals from Microsoft, Uber, and others suggest that at current pricing and usage patterns, the economics of large-scale AI deployment remain deeply unresolved.