The team bought more tokens. The agent still gave confident, wrong answers. The bottleneck was never compute.

I keep seeing AI budgets pour into bigger models and more tokens while the thing actually blocking the agent sits upstream: it has no reliable way to know what your data means.

An agent answering a business question needs three things most stacks don’t have ready:

  • A semantic layer. One governed definition of “active user,” “revenue,” “churn,” so the agent isn’t guessing which of four tables is the real one.
  • Governance and access. What this agent is allowed to see, and a way to prove it later. Added after launch, this becomes the thing that stalls the rollout.
  • A protocol for tools and context. Something like MCP, so the agent reaches data through a defined interface instead of a pile of bespoke glue.

That’s all data foundation work, the kind that’s been undervalued for a decade and now has a deadline attached, because the agent exposes every gap instantly. Compute doesn’t touch any of it.

The order that works: get definitions, governance, and access right first. Then the model has something trustworthy to reason over. Skip it and a bigger model gets to the wrong answer sooner. That’s all you bought.

Before the next compute invoice clears, find out what the agent reads from and whether a single person owns it.

Does your AI feature read from governed definitions, or from whichever table someone wired up first?