Chapter 40 · The Next Decade — The Agentic Enterprise

In 2030, the question "are you ready for agentic AI?" will be as dated as "are you ready for cloud?" It will not be because the question was wrong — it was the right question for its moment — but because readiness will have been folded into the ordinary operations of a competent enterprise. The organisations that spent the years between 2025 and 2030 building institutional maturity — governance structures, evaluation cultures, integration discipline — will find that the new generation of agentic capability lands on fertile ground. The organisations that spent those years in perpetual pilot mode will find it landing on sand.

Agent-as-employee accountability frameworks

The most significant institutional shift of the next decade will not be technical; it will be legal. The question of who is accountable when an agent causes harm — the enterprise that deployed it, the model provider that built the underlying model, the vendor whose tool the agent called, or the human who approved the deployment — is currently answered inconsistently by different jurisdictions and differently by different courts. By 2030, this inconsistency will have been reduced, not by a single global standard but by a set of converging national frameworks that share a common architecture: the enterprise that deploys the agent is the accountable party, in the same way that the employer of a human employee is the accountable party for that employee's actions within the scope of their employment.

This framing — agent as employee, enterprise as employer — has significant implications for how agents are designed and deployed. An employer is accountable for the actions of an employee only within a defined scope; actions outside that scope may shift liability to the employee. The analogue for agents is the agent's permission model: actions within the defined permission scope are the enterprise's responsibility; actions that exceed that scope because of a failure in the enterprise's governance model are still the enterprise's responsibility, because the enterprise owns the governance model. The EU AI Act's accountability provisions are a first approximation of this framework; by 2030, they will have been refined through enforcement actions and court decisions into something more precise.

Agent identity and credentials standards mature

By 2030, the problem of agent identity — currently solved ad hoc by each enterprise, with service accounts, API keys, and OAuth tokens assigned in ways that reflect the convenience of the deployment rather than a principled model — will have been addressed by mature open standards. The trajectory is already visible: the Model Context Protocol has established a lingua franca for tool access; the A2A protocol is extending it to agent-to-agent communication; and identity standards bodies are beginning to publish guidance on machine-identity scope and lifecycle management for AI systems.

The 2030 state will likely include: a standardised format for agent identity credentials that is legible to identity providers, audit systems, and regulatory inspection tools; a scope language that allows the enterprise to express, in machine-readable form, exactly what an agent is authorised to do and under what conditions; and a lifecycle management protocol that governs how credentials are issued, rotated, suspended, and revoked as the agent programme evolves. Anthropic's Responsible Scaling Policy and similar commitments from other frontier model labs will, by 2030, have been translated into interoperable technical standards rather than individual vendor commitments.

"Identity is the unsexy problem that determines everything. The enterprises that built their agent identity model early — before the standards existed — will spend 2027 migrating to the standard. The enterprises that waited will adopt the standard on day one. Neither will have a competitive advantage; both will have a functional system." — observed at an enterprise AI architecture summit, 2025.

Agent-native procurement clauses

The procurement contracts of 2030 will look different from those of 2025 in ways that reflect the maturing understanding of agent-specific risk. Today's contracts are written with human software in mind and adapted (imperfectly) for AI. Tomorrow's contracts will include native clauses for: model behaviour change notification, specifying the enterprise's right to receive advance notice of any change to the model's behaviour above a defined threshold of deviation from the baseline evaluation; agentic scope limitation, specifying the maximum permission scope the provider's agent runtime is permitted to request; incident correlation, specifying the provider's obligation to assist in post-incident root-cause analysis when a model output is implicated in a harm; and evaluation interoperability, specifying the provider's commitment to support the enterprise's evals harness against the provider's model API.

These clauses will not emerge from regulatory fiat alone; they will emerge from the accumulation of incidents, post-mortems, and legal precedents produced by the agent deployments of the mid-2020s. The organisations that negotiated the early contracts, discovered the gaps, and documented what they wished they had insisted on — the programme teams that wrote their procurement playbooks in year one and updated them every year — will be the organisations whose lawyers write the first industry-standard agent contract templates.

Multi-agent labour markets

By 2030, the most sophisticated enterprise agent programmes will not be operating a collection of individual agents; they will be operating agent networks — ensembles of specialised agents that collaborate on complex tasks, with one agent decomposing a goal, delegating to specialised sub-agents, and synthesising the results. This architectural pattern is already emerging in research contexts; its enterprise deployment will require governance frameworks that do not yet exist.

The governance challenge for agent networks is the chain-of-custody problem: when a network of five agents collaborates to produce an output, and that output is flawed or harmful, which agent in the chain is the failure point? The answer requires a full trace of the agent network's execution — which agent made which decision, with which inputs, at which step — and an evals framework that can test the network's behaviour as a system, not just the behaviour of each agent in isolation. MIT CISR research on multi-system risk suggests that the failure modes of agent networks will be qualitatively different from the failure modes of individual agents — more like the failure modes of organisational processes than the failure modes of individual software components.

The lab-grade evaluation regime as commodity

The evals regimes of 2025 — custom test suites, LLM-graded subjective evaluations, red-team exercises run by hand — are sophisticated and expensive. By 2030, they will be commodities. The trajectory is already established: evaluation frameworks are being open-sourced, benchmark datasets are being standardised by bodies like NIST, and the AI safety research community is moving toward evaluation-as-a-service offerings that allow enterprises to run comprehensive safety and reliability evaluations without maintaining a dedicated evals team.

The commoditisation of evals will lower the barrier to entry for smaller enterprises that currently cannot afford a dedicated evals function. It will also raise the floor: an enterprise that cannot demonstrate that its agents have passed a standardised evaluation battery will face increasing scrutiny from regulators, insurers, and enterprise customers. By 2030, the evals pass report may be as standard a supplier qualification document as an ISO 9001 certificate is today — not a differentiator but a minimum expectation.

What ready looks like in 2030

The agentic enterprise of 2030 is not the organisation with the most agents or the largest model investment. It is the organisation that has built the institutional capacity to govern, operate, and improve its agent estate continuously — the organisation for which the maturity model, the scorecard, the operating model, and the economic framework described in this part of the book are not aspirational tools but operational defaults.

Ready looks like this: a governance model that is tested quarterly, not written annually. An evals culture that treats a failure in a test suite as a success of the evaluation process, not a failure of the team. A procurement playbook that is updated after every significant vendor negotiation. An operating model that has been reviewed and deliberately chosen, not inherited by default. A workforce that understands the agents it oversees well enough to catch the failures the evals missed. And a board that can answer, in a single conversation, where the enterprise sits on the maturity model, what risks are open, and what the programme will look like in twelve months.

This is not a vision of an enterprise run by machines. It is a vision of an enterprise in which machines are trusted enough to act, watched closely enough to be caught when they fail, and governed carefully enough that the consequences of those failures are bounded. That is what readiness means — not the absence of risk but the institutional capacity to manage it. The organisations that build that capacity in this decade will not merely survive the agentic transition; they will define what it means to operate well within it.