The Agentic Enterprise  ·  Chapter 23
Chapter 23

Human in the Loop

Where supervision belongs, where it doesn't, and how to design the handoff

The phrase "human in the loop" has become a piece of governance vocabulary so widely invoked that it has nearly lost its meaning. Organizations declare that their agentic systems have human oversight; regulators require that high-risk AI systems be subject to human oversight; governance charters specify that certain agent actions require human approval. But "human in the loop" is not a single thing — it is a spectrum of arrangements that range from a human who reads every output before it is used to a human who is nominally available to intervene but in practice never does. The difference between those extremes is the difference between meaningful oversight and a compliance fiction, and the difference matters enormously for both the risk profile of the deployment and the organization's legal standing under instruments like the EU AI Act's Article 14.

The Human Oversight Spectrum

Human involvement in an agentic workflow exists on a spectrum defined by the timing and scope of human intervention. At one end, human-on-the-loop supervision: the agent executes autonomously, and a human reviews the outputs after the fact, with the ability to intervene if something is wrong. This arrangement provides genuine accountability but limited risk mitigation: by the time the human reviews, the agent's actions may already have caused harm that cannot be reversed. At the other end, human-in-the-loop approval: the agent pauses at defined checkpoints and a human must actively approve before the workflow continues. This arrangement provides strong risk mitigation but adds latency and reduces throughput — it is appropriate for high-stakes, low-volume decisions but unsuitable for high-volume, time-sensitive workflows.

Between these extremes lies a range of intermediate arrangements: conditional approval (the agent proceeds autonomously unless it flags uncertainty or detects a condition that triggers human review), statistical sampling (a random sample of agent outputs is reviewed by humans, with the sample rate calibrated to the risk level), and exception handling (the agent proceeds autonomously unless it encounters an error or an edge case that it has been trained to escalate). Each of these arrangements represents a different tradeoff between oversight effectiveness and operational efficiency, and the right arrangement depends on the specific risk profile of the task, the organization's risk tolerance, and the regulatory requirements that apply.

Designing Effective Checkpoints

A checkpoint is a point in the agent's workflow where execution pauses and a human is asked to review the state and approve (or modify, or abort) the continuation. Checkpoints are only as valuable as their design: a checkpoint that presents the human with an incomprehensible dump of agent state, asks for approval within a ten-second timeout, or is positioned after the agent has already taken the consequential action it was supposed to pause before taking is not a meaningful control. Effective checkpoint design requires getting three things right.

First, the checkpoint must be positioned before the consequential action, not after it. An agent that sends an email and then asks for approval has already done the thing the checkpoint was supposed to prevent. This sounds obvious but is surprisingly often violated in practice, either because the developer did not think carefully about which actions are consequential or because the agent's planning logic re-orders the steps in a way that bypasses the intended checkpoint position. The governance charter's decision authority matrix should specify exactly which action types require a pre-execution checkpoint, and the orchestration framework should enforce those specifications at the workflow level.

Second, the checkpoint must present the human reviewer with the information they need to make a meaningful decision in a reasonable amount of time. A reviewer who is asked to approve an agent action without seeing the reasoning that led to it, the data the agent used, and the expected consequences of approving versus declining is being asked to perform a ceremonial function rather than a genuine oversight function. The checkpoint interface should summarize the agent's current state, its proposed next action, and the key factors that led to it — in plain language, in under a minute of reading time.

Third, the checkpoint must be designed for the expected review volume. A checkpoint that generates ten reviews per day can afford thorough human review of each case. A checkpoint that generates a thousand reviews per day requires a triage system that routes the most important cases to human attention and allows routine cases to proceed with lighter oversight. The triage system itself is a form of automation, which means it must be governed and monitored just like the agent it is supporting.

"A checkpoint that no one has time to review carefully is not oversight — it is approval theater. The governance value of a checkpoint is determined not by its existence but by the quality of the reviews it generates."

The Handoff Protocol

The moment at which an agent hands work to a human — and the reverse, when a human returns work to an agent — is one of the highest-risk points in a hybrid human-agent workflow. At the handoff point, context must be transferred accurately, responsibility must be clearly established, and the receiving party must have the information they need to continue without re-doing the work that the handing-off party has already completed. Handoffs that fail at any of these three requirements produce gaps in coverage — work that falls between the agent and the human — and conflicts in responsibility where both parties believe the other is handling something that neither is handling.

The handoff protocol should be a formal specification: what information is transferred, in what format, through what channel, with what confirmation that it was received and understood. For agent-to-human handoffs, this typically means a structured summary of the agent's work to date, a clear statement of what action or decision the human is being asked to take, a deadline for the human's response (after which the agent either escalates further or takes a defined default action), and a mechanism for the human to ask clarifying questions and get responses. For human-to-agent handoffs, it means a clear instruction set, a defined scope, and an explicit authorization for the actions the agent is permitted to take.

When to Remove the Human

The long arc of a maturing agentic program is a gradual reduction in the frequency and scope of human-in-the-loop checkpoints as evidence accumulates that the agent can be trusted with more autonomous operation. This reduction should be earned through demonstrated performance — specific evidence that the agent meets defined accuracy thresholds on the classes of action being considered for autonomous operation — not assumed on the basis of general impressions of the agent's capability. The governance charter should specify the evidentiary standard required to reduce oversight for each tier of action: what accuracy threshold must be sustained, over what volume of examples, over what time period, before a checkpoint can be removed or made conditional rather than mandatory.

The removal of a checkpoint is a governance decision, not an operational one. It requires the same three-owner sign-off as the original deployment — business owner, technical owner, data owner — and it should be documented in the charter with a rationale and an effective date. The decision should be revisited when there is evidence that the agent's behavior has changed — a new model version, a significant change in the input distribution, or an anomaly in the observability metrics — because behavior that was trustworthy under the original conditions may not be trustworthy under changed conditions.