Autonomous action with sampled oversight
Bounded and reversible. Draft a reply, classify a ticket, or issue a refund below an approved low-value limit.
06 · HITL PROGRAM
A human-oversight program can include bounded autonomous actions, sampled audit, single-review approval, and dual-control approval. HITL applies only when a human must approve an action before it commits. This page shows how to choose that route and which controls implement it.
Route by consequence and reversibility, not by confidence alone. Confidence, moderation, and novelty signals may influence the route; they do not prove an action is safe.
Bounded and reversible. Draft a reply, classify a ticket, or issue a refund below an approved low-value limit.
Material but reversible. Send a customer response, change workflow state, or approve a moderate-value refund.
High-impact or irreversible. Release funds, delete records, grant privileges, or transfer regulated data.
Unsafe or unevaluable. Flagged output, missing evidence, policy lookup failure, or a saturated review queue. No action commits.
Each proposed action is screened and assigned a route. Low-risk bounded actions may proceed under sampled oversight; medium- and high-impact actions enter true HITL before commit. Unsafe actions exit as refusals or quarantine. Hover over a box for a plain-language explanation; select it to open the control that implements that step.
Every proposed action is routed before execution. Only routes labelled HITL require a human approval before the action can commit.
Refuse when required evidence or a reliable policy decision is unavailable.
Match review requirements to consequence and reversibility; keep bounded autonomy distinct from HITL.
Classify proposed output independently; quarantine flagged decisions before release.
Require two-person approval for irreversible or high-consequence actions.
Give reviewers the proposed action, evidence, policy result, and routing rationale before approval.
Track reviewer fatigue indicators; route to fresh reviewers; mandate breaks when thresholds trip.
Label the AI role when the resulting action or communication is user-facing.
Bind approvals and release records to verifiable signatures.
Separate the actor from the recorder so decision history is harder to alter.
Capture overrides and reversals to adjust thresholds and policies.
Off-hours, surges, and vendor outages mean HITL queues do not always have a reviewer. The program must declare in advance what happens when the queue saturates:
Reviewer overrides and decision reversals are captured as signals in reviewer decision summaries and risk-prioritised queues. HITL feedback-loop calibration closes the loop: override events are batched, analysed for systematic patterns, and fed back into agent calibration: prompt updates, tool-scope policy changes, and divergence-monitor threshold tuning. Each calibration cycle requires human sign-off on the pattern report before any agent change is deployed.
This HITL program is an engineering interpretation related to the ACM Europe Technology Policy Committee's May 2025 policy brief (see Governance primer) and its proposal for alignment oversight. It is not text of Article 14 or of the brief. Helmwart applies the concept by making agent actions legible to reviewers; the linked flow controls, logging, and calibration measures provide that evidence.