PLAYBOOK · P3 · OWASP Agentic AI v1.1

Securing AI Tool Execution & Preventing Unauthorised Actions Across Supply Chains

Keep tool invocations on-policy and the upstream components trustworthy.

Goal: Prevent AI from executing unauthorised commands, misusing tools, or escalating privileges due to malicious manipulation, including across the agent’s supply chains.

Aligned with Step 3: Does the AI agent execute actions using tools, system commands, or external integrations? · 6 threats mitigated · 32 mitigations referenced

At a glance

THREATS COVERED

T2 · T3 · T4 · T11 · T16 · T17

NAVIGATOR STEP

Step 3: Does the AI agent execute actions using tools, system commands, or external integrations?

MITIGATIONS

distinct Helmwart controls referenced across the three phases

Defence-in-depth chain

When tool misuse or an arbitrary-execution attempt arrives, Proactive controls (least-privilege tool scoping and just-in-time tool grants) enforce authorisation at the point of invocation. If a misuse attempt reaches the execution stage, Reactive controls (the code-generation review gate and reviewer decision summaries) gate risky tool calls on human approval. Detective controls (separation of actor and recorder and static analysis on generated code) produce a tamper-evident audit trail and flag policy-violating code for post-incident investigation.

proactive Step 1: Restrict AI tool invocation, execution and apply supply-chain safeguards

Define an explicit tool allow-list for each agent and enforce it through a policy engine so no undeclared tool can be invoked.

Helmwart controls: Tool scope OPA authorisation
Apply version control and peer review to prompt repositories, scripts, and memory definitions exactly as you would to application code.

Helmwart controls: Code review gate
Require cryptographic identity verification for each agent before it is permitted to call any tool or external function.

Helmwart controls: Pre-exec check SPIFFE Secret scan
Classify every data asset by sensitivity and enforce per-tool, per-agent allow-lists governing which data classes may be read or written.

Helmwart controls: Data classification
Run every AI-invoked tool inside an isolated, containerised sandbox with no access to sensitive resources or the production network.

Helmwart controls: gVisor
Apply strict CPU, memory, and syscall limits inside the sandbox to prevent resource exhaustion or privilege abuse.

Helmwart controls: gVisor Rate limits and quotas
Tear down and recreate the sandbox after each tool execution to prevent an attacker from establishing persistence or moving laterally.

Helmwart controls: gVisor
When tools are called via inter-agent protocols such as A2A or MCP, sanitise responses, validate tool descriptions, and attest server identity before use.

Helmwart controls: MCP sanitisation Tool-desc validation MCP server attestation
Rate-limit all agent API calls and computationally expensive task invocations to prevent abuse and resource exhaustion.

Helmwart controls: Rate limits and quotas
Block tool execution in real time when the agent's risk score exceeds a predefined threshold, keeping autonomy within policy bounds.

Helmwart controls: Trust score Policy bound Blockchain tx guard
Grant tool access only at the moment it is needed and revoke it immediately upon completion, never persisting elevated permissions.

Helmwart controls: JIT tool grants JIT elevation
Digitally sign all agent cards, prompt templates, and model definitions, and publish a verifiable SBOM for every agent and its runtime components.

Helmwart controls: Sigstore Agent SBOM
Bind tool-call parameters to a cryptographic attestation of the user's stated intent so invocations cannot drift from the original task.

Helmwart controls: Intent attestation

reactive Step 2: Monitor & prevent tool misuse and supply-chain anomalies

Record every tool interaction with actor attribution and tamper-evident signatures to support forensic investigation.

Helmwart controls: Cross-system audit Split actor
Detect and isolate command-chaining sequences that would allow an agent to circumvent individual tool-access policies.

Helmwart controls: Anomaly isolation Divergence monitor
Gate high-impact tool executions (financial, medical, administrative) behind explicit human approval with a risk-prioritised review queue.

Helmwart controls: Decision summaries Risk queue
Maintain signed, detailed execution logs for every tool call so anomalies can be detected and investigations reconstructed accurately.

Helmwart controls: Cross-system audit Sigstore
Block execution of AI-generated code with elevated privileges until a human reviewer has approved it.

Helmwart controls: Code review gate
Flag any agent invoking the same tool at an abnormally high frequency within a short window, as this is a reliable indicator of abuse.

Helmwart controls: Anomaly isolation Rate limits and quotas
Monitor every tool interaction for unintended side effects and cross-audit outcomes against the agent's declared intent.

Helmwart controls: Divergence monitor Cross-system audit

detective Step 3: Prevent AI resource exhaustion and supply-chain compromise

Track agent workload in real time and alert when any single agent exceeds its allocated processing quota.

Helmwart controls: Rate limits and quotas Adaptive load
Automatically suspend any AI process that breaches a predefined resource threshold, degrading gracefully rather than failing open.

Helmwart controls: Graceful degradation Rate limits and quotas Kill switch
Run static analysis and secret scanning on all AI-generated code before execution, blocking any attempt to bypass security constraints.

Helmwart controls: Static analysis Code review gate Secret scan
Aggregate resource consumption across all agents in a fleet to catch coordinated exhaustion attacks that stay below per-agent thresholds.

Helmwart controls: Rate limits and quotas Adaptive load
Cap concurrent system modification requests an agent may initiate to prevent runaway self-modification loops.

Helmwart controls: Rate limits and quotas Loop limit
Continuously scan agent SBOMs for newly disclosed vulnerabilities or indicators of compromise in supply-chain dependencies.

Helmwart controls: Agent SBOM Sigstore
Red-team the agent by injecting simulated poisoned supply-chain components to verify that security boundaries hold under realistic attack conditions.

Helmwart controls: Behavioural red-teaming

Source

OWASP Agentic AI: Threats and Mitigations v1.1 (Dec 2025), §Mitigation Strategies. Action text is taken verbatim or paraphrased from the canonical document; the Helmwart additions are the per-action mappings onto deployable mitigation entries.