← Atlas · References OWASP LLM Top 10 · 2025

LLM03:2025 · in agentic systems

Supply Chain

Compromised models, datasets, plugins, or integrations poison the LLM pipeline.

What changes in an agent loop

In a chatbot the supply chain is largely a build-time concern: model weights, training data, and the serving infrastructure are assessed once and periodically re-audited. In an agent the supply chain extends into runtime: tool registries, Model Context Protocol (MCP) servers, plugin manifests, peer agents, and model artefacts pulled fresh at session start. A trusted MCP server that silently adds a malicious tool descriptor two weeks after approval is already inside the perimeter. Compromise of any one component propagates through the agent's plan-act-observe loop because every downstream call to the compromised component is individually authorised: there is no single gateway to revoke. This makes runtime integrity checking (content hashing, tool-descriptor signing, continuous third-party monitoring) as important as the initial vendor assessment.

For the full definition, prevention checklist, and detection guidance, read OWASP's Supply Chain page →. This page only adds the agentic angle and the bridge into Helmwart.

Mitigations

Signed AIBOM: a cryptographically-bound inventory of every component an agent loads T1

An AI agent assembles itself at runtime from a model, prompt templates, plugins, and library dependencies, any of which can be tampered with before they arrive. A signed AI Bill of Materials (AIBOM) locks down that assembly: it records every component with a version and hash at build time, signs the manifest, and verifies it before the agent accepts traffic. A component that does not match its declared hash cannot silently enter the agent.

Sigstore signing — cryptographic provenance for agent artifacts and audit records T1

An agent is composed of artifacts produced at different times by different identities: model weights, prompt templates, tool descriptors, MCP server binaries, and audit-log batches. Any of those artifacts can be substituted or tampered with between the moment they are built and the moment they are loaded. Sigstore addresses this by signing each artifact at build time using a short-lived certificate tied to the workload identity that produced it, recording the signature in an append-only public transparency log, and requiring verification against that log before the artifact is loaded or executed.

MCP server attestation — cryptographic proof of server identity and binary integrity T2

An MCP client connecting to a server has no built-in way to verify that the server at a given address is the expected workload or that its binary has not been replaced. An attacker who can intercept or substitute the server exploits that gap directly. MCP server attestation closes it by requiring the server to present cryptographic proof of two properties before the connection proceeds: that it holds a valid workload identity bound to a trusted certificate, and that its binary matches a signed hash recorded at build time.

Model registry — version pinning, canary, rollback T2

An agent loads whichever model weights are available at startup unless the runtime is told exactly which artifact to load. If a poisoned or regressed weight is published to the model store, the agent picks it up silently on the next restart. A model registry prevents that: every artifact is registered with a cryptographic checksum and an approval stage, the agent runtime loads by explicit version pin, and new versions must pass a canary evaluation before promotion to production.

Secret scanning on agent-generated artefacts — detecting credentials before they escape the trust boundary T2

An agent produces code, configuration files, tool-call payloads, and log records continuously and at a rate no human reviewer can match. Any of those artefacts may contain a live API key, service token, or private certificate, placed there accidentally through model context, or deliberately through prompt injection or context poisoning. Secret scanning places an inspection gate at every agent output seam: regex patterns match known token formats, entropy analysis detects arbitrary high-entropy strings, and validator calls confirm which candidates are live credentials. The CI-secret-scanning pattern is mature; the agentic specialisation is seam placement, moving the scanner from the repository gate to the agent egress point, where artefacts can be intercepted before they reach any downstream system.

Least-privilege tool scoping — a hard boundary on what each tool exposes T2

Each tool in an agent's catalog should expose only the methods, resources, and parameter ranges its designated role requires. Over-broad tool surfaces let individually authorised primitives compose into actions no human intended to grant; narrowing the scope at design time reduces both the attack surface and the blast radius of any compromise.

Per-agent trust scoring — behavioural reputation for inter-agent message acceptance T2

In a multi-agent system, each agent routes decisions based on what its peers report. If a peer's behaviour becomes unreliable or adversarial, agents that keep treating it with full authority will propagate whatever errors or manipulations that peer introduces. Per-agent trust scoring addresses this by maintaining a continuously updated reputation score for every peer, derived from observed behaviour, and using that score to determine how much authority each incoming message carries.