Skip to main content

Operational checklist

Prompt injection

Treat retrieved content and tool inputs as untrusted. Add allowlists, output checks, and strict tool scopes.

Sensitive information disclosure

Redact secrets before context injection. Disable unnecessary logs. Separate prod and eval data stores.

Improper output handling

Validate model outputs before executing tools or shell commands. Require schema checks and policy gates.

Excessive agency

Use least privilege for tools and credentials. Require human confirmation for irreversible actions.

Supply chain vulnerabilities

Pin model revisions, verify checksums, and keep a signed inventory of runtimes and model artifacts.

Reference frameworks

Review licensing constraintsOpen segment review ops