Skip to content
$clawproof --check-all --verbose

Your agent is in production.
The question is whether it should be.

Clawproof is a reliability and governance practice for teams shipping AI agents. Ten checks, distilled from real production incidents. A free assessment that takes five minutes. An open-source toolkit you can run today.

Score your agentRead the 10 checks
5 minutes  ·  20 questions  ·  no email required
example.report  ·  illustrativegenerated 2026-04-25 14:21 UTC
Clawproof Score
0
/ 100
Production-ready

Most checks are real. Two gaps below would block a clean audit.

Per-check breakdown
01
Tool Permissions
80%
02
Logging & Audit
60%
03
Prompt Injection
40%
04
Human-in-the-Loop
80%
05
Kill-Switches
100%
06
Secrets Management
80%
07
Evaluation & Regression
20%
08
Data Boundaries
60%
09
Cost Controls
80%
10
Multi-Agent Coordination
70%
Run yours →this is a sample report. yours will be specific to your stack.
The operator
WP
Werner Plutat
Legal Engineer. LL.M. Big 4 + DAX. I run AI agents in production every day. These checks come from real operations, not papers.
example.log  —  illustrative failure patterns3 entries
[ERROR] rate_limiter: request burst exceeded cap → circuit open
        → Prevented by Check #09 Cost Controls

[WARN ] tool_permissions: agent requested scope beyond grant, denied by policy
        → Prevented by Check #01 Tool Permissions

[INFO ] kill_switch: manual halt triggered, in-flight actions cancelled
        → Prevented by Check #05 Rollback & Kill Switches
illustrative. not real production logs. each entry maps to the check that catches the failure mode.
Open source

Ten checks you can run on your own agent in five minutes.

★ on GitHub →
install (Claude Code)
# add the skill to Claude Code
claude skill install clawproof-checks

# run all ten checks against your repo
clawproof --check-all --verbose
install (yaml only)
# clone the repo
git clone github.com/lexbeam-software/clawproof-checks
cd clawproof-checks

# inspect any check
cat checks/03-prompt-injection.yaml
Read the project page →
$clawproof --assess

Twenty questions. Five minutes. A score that tells you what to fix first.

The free assessment is the same questionnaire we use with paying clients. The score is generated locally in your browser. No email, no calendar invite, no follow-up.

Start the assessment →Browse the checks first
Newsletter · monthly · ~600 words
One operator's notes on running agents in production. No funnel, no upsell.