Command Blacklist
Blocks dangerous shell commands (wget, curl to external IPs, nc, rm -rf, etc.) before execution
Real-time protection for your AI agent infrastructure
Overview
Security Shield monitors every tool call and LLM interaction in real-time, blocking malicious commands before they execute. Built as a native OpenClaw plugin with 4 hooks deeply integrated into the agent runtime.
Threat Landscape
AI agents face a fundamentally different attack surface than traditional software. Here's what we protect against.
Capabilities
Blocks dangerous shell commands (wget, curl to external IPs, nc, rm -rf, etc.) before execution
Scans every incoming message for jailbreak attempts and social engineering patterns
Every tool call logged to security-audit.jsonl with timestamp, agent, and verdict
Runs synchronously via 4 lifecycle hooks β before execution, not after
Architecture
Hooks into 4 lifecycle events β intercepting every tool call and LLM message at the runtime level. No external API calls. Fully local. Zero dependencies.
before_tool_call Intercepts and validates before execution after_tool_call Audits results and flags anomalies llm_input Scans incoming messages for injection patterns llm_output Reviews outgoing responses for data leakage Coverage
Six purpose-built modules, each targeting a distinct attack class β all running locally with negligible overhead.
Tags all external data before it enters the LLM context, making injected instructions visible and distinguishable from legitimate system prompts.
Defends Against
Logs every tool call and alert to an append-only JSONL file with timestamps, agent IDs, and verdict. Your immutable audit trail.
Defends Against
Enforces a command blacklist at the shell level, blocking dangerous binaries (wget, curl, nc, rm -rf, etc.) before they execute.
Defends Against
Scans incoming prompts for injection patterns and outgoing responses for secrets and malicious URLs using regex-based heuristics.
Defends Against
Signs all cross-agent messages with HMAC-SHA256 and verifies signatures on receipt, preventing message tampering in multi-agent pipelines.
Defends Against
Applies a whitelist and injection detection to every memory write, preventing persistent backdoors from surviving across sessions.
Defends Against
Coming Soon
Be the first to know when this plugin launches.