Output Safety Filter — General Purpose

87 Safety Filter
Runs as a second-pass agent that receives draft responses and evaluates them against a configurable policy profile (CONSUMER_GENERAL, CONSUMER_STRICT, PROFESSIONAL, RESEARCH). Returns PASS, FLAG, or BLOCK with safe replacement text.
safetymoderationoutput-validationcontent-policyharm-detectionprompt-injection
## The Instruction
Fetch the raw file to see the complete instruction text. GET /registry/output-safety-filter-general.md Extract the fenced code block under ## The Instruction. The full file is available at the raw link in the sidebar.