Status: Research / Future Roadmap
Priority: Medium
Depends on: Channel interception layer (optional)
When agents send outbound messages (responses to users), they may inadvertently include:
Current implementation removed outbound scanning from the adversary-detector crate to simplify the initial channel integration. This document captures the research directions for re-implementing outbound content filtering.
sk-[a-zA-Z0-9]{32,}, AKIA[0-9A-Z]{16}eyJ[a-zA-Z0-9_-]*\.[a-zA-Z0-9_-]*\.[a-zA-Z0-9_-]* (JWT)-----BEGIN (RSA |EC |DSA |OPENSSH )?PRIVATE KEY-----mongodb(\+srv)?://, postgres://, mysql://\d{3}-\d{2}-\d{4} (with contextual keywords)"my password is", "password:" + high-entropy string"api key:", "token:", "secret:" + following value"don't share this", "private:" + contentsecurity:
outbound_scanning:
enabled: true
mode: "flag" # "block", "flag", "log_only"
detectors:
high_entropy:
enabled: true
min_entropy: 4.5
min_length: 16
patterns:
enabled: true
patterns_file: "secrets-patterns.json"
context_keywords:
enabled: true
keywords: ["password", "secret", "token", "key", "credential"]
redaction:
enabled: true
mask: "***REDACTED***"
alerts:
on_detection: true
channel: "signal"
to: "+1XXXXXXXXXX"
outbound_contains_secrets → block