Custom Intents
Define your own detection categories with examples.
Overview
Custom intents extend the guardrails system with your own detection logic. Provide a name, description, and example prompts (both positive matches and negative non-matches) to train a lightweight classifier. Once created, your intent runs on every request alongside the built-in categories.
Key Features
- Create custom detection intents with positive and negative examples
- Intents are evaluated alongside built-in guardrails
- Assign block, monitor, or redact actions per intent
How It Works
- Name your intent (e.g.,
competitor-mentions) - Describe what the intent should detect
- Add positive examples - prompts that should trigger the intent
- Add negative examples - prompts that should not trigger the intent
- Assign an action - block, monitor, or redact
The classifier learns from your examples and applies the configured action when a match is detected.