Custom Intents

Define your own detection categories with examples.

Overview

Custom intents extend the guardrails system with your own detection logic. Provide a name, description, and example prompts (both positive matches and negative non-matches) to train a lightweight classifier. Once created, your intent runs on every request alongside the built-in categories.

Key Features

Create custom detection intents with positive and negative examples
Intents are evaluated alongside built-in guardrails
Assign block, monitor, or redact actions per intent

How It Works

Name your intent (e.g., competitor-mentions)
Describe what the intent should detect
Add positive examples - prompts that should trigger the intent
Add negative examples - prompts that should not trigger the intent
Assign an action - block, monitor, or redact

The classifier learns from your examples and applies the configured action when a match is detected.

Overview​

Key Features​

How It Works​

Overview

Key Features

How It Works