Working with AI Security Guards

AI Security Guards help you protect and govern the AI-powered applications that you build, and enforce security controls in runtime. They let you enforce rules to inspect and control traffic between the AI applications that you build, your users, and supported AI models. You use guards to detect malicious prompts, unsafe content, and sensitive data exposure before they affect your application. Guards also give you visibility into how AI applications are used, so you can monitor violations and enforce organizational policies with a consistent control point.

A guard acts as the enforcement layer for AI Security. It evaluates prompts and, when relevant, model responses against the detections and actions that you configure in your policy. Based on the policy match, the guard can allow, block, or log the interaction.

How Guards Work

Guards inspect AI traffic and compare it to the detections and actions that are defined in your policy. This lets you apply security and governance controls to AI interactions before the traffic reaches the model provider, and in some cases, before the response is returned to the user.

Traffic Flow

AI Security supports different traffic flow models, depending on how your application integrates with the guard.

Proxy Mode - In this mode, the guard sits inline between the application and the AI provider. Your application sends requests through the guard, which forwards traffic to the provider after evaluating the content against policy.

This mode lets you apply controls directly in the traffic path. The guard can inspect the request before it reaches the provider and, where supported, inspect the response before it is returned to the application. This gives you a direct enforcement point for real-time protection and visibility.
API Mode - In API mode, your application interacts with the guard via an API-based integration rather than routing traffic through an inline proxy. The application sends the relevant content to the guard for evaluation as part of the application flow.

This mode gives you more flexibility for environments where an inline deployment is not the preferred architecture. The guard still evaluates the content against policy, but the enforcement flow depends on how your application is integrated with the guard.

What does the Guard Detect and Enforce

You can use guards to detect and enforce policy for a range of AI-related risks, including:

Prompt injection attempts
Unsafe or restricted content
Sensitive data exposure, such as personally identifiable information (PII) or protected health information (PHI)
Topic or usage violations based on company policy
Suspicious or non-compliant AI interactions

When a guard detects a match, it applies the action that is configured in the policy. For example, the guard can block a request, allow it, or log the event for monitoring and investigation.

Use Case - Protecting an Internal AI Assistant

Your company uses an internal AI assistant to help employees search internal documentation and summarize content. To reduce the risk of sensitive data exposure, prompt injection, and policy violations, you apply an AI Security guard to the application traffic.

The guard inspects prompts and, where supported, responses against your AI Security policy. You can detect unsafe or non-compliant interactions and block or log them before the content reaches the AI provider or is returned to the user.

Supported Providers

Proxy mode supports the following providers and API formats.

Gemini AI Studio (Google AI Studio)
- Gemini API: generateContent, streamGenerateContent
- OpenAI-compatible API: completions (limited)
Anthropic's Claude (Claude Platform)
- Claude API: messages, complete (legacy)
- OpenAI-compatible API: completions (limited)
OpenAI (OpenAI Platform)
- OpenAI API: responses, completions, embeddings
Azure OpenAI Service (Azure OpenAI)
- OpenAI API: completions
Bedrock
- Bedrock API: Converse, ConverseStream
- OpenAI-compatible API: completions (limited)
Custom Endpoint (Azure Foundry, or any other OpenAI-compatible endpoint)
- OpenAI API: responses, completions, embeddings

Limitations

The following limitations apply to AI Security guards in this context:

Proxy mode supports only the providers and API formats listed in this article
Support for OpenAI-compatible completions is limited for some providers