Question 1

What are AI guardrails?

Accepted Answer

AI guardrails are runtime checks that sit between an AI model and the application around it. For AI agents, that means scoring every prompt, tool call, and output for intent and risk before it executes, then letting policy allow, escalate, or block. They are the difference between hoping a model behaves and proving it did.

Question 2

How is precision measured?

Accepted Answer

On held-out adversarial and benign traffic, with precision, recall, and false-positive rates reported per intent class and per risk band. You can run the engine in shadow mode against your own production traffic before enforcing anything.

Question 3

Does it work across models and frameworks?

Accepted Answer

Yes. Classification sits at the execution boundary, independent of model and framework. Switching providers or upgrading models does not change the policy surface.

Question 4

What happens to actions the engine cannot classify with confidence?

Accepted Answer

They are escalated, blocked, or routed for review according to your policy. The default posture is to never allow an unclassified execution silently.

Question 5

Can we bring our own intent taxonomy?

Accepted Answer

Yes. The taxonomy is configurable per product surface. Start from our generic baseline and extend it, or define one from scratch for a specific copilot or workflow.

Question 6

Where does the engine sit in the request path?

Accepted Answer

Inline, ahead of the model and ahead of any tool execution. Inputs are classified before they reach the agent, planned actions before they fire, and outputs before they reach the customer.

Question 7

Is this an AI firewall or AI guardrails?

Accepted Answer

Both terms describe the same job: a guardrails layer that inspects prompts and actions before they execute. Averta's Classification Engine is that layer for AI agents, scoring every input, tool call, and output inline so your policy layer can allow, escalate, or block.

Question 8

How is the data stored?

Accepted Answer

Sensitive data is redacted in flight, so account numbers, balances, and personal data are stripped before anything is written to a log or store. Classification metadata and audit records are encrypted in transit and at rest, retained according to your policy, and never used to train shared models. Averta can run in your own cloud or VPC, or as a managed service in the region you choose.

AI guardrails for agents in production.

What gets through without AI guardrails.

Prompt injection and jailbreaks

Misread intent, wrong action

Risk scored after the fact

Static rules going stale

Catch prompt injection and jailbreaks before they land.

A risk profile for every prompt, action, and output.

Know what every prompt is trying to do.

Built for the execution path.

The decision layer in front of every action.

Govern every tool call.

Every interaction recorded.

Govern MCP tool access.

Pressure-test your agents.

Classification, specifics.

See Averta OS in action