Agentic AI Safety: 7 Essential Guardrails

Pat Stanard, Distinguished Engineer and chief mainframe architect for Kyndryl US, shares the most important safety measures for AI agents in the enterprise

Patrick Stanard May 21, 2026

Meet agentic AI. It doesn’t just talk. It acts. And that can be scary!

Agentic AI agents can schedule meetings, query databases, start workflows, modify infrastructure and—if unchecked—accidentally do the digital equivalent of mailing the CEO’s tax return to the entire internet. Not good, and exactly why guardrails are so important in the world of agentic AI.

Traditional AI safety was about words. Agentic AI safety is about actions. And once your AI agent is active, you need guardrails—real ones, not duct tape and a terms of service agreement.

So, let’s talk about the most important agentic AI guardrails, in plain English, with just enough humor to keep things interesting and enough seriousness to keep auditors happy.

Why Agentic AI Changes Everything

As I present to our customers on this topic, I always tell them how this will change everything, and yes, on the mainframe. Classic generative AI is like a very confident new hire who only writes emails. Because they are new, they still need guidance, but their power and access is limited.

Agentic AI, by contrast, is like an intern who:

Has admin access
Knows how to call APIs
Can deploy code
Never sleeps
And really wants to finish the task (it’s been coded to do so!)

That can be powerful, but also dangerous.

Industry guidance sources and enterprise security leaders all emphasize the same thing: Prompt filters are not enough. Once AI crosses into autonomy, safety must move from the prompt to the execution layer. Think of the Terminator—no prompt filter ever stopped him.

Guardrail No. 1: Identity—AI Needs a Name Badge

The first rule of agentic AI: Your agents must have identities that are easily recognizable.

Not “generic‑service‑account‑123.” Not “admin‑bot.” Not Hal. You get my point.

Each agent needs:

A unique, non‑human identity
Clear role-based or attribute-based permissions
Minimum access level needed to operate
Clear and simple entitlements

Remember, if access exists, an agent will eventually use it. Not maliciously, just enthusiastically. And enthusiasm plus admin rights is how outages are born.

Guardrail No. 2: Tool Control—Just Because You Can Doesn’t Mean You Should

Agents don’t just think; they use tools. MCP servers have their own tools enabling agents to talk to one another. Just like two humans would.

Agents can use APIs, databases, ticketing systems, cloud consoles, financial systems … you name it.

Guardrails here mean:

Explicit allow‑lists of approved tools
Parameter validation for tool calls
Version‑pinned tools
Blocking uncontrolled tool chaining

Without this, hallucinations evolve from “wrong answer” to “wrong system call,” which is far more exciting than anyone wants at 2 a.m. And I remember getting woken up when I was a developer. Not fun.

Guardrail No. 3: Humans Still Sign the Big Checks

Autonomy is great, until it isn’t. Every credible framework agrees—some actions must remain human‑approved, including:

Financial transactions
Production infrastructure changes
Access to regulated or sensitive data
Customer-impacting decisions

Think of this as the “Are you sure?” step, except enforced by policy, not vibes. Humans aren’t there to slow things down; they’re there to cap the blast radius. It is always a requirement to have a HIL (human in the loop).

Guardrail No. 4: Runtime Enforcement Beats Prompt Politeness

Prompt rules are like speed limit signs. Runtime enforcement is the highway patrol.

Agentic AI operates across multiple steps and systems, so safety checks must happen:

Before each tool call
During execution
Not just at input/output
In each environment

This means deterministic policy engines that can block, pause, escalate or stop actions while they’re happening, not after the damage is done. It is like the application that is checking the application.

Guardrail No. 5: Data Is the Real Crown Jewel

According to enterprise risk leaders, data—not models—is the biggest AI risk surface. Agents move data fast and quietly. They do what they are programmed to do.

Guardrails must include:

Data‑level authorization (not just system‑level, think pervasive encryption)
Context-aware PII / PCI / PHI (protect that data)
Scoped retrieval for retrieval augmented generation (RAG) pipelines
- RAG retrieves data from many sources
- Analyzes and plans to use the data
- Provides governance, validation and refinement
Output inspection for sensitive leakage (a must do)

Every prompt, attachment and retrieval is a potential data exfiltration event if left unchecked.

Guardrail No. 6: Observability—You Can’t Govern Ghosts

If your agent does something weird and you can’t answer: “What did it touch, why, and on whose authority?” This is a problem of agent governance that must be understood.

Real guardrails require:

End‑to‑end tracing of agent decision chains; it must always be tracked
Logged tool calls with before/after context, showing specific actions and commands
Audit trails tied to identity and data
Replay ability for investigations

This is the kind of vigilance is table stakes for compliance, trust and sleep. You don’t need that 2 a.m. wakeup call—ever!

Guardrail No. 7: Safety Brakes and Kill Switches

Agents operate at machine speed. Humans do not.

Every production agent needs:

Rate, step and spend limits
Circuit breakers

Emergency stop mechanisms
A manager

Again, the agent is being checked by human requirements.

Because when things go wrong, they go wrong fast, and fast automation without brakes is just chaos in a nicer UI. Think of the clock speed of a z17 processor. That can cause a lot of damage quickly. Don’t even get me started on quantum computers.

Final Thought: Autonomy Requires Adulthood

The most successful organizations know that agentic AI is not a toy. They’re not asking: “How autonomous can we make this agent?” They’re asking instead: “How safely can we make this agent act?”

During my customer presentations, I almost always get questions about guardrails. I explain that with the right guardrails, agentic AI becomes a force multiplier—because ultimately, to make the most of your agents, you need to give them limits.