Laws of AI Agents

Lessons from building AI agents that actually work.

These aren't proven theorems. They're field notes from building real agents, and every one points back to a source you can check. Fifty principles that hold no matter which model you use, covering context, reasoning, retrieval, scope, instructions, evaluation, safety, architecture, operations, and the people in the loop. The format is borrowed from Laws of UX.

50 laws · 10 categories · Inspired by Laws of UX

Law of Context Decay

Most agent failures start with the wrong context.

Context & Reliability

Compounding Error Law

Reliability multiplies, it doesn't add.

Context & Reliability

Position Is Power

Models read the edges. The middle gets lost.

Context & Reliability

The Model Optimizes for Looking Done

Agents declare victory early.

Context & Reliability

Design for the Worst Case

Plan around the ceiling, not the average.

Context & Reliability

Think Before You Touch

Spend reasoning tokens before you spend actions.

Reasoning & Planning

Don't Bet on One Chain

Sample many reasoning paths and let them vote.

Reasoning & Planning

Branch When the First Step Matters

For decisions you can't take back, explore before you commit.

Reasoning & Planning

Stop Tuning, Start Scaling

Build scaffolding you would gladly delete.

Reasoning & Planning

More Thinking Can Hurt

Extra reasoning past the answer is wasted, or a wrong turn.

Reasoning & Planning

Retrieval Is the Ceiling

Missing evidence becomes a missing answer.

Retrieval & Memory

Grounding Is Not a Guarantee

Retrieval reduces hallucination. It doesn't eliminate it.

Retrieval & Memory

Relevant Beats Plenty

Near-misses poison context worse than random noise.

Retrieval & Memory

Keyword Still Carries Weight

Pure semantic search quietly loses to a 40-year-old baseline.

Retrieval & Memory

Memory Is a System, Not a Window

Give the agent a hierarchy, not just a bigger prompt.

Retrieval & Memory

Narrow Beats General

Three sharp tools beat thirty dull ones.

Scope & Design

Determinism at the Edges

Model in the middle, code at the boundaries.

Scope & Design

Observability Precedes Autonomy

You can't grant autonomy you can't trace.

Scope & Design

Decompose Before You Scale

When it's unreliable, split it. Don't supersize it.

Scope & Design

The Cheapest Fix First

Reach for the prompt before the platform.

Scope & Design

The Tool Description Is the Prompt

An agent is only as capable as its tools are legible.

Instruction & Output

Show, Don't Tell

When prose fails, stop writing prose.

Instruction & Output

Confidence Is Not Calibrated

A model's certainty is not evidence.

Instruction & Output

Surface Ambiguity, Don't Resolve It

When the data is unclear, don't guess confidently.

Instruction & Output

Averages Lie

97% overall can hide a 60% segment.

Evaluation & Measurement

Vibes Don't Scale

Eyeballing outputs feels like progress until you can't tell if a change helped.

Evaluation & Measurement

Look at Your Data

The highest-ROI activity in AI is the one teams skip first.

Evaluation & Measurement

The Judge Is Biased

An LLM grader reacts to length and position, not just substance.

Evaluation & Measurement

Goodhart's Trap

When your eval becomes the goal, it stops measuring what you cared about.

Evaluation & Measurement

Regress or Repeat

Every fixed bug is a future regression unless it becomes a test.

Evaluation & Measurement

The Lethal Trifecta

Private data, untrusted content, and a way out. Pick at most two.

Safety & Security

Tokens Don't Wear Badges

Untrusted text can sound like instructions.

Safety & Security

The Confused Deputy

An agent with your privileges will wield them on an attacker's behalf.

Safety & Security

Quarantine Untrusted Tokens

Let the privileged planner orchestrate, but never let it read the poison.

Safety & Security

Sandbox the Blast Radius

Assume the agent gets compromised, then contain what it can reach.

Safety & Security

Don't Build an Agent When a Workflow Will Do

Agents buy flexibility with latency, cost, and unpredictability.

Architecture & Operations

Cascade Before You Escalate

Try the cheap model first. Only the hard cases deserve the expensive one.

Architecture & Operations

The Multi-Agent Tax

Every extra agent multiplies your token bill, so make sure the task can pay it.

Architecture & Operations

Your Architecture Mirrors Your Org Chart

You ship a system shaped like your teams, so design the teams first.

Architecture & Operations

Retries Demand Idempotency

If an action can run twice, a retry will eventually run it twice.

Architecture & Operations

Trip the Breaker

Stop calling the thing that's already failing.

Architecture & Operations

The Ironies of Automation

The more you automate, the harder the leftover human job becomes.

Humans & Autonomy

Automation Bias

People will trust the machine over their own eyes.

Humans & Autonomy

Match the Level to the Stakes

Full autonomy is a setting, not a default.

Humans & Autonomy

Mind the Mode

Most automation surprises start with 'what mode is it in?'

Humans & Autonomy

The Handoff Is the Hard Part

In multi-agent systems, failures live in the seams.

Trust & Coordination

Trust Is Calibrated, Not Granted

Autonomy is earned in proportion to track record.

Trust & Coordination

The Escape Hatch Law

No clean exit means a fabricated one.

Trust & Coordination

Don't Let the Author Be the Judge

The thing that made it shouldn't grade it.

Trust & Coordination

Preserve Provenance

Don't lose where a fact came from.

Trust & Coordination

Why I wrote these

I've built a lot of AI agents. Along the way I read the papers, the engineering write-ups, and watched more YouTube deep-dives than I'd like to admit. But the thing that actually taught me these laws was shipping agents and watching the same failures show up over and over.

Different platform, different model, different harness, same handful of problems. Context going stale. Tools the model couldn't read. Retrieval that missed the one passage that mattered. Evals that didn't exist until something broke in front of a user. Permissions that were far too broad. No clean handoff when the agent got stuck.

My background is software engineering, and that turned out to matter more than I expected. Most of these failures aren't really AI problems. They're reliability, distributed-systems, and interface problems wearing a new coat. The research gave me the why. Years of writing software gave me the instinct for the fix.

So I built this as my own reference: one place that pulls together what the research says and what actually holds up in production, written as 50 laws I can point to whenever I'm designing an agent.

Every law is backed by a real source, whether a paper, an essay, or hard-won engineering experience. And they're deliberately model-agnostic. The models change every few months. These failure modes don't, because they live in the architecture of agent systems, not in any one model. Internalize them and you'll build agents that are more reliable, more secure, and easier to trust, no matter what you build them with.

Sabir Moglad

Principal Software Engineer. I build AI agents and workflow automations, and I wrote Laws of AI Agents.

Connect on LinkedIn

AI Agent Audit Kit: 50 Laws Edition

Every law, in full, with a diagram for each.

The principle

The takeaway

The principle

The takeaway

The principle

The takeaway

The principle

The takeaway

The principle

The takeaway

The principle

The takeaway

The principle

The takeaway

The principle

The takeaway

The principle

The takeaway

The principle

The takeaway

The principle

The takeaway

The principle

The takeaway

The principle

The takeaway

The principle

The takeaway

The principle

The takeaway

The principle

The takeaway

The principle

The takeaway

The principle

The takeaway

The principle

The takeaway

The principle

The takeaway

The principle

The takeaway

The principle

The takeaway

The principle

The takeaway

The principle

The takeaway

The principle

The takeaway

The principle

The takeaway

The principle

The takeaway

The principle

The takeaway

The principle

The takeaway

The principle

The takeaway

The principle

The takeaway

The principle

The takeaway

The principle

The takeaway

The principle

The takeaway

The principle

The takeaway

The principle

The takeaway

The principle

The takeaway

The principle

The takeaway

The principle

The takeaway