In development ยท active draft (2026)

Confidently Wrong.

DevOps, SRE, cloud, platform engineering, and critical thinking in a world of AI-assisted automation.

AI is not just another productivity tool for production engineers. It is an untrusted system with real authority, real failure modes, and a real blast radius. We need to evaluate, constrain, test, and operate it the same way we would any risky dependency in production.

The machine got friendlier. The job got harder.

An honest argument for why production engineers need a different mental model for AI — not because AI is useless, but because it is most useful right before it leads you off a cliff.

Production engineering used to have a simple feedback loop: the machine was not your friend. If your config was wrong, the service did not start. If you misunderstood the system, the system did not gently explain. It just broke. That was frustrating, but it taught the right lesson — understand it, test your assumptions, build a mental model that matches reality.

AI changes the emotional texture of that relationship. Now the machine is polite. It explains things. It writes the Terraform, drafts the Kubernetes manifest, summarizes the outage, suggests the IAM policy. It feels less like a compiler and more like a helpful coworker. That is exactly what makes it dangerous.

This book is not anti-AI. It is anti-pretending-AI-is-a-coworker. It treats AI the way every senior engineer eventually learns to treat any new dependency that touches production: useful, constrained, observable, reviewable, and never blindly trusted. It maps that mental model onto the real surfaces production engineers work with — Terraform, Kubernetes, CI/CD, cloud ops, agentic workflows, prompt injection, observability — and gives you a practice you can take back to your team.

How the book sounds.

The full opening of the book — draft prose, subject to revision before publication.

Introduction · Working draft

I was eleven or twelve, sitting at a HP-UX my dad had hobbled together from old parts, trying to compile an IRC daemon I'd downloaded from some FTP mirror that probably doesn't exist anymore. The README was three paragraphs of assumptions. The dependencies weren't listed anywhere useful. I had no idea what I was doing, but I stumbled my way through. Also big thank you to my dad explaining libraries and compiling. The first compile attempt died with an error about a missing header file that I'd never heard of.

The machine didn't care. It just sat there, blinking cursor, waiting for me to figure it out.

No Stack Overflow. No ChatGPT. No helpful blog post titled "How to Fix Missing <some_header.h> on HP-UX in 1994." Just me, some half-broken man pages, my dad when he wasn't busy who would always lead me but never do it for me, and a nudge toward me needing to hunt down another tarball, extract it, read its README, compile that, and hope it didn't cascade into three more missing dependencies.

It took me four hours. Maybe five. I don't remember feeling proud when it finally worked. I just remember the relief. And the lesson: the machine is hostile, but it's honest. If it says the file is missing, the file is missing. If it says the function is undefined, you didn't link the library. It won't lie to you. It won't make you feel better. It'll just sit there being broken until you fix it.

That was how you learned systems back then. You broke things. You read errors. You tested assumptions. You built a mental model of what the hell was actually happening under the hood, because guessing didn't work and nobody was coming to save you.

You learned to isolate the problem. Not "it's broken" — which line, which function, which missing piece. You learned to read error messages like they were clues, not obstacles. You learned that "it works on my machine" meant you didn't understand the environment yet. You learned to keep pushing when the answer wasn't obvious, because the answer was never obvious.

And you learned that the feeling of being stuck — really, deeply stuck — wasn't a sign you were bad at this. It was just part of the work.

I didn't know it at the time, but that was the foundation. That's the skill set that still matters. Maybe more now than it did then.

Fast forward to today.

You can open a chat window and say, "Write me a Terraform module that provisions an EKS cluster with three node groups, autoscaling, and logging to CloudWatch," and thirty seconds later you've got 200 lines of clean-looking HCL with comments and variable definitions, with decent module structuring to boot.

You can ask for a GitHub Actions workflow that builds a Docker image, scans it for vulnerabilities, and deploys it to three environments with approval gates. You'll get YAML that looks like someone who knows what they're doing wrote it. Sometimes pretty clever in the implementation.

You can paste an incident timeline into an AI and ask it to write the postmortem summary. It'll give you something polished enough to send to your Sr. Director.

The machine got helpful. It got fast. It got really, really good at looking like it knows what it's doing.

And that's the problem.

Because here's the weird part for me: the machine used to be hostile and honest. Now it's helpful and sometimes full of crap.

It'll generate a Kubernetes manifest that deploys successfully but misconfigures resource limits in a way that'll bite you in production three weeks from now. It'll write a Lambda function that works in the happy path and fails silently when the input is slightly malformed. It'll suggest an architecture that sounds smart and reasonable and introduces a single point of failure you won't notice until 2 a.m. on a Saturday.

And it'll do all of this with confidence. Clean formatting. Helpful comments. No warnings. No "hey, you should probably double-check this part."

And that's exactly the trap.

The old machine wouldn't compile if you screwed up. The new one compiles and deploys and smiles while it does it.

The real danger isn't that AI is useless. It's that AI is useful enough to make you trust it before you understand it.

That's the shift. That's what makes this moment different.

You used to have to earn your way to a working system. You had to understand the pieces, or it wouldn't work. Now you can skip that part. You can copy, paste, deploy, and move on. And if you're lucky, it works. If you're not, it works just enough to hide the problem until it's expensive.

The skill that matters now — maybe more than ever — is judgment.

Not "can you write Terraform." Not "do you know Kubernetes." Those are table stakes.

The skill is: can you look at something that looks right and figure out if it is right? Can you spot the thing that'll break in production? Can you tell the difference between "this works" and "this works safely"? Can you read generated code and know what it's actually doing, not just what it says it's doing?

That's engineering. That's the job.

And it's the same job it was in 1994 when I was staring at a missing library file. You still have to isolate the problem. You still have to read the errors — or in this case, read the code that didn't throw an error but should have. You still have to test your assumptions. You still have to build a mental model.

The difference is that now the machine is trying to help you skip that part. And if you let it, you're in trouble.

This book is for people who work in infrastructure, operations, reliability, platform engineering, or any adjacent role where the systems you build have to actually work when nobody's watching.

It's for people who are already using AI — or thinking about it — and who've noticed that the output is useful but not always trustworthy.

It's for people who want to get better at the part of the job that AI can't do for you: thinking clearly, asking the right questions, and knowing when to trust the tool and when to rewrite the whole thing from scratch.

If you're a DevOps engineer, SRE, cloud engineer, platform engineer, or someone growing into one of those roles, this is for you.

This is not a book of prompts. I'm not going to teach you 47 ways to ask ChatGPT or the LLM of the month to write a Helm chart.

This is not a hype piece. I'm not going to tell you AI is going to solve all your problems or that you're obsolete if you don't use it.

This is not an anti-AI rant. AI is useful. I use it. You probably should too.

And this is not a list of frameworks, tools, or "best practices" that'll be outdated in six months.

This is a book about judgment. About how to frame a problem so you get useful output instead of plausible garbage. About how to review what AI gives you and spot the things that'll fail later. About how to build systems with guardrails that catch mistakes before they become incidents. About where AI actually helps and where it quietly makes things worse.

It's about thinking like an engineer in a world where the machine is happy to do the thinking for you.

By the end of this book, you'll be better at a few specific things:

  • Framing problems clearly enough that AI gives you something useful instead of something confident.
  • Giving AI the context it needs to generate output you can actually use.
  • Reviewing AI-generated code, configs, and recommendations with a skeptical eye.
  • Spotting answers that sound good but don't hold up under scrutiny.
  • Designing systems and workflows with guardrails that catch AI mistakes before they reach production.
  • Understanding where AI is a force multiplier and where it's a liability you're better off avoiding.

You'll still use AI. You'll just stop trusting it by default.

AI can write a lot of things.

Your job is knowing which ones should never reach production.

Working draft. Final manuscript will be edited and copyedited before publication.

What the book covers.

A preface, an introduction, sixteen chapters, and a closing argument. Subject to revision — chapters may be reordered, merged, split, or renamed as the manuscript develops.

Preface

This Book Is Not Here to Hype You Up

Set expectations up front. This is not a book about replacing engineers with prompts. It is a book about becoming more effective, more skeptical, and more dangerous in the good way.

Introduction

The Machine Got Friendlier. The Job Got Harder.

Why AI makes engineering feel easier on the surface while quietly raising the bar for judgment, context, and critical thinking.

Chapter 1

How AI Sounds Right When It's Wrong

How AI produces plausible answers, why that is dangerous in infrastructure work, and how engineers can learn to spot convincing nonsense before it reaches production.

Chapter 2

AI Is Not Your Senior Engineer

A practical framing for what AI is good at, where it fails, and why "sounds right" is not the same as "is right."

Chapter 3

The New Skill Is Judgment

Why the most valuable engineers will not be the ones who type the fastest, but the ones who can ask better questions, spot missing context, and validate the output.

Chapter 4

Prompting Is Requirements Gathering

How to give AI useful constraints, architecture context, acceptance criteria, risks, and examples instead of tossing vague requests over the wall.

Chapter 5

Context Is the Control Plane

Why AI output quality depends heavily on what it can see, what it is allowed to assume, and how well your repos, docs, ownership, and standards are structured.

Chapter 6

Treat AI Like an Untrusted System

The core operating model of the book: AI should be sandboxed, reviewed, tested, permissioned, logged, and constrained before it touches anything important.

Chapter 7

Guardrails Beat Vibes

How policy-as-code, IAM boundaries, branch protections, CI checks, approval flows, and test environments turn AI from a risky helper into a usable engineering tool.

Chapter 8

The Agent Has Permissions Now

A realistic look at agentic workflows, tool use, MCP-style integrations, CI/CD agents, and what changes when AI can take actions instead of just writing text.

Chapter 9

Multi-Agent Systems Without the Hype

How planner, executor, reviewer, security, and documentation personas can be useful, and where multi-agent setups become expensive prompt theater.

Chapter 10

AI in Terraform, Kubernetes, and Cloud Ops

Practical examples of using AI for infrastructure work while avoiding common traps around generated Terraform, Helm changes, IAM policy edits, and cluster troubleshooting.

Chapter 11

Prompt Injection Comes for DevOps

Why prompt injection is not only a chatbot problem, especially when AI reads tickets, logs, PRs, docs, build output, dependency files, or user-controlled content.

Chapter 12

The Pipeline Is the Safety Net

How CI/CD becomes even more important in the AI era, because the pipeline is often the last place where generated changes can be tested, blocked, reviewed, or rolled back.

Chapter 13

Observability for AI-Assisted Work

What teams should log, review, and measure when AI is contributing to engineering workflows, including generated changes, approvals, failures, and near misses.

Chapter 14

Productivity Claims Need Evidence

A practical way to think about AI productivity, DORA metrics, developer experience, false speed, rework, review burden, and the difference between faster output and better outcomes.

Chapter 15

The Human Still Owns Production

Why accountability does not disappear because AI suggested the change, and how teams should think about responsibility, review culture, incident response, and escalation.

Chapter 16

Building Your AI-Ready Engineering Practice

A maturity model for teams adopting AI in production engineering, from ad hoc prompting to governed workflows with standards, patterns, and repeatable safety controls.

Conclusion

The Engineers Who Still Think Win

A closing argument that AI will reward engineers who understand systems, constraints, risk, and tradeoffs, not engineers who blindly accept the most confident answer.

And who it's not for.

Best for

DevOps engineers, SREs, platform engineers, cloud engineers, infrastructure engineers, security-minded engineering leaders, and senior software engineers who work close to production systems.

Especially for engineers and teams who are already using AI tools, or expect to soon, but do not want their adoption strategy to be "let everyone paste things into a chatbot and hope nothing weird happens."

Also useful for engineering managers, staff engineers, principal engineers, and architects who need to create practical AI usage standards for infrastructure, CI/CD, cloud, Kubernetes, Terraform, incident response, documentation, and operational workflows.

Not for

This is not a beginner "what is AI?" book. It is not a prompt engineering tricks book. It is not a generic business book about AI transformation. It is not a hype-driven tour of every new agent framework.

It is also not trying to convince people that AI will replace engineers.

The more interesting and useful question is how production engineering changes when AI becomes another actor inside the engineering workflow — sometimes writing code, sometimes reviewing it, sometimes reading logs, and sometimes holding credentials that can break real systems.

Where the book is today.

Currently writing.

The thesis, table of contents, and opening prose are stable. Chapters are being drafted, refined, and pressure-tested against real engineering teams adopting AI in production. No firm publication date — this page will be updated as launch milestones land.

If you want a feel for the voice and the lens before the book ships, the same operator-focused thinking shows up every week on Ship It Weekly and in the talk topics on Speaking.

Scroll to Top