Ship It Weekly
The DevOps and SRE podcast for practitioners — honest conversations on shipping software, running production, and the on-call reality the keynotes skip.
The DevOps, SRE, Platform and Cloud Engineering News Podcast
Weekly DevOps, SRE, Platform, and Cloud engineering news plus real conversations with people building this stuff in production.
Where to Listen
Available on all major podcast platforms
Latest Episodes
Catch up on every episode of Ship It Weekly
Ship It Weekly – DevOps and SRE News for Engineers Who Run Production
Ship It Weekly delivers essential DevOps and SRE news for engineers managing production systems. Each episode distills key industry shifts, security incidents, and tooling updates into practical insights, helping listeners stay informed without the hype.
Ship It Conversations: Meta’s Francois Richard on AI Incident Response, SLOs, and Reliability at Scale
In this episode, Francois Richard from Meta discusses the evolving landscape of reliability at scale, particularly with AI's impact on production risks. He emphasizes the importance of recovery practices alongside prevention, and how SLOs should reflect a commitment to users.
Coinbase Outage, Meta AI Account Recovery, AWS AgentCore Code Injection, Apigee Tenant Isolation, and the Glue That Breaks Production
This episode of Ship It Weekly discusses critical infrastructure failures and their implications. Brian analyzes Coinbase's outage due to an AWS cooling failure, Meta's AI-driven account recovery issues, and vulnerabilities in AWS AgentCore and Google Apigee.
Kiro CLI Approval Bypass, Amazon Braket Pickle Risk, AWS Org Logging, KEDA Upgrades, and Automation’s Hidden Boundaries
This episode of Ship It Weekly explores automation's hidden boundaries, focusing on Kiro CLI's CVE-2026-9255 approval bypass and Amazon Braket's Python pickle risk.
GitHub Supply Chain Attacks, Railway’s GCP Outage, Discord’s Voice Failure, AWS Retry Changes, and Trusted Tool Risk
This episode of Ship It Weekly discusses the growing risks of trusted tools in production, highlighted by a GitHub supply chain attack involving a compromised VS Code extension.
Ship It Conversations: Jake Warner on Cycle.io, Bare Metal’s Comeback, and Why Private Cloud Is Getting Interesting Again
In this episode, Jake Warner of Cycle.io discusses the resurgence of bare metal and private cloud, emphasizing their benefits in cost, performance, and compliance.
CISA’s GitHub Leak, AI Root Cause Analysis, Copilot Agents, Claude Code in CI/CD, and Kubernetes Seccomp Risk
This episode of Ship It Weekly discusses the implications of the CISA GitHub leak, highlighting the exposure of AWS keys and internal documentation.
AI Agents Get API Access and Identity: GitHub Copilot Cloud Agents, MCP Auth, Ansible Automation, OpenAI Daybreak, and the New Production Risk
This episode explores the shift of AI agents from coding assistants to operational actors. Topics include GitHub's REST API for Copilot cloud tasks, Auth0's MCP authentication, and Ansible as an execution layer.
Cursor Deletes PocketOS Prod DB, .de DNSSEC Outage, Bluesky Postmortem, Argo CD, and Copy Fail
This episode of Ship It Weekly discusses modern reliability challenges, including the PocketOS database wipe and the .de DNSSEC outage.
Ship It Conversations: Gareth Kersey on IaCConf 2026, AI, and Corey Quinn’s Terraform Keynote
In this episode, Gareth Kersey discusses IaCConf 2026, focusing on how infrastructure teams adapt to AI-driven changes in software delivery.
GitHub RCE, AI Agent Prompt Injection, and the New Reality: Your Developer Toolchain Is Production Now
This episode of Ship It Weekly discusses the evolving role of developer tools in production environments. Brian covers critical vulnerabilities like GitHub's git push RCE, AI prompt injection, and supply chain incidents, highlighting the need for enhanced security measures in
Kubernetes 1.36, Gateway API v1.5, AWS Copilot End of Support, and Cloudflare Non-Human Identities
This episode of Ship It Weekly discusses Kubernetes 1.36's maturity release, focusing on deprecating legacy features for better security.
Ship It Conversations: Stephane Moser on Pipedrive’s Jenkins-to-GitHub Actions Migration, Argo CD, and CI/CD at Scale
In this episode, Stephane Moser discusses Pipedrive's migration from Jenkins to GitHub Actions and the implementation of Argo CD for GitOps.
AWS Interconnect GA, Cloudflare Mesh, GitLab 19, EKS Auto Mode, and OpenTelemetry Config
This episode of Ship It Weekly discusses the evolution of networking and ingress in cloud platforms, covering AWS Interconnect's GA, Cloudflare Mesh, GitLab 19.0 breaking changes, EKS Auto Mode, and OpenTelemetry's stable config.
Special: Claude Mythos Preview and Project Glasswing: AI Exploit Discovery, Zero-Day Risk, Business Fallout, and What It Means for DevOps, Cloud, and Platform Security
In this special episode, Brian discusses Claude Mythos Preview and Project Glasswing, emphasizing their significance for security in DevOps and cloud environments.
Amazon S3 Files, Malicious npm Plugins, Trivy Fallout, and Kubernetes’ Gateway Shift
This episode of Ship It Weekly explores the evolving interface layer in cloud infrastructure, focusing on Amazon S3 Files as a managed filesystem, the rise of malicious npm plugins, and the implications of Kubernetes' Gateway API shift.
Ship It Conversations: David Tuite on Backstage, Internal Developer Portals, and the Shift to AI Agents
In this episode, David Tuite discusses the evolution of internal developer portals (IDPs) and their shift towards AI agents in engineering workflows.
GitHub Actions Hardening, Airbnb Config Rollouts, Cloudflare Rust Restarts, ECS Managed Daemons, and Terraform Access Controls
This episode of Ship It Weekly discusses crucial platform work that enhances system safety, including GitHub Actions hardening, Airbnb's safer config rollouts, and Cloudflare's zero-downtime Rust restarts.
Hackerbot-Claw Grows, Xygeni Tag Poisoning, GitHub Search HA, Windows SID Failures, and AI Skills Supply Chain
In this episode of Ship It Weekly, Brian explores how convenience can lead to trust issues in software development.
Ship It Conversations: Ang Chen on Project Vera, AI Cloud Emulation, and Safer Infrastructure Testing
In this episode, Ang Chen discusses Project Vera, a cloud emulator designed to enable safer infrastructure testing before impacting real cloud environments.
McKinsey AI Flaw, Kafka Goes Diskless, Google Buys Wiz, AWS Copilot Ends, and AI Gateway on Kubernetes
In this episode of Ship It Weekly, Brian discusses the implications of new AI interfaces on existing responsibilities. He covers McKinsey's AI tool vulnerability, Kafka's diskless topics model, Google's acquisition of Wiz, AWS Copilot's end, and Kubernetes' AI Gateway initiative.
Meta Buys Moltbook, Block AI Layoffs Get Messier, Atlassian Cuts Jobs, and GitHub Explains the Outages
This episode of Ship It Weekly dives into five key stories at the intersection of AI and reality, focusing on the implications for DevOps and SRE teams. Topics include Meta's acquisition of Moltbook, Block's messy AI layoffs, Atlassian's job cuts, and GitHub's outage analysis.
Ship It Conversations: Yvonne Young on Linux Foundations, Mentorship, and Getting Job Ready in Cloud
In this episode, Yvonne Young discusses the essential skills for breaking into cloud and DevOps, emphasizing Linux fundamentals and the importance of focus.
AWS Bahrain/UAE Data Center Issues Amid Iran Strikes, ArgoCD vs Flux GitOps Failures, GitHub Actions Hackerbot-Claw Attacks (Trivy), RoguePilot Codespaces Prompt Injection, Block “AI Remake” Layoffs, Claude Code Security
In this episode of Ship It Weekly, Brian discusses the expanding boundaries of operations, including AWS issues in Bahrain/UAE amid Iran strikes, and GitOps failures with ArgoCD.
Cloudflare BYOIP BGP Withdrawals, Clerk’s Postgres Query-Plan Flip Outage, and AWS Kiro Permissions Lessons (Grafana Privesc + runc CVEs)
This episode of Ship It Weekly delves into Cloudflare's BYOIP outage, highlighting how automation can lead to unintended BGP withdrawals, and Clerk's performance issues from a query plan flip in Postgres.
Ship It Conversations: Mike Lady on Day Two Readiness + Guardrails in the AI Era
In this episode, Mike Lady discusses day two readiness and the importance of guardrails in the AI era. He explains how effective guardrails can enhance safety and predictability in code delivery, especially as AI-generated code becomes prevalent.
Browse by Topic
Jump into curated feeds for the subjects we cover most
Work With the Show
Come on as a guest or partner with Ship It Weekly as a sponsor
Be a Guest
Want to share your DevOps journey on Ship It Weekly? We're looking for passionate engineers to interview!
Apply to be a Guest →Become a Sponsor
Reach thousands of DevOps, Platform, and Cloud Engineering professionals. Partner with Ship It Weekly!
Latest Updates
Announcements, milestones, and behind-the-scenes from Ship It Weekly
Ship It Weekly is officially a video podcast now. Starting with the latest release, every episode going forward will be available on YouTube in full video format. If you like seeing the host, guests, or just prefer the “watch it while you work” vibe, that’s now a first-class option. Watch…
Read more →About Ship It Weekly
Ship It Weekly is a short, practical recap of what actually matters in DevOps, SRE, cloud infrastructure, and platform engineering.
Each episode, your host Brian Teller walks through the latest outages, releases, tools, and incident writeups, then translates them into “here’s what this means for your systems” instead of just reading headlines. Expect a couple of main stories with context, a quick hit of tools or releases worth bookmarking, and the occasional segment on on-call, burnout, or team culture.
This isn’t a certification prep show or a lab walkthrough. It’s aimed at people who are already working in the space and want to stay sharp without scrolling status pages, cloud updates, and blogs all week. You’ll hear about things like cloud provider incidents, Kubernetes and platform trends, Terraform and infrastructure changes, and real postmortems that are actually worth your time.
Most episodes are 15–30 minutes, so you can catch up on the way to work or between meetings. Every now and then there will be a “special” focused on a big outage or a specific theme, but the default format is simple: what happened, why it matters, and what you might want to do about it in your own environment.
If you’re the person people DM when something is broken in prod, or you’re building the cloud and platform everyone else ships on top of, Ship It Weekly is meant to be in your rotation.
Meet Brian Teller
Brian is the host of Ship It Weekly and the builder behind Teller's Tech, a media and training platform focused on DevOps, SRE, platform engineering, cloud infrastructure, and the real-world work of keeping production systems alive.
Brian started Ship It Weekly because most tech news says what happened—but not always why it matters to the people on-call when the headline becomes their incident.