Chaos Mesh: Chaos Engineering on Kubernetes

Shriira Press

Preface

Deliberately inject failures to test and improve resilience. Chaos engineering on Kubernetes — done as a disciplined, safe practice.

Welcome to Chaos Mesh: Chaos Engineering on Kubernetes.

Chaos Mesh is a Kubernetes-native chaos engineering platform for deliberately injecting failures into your systems to test and improve their resilience. This free book teaches it from the ground up: the resilience problem (why resilience must be tested, not assumed) and what Chaos Mesh does, the principles of chaos engineering (hypothesis-driven experiments, the steady state, blast radius), how Chaos Mesh works (CRDs, the controller manager, the per-node chaos daemon, and the dashboard), the fault types (pod kills, network latency/loss/partition, stress, IO, DNS, time, and application-layer faults), defining experiments as declarative Kubernetes resources, running experiments safely (blast-radius control, environments, auto-recovery, RBAC), the experiment workflow and observing results, automating chaos in CI/CD and on schedules, chaos in production and the ecosystem, and adopting chaos engineering in practice. Ten focused chapters with clear diagrams that turn resilience from a hope into a tested, continuously-verified property.

This title is part of the ShriIra library and is free to read in full, right here — our small contribution to making world-class knowledge easy to reach.

A note on reading it: open the Contents menu at the top of the reader to jump between chapters, use the Aa menu to set a comfortable text size, theme (light, sepia, or night), and single- or two-page layout. Your place is saved automatically, so you can always pick up where you left off.

We hope it serves you well.

— Shriira Press

Contents

  1. Chapter 1 — What Chaos Mesh Is
  2. Chapter 2 — Chaos Engineering Principles
  3. Chapter 3 — How Chaos Mesh Works
  4. Chapter 4 — The Fault Types
  5. Chapter 5 — Defining Experiments as Resources
  6. Chapter 6 — Running Experiments Safely
  7. Chapter 7 — The Experiment Workflow and Observing Results
  8. Chapter 8 — Chaos in CI/CD and Automation
  9. Chapter 9 — Chaos in Production and the Ecosystem
  10. Chapter 10 — Adopting Chaos Engineering in Practice
0%
1/1