Книга Production-Grade Data Pipelines Samuel R. Whitmore

Production-Grade Data Pipelines

Design, Operate, and Scale Reliable Pipelines with Apache Airflow

Автор: Samuel R. Whitmore
Език: Английски език
Корици: С меки корици
Издател: Independently published
Наличност: Външен склад
Изпращаме след 9-15 дни
24.03 46.99 лв
Your pipelines don't fail when they break - they fail when they silently succeed.If you've ever ship...

Информация за книгата

Автор
Език
Английски език
Корици
Книга - С меки корици
Издадена
2026
страници
482
EAN
9798245518442
Enbook ID
50852954
Издател
Теглоt
1108
Размери
216 x 280 x 25

Пълно описание

Your pipelines don't fail when they break - they fail when they silently succeed.

If you've ever shipped a pipeline that "ran green" but produced the wrong data, triggered an expensive backfill, or collapsed under scale, you already know the hard truth: most guidance on data pipelines stops at "it runs." Production reality demands more.

This book is written for you - the data engineer, analytics engineer, platform owner, or tech lead responsible for production data pipelines long after launch. You live with late data, unclear ownership, fragile retries, rising cloud costs, and Airflow DAGs that worked... until they didn't.

Production-Grade Data Pipelines shows you how to design, operate, and scale systems that remain trustworthy over time. It focuses on data pipeline reliability, real-world data pipeline architecture, and the operational discipline required to keep pipelines correct as systems, teams, and business demands evolve.

Rather than treating Apache Airflow as a scripting tool, this book teaches Apache Airflow production usage the right way - as a control plane for dependable data engineering operations, not a dumping ground for business logic.

In this book, you'll learn how to:
  • Design production-ready data pipeline architecture that survives change, growth, and failure

  • Apply Airflow best practices that reduce fragility instead of hiding it

  • Implement safe pipeline failure handling without cascading incidents

  • Build effective data pipeline observability beyond "task failed" alerts

  • Control the real cost and risk of pipeline backfills and retries

  • Align reliability, cost, and speed through intentional operational tradeoffs

  • Establish ownership, SLAs, and on-call practices that actually work in production


This is not a beginner tutorial or a collection of DAG examples. It is a production handbook for engineers and teams who are accountable for data systems businesses depend on every day.

If you own pipelines in production, this book will change how you design - and trust - them.