Building Data Pipelines at Scale
Data EngineerIntermediate

Building Data Pipelines at Scale

Pipeline fundamentals, Airbyte ingestion, Airflow DAGs, production dbt, data quality with Great Expectations, Spark, Kafka streaming, and a two-part capstone.

8 Weeks8 modules6 Modules + 2-Part Capstone
What you'll build5 outcomes
  1. 01Design ingestion and batch pipelines with Airbyte and Airflow operators.
  2. 02Ship production-grade dbt models with testing and maintainability in mind.
  3. 03Apply Great Expectations for profiling, tests, and documentation.
  4. 04Scale transforms with Apache Spark and PySpark; integrate real-time flows with Kafka.
  5. 05Build an end-to-end pipeline and harden it for production monitoring.
Full curriculum8 items · 2 projects
01Pipeline Fundamentals & Data Ingestion with Airbyte
02Orchestration with Apache Airflow : DAGs, Operators & Schedules
03Transforming at Scale with dbt : Production-Grade Models
04Data Quality with Great Expectations : Tests, Profiling & Docs
05Distributed Processing with Apache Spark & PySpark
06Real-Time Streaming with Apache Kafka
07Capstone Project Part 1 : End-to-End Pipeline Build
Project
08Capstone Project Part 2 : Production Readiness, Monitoring & Review
Project
Instructor
Isuru Alagiyawanna

Isuru Alagiyawanna

Lead Instructor & Head of AI Engineering

Isuru leads Zuu Crew AI's technical direction and curriculum design. He has trained students and professionals across practical machine learning, MLOps, and AI engineering systems with a strong emphasis on shipping production-grade work.

University of MoratuwaRobert Gordon University, UKHead of AI Eng, Veracity GroupAWS ML EngineerTensorFlow Certified

Explore More

See all our programs
and free content.

Back to HomeAll Programs