Skip to content

Building the Data Pipeline

About This Course

This course focuses on the methods used to acquire, store and process data for downstream analysis. You’ll analyze and compare available technologies to make informed decisions as data engineers. You’ll also explore the modern cloud data platform — building systems that handle data from ingestion, through storage, processing and, ultimately, serving.  

What You’ll Learn

  • How a data lake (like Delta Lake, which is part of the Spark ecosystem) can enhance the usability of your organization’s data 

  • Batch and streaming processing using Spark, Flink and other processing tools 

  • How to use Kafka to enable low-latency and real-time processing 

  • The unified log model and the ways the log notion recur in support of building robust, fault-tolerant distributed data systems

  • Data acquisition, data governance and modeling techniques 

Get Hands-On Experience

  • Organize and store data in a data lake and handle updates and changes to your data

  • Use Spark to connect to different data sources and process batch and streaming data 

  • Design, build and integrate a complete end-to-end data pipeline to support a realistic business case

Course Sessions

Online Synchronous

January 2027
Dates Jan 7 - Mar 18
Location Online
Instructor Jerry Kuch
Cost $1,665
Scheduled Meetings
Date
Day
Time
Location
Jan 7, 2027
Thu
6 – 9 p.m.
Online
Jan 14, 2027
Thu
6 – 9 p.m.
Online
Jan 21, 2027
Thu
6 – 9 p.m.
Online
Jan 28, 2027
Thu
6 – 9 p.m.
Online
Feb 4, 2027
Thu
6 – 9 p.m.
Online
Feb 11, 2027
Thu
6 – 9 p.m.
Online
Feb 18, 2027
Thu
6 – 9 p.m.
Online
Feb 25, 2027
Thu
6 – 9 p.m.
Online
Mar 4, 2027
Thu
6 – 9 p.m.
Online
Mar 11, 2027
Thu
6 – 9 p.m.
Online
Mar 18, 2027
Thu
6 – 9 p.m.
Online

All times are Pacific Time.

Noncredit Course

You'll earn 3.3 continuing education units (CEUs) for successfully completing this course.