Coronavirus Update

For the safety of our community, UWPCE programs will be taught remotely for the 2020-21 academic year.

For more information, see our Coronavirus FAQ

Building the Data Pipeline


Course Details

This course can only be taken as part of the Certificate in Big Data Technologies.

Get Program Details

About this Course

This course focuses on the process used to create usable data for downstream analysis. You'll analyze and compare available technologies in order to make informed decisions as data engineers. You'll also learn how to run a data processing workflow through several data stack platforms and design a data pipeline for a business-use case.

What You’ll Learn

  • How to use Spark for batch and streaming processing
  • How to use Kafka for low-latency and real-time processing
  • Data acquisition and modeling techniques
  • Workflow orchestration and automations
  • Pipeline design and integration

Program Overview

This course is part of the Certificate in Big Data Technologies.

  Stay up to date with emails featuring career tips, event invitations and program updates.       Sign Up Now