Restricted Operations Due to Coronavirus Outbreak

For the safety of our community, UWPCE has restricted operations due to the coronavirus outbreak.

For more information, see our FAQ

Building the Data Pipeline


Course Details

This course can only be taken as part of the Certificate in Big Data Technologies.

Get Program Details

About this Course

This course focuses on the process used to create usable data for downstream analysis. You'll analyze and compare available technologies in order to make informed decisions as data engineers. You'll also learn how to run a data processing workflow through several data stack platforms and design a data pipeline for a business-use case.

What You’ll Learn

  • How to use Spark for batch and streaming processing
  • How to use Kafka for low-latency and real-time processing
  • Data acquisition and modeling techniques
  • Workflow orchestration and automations
  • The pros and cons of available technologies
  • Pipeline design and integration

Program Overview

This course is part of the Certificate in Big Data Technologies.

  Stay up to date with emails featuring career tips, event invitations and program updates.       Sign Up Now