Coronavirus Update

For the safety of our community, UWPCE programs will be taught remotely for the 2020-21 academic year.

For more information, see our Coronavirus FAQ

Introduction to Data Engineering

collapse

Course Details

This course can only be taken as part of the Certificate in Big Data Technologies.

Get Program Details

About this Course


In this course, you'll get an introduction to the fundamental building blocks of big data engineering. You'll learn the foundational concepts of distributed computing, distributed data processing, data management and data pipelines. You'll also survey a variety of available data stack technologies and learn how to run a data processing workflow through a commonly used platform.

What You'll Learn

  • The fundamentals of data stacks, their uses, advantages and limitations
  • The background of distributed systems, relational databases and key-value stores
  • The foundations of the Hadoop ecosystem
  • The pros and cons of batch processing versus in-memory processing (Spark, Hive, SQL)
  • The uses and limitations of NoSQL stores (HBase, Redis, Elasticsearch, Cassandra, etc.)

Program Overview

This course is part of the Certificate in Big Data Technologies.

  Stay up to date with emails featuring career tips, event invitations and program updates.       Sign Up Now