Skip to content

Latest commit

 

History

History
33 lines (30 loc) · 1.81 KB

File metadata and controls

33 lines (30 loc) · 1.81 KB

Data Engineer with Python

In this track, you’ll discover how to build an effective data architecture, streamline data processing, and maintain large-scale data systems. In addition to working with Python, you’ll also grow your language skills as you work with Shell, SQL, and Scala, to create data engineering pipelines, automate common file system tasks, and build a high-performance database.

Through hands-on exercises, you’ll add cloud and big data tools such as AWS Boto, PySpark, Spark SQL, and MongoDB, to your data engineering toolkit to help you create and query databases, wrangle data, and configure schedules to run your pipelines. By the end of this track, you’ll have mastered the critical database, scripting, and process skills you need to progress your career.


Courses

  1. Data Engineering for Everyone
  2. Introduction to Data Engineering
  3. Streamlined Data Ingestion with pandas
  4. Writing Efficient Python Code
  5. Writing Functions in Python
  6. Introduction to Shell
  7. Data Processing in Shell
  8. Introduction to Bash Scripting
  9. Unit Testing for Data Science in Python
  10. Object-Oriented Programming in Python
  11. Introduction to Airflow in Python
  12. Introduction to PySpark
  13. Building Data Engineering Pipelines in Python
  14. Introduction to AWS Boto in Python
  15. Introduction to Relational Databases in SQL
  16. Database Design
  17. Introduction to Scala
  18. Big Data Fundamentals with PySpark
  19. Cleaning Data with PySpark
  20. Introduction to Spark SQL in Python
  21. Cleaning Data in SQL Server databases
  22. Transactions and Error Handling in SQL Server
  23. Building and Optimizing Triggers in SQL Server
  24. Improving Query Performance in SQL Server
  25. Introduction to MongoDB in Python