Professional Documents
Culture Documents
Course Code:20CS11Q3 L T P C
3 0 0 3
Course outcomes: At the end of the course students will be able to
CO1: Understand data lakes architectures and data engineering tools and services. (L2)
CO2: Explain architectures and pipelines to create data lakes. (L2)
CO3: experiment with delta lake tables (L3)
CO4: Build the data pipeline for the data curation stage. (L2)
CO5: Develop gold layer for data aggregation to meet customer expectations. (L3)
Learning Outcomes: At the end of the module, students will be able to:
Learning Outcomes: At the end of the module, students will be able to:
Learning Outcomes: At the end of the module, students will be able to:
Learning Outcomes: At the end of the module, students will be able to:
Learning Outcomes: At the end of the module, students will be able to:
Text Books:
1. Manoj Kukreja,, Data Engineering with Apache Spark, Delta Lake, and Lakehouse, Packt
Publishing, 2021.
References Books:
1. Scott Haines, Modern Data Engineering with Apache Spark: A Hands-On Guide for
Building Mission-Critical Streaming Applications, Apress, 2022.
Web References:
1. https://www.coursera.org/learn/introduction-to-data-engineering
2. https://www.coursera.org/professional-certificates/microsoft-azure-dp-203-data-engineeri
ng
3. https://aws.amazon.com/compare/the-difference-between-a-data-warehouse-data-lake-an
d-data-mart/