Data Engineering
Course Objectives:
1. Learn the Basics and Importance of Data Engineering.
2. Understand the Steps in Handling Data.
3. Design Scalable, Reliable Data Systems.
4. Explore Data Tools and Technologies.
Course Outcomes:
On successful completion of this course, students will be able to:
1. Explain data engineering fundamentals, lifecycle, and organizational role.
2. Analyze technologies and architectures for effective data systems.
3. Apply methods for scalable and reliable data storage and ingestion.
4. Design data models, queries, and transformations for analytics and ML.
5. Evaluate security, privacy, and emerging trends in data engineering.
UNIT-I: Foundations of Data Engineering
Data Engineering Defined, Data Engineering Skills and Activities, Data Engineers in
Organizations,Data Engineering Lifecycle,Major Undercurrents Across the
Lifecycle,Designing Good Data Architecture,Principles of Good Data Architecture.
UNIT-II: Technologies Across the Data Engineering Lifecycle
Decision Factors,Technology Lifespan: Today vs. Future,Location Considerations,Monolith
Versus Modular,Serverless Versus Servers,Performance and Optimization Challenges,Data
Generation in Source Systems,Time in Data Systems,Source System Practical Details.
UNIT - III: Storage and Ingestion
Data Storage systems,Data Engineering Storage Abstractions,Data Ingestion,Key Engineering
Considerations for the Ingestion Phase,Batch Ingestion Considerations,Message and Stream
Ingestion Considerations,Ways to Ingest Data.
UNIT – IV: Queries, Modeling, and Transformation
Queries,Data Modeling,Transformations,Upstream and Downstream Stakeholders,Serving
Data for Analytics,Machine Learning,Ways to Serve Data for Analytics and ML,Reverse
ETL.
UNIT – V: Security, Privacy, and the Future of Data Engineering
Security and Privacy:People,Processes,Technology,The Data Engineering Lifecycle is
here,Decline of Complexity,The Cloud-Scale Data OS,Enterprisey Data Engineering,Moving
Beyond the Modern Data Stack, Toward the Live Data Stack.
Text Books: [Link] of Data Engineering by Joe Reis and Matt Housley, First
Edition,O’Reilly Media.