Curriculum For Data Engineering for Business Analytics: 40 Hours
1. Data Pre-processing (Total 18 Hours)
Technical Needs (5 Hours)
o Review of the Core Modules NumPy and Pandas
o Review of Another Core Module – Matplotlib for Data Preprocessing
o Data – What Is It Really?
o Databases
Analytic Goals (2 Hours)
o Data Visualization
Data Cleaning (6 Hours)
o Data Cleaning Level I – Cleaning Up the Table
o Data Cleaning Level II – Unpacking, Restructuring, and Reformulating the Table
o Data Cleaning Level III – Missing Values, Outliers, and Errors
Mixing Sources of Data (1 Hour)
o Data Fusion and Data Integration
Data Reduction (1 Hour)
Data Transformation and Massaging (1 Hours)
Case Study (2 Hours)
2. Data Preparation (Total 7 Hours)
Data for Business Analytics using Excel (4 Hours)
o Functions
o Pivots
o Preparing data for Analysis using Functions
Data for Business Analytics using SQL (3 Hours)
3. Data Pipelines (Total 15 Hours)
Introduction to Data Pipelines ( 1 Hour)
Data Pipeline Patterns (1 Hour)
Data Pipeline Architecture (2 Hours)
o Stages
o Steps
o Components
Orchestrating Pipelines (3 Hours)
Data Pipeline Validation (1 Hour)
Data Pipeline Testing Techniques (1 Hour)
Data Pipeline vs ETL Pipeline (1 hour)
Best Practices for maintaining Pipelines (1 Hour)
Measuring and Monitoring Pipeline Performance (2 Hours)
Use Cases (2 Hours)