Databricks Course Curriculum
14-Oct-2023
Apache Spark Architecture
Spark's internal details (Driver, Executor, task, stages, jobs etc.)
Spark Memory Allocation (Driver and Executor Memory Allocation)
Cluster Deployment Modes (Client, Cluster)
Narrow and Wide Transformations
Spark different configurations
Interview Questions based on Apache Spark Architecture
15-Oct-2023
Introduction to Databricks
Walkthrough of Databricks workspace
Different types of clusters and their uses.
Magic Commands
DBUTILS
Notebook Parametrization
Secret Scopes
Accessing Blob Storage/ ADLS Gen 2 using notebook
Understanding of DBFS
Interview Questions based on session.
21-Oct-2023
Databricks Data Processing
Read different file formats (CSV, JSON, Parquet.. etc.)
Different reading options
Interview Questions based on session.
22-Oct-2023
Narrow Transformations
Select,filter,map, flatmap etc.
Wide Transformations
groupBy, Aggregate,Window functions, Joins
Interview Questions based on session.
28-Oct-2023
Spark SQL
Databricks Database
Global Temporary View
Temporary View
External Tables
Managed Tables
Views
Joins
Interview Questions based on session.
29-Oct-2023
Lakehouse Architecture
Delta Tables
Time Travel
History
Vacuum
Incremental File Loading
Interview Questions based on session.
04-Nov-2023 and 05-Nov-2023
Unity Catalog
Benefits of Unity Catalog
Data Lineage
Data Auditing
Mini Project on Unity Catalog
Interview Questions
11-Nov-2023 and 12-Nov-2023
Structured Streaming
Different sources, event hub, kafka etc.
Stateless/Stateful Transformations
Output Modes (Complete, Update, Append)
Tumbling and Sliding Window
Watermarking
Stream Joins
Kafka Sink
Cosmos Sink
File Sink
18-Nov-2023 and 19-Nov-2023
Delta LIVE Tables
Quality Checks
Data Quarantine
Structured Streaming using Delta LIVE Tables
Autoloader
Interview Questions
25-Nov-2023 and 26-Nov-2023
Azure Devops CI/CD for databricks