0% found this document useful (0 votes)
13 views33 pages

Data Analysis Hand Book

The document outlines a comprehensive curriculum for courses in Data Analytics, Business Intelligence with Power BI, and Database Fundamentals with MySQL. It covers various topics including data literacy, Excel functions, Power BI dashboard creation, SQL commands, and statistical analysis techniques. By the end of the modules, students will be equipped with practical skills in data handling, analysis, and reporting, preparing them for roles in data-driven environments.

Uploaded by

Roshid Mostakim
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
13 views33 pages

Data Analysis Hand Book

The document outlines a comprehensive curriculum for courses in Data Analytics, Business Intelligence with Power BI, and Database Fundamentals with MySQL. It covers various topics including data literacy, Excel functions, Power BI dashboard creation, SQL commands, and statistical analysis techniques. By the end of the modules, students will be equipped with practical skills in data handling, analysis, and reporting, preparing them for roles in data-driven environments.

Uploaded by

Roshid Mostakim
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 33

https://interactivecares.

com/course/data-analytics
https://interactivecares.com/course/data-analytics
https://cutt.ly/4eo9uKJb
https://forms.gle/UxUxpH2bwAE7tfwd9
Advanced Excel & Google Sheets
Week 01

• Class - 1: Understanding Data & Building Data Literacy


• Understanding Types of Data & Variables
• Statistics: Measures of Central Tendency, Location &
Dispersion (Spread)
• How to handle Outliers?
• Types of Variables & Level of Measurements
• Picking the appropriate Chart based on the Variable
Types?
• Statistical Functions in Excel

• Class - 2: Excel Basics, Functions & Data Analysis with


Pivot Table
• Basics of Excel
• Excel Formulas: COUNT, COUNTA, IF, IFS, SUM, SUMIF,
SUMIFS, COUNTIFS, UNIQUE, SORT, FILTER, LEN, SUBSTITUTE,
LEFT, RIGHT, MAX, MIN, TRIM, CONCATENATE, TEXTJOIN, DAYS
• Conditional Logic
• Importance of Cell Locking using F4 on your Keyboard
• XLOOKUP; Advantages over VLOOKUP
• Introduction to Data Analysis with Pivot Table
• Conditional Formatting
• Number Formatting
• Use of Secondary Axis in Combo Chart
Week 02

• Class - 3: Data Consolidation- Appending, Merging (joining)


and Report Automation using Power Query

• Normalization Concept; Primary vs. Foreign Key


• Why Normalization is Required?
• Consolidating/Automating Multiple Sheet, Files or a Folder
using Power Query
• Merge (Join) vs. Append
• Joining/Merging: Left Join vs. Inner Join
• XLOOKUP vs. Merging in Power Query
• Data Cleaning using Power Query and Conditional
Formatting
• Long (Unpivot) vs. Wide (Pivot) Format
• Calculated Field from Pivot Table
• Why do we need Calculated Measure from Power Pivot?

• Class - 4: Interactive Charts Building using Excel and


Pivot Table Best Practices

• More on Summarizing Data with Pivot Table


• % of Grand Total Calculation for Pie/Donut Chart
• Use of Slicers, Filters and Timeline for Interactiveness
• When to use Bar vs. Column Chart?
• When to use Pie vs. Donut Chart?
• Clustered Bar and Stacked Chart
• Combo Chart
• Waterfall Chart
• Bullet Chart
• Dynamic Number Formatting
• Use of Secondary Axis in Charts
Week 03

• Class - 5: Root Cause Analysis and Dashboard Building


with Excel
• Understanding the Hospitality/Hotel Management Data
• Root Cause Analysis
• Creating Pivot Tables and Charts
• Adding Slicers and Timeline for Interactiveness
• Connecting the Slicers & Timeline with the Charts and
Tables
• Dynamic Number Formatting for Charts and Visualization
• Creating Interactive Dashboard and Making
Recommendation based on Findings
• Storytelling with Data

• Class - 6: Statistical Analysis using Microsoft Excel

• Descriptive Analysis using Data Analysis Toolpak


• Difference between Correlation and Causation
(Regression Analysis)?
• Estimating Correlation Coefficient and Interpreting the
Coefficient
• Simple vs. Multiple Linear Regression Model?
• Interpreting the Regression Model Parameters
• Goodness of Fit of the Model.
• Hypothesis Testing: Independent Sample t-test, Paired
t-test; ANOVA (F-test)
• When to perform a Non-parametric Test?
Week 04

• Class - 7: Data Cleaning and Data Profiling with Power


Query in Excel
• Web Scraping using Power Query
• Divisional Map Visual in Microsoft Excel
• What is Data Dictionary/Data Profiling and Why is it so
important?
• Power Query User Interface (UI) Tour
• Difference between CSV & Excel Format
• Cleaning an Online Retail Data
• Power Query Steps and Documentation

• Class - 8: Sales & Financial Report using Power Pivot,


Data Modelling & DAX

• Knowing What cannot be achieved with Pivot Table and


Calculated Field
• Power Pivot came as a Rescue; You can work with more
than 1 Million Records in Excel as well!
• Working with Multiple CSV/Excel File from different
Sources
• Difference between Fact and Dimension/Lookup Table
• Creating Custom Date Table for Connecting multiple Fact
Table
• Solving the Date Conversion Problem from US to Local
Format
• Star Schema vs. Snowflake Schema
• Data Modelling and Normalization in Power Pivot
• Creating DAX Measures to Create Custom Measures that
we can use in our Pivot Tables and Report
• KPIs for Sales and Finance: COGS, NET SALES, PROFIT,
PROFIT MARGIN, Sales vs. Target, ect.
• P&L (Profit & Loss) Statement for a Organization

• Class - 8: Providing Feedback on Submitted Excel Projects.

*** By the end of this Module, a student will have a clear


understanding of different dataset and how to approach
a Dataset and Business Problem; Root-cause Analysis;
Sales and Financial Reporting and Statistical Analysis.
They will also learn applying different Statistical Techniques
to Draw Inference/Conclusion of Data. By completing the
End-to-End Project on E-commerce/Sales & Public Health
Data they will enrich their Knowledge in this Domain and
would be able to Demonstrate their Skill Sets through
Developing Interactive and Dynamic Dashboard.
Business Intelligence with Power BI
Week 01

• Class - 1: Introduction to Power BI & Healthcare Analytics


(Hospital Management)

• Installing Power BI Desktop from Microsoft Store


• Power BI User Interface Tour
• Necessary Settings
• Power BI Desktop vs. Services
• Power BI License Guide depending on your Organization’s
Needs
• Exploring Healthcare Data
• Importing Data in Power BI: Excel vs. Cloud Source
Advantages
• Transformation Steps in Power Query
• How Calendar Tables are Created why this is Required for
Time Intelligence DAX?
• Data Source Settings in Power BI
• Using Custom Theme and Background in Power
• ETL/Data Transformation in Power Query
• Building Basic Visuals by Dragging and Dropping to
Canvas
• Calculated Column vs. DAX Measure
• Writing DAX Measure
• Implicit vs Explicit Measure Difference
• Class - 2: End-to-End Hospital Management Dashboard
Building
• Designing the KPIs
• Slicers and Filters; Explaining the Difference and Use
Cases
• YoY % Change DAX and Making it Dynamic with Year
Selection
• Patient Demographic Page
• Key Trend Page
• Treatment & Cost Page
• AI Visuals (Key Influencer Plot, Decomposition Tree, Q&A)
• Conditional Formatting Columns/Bars
• Cost and LOS Comparison
• Use of Field Parameters
• Edit Interaction to control the effects of Slicers
• Matrix Visual, Bar & Column Chart, Line Chart, Pie & Donut
Chart
• Adding Dynamic Title to the Charts
Week 02
• Class - 1: Creating Power BI Service Account Free &
Publishing our Project

• Giving an Aesthetic Look to your Dashboard before


Publishing it
• Creating a Power BI Service Account Free with GMAIL
• GMAIL vs. Business Email Advantages and Disadvantages
• Publishing our Project in Power BI Service
• Publishing to Web and How to Embed on a Website?
• How to generate a shareable Link of your work
• Admin Portal and Tenant Settings
• Activating Pro License free for 60 Days
• My Workspace vs. Other Workspace
• Schedule Refresh for Automating our Report with New
Data
• Best Practice for Licensing

• Class - 2: ETL & Data Modeling in Power BI

• Briefing the Adventure Works (E-commerce) Project


• Working with Multiple Files in Power BI
• ETL Process in Power Query
• Data Validation and Data Dictionary Concepts
• Data Modeling in Power BI
• Star vs. Snowflake Schema
• Active vs. Inactive Relationship
• How to Activate an Inactive Relationship into an Active
one: USERELATIONSHIP DAX
• How to Make the Data Model Better for Q&A Visual: Using
the AI to Generate Insights
Week 03
• Class - 1: E-commerce Project- Executive Summary &
Territory Analysis
• Designing Custom KPIs with Reference like PY & YoY %
Change
• Using Field Parameters to accommodate multiple KPIs in a
single chart
• Trend (Line Chart) & MoM change using Ribbon Chart
• Custom Shape Map Visual using a geo/topo json file
• Learning to write Time Intelligence Function
• Conditional Formatting

• Class - 2: E-commerce Project- Executive Summary &


Territory Analysis Completion

• Custom Tooltip Page Designing


• YTD, PY YTD, SAMEPERIODLASTYEAR
• MoM % Change, YoY % Change
• No. of Returns vs. Return Quantity; Return Rate (%)
• Top 10 Products by Sold Quantity using Visual Level Filter
• No. of Customers vs. No. of Customers Purchased
• TEXT KPI Card
• Forecasting with Power BI
• Edit Interaction in Slicers
• Profit Margin Calculation
• % of Revenue
Week 04
• Class - 1: E-commerce Project- Customer Analytics &
RFM Segmentation
• What is RFM (Recency, Frequency, Monetary)
Segmentation?
• Performing RFM Segmentation to find the Most Loyal
Customers, Churned Customers, Slipping Away Customers
to Design Customize Targeted Campaigns for different
Segments
• Using Field Parameters in the X/Y axis of the charts to
create Dynamic Charts

• Class - 2: E-commerce Project- Product Analysis & AI Visuals

• Drillthrough Filter in Power BI for Detailed Analysis of a


Product
• Gauge Chart in Power BI track the Progress towards
Target
• What If Analysis using Numeric Parameters in Power BI
• Use of Field Parameters in Charts
• AI Visuals- Key Influencers, Decomposition Tree
• Explaining the Next Portfolio Project
Week 05
• Class - 1: Advanced Features
• How to download and work with Custom Visuals
• Setting up Row Level Security (RLS)
• How to Schedule Refresh & Incremental Refresh Data in
Power BI

• Class - 2: End-to-End Cohort Analysis Project with Power BI


• Advanced Retail Customer Analytics [Cohort Analysis]
• Customer Retention & Churn Analysis
• Customer Lifetime Value
• Finding the New, Lost & Recovered Customer
• Advanced DAX for Cohort Analysis

Week 06
• Class - 1: PL-300 Exam Preparation

• What is the difference between Course Completion


Certifications and Vendor Certifications?
• What is the PL-300 Exam?
• Why could this Certification add Extra Value for the
people with no Experience?
• How to Prepare and Pass the Exam?
• Sharing the Resources that would ensure you pass the
Exam
*** There will be more projects on Power BI connecting with
Database after we learn all about Data Cleaning and Analysis
in SQL ***

*** By the end of this module, Students would be able to


become an asset to any organization that works with Data
as well as would be able to Pass PL-300 Exam. They would
learn all the best practices to automate static reporting
of a company that would save thousands of hours and
share the reports with different levels of stakeholders
properly; They would learn about the different Licensing
Options of Power BI and when to pick which ones with
practical examples.
Database Fundamentals &
Data Analysis with MySQL &
Snowflake SQL (Cloud Version)

Week 01
• Class - 1: Fundamentals of DBMS-I & Structured
Query Language (SQL)

• What is a Relational Database Management System and


SQL?
• ACID Property
• How is data stored in a relational database?
• Concept of Normalization
• Understanding the Normalization of a Database through a
Sales Database's ER (Entity Relationship) Diagram
• OLTP vs. OLAP
• For BI Solution, which one should you Choose?
• Database vs. Data Warehouse vs. Data Lake/Lakehouse
• What is a NoSQL Database and BASE Property?
• Difference between Relational and NoSQL Database
• Which one should you choose in which case?
• Class - 2: Fundamentals of DBMS-II & Designing a Relational
Database in MySQL
• Installing MySQL Locally.
• MySQL Workbench UI Tour.
• Concepts of Database, Schema, Table and Fields
• Designing and Creating a Relational Database
(E-commerce/Fashion Brand) from Scratch
• Types of SQL Commands: DDL, DML, DCL, TCL, DQL.
• What are the Commands under each of these categories
and what does each of these commands do?
• As a Data Analyst & Data Scientist which Commands do
you only need to Master?
• Constraints: Primary Key, Foreign Key, Not Null, Unique,
Check, Default
• Data Types: DATE, DATETIME, VARCHAR, INT, NUMBER, FLOAT,
Auto Increment

Week 02
• Class - 1: Exploratory Data Analysis using SQL
• Most Used Queries: SELECT, FROM, WHERE, LIKE, ILIKE, IN,
DISTINCT, BETWEEN, GROUP BY, ORDER BY, LIMIT, ALIAS
• Aggregate Functions: COUNT, SUM, AVG, MIN, MAX
• Difference between WHERE and HAVING Clause
• Some built-in Functions: EXTRACT, DATE_PART, TO_DATE,
TO_CHAR, STR_TO_DATE
• CASTING, SUBSTRING, POSITION, COALESCE, NULLIF
• Class - 2: JOINING, UNION & CASE WHEN
• SQL Join: Left, Right, Inner & Full Join
• Be aware of Cross Join!
• UNION
• SQL Code Order of Execution
• Wide use of CASE WHEN Statement during data Cleaning,
Analysis & Feature Engineering

Week 03
• Class - 1: Subqueries and Common Table Expressions (CTEs)

• Subqueries in SELECT, FROM and WHERE Clause


• Common Table Expressions (CTE)
• Between Subqueries and CTE, which one is more efficient
and takes less time?
• Removing Duplicates

• Class - 2: Window Functions


• Window Functions, the most widely used SQL Commands
used by Data Analysts.
• Window Functions: Aggregate, Rank and Value Window
Functions
• RANK, DENSE_RANK, ROW_NUMBER: When to Choose
Which One?
• LEAD, LAG
• SUM, AVG, MIN, MAX, NTILE
• FRAME SPECIFICATION, WINDOW CHAINING
• YTD, MTD, YoY, MoM % Change using Window Functions
• ROLLING AVERAGE, RUNNING TOTAL
• REMOVING DUPLICATES USING WINDOW FUNCTIONS
Week 04
• Class - 1: RFM Segmentation
• Bulk Insert of CSV Data in SQL Database
• Segmenting Customers based on their Recency,
Frequency & Monetary value using SQL
• Using VIEW for more efficient coding/analysis
• Connecting the VIEW (Database) with Power BI for
Interactive Dashboard

• Class - 2: Cohort Analysis & Churn Rate Calculation


• Cohort Analysis using Retail Data
• Use case of CASE WHEN and PIVOTING
• Customer Lifetime Value, Retention, and Churn Rate
Calculation
• USER DEFINED FUNCTIONS, STORED PROCEDURES & TRIGGER
• Explaining End-to-end Project on Database Creation, Data
Insertion, Data Cleaning & Data Analytics Project Outline
and Instructions

*** By the end of this module, Students would be able to


confidently perform any sort of Data Analysis and Reporting
for different Departments in any Organizations. They would
be well-equipped to understand any complex queries;
validating data and help any Business by generating important
KPIs.
They would also be able to help Businesses by Performing
Complex Analyses like Cohort and RFM Segmentation. They
would understand the importance of VIEWS, Stored Procedures,
Triggers which would help them when they would be working
on BI (Power BI/Tableau, etc.) Tools and Databases as Data Source.
Python for Data Analysis &
ML Fundamentals
Week 01
• Class - 1: Basics of Python Programming Language
• Introduction to Python Programming Language
• Introduction to Google Colab & Jupyter Notebook for
Writing and Executing Python Codes
• Getting used to the User Interface (UI) of Jupyter
Notebook/Google Colab
• Python Data Types: Numeric, Strings, Boolean, List,
Dictionary, Tuple, Set, None
• Iterables & Mutability; List vs. Tuples.
• Assigning & Overwriting Variables; Naming Convention for
Variables
• Type Conversion
• Operators: Arithmetic & Logical/Comparison
• Numeric & String (Text) Functions
• String Indexing & Slicing
• String Methods
• F String
• Class - 2: Conditional Statement, User Defined Functions
& Iteration
• IF & ELIF Statements
• How to Define/Create a Function in Python?
• How to Create a Function with Parameter/Argument(s)?
Function within a Function
• Use Conditional Statements and Functions Together
• Using FOR & WHILE Loops
• Combining Conditional Statements & Loops
• Combining Conditional Statements, Functions & Loops
• How to Iterate over a Dictionaries

Week 02
• Class - 1: Data Wrangling with pandas and numpy
• Importing Data in Python using pandas
• Data Wrangling using pandas and numpy
• Visualization using matplotlib

• Class - 2: Visualization with matplotlib, seaborn & plotly


• Visualizing Data with matplotlib library
• Visualizing Data with seaborn library
• Generating Interactive Visualization with Plotly
Week 03
• Class - 1: Bank Churn Customer Project- Data Preparation
& EDA
• Importing and Exploring Data in Python
• Data Cleaning in Python
• Data Analysis in Python
• Preparing Data for Modelling

• Class - 2: Bank Churn Customer Project- Classification Model


• Machine Learning Fundamentals
• Train Test Split
• Classification Model (Logistic Regression & Random
Forest)
• Hyperparameter Tuning using RandomSearchCV and
GridSearchCV
• Feature Importance Plot
• Data Cleaning Code Optimization

Week 04
• Class - 1: A/B Testing in Python for Online Controlled
Experiments
• What is A/B Testing?
• What is it used for?
• A/B Testing Workflow
• Class - 2: Multiple Linear Regression Analysis
• Multiple Linear Regression Model
• Performing with Statistical Approach and ML Approach
• Diagnostic Checking
• Interpreting Coefficients and Test Results
• R-Squared, MAE, MSE, RMSE

Week 05
• Class - 1: Polynomial Regression & Regularization Methods
(Ridge and LASSO)
• Polynomial Regression
• Saving a ML Model for Future use using joblib
• Regularization Methods
• Ridge Regression
• LASSO

• Class - 2: Bank Customer Segmentation (KMeans Clustering)


• Behavioral Clustering using KMeans Clustering
• How to find the Optimum Number of Cluster
• Interpreting the Clustering Results
• Recommendation from the Clustering Results

*** By the end of this module, Students will have a fundamental


knowledge of Python Programming Language and all the
necessary Libraries used for Data Analysis and Machine Learning.
They would be able to analyze and Visualize any dataset
including Marketing Campaign Data.

They will have fundamental and business knowledge of ML


Algorithms and how their methodologies can impact
business-decision making. They would know how to present
their findings appropriately to their Stakeholders.
Statistics with R for Research

Week 01
• Class - 1: Basics of R Programming Language &
Data Wrangling
• Introduction to R Programming Language.
• Installing R & Rstudio.
• Basics of R Programming.
• Vector, Matrix, Factor, Data Frame, List.
• Selection of Elements in Vector, Matrix, Factor, Data
Frame, List.
• Nominal and Ordinal Categorical Variables/Factor.
• Importing Data in R.
• Data Wrangling using dplyr.
• Visualization using ggplot2.

• Class - 2: Survey Data Analysis using R & How to write a


Research Paper using Rmarkdown
• How to Analyze a Survey/Research Data using R
• What is causality and what is the difference between
Correlation and Regression Analysis?
• Performing Correlation and Regression Analysis; Also,
creating publication standard tables and charts.
• Interpreting the Results Properly
• Difference between Linear and Logistic Regression.
• Interpreting the results of Logistic Regression (Odds
Ratio).
• How to Write a Research Paper with Publication
Standards Plots and Tables.
*** Using the knowledge gained in this module, students
would be able to generate Reproducible Research Work which
would help them in their Higher Studies.
By learning the different steps of a Research Project and
various Important Statistical Methodologies students/
professionals would be largely benefitted in Higher Studies
and Development Projects.

Job Preparation
Week 01
• Class - 1:
• CV Making & How to Write a Cover Letter; Portfolio
Building.
• Interview Skill Development & How to effectively Apply for
a Position.

• Class - 2:
• Guidance on Freelancing Career,
• Mock Interview & Roadmap for Future Ahead.

You might also like