0% found this document useful (0 votes)
39 views51 pages

Data Analyst Compressed

Propaathshala offers a comprehensive 130-day Data Analyst course designed to equip learners with essential skills in data analysis, including Python programming, MySQL, and Excel. The curriculum covers various topics such as data structures, data visualization, and advanced data manipulation techniques, along with hands-on projects for real-world application. The training aims to bridge the gap between theory and practice, preparing students for success in the IT industry.

Uploaded by

Akhil Gaur
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
39 views51 pages

Data Analyst Compressed

Propaathshala offers a comprehensive 130-day Data Analyst course designed to equip learners with essential skills in data analysis, including Python programming, MySQL, and Excel. The curriculum covers various topics such as data structures, data visualization, and advanced data manipulation techniques, along with hands-on projects for real-world application. The training aims to bridge the gap between theory and practice, preparing students for success in the IT industry.

Uploaded by

Akhil Gaur
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 51

Course Content

DATA
ANALYST

130-days Module
Prepared By
Propaathshala

+91 - 9462782567, 9462392567


www.propaathshala.in
[email protected]
About Pro paathshala
Welcome to Pro paathshala, your gateway to an innovative and
transformative IT training experience! In today's fast-paced and
technology-driven world, the demand for skilled professionals
in the IT industry is constantly growing. To stay ahead of the
curve, individuals and businesses need reliable and cutting-
edge training solutions that empower them to harness the full
potential of technology.

Pro paathshala is an IT training company that understands the


evolving needs of the digital age. Our mission is simple yet
powerful: to provide accessible, comprehensive, and high-
quality training programs that equip learners with the skills
necessary to thrive in the IT landscape.

Pro paathshala is your ultimate destination for comprehensive


and cutting-edge training in the field of training. At Pro
paathshala, we are committed to providing top-notch coaching
and guidance to help you excel in various domains, including
Python, Java, AI, ML, Full Stack Web Development, Full-Stack
App Development, and BI tools.
Course Overview

Our training programs are designed to bridge the gap between


theory and practice, allowing you to gain real-world experience by
working on industry-relevant projects. This experiential approach not
only builds confidence but also prepares you to tackle real-life
challenges from day one.

The Data Analyst Fundamentals course offered by Pro Paathshala


is designed to equip aspiring professionals with the essential skills
and knowledge required to excel in the field of data analysis. This
comprehensive course provides a solid foundation in data
analytics techniques, tools, and best practices, empowering
students to transform raw data into meaningful insights.
.
An Introduction to python

What is Python?
WHY Python?
Features of Python.
Python Use In Other Technology.

Python Software
Python Distributions
Download &Python Installation Process in Windows, Unix,
Linux and Mac
Online Python IDLE
Python Real-time IDEs like Jupyter Note Book, Visual
Studio Code, ATOM, ETC.
Python Language Fundamentals

Python Implementation Alternatives


Keywords
Identifiers
Constants / Literals
Data types
Python VS JAVA
Python Syntax

Modes of Python

Interactive Mode
Scripting Mode
Programming Elements
Structure of Python program
First Python Application
Comments in Python
Python file extensions
Python Variables

bytes Data Type


byte array
String Formatting in Python
Math, Random, Secrets Modules
Introduction
Initialization of variables
Local variables
Global variables ‘
global’ keyword
Input and Output operations
Data conversion functions – int(), float(), complex(),
str(), chr(), ord()
Python operators

Arithmetic Operators
Comparison Operators
Python Assignment Operators
Logical Operators
Bitwise Operators
Shift operators
Membership Operators
Identity Operators
Ternary Operator
precedence Operator
Difference between “is” vs “==”
Input and Output operators
Print
Input
Command-line arguments

Control Statements
Conditional control statements
If
If-else
If-elif-else
Nested-if
Loop control statements
for
while
Nested loops
Branching statements
Break
Continue
Pass
Return
Case studies
Data Structures or Collections

Introduction
Importance of Data structures
Applications of Data structures
Types of Collections
Sequence
Strings, List, Tuple, range
Non sequence
Set, Frozen set, Dictionary
Strings
What is string
Representation of Strings
Processing elements using indexing
Processing elements using Iterators
Manipulation of String using Indexing and Slicing
String operators
Methods of String object
String Formatting
String functions
String Immutability
Case studies
List Collection

What is List
Need of List collection
Different ways of creating List
List comprehension
List indices
Processing elements of List through Indexing and
Slicing
List object methods
List is Mutable
Mutable and Immutable elements of List
Nested Lists
List_of_lists
Hardcopy, shallowCopy and DeepCopy
zip() in Python
How to unzip?
Python Arrays:
Case studies
Tuple Collection
What is tuple?
Different ways of creating Tuple
Method of Tuple object
Tuple is Immutable
Mutable and Immutable elements of Tuple
Process tuple through Indexing and Slicing
List v/s Tuple
Case studies

Set Collection
What is set?
Different ways of creating a set
Difference between list and set
Iteration Over Sets
Accessing elements of a set
Python Set Methods
Python Set Operations
Union of sets
functions and methods of set
Python Frozen set
Difference between set and frozenset?
Case study
Dictionary Collection

What is dictionary?
Difference between list, set and dictionary
How to create a dictionary?
PYTHON HASHING?
Accessing values of dictionary
Python Dictionary Methods
Copying dictionary
Updating Dictionary
Reading keys from Dictionary
Reading values from Dictionary
Reading items from Dictionary
Delete Keys from the dictionary
Sorting the Dictionary
Python Dictionary Functions and methods
Dictionary comprehension
Functions
What is Function?
Advantages of functions
Syntax and Writing function
Calling or Invoking a function
Classification of Functions :
No arguments and No return values
With arguments and No return values
With arguments and With return values
No arguments and With return values
Recursion
Python argument type functions :
Default argument functions
Required(Positional) arguments function
Keyword arguments function
Variable arguments functions
‘pass’ keyword in functions
Lambda functions/Anonymous functions
map()
filter()
reduce()
Nested functions
Non local variables, global variables
Closures
Decorators
Generators
Iterators
PYTHON
MODULES
Introduction to Python for Data
Analysis

Overview of Python and its libraries for data analysis


Installing Python and required libraries (Numpy,
Pandas, Matplotlib, Plotly)
Basic Python programming concepts and syntax

Numpy Fundamentals

Introduction to Numpy arrays and their advantages


Creating and manipulating Numpy arrays
Array indexing and slicing
Numpy mathematical functions and operations
Broadcasting and vectorization
Pandas Basics

Introduction to Pandas and its key data structures


(Series, DataFrame)
Loading data into Pandas DataFrames from different
sources (CSV, Excel, databases)
Data cleaning and preprocessing with Pandas
Data manipulation and transformation using Pandas
Handling missing data and duplicates
Merging, joining, and reshaping DataFrames

Exploratory Data Analysis with Pandas

Data exploration and descriptive statistics using Pandas


Data visualization with Matplotlib and Seaborn
Grouping, aggregation, and pivoting in Pandas
Working with dates and times in Pandas
Statistical analysis and hypothesis testing using Pandas
Advanced Data Visualization with
Matplotlib

Customizing plots and figures in Matplotlib


Plotting different types of charts (line, bar, scatter,
histogram, etc.)
Multiple subplots and layouts
Adding annotations and labels to plots
Plotting time series data

Interactive Data Visualization with Plotly

Introduction to Plotly and its key features


Creating interactive plots with Plotly Express
Customizing Plotly visualizations
Adding interactivity and animations to plots
Plotly Dash for building interactive web-based
dashboards
Advanced Pandas Techniques

Advanced data manipulation and transformation with


Pandas
Working with large datasets and optimizing
performance
Handling categorical data and encoding
Text data processing using Pandas
Reshaping and pivoting complex data structures

Final Projects and Applications

Applying Numpy, Pandas, Matplotlib, and Plotly to real-


world datasets
Solving data analysis problems using the learned
libraries
Exploring additional topics such as time series analysis,
machine learning integration, etc.
Building a complete data analysis project from start to
finish
Presenting and sharing project findings
My SQL
Introduction to Databases and MySQL

What is a database?
Introduction to MySQL
Installing MySQL
Creating a database
Basic SQL queries: SELECT, INSERT, UPDATE,
DELETE

Managing Data with MySQL

Creating tables and defining columns


Data types in MySQL
Inserting data into tables
Retrieving data with SELECT
Sorting and filtering data
Querying Data with MySQL

Using WHERE clause


Combining conditions with logical operators
Sorting query results
Limiting and paging data
Working with NULL values

Modifying Data with MySQL

Updating existing data


Deleting data from tables
Advanced UPDATE and DELETE queries
Using transactions for data integrity

Working with Multiple Tables

Relational databases and table relationships


Primary keys and foreign keys
JOIN statements
Inner, left, right, and full joins
Subqueries and nested queries
Database Design and Normalization

Introduction to database design principles


Normalization and its benefits
First, second, and third normal forms
Denormalization and its trade-offs

Advanced Querying Techniques

Aggregate functions: COUNT, SUM, AVG, MIN, MAX


Grouping data with GROUP BY
Filtering grouped data with HAVING
Advanced JOIN operations: self-joins, outer joins
Using aliases and expressions

Advanced Data Manipulation

INSERT...SELECT statement
Updating data with JOIN
Deleting data with JOIN
Merging data with UNION and UNION ALL
Working with views and stored procedures
Indexing and Performance Optimization

Understanding indexes and their types


Creating and managing indexes
Query optimization techniques
Analyzing query execution plans
Using EXPLAIN statement

Data Constraints and Integrity

Enforcing data constraints: NOT NULL, UNIQUE,


PRIMARY KEY
Working with foreign key constraints
Cascading actions: ON DELETE and ON UPDATE
Defining and using check constraints
Handling data integrity issues
String Functions and Patterns

Working with string data


Commonly used string functions
Pattern matching with LIKE and REGEXP
Regular expressions in MySQL
Using string functions in queries

Date and Time Functions

Working with date and time data types


Date and time functions in MySQL
Formatting dates and times
Date calculations and comparisons
Time zone considerations
Stored Procedures and Functions

Introduction to stored procedures


Creating and executing stored procedures
Parameters and variables in procedures
Conditional logic in procedures
Returning result sets from procedures

Triggers and Events

Introduction to triggers
Creating and managing triggers
Trigger syntax and execution
Event scheduling with MySQL
Automating database tasks with events
Security and User Management

Managing MySQL users and privileges


Creating and granting privileges
Revoking and removing privileges
Security best practices
Auditing and logging in MySQL

Backup and Recovery

Backup strategies and options


mysqldump and other backup tools
Restoring data from backups
Point-in-time recovery
Handling database failures
MySQL Performance Tuning

Identifying performance bottlenecks


Analyzing server and query performance
Optimizing query execution
Configuring MySQL server variables
Caching and buffer tuning

Full-Text Search

Introduction to full-text search


Full-text indexing and searching
Boolean search operators
Natural language search
Performance considerations
JSON Data in MySQL

Storing and retrieving JSON data


JSON data functions and operators
Indexing JSON columns
JSON path expressions
Working with nested JSON data

Working with Spatial Data

Introduction to spatial data types


Storing and indexing spatial data
Spatial functions and operators
Querying spatial data
Geospatial indexing techniques
MySQL and Web Applications

Integrating MySQL with web applications


Connecting to MySQL from different programming
languages
Executing queries from web applications
Securing database connections
Best practices for web application development

MySQL and Data Analytics

Introduction to data analytics with MySQL


Aggregating and summarizing data
Window functions and analytical queries
Advanced data analytics techniques
Working with large datasets
MySQL Replication

Understanding database replication


Setting up a MySQL replication environment
Configuring master and slave servers
Replication modes and topologies
Monitoring and managing replication

MySQL High Availability

High availability concepts and solutions


MySQL clustering and failover options
Setting up a MySQL cluster
Load balancing and failover strategies
Ensuring data consistency and reliability
MySQL and Cloud Platforms

Deploying MySQL on cloud platforms


Cloud database services and offerings
Managing MySQL in cloud environments
Backup and recovery in the cloud
Scaling and optimizing MySQL in the cloud

MySQL and NoSQL

Introduction to NoSQL databases


MySQL as a NoSQL database
Document storage and retrieval
Key-value storage with MySQL
Combining SQL and NoSQL approaches
MySQL Administration

MySQL server administration tasks


Managing user accounts and privileges
Monitoring server performance
Configuring server variables
Security and backup best practices

MySQL and Business Intelligence

Introduction to business intelligence


Data warehousing concepts
Extract, Transform, Load (ETL) processes
Building BI solutions with MySQL
Reporting and visualization tools
MySQL and Big Data

Big data concepts and challenges


Integrating MySQL with big data platforms
Working with Hadoop and Hive
Spark integration with MySQL
MySQL as a data source for analytics

MySQL Best Practices

Best practices for database design


Writing efficient queries
Optimizing server performance
Ensuring data integrity and security
Scalability and high availability considerations
Hands-on Projects and Case Studies

Applying MySQL knowledge to real-world scenarios


Designing and implementing database solutions
Optimizing performance and troubleshooting issues
Working on practical projects and exercises
EXCEL
Introduction to Databases and MySQL

What is a database?
Introduction to MySQL
Installing MySQL
Creating a database
Basic SQL queries: SELECT, INSERT, UPDATE,
DELETE

Managing Data with MySQL

Creating tables and defining columns


Data types in MySQL
Inserting data into tables
Retrieving data with SELECT
Sorting and filtering data
Introduction to Excel

Excel interface and workbook structure


Navigating worksheets and workbooks
Entering and editing data

Formatting Cells and Worksheets

Formatting numbers, text, and dates


Applying cell borders and background colors
Adjusting column width and row height

Basic Formulas and Functions

Using arithmetic operators in formulas


Applying basic functions: SUM, AVERAGE, MAX, MIN
Understanding cell references: relative and absolute
Working with Ranges and Tables

Selecting and managing ranges of cells


Sorting and filtering data in tables
Using basic Excel functions: COUNT, COUNTA,
COUNTIF

Data Visualization with Charts

Creating and customizing charts: column, line, pie


Adding titles, labels, and legends to charts
Formatting chart elements and axes

Conditional Formatting

Applying conditional formatting rules to highlight data


Using color scales, data bars, and icon sets
Managing conditional formatting rules
Data Validation and Protection

Setting data validation rules


Restricting data entry with drop-down lists
Protecting worksheets and workbooks with passwords

Working with Formulas

Using logical functions: IF, AND, OR


Lookup functions: VLOOKUP, HLOOKUP, INDEX,
MATCH
Text functions: CONCATENATE, LEFT, RIGHT, MID

Data Analysis Tools

Creating and using PivotTables


Grouping and filtering data in PivotTables
Analyzing data with PivotCharts
Advanced Charting Techniques

Creating dynamic and interactive charts


Using advanced chart types: scatter, radar, waterfall
Adding secondary axes and trendlines

Advanced Data Manipulation

Advanced sorting and filtering techniques


Using advanced filter options
Removing duplicates and data consolidation

Introduction to Macros

Recording and running macros


Assigning macros to buttons or keyboard shortcuts
Editing and managing macros
Advanced Formulas and Functions

Nested functions and complex formulas


Array formulas and functions
Date and time functions

Data Analysis with What-If Analysis

Goal Seek and Solver


Scenario Manager and Data Tables
Forecasting techniques

Data Import and External Connections

Importing data from external sources


Linking and updating data from other workbooks
Working with databases and SQL queries
Working with Text Data

Text functions: FIND, REPLACE, SUBSTITUTE


Splitting and combining text using formulas
Cleaning and transforming text data

Advanced Data Analysis Tools

Advanced filtering techniques: advanced criteria,


wildcard
Using database functions: DSUM, DAVERAGE, DGET
Advanced data analysis with Power Query

Advanced PivotTable Techniques

Calculated fields and items


Slicers and timelines
Grouping and customizing PivotTable layouts
Advanced Data Visualization

Creating interactive dashboards


Using form controls and dropdowns
Adding dynamic charts and sparklines

Data Analysis with Statistical Functions

Statistical functions: AVERAGEIF, COUNTIF, SUMIF


Correlation and covariance functions
Working with large datasets using Data Analysis
ToolPak

Advanced Formula Auditing and Error


Handling

Tracing precedents and dependents


Evaluating formulas and detecting errors
Using error-handling functions: IFERROR, ISERROR,
NA
Data Cleansing and Data Quality

Cleaning and removing duplicates in data


Text-to-columns and data splitting techniques
Managing and correcting common data errors

Advanced Data Import and


Transformation

Web scraping and data extraction


Combining and transforming data with Power Query
Advanced data transformation techniques

Data Visualization with Power BI

Introduction to Power BI and its features


Connecting to data sources and creating visualizations
Publishing and sharing reports and dashboards
Excel Tips and Tricks

Productivity shortcuts and time-saving techniques


Customizing the Excel environment
Excel add-ins and advanced customization options
POWER Bi
Introduction to Power BI
Overview of Power BI and its components
Understanding the Power BI ecosystem

Power BI Desktop
Installation and setup of Power BI Desktop
Importing data from various sources
Transforming and shaping data using Power Query
Editor
Creating and managing relationships between tables
Designing and formatting visuals using the report
canvas

Data Modeling in Power BI


Creating calculated columns and measures using DAX
(Data Analysis Expressions)
Understanding and utilizing DAX functions
Implementing basic calculations, aggregations, and
conditional logic
Data Visualization in Power BI
Choosing appropriate visuals for different data types
and insights
Applying formatting, colors, and themes to enhance
visual appeal
Utilizing drill-through and drill-down features
Incorporating custom visuals and third-party visuals

Power BI Data Sources and Querying


Connecting to various data sources (SQL Server,
Excel, SharePoint, etc.)
Querying data using Power Query and M language
Implementing advanced data transformations
Utilizing query parameters and functions

Power BI Service and Sharing


Publishing reports and dashboards to Power BI Service
Configuring datasets, gateways, and scheduled
refreshes
Collaborating and sharing content with colleagues
Setting up and managing security and access controls
Power BI Data Analysis and Modeling
Techniques

Advanced DAX calculations and techniques


Time intelligence functions for analyzing trends and
patterns
Implementing hierarchies, perspectives, and partitions
Applying advanced modeling concepts (e.g., role-
playing dimensions, advanced relationships)

Power BI Administration and


Governance

Managing workspaces, apps, and content packs


Implementing row-level security and security roles
Monitoring usage and performance of Power BI reports
Optimizing data models and query performance
Power BI Advanced Topics

Power BI Premium features and capabilities


Power Automate (formerly Microsoft Flow) integration
with Power BI
Power BI Embedded and embedding reports in external
applications
Exploring AI and machine learning capabilities in Power
BI (e.g., anomaly detection, clustering)
CALL TO REGISTER

+91-9462782567,
+91-9462392567

THANK
YOU

You might also like