0% found this document useful (0 votes)
14 views27 pages

Unit 1 Process For Making Sense of Data - Copy-1

The document outlines the process of Exploratory Data Analysis, including data sourcing, preparation, analysis, and deployment. It emphasizes the importance of descriptive statistics and various data visualization techniques such as bar charts, histograms, and pie charts for understanding data distribution. Additionally, it discusses the types of variables and the significance of cleaning and transforming data for effective analysis.

Uploaded by

pranav19072005
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
14 views27 pages

Unit 1 Process For Making Sense of Data - Copy-1

The document outlines the process of Exploratory Data Analysis, including data sourcing, preparation, analysis, and deployment. It emphasizes the importance of descriptive statistics and various data visualization techniques such as bar charts, histograms, and pie charts for understanding data distribution. Additionally, it discusses the types of variables and the significance of cleaning and transforming data for effective analysis.

Uploaded by

pranav19072005
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd

VCET R 2021 21PCS02- Exploratory Data Analysis 2024

UNIT 1 – EXPLORING AND UNDERSTANDING DATA

 Introduction: Sources of data


 Process for making sense of data
 Describing data: Variable types
 Distribution of data
 Hypothesis test
 Preparing data tables: Cleaning the data
 Data type conversion
 Combining variables
 Unstructured data.
 CO1:Make use of modern ICT tools to explore the data and its characteristics. [K3]
Process for Making Sense of Data

1. Problem definition and planning: The problem to be solved and the


projected deliverables should be clearly defined and planned, and an
appropriate team should be assembled to perform the analysis.

2. Data preparation: Prior to starting a data analysis or data mining


project, the data should be collected, characterized, cleaned,
transformed, and partitioned into an appropriate form for further
processing.

3. Analysis: Based on the information from steps 1 and 2, appropriate data


analysis and data mining techniques should be selected. These methods
often need to be optimized to obtain the best results.

4. Deployment: The results from step 3 should be communicated and/or


deployed to obtain the projected benefits identified at the start of the
project.
VCET R 2021 21PCS02- Exploratory Data Analysis 2024

Describing Data

 Descriptive statistics aims to summarize a sample,


rather than use the data to learn about the population
that the sample of data is thought to represent.
 Descriptive statistics help us to simplify large amounts
of data in a sensible way.
 The data /variables can be both quantitative and
qualitative in nature.
 Some measures that are commonly used to describe a
data set are measures of
 Central tendency and
 Measures of variability
VCET R 2021 21PCS02- Exploratory Data Analysis 2024

Types of Variables
VCET R 2021 21PCS02- Exploratory Data Analysis 2024

Summary Statistics
VCET R 2021 21PCS02- Exploratory Data Analysis 2024
VCET R 2021 21PCS02- Exploratory Data Analysis 2024
VCET R 2021 21PCS02- Exploratory Data Analysis 2024
VCET R 2021 21PCS02- Exploratory Data Analysis 2024
VCET R 2021 21PCS02- Exploratory Data Analysis 2024
VCET R 2021 21PCS02- Exploratory Data Analysis 2024
VCET R 2021 21PCS02- Exploratory Data Analysis 2024
Contd…
Distribution of Data

Visualization is an aid to understanding the distribution of data


Bar Chart
Contd…
Histogram
Contd…
Pie Chart
Thank You

You might also like