Unit III Dev Data Exploration and Visualization

The document discusses univariate and bivariate data, focusing on descriptive statistics such as measures of central tendency (mean, median, mode) and measures of variability (range, variance, standard deviation). It provides examples and Python code for calculating these statistics. Additionally, it emphasizes the importance of visual representations like histograms and pie charts in analyzing single-variable data.

Uploaded by

kumaresan7751

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

40 views9 pages

Unit III Dev Data Exploration and Visualization

Uploaded by

kumaresan7751

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

UNIT III UNIVARIATE ANALYSIS

Introduction to Single variable: Distributions and Variables –

Numerical Summaries of Level and Spread – Scaling and
Standardizing – Inequality – Smoothing Time Series.
1. Univariate data (Introduction to Single Variable)

This type of data consists of only one variable

Suppose that the heights of seven students of a class is recorded(figure),there is only

one variable that is height and it is not dealing with any cause or relationship.

The description of patterns found in this type of data can be made by drawing
conclusions using central tendency measures (mean, median and mode),
dispersion or spread of data (range, minimum, maximum, quartiles, variance and
standard deviation) and by using frequency distribution tables, histograms, pie charts,
frequency polygon and bar charts.
Bivariate data

This type of data involves two different variables.

The analysis of this type of data deals with causes and relationships and the
analysis is done to find out the relationship among the two variables.

Example of bivariate data can be temperature and ice cream sales in

summer season.
[Link] and Variables

Types of Descriptive Statistics

● Measures of Central Tendency
● Measure of Variability
● Measures of Frequency Distribution

2.1 Measures of Central Tendency

It represents the whole set of data by a single value. It gives us the location of the central points. There
are three main measures of central tendency:

● Mean
● Mode
● Median
[Link] and Variables
Example Program (Mean. Median, Mode)
Mean: (To calculate the mean, find the sum of all values, and divide the sum by the number of values)
import numpy
speed = [99,86,87,88,111,86,103,87,94,78,77,85,86]
x = [Link](speed)
print(x)

Median:(The median value is the value in the middle, after you have sorted all the values)
import numpy
speed = [99,86,87,88,111,86,103,87,94,78,77,85,86]
x = [Link](speed)
print(x)

Mode:(The Mode value is the value that appears the most number of times)
from scipy import stats
speed = [99,86,87,88,111,86,103,87,94,78,77,85,86]
x = [Link](speed)
print(x)
[Link] and Variables

2.2 Measure of Variability

● Range
● Variance
● Standard deviation
2.2.1 Range
The range describes the difference between the largest and smallest data point in our data set. The
bigger the range, the more the spread of data and vice versa.
Range = Largest data value – smallest data value

Example Program:

import numpy as np
arr = [1, 2, 3, 4, 5]
Maximum = max(arr)
Minimum = min(arr)
Range = Maximum-Minimum
print('Range of your Data',Range)
[Link] and Variables
2.2.2 Variance

It is defined as an average squared deviation from the mean. It is calculated by finding the
difference between every data point and the average which is also known as the mean, squaring
them, adding all of them, and then dividing by the number of data points present in our data set.
[Link] and Variables
2.2.2 Variance
Example Program:
import statistics
arr = [1, 2, 3, 4, 5]
print("Var = ", ([Link](arr)))
Output:
Var = 2.5

2.2.3 Standard Deviation

Example Program
import statistics
arr = [1, 2, 3, 4, 5]
print("Std = ", ([Link](arr)))
Output:
Std = 1.58
[Link] and Variables

How to calculate Variance:

ML Lab Final R22
No ratings yet
ML Lab Final R22
67 pages
Statistics and Probabilities Quarter 1
No ratings yet
Statistics and Probabilities Quarter 1
6 pages
Basic Stat
No ratings yet
Basic Stat
46 pages
Biostats Lesson 3
No ratings yet
Biostats Lesson 3
6 pages
Descriptive Statistics & Probability Guide
No ratings yet
Descriptive Statistics & Probability Guide
510 pages
Session 3
No ratings yet
Session 3
61 pages
Measures of Central Tendency
No ratings yet
Measures of Central Tendency
4 pages
Notes 3 Descriptive Statistics RJMurden 2021
No ratings yet
Notes 3 Descriptive Statistics RJMurden 2021
47 pages
TOPIC 2probability 1
No ratings yet
TOPIC 2probability 1
16 pages
Lecture Methods 3
No ratings yet
Lecture Methods 3
23 pages
Stats 1, Lecture
No ratings yet
Stats 1, Lecture
11 pages
Intro to Descriptive Statistics
No ratings yet
Intro to Descriptive Statistics
51 pages
Notebook Statistics
No ratings yet
Notebook Statistics
6 pages
Unit 5 BRM
No ratings yet
Unit 5 BRM
17 pages
Statistical Analysis of Variables and Data
No ratings yet
Statistical Analysis of Variables and Data
16 pages
Statistical Modeling and Data Analysis Guide
No ratings yet
Statistical Modeling and Data Analysis Guide
510 pages
Chapt3 Overheads
No ratings yet
Chapt3 Overheads
8 pages
DV Stat
No ratings yet
DV Stat
39 pages
Business Analytics Unit 4
No ratings yet
Business Analytics Unit 4
24 pages
Module 3 - Branches of Statistics
No ratings yet
Module 3 - Branches of Statistics
50 pages
Basic Statistics: Measures of Data Analysis
No ratings yet
Basic Statistics: Measures of Data Analysis
62 pages
Understanding Statistics: Concepts & Applications
No ratings yet
Understanding Statistics: Concepts & Applications
35 pages
Topic 1 Introduction To Statistics - Part 2
No ratings yet
Topic 1 Introduction To Statistics - Part 2
40 pages
Central Tendency & Variability
No ratings yet
Central Tendency & Variability
5 pages
Statistical Methods in Social Sciences
No ratings yet
Statistical Methods in Social Sciences
69 pages
Math236 Lecture 2
No ratings yet
Math236 Lecture 2
64 pages
Intro To Statistics - Descriptive Statistics and NPC - 20250225 - 171911 - 0000
No ratings yet
Intro To Statistics - Descriptive Statistics and NPC - 20250225 - 171911 - 0000
23 pages
Statistics Notes
No ratings yet
Statistics Notes
16 pages
MCS Lecture 3
No ratings yet
MCS Lecture 3
57 pages
DSILYTC Session 5 - Descriptive Statistics
No ratings yet
DSILYTC Session 5 - Descriptive Statistics
99 pages
Statisitics For Research Lecture 3 Notes For Free
No ratings yet
Statisitics For Research Lecture 3 Notes For Free
33 pages
Q & A - Unit 1 - Introduction To Statistics
No ratings yet
Q & A - Unit 1 - Introduction To Statistics
20 pages
Engineering Statistics Guide
No ratings yet
Engineering Statistics Guide
124 pages
Descriptive Statistics: Ungrouped Data
100% (3)
Descriptive Statistics: Ungrouped Data
21 pages
Discriptive Statistics
No ratings yet
Discriptive Statistics
50 pages
Social Science Statistics (June-Aug) 2025-Topic 2
No ratings yet
Social Science Statistics (June-Aug) 2025-Topic 2
21 pages
Data Analysis and Visualization EDA
No ratings yet
Data Analysis and Visualization EDA
51 pages
Business Statistics Notes
No ratings yet
Business Statistics Notes
50 pages
Lesson2 - Measures of Tendency
No ratings yet
Lesson2 - Measures of Tendency
65 pages
Simple Statistics
No ratings yet
Simple Statistics
8 pages
Unit 3 - Descriptive Statistics
No ratings yet
Unit 3 - Descriptive Statistics
44 pages
Understanding Central Tendency Measures
No ratings yet
Understanding Central Tendency Measures
5 pages
CH 003
No ratings yet
CH 003
87 pages
Business Statstics Complete
No ratings yet
Business Statstics Complete
13 pages
Chapter - 3
No ratings yet
Chapter - 3
11 pages
Lab Plan 5: Statistics and Probability: Describing A Single Set of Data
No ratings yet
Lab Plan 5: Statistics and Probability: Describing A Single Set of Data
19 pages
Lecture 1
No ratings yet
Lecture 1
32 pages
Unit 01 - Describing Data and Its Distributions - 1 Per Page
No ratings yet
Unit 01 - Describing Data and Its Distributions - 1 Per Page
79 pages
Intro to Statistics for Beginners
No ratings yet
Intro to Statistics for Beginners
42 pages
Business Statistics Course Guide
No ratings yet
Business Statistics Course Guide
69 pages
Describing Data - Numerical Measure
No ratings yet
Describing Data - Numerical Measure
33 pages
Statistics I Chapter 2: Univariate Data Analysis
No ratings yet
Statistics I Chapter 2: Univariate Data Analysis
27 pages
Statistics Part I
No ratings yet
Statistics Part I
38 pages
DS Chapter - 2
No ratings yet
DS Chapter - 2
73 pages
Statistical Analysis Basics
100% (1)
Statistical Analysis Basics
143 pages
f592b059 1643454320549
No ratings yet
f592b059 1643454320549
39 pages
This Section Presents Concepts Related To Using and Interpreting The Following Measures
No ratings yet
This Section Presents Concepts Related To Using and Interpreting The Following Measures
24 pages
CHAPTER 1 Descriptive Statistics
No ratings yet
CHAPTER 1 Descriptive Statistics
5 pages
One-Sample T-Test and Z-Test Explained
100% (1)
One-Sample T-Test and Z-Test Explained
42 pages
Quartiles
No ratings yet
Quartiles
8 pages
Ai Q
No ratings yet
Ai Q
15 pages
ECON 10 Statistical Methods Exam Guide
No ratings yet
ECON 10 Statistical Methods Exam Guide
1 page
Lesson 7
No ratings yet
Lesson 7
74 pages
Lecture12 Neural Nets
No ratings yet
Lecture12 Neural Nets
104 pages
BW Anova General
No ratings yet
BW Anova General
18 pages
MUF0142 Sample Exam Questions 4
No ratings yet
MUF0142 Sample Exam Questions 4
16 pages
Six Sigma Green Belt Training Kit (Rev 04)
No ratings yet
Six Sigma Green Belt Training Kit (Rev 04)
6 pages
Introduction To Normal Distribution: Nathaniel E. Helwig
0% (1)
Introduction To Normal Distribution: Nathaniel E. Helwig
56 pages
(Ebook PDF) Using Multivariate Statistics 7th Edition by Barbara G. Tabachnick Instant Download
No ratings yet
(Ebook PDF) Using Multivariate Statistics 7th Edition by Barbara G. Tabachnick Instant Download
58 pages
Data Management
100% (1)
Data Management
51 pages
Sem I - Descriptive Statistics - Question Bank - FYBCS - Xls - Compatibility Mode
No ratings yet
Sem I - Descriptive Statistics - Question Bank - FYBCS - Xls - Compatibility Mode
10 pages
ANOVA: Understanding SSbetween and Means
No ratings yet
ANOVA: Understanding SSbetween and Means
11 pages
Market Neutral Hedge Funds
No ratings yet
Market Neutral Hedge Funds
36 pages
Types and Methods of Data Diagrams
No ratings yet
Types and Methods of Data Diagrams
14 pages
Confidence Intervals vs. P-Values
No ratings yet
Confidence Intervals vs. P-Values
3 pages
Analyzing Cohort-Sequential Designs with PROC MIXED
No ratings yet
Analyzing Cohort-Sequential Designs with PROC MIXED
11 pages
Comparative Study of Customer Churn Prediction Based On Data Ensemble Approach
No ratings yet
Comparative Study of Customer Churn Prediction Based On Data Ensemble Approach
10 pages
1.02. Leung-2018-An-Updated-Meta-Analysis-On-The-Effect-Of-Peer-Tutoring-On-Tutors-Achievement
No ratings yet
1.02. Leung-2018-An-Updated-Meta-Analysis-On-The-Effect-Of-Peer-Tutoring-On-Tutors-Achievement
15 pages
Business Analytics Numerical Practice
No ratings yet
Business Analytics Numerical Practice
24 pages
Answer On Question 39513 - Math - Statistics
No ratings yet
Answer On Question 39513 - Math - Statistics
1 page
Normality Test For FRS Score
No ratings yet
Normality Test For FRS Score
3 pages
Unit II Descriptive-Statistics-And-Correlation
No ratings yet
Unit II Descriptive-Statistics-And-Correlation
19 pages
Lec 02 LogisticReg
No ratings yet
Lec 02 LogisticReg
33 pages
50 Sample Questions of Marketing Research
No ratings yet
50 Sample Questions of Marketing Research
8 pages
Demand Forecasting Report Polyester Filament Yarn
No ratings yet
Demand Forecasting Report Polyester Filament Yarn
9 pages
Research in Education 10th Edition John W. Best Latest PDF 2025
No ratings yet
Research in Education 10th Edition John W. Best Latest PDF 2025
89 pages
Evaluation of An Arabic Speech Corpus of Emotions A Perceptual and Statistical Analysis
No ratings yet
Evaluation of An Arabic Speech Corpus of Emotions A Perceptual and Statistical Analysis
17 pages