0% found this document useful (0 votes)

12 views2 pages

BA End Term Q1

The document details an Exploratory Data Analysis (EDA) on a dataset of 100 entries with five numerical columns related to customer demographics and behavior. Key findings include the absence of missing values, the age range of customers being predominantly between 25 and 45, and a churn rate of 40%. Visualizations of age, annual income, and spending score distributions are also provided, highlighting trends in customer characteristics.

Uploaded by

vaisurithi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views2 pages

BA End Term Q1

Uploaded by

vaisurithi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

Question 1:

Exploratory Data Analysis (EDA) on dataset, checking out the missing values, computing basic
statistics and visualizing the distributions. We will start by loading the dataset and summarizing it.

The dataset consists of 100 entries with 5 numerical columns:

● Customer ID (Unique identifier)

● Age

● Annual Income

● Spending Score

● Churn (1 = Churned, 0 = Not Churned)

There are no missing values.

All columns are in integer type.

Statistics for numerical data:

● Age: Ranges from 22 to 50.

Mean value = 34.1.

● Annual Income: Ranges from $20,000 to $100,000

Average = $53,000.

● Spending Score: Ranges from 30 to 90, average value is 64.5

● Churn Rate: 40% of customers churned.

Histogram Visualization of the distributions of Age, Annual Income and Spending Score

Main conclusions arrived:

● Age: Mostly between 25 and 45 and 30-35 age is at slightly high.

● Annual Income: Distributed widely from $30,000 to $70,000.

● Spending Score: Most customers have a spending score between 50 and 80.

Step 1: Load the Dataset and Perform EDA

CopyEdit

# Load necessary libraries

library(ggplot2)

library(dplyr)

library(cluster)

library(caret)

# Read the dataset

df <- read.csv("Customer_Segmentation_and_Churn.csv")

# View basic information

str(df)

summary(df)

# Check for missing values

colSums(is.na(df))

# Visualize distributions

ggplot(df, aes(x=Age)) + geom_histogram(binwidth=5, fill="blue", color="black") + ggtitle("Age

Distribution")

ggplot(df, aes(x=AnnualIncome)) + geom_histogram(binwidth=5000, fill="green", color="black") +

ggtitle("Annual Income Distribution")

ggplot(df, aes(x=SpendingScore)) + geom_histogram(binwidth=5, fill="red", color="black") +

ggtitle("Spending Score Distribution")

Part 7
No ratings yet
Part 7
26 pages
Experiment1 EDA Business Sales
No ratings yet
Experiment1 EDA Business Sales
2 pages
Exploratory Data Analysis
No ratings yet
Exploratory Data Analysis
19 pages
ch04 - DS Unit 3
No ratings yet
ch04 - DS Unit 3
64 pages
Exploratory Data Analysis
No ratings yet
Exploratory Data Analysis
17 pages
Unit 1
No ratings yet
Unit 1
23 pages
Data Science & EDA Essentials
No ratings yet
Data Science & EDA Essentials
151 pages
BI-LEc 3
No ratings yet
BI-LEc 3
24 pages
Exploratory Data Analysis Gam
No ratings yet
Exploratory Data Analysis Gam
10 pages
Document
No ratings yet
Document
21 pages
Exploratory Data Analysis - Tech
No ratings yet
Exploratory Data Analysis - Tech
6 pages
Data Science: EDA & Preprocessing Guide
No ratings yet
Data Science: EDA & Preprocessing Guide
16 pages
Exploratory Data Analysis in Data Science
No ratings yet
Exploratory Data Analysis in Data Science
31 pages
CCL Removed Merged
No ratings yet
CCL Removed Merged
9 pages
Unit 3
No ratings yet
Unit 3
47 pages
Lesson 5 Exploratory Data Analysis
No ratings yet
Lesson 5 Exploratory Data Analysis
10 pages
Exploratory Data Analysis
No ratings yet
Exploratory Data Analysis
10 pages
Unit3 Eda
No ratings yet
Unit3 Eda
13 pages
Tableau EDA Guide for Analysts
No ratings yet
Tableau EDA Guide for Analysts
16 pages
Telecom Customer Churn Project Report
50% (2)
Telecom Customer Churn Project Report
25 pages
AIDS C04-Session-22
No ratings yet
AIDS C04-Session-22
22 pages
Data Mining and Predictive Analytics, Second Edition, by Daniel Larose and Chantal Larose, John Wiley and Sons, Inc., 2015
No ratings yet
Data Mining and Predictive Analytics, Second Edition, by Daniel Larose and Chantal Larose, John Wiley and Sons, Inc., 2015
40 pages
Eda 1
No ratings yet
Eda 1
25 pages
22amh32 - Data Analytics and Data Science Unit I & Exploratory Data Analysis (Eda) 1. Exploratory Data Analysis (Eda)
No ratings yet
22amh32 - Data Analytics and Data Science Unit I & Exploratory Data Analysis (Eda) 1. Exploratory Data Analysis (Eda)
9 pages
EDA Summary Report
No ratings yet
EDA Summary Report
2 pages
FDS Unit 2
No ratings yet
FDS Unit 2
15 pages
Perform Exploratory Data Analysis
No ratings yet
Perform Exploratory Data Analysis
5 pages
What Is Exploratory Data Analysis
No ratings yet
What Is Exploratory Data Analysis
28 pages
Introduction To EDA: Exploratory Data Analysis (EDA) in Data Science
No ratings yet
Introduction To EDA: Exploratory Data Analysis (EDA) in Data Science
4 pages
Swaraj Project
No ratings yet
Swaraj Project
16 pages
Exploratory Data Analysis (EDA) For Banking and Finance: Unveiling Insights and Patterns
No ratings yet
Exploratory Data Analysis (EDA) For Banking and Finance: Unveiling Insights and Patterns
35 pages
Eda Sandhya
No ratings yet
Eda Sandhya
7 pages
03a EDA
No ratings yet
03a EDA
47 pages
Exploratory Data Analysis (EDA)
No ratings yet
Exploratory Data Analysis (EDA)
12 pages
Dev 1
No ratings yet
Dev 1
2 pages
Assignment 2 - Factor Hair
No ratings yet
Assignment 2 - Factor Hair
39 pages
Exploratory Data Analysis: Masters of Science
No ratings yet
Exploratory Data Analysis: Masters of Science
12 pages
What Is Exploratory Data Analysis?: Intuition
No ratings yet
What Is Exploratory Data Analysis?: Intuition
8 pages
Recall Bias
No ratings yet
Recall Bias
27 pages
What Is EDA in Data Science - Everything About Exploratory Data - by Aman Kharwal - Medium
No ratings yet
What Is EDA in Data Science - Everything About Exploratory Data - by Aman Kharwal - Medium
11 pages
IMPDAV
No ratings yet
IMPDAV
105 pages
EDA Techniques and Statistical Insights
No ratings yet
EDA Techniques and Statistical Insights
18 pages
Data Visualization Exam Guide
100% (1)
Data Visualization Exam Guide
4 pages
Master Exploratory Data Analysis For Fast Business Growth!2
No ratings yet
Master Exploratory Data Analysis For Fast Business Growth!2
21 pages
Exploratory Data Analysis of Heart Disease Dataset 1737826105
No ratings yet
Exploratory Data Analysis of Heart Disease Dataset 1737826105
50 pages
Wa0000.
No ratings yet
Wa0000.
15 pages
IoT Data Analytics Guide
No ratings yet
IoT Data Analytics Guide
70 pages
Exploratory Data Analysis
No ratings yet
Exploratory Data Analysis
4 pages
Internship Report 1 Internship Report 1
No ratings yet
Internship Report 1 Internship Report 1
24 pages
ChatGPT in Exploratory Data Analysis
No ratings yet
ChatGPT in Exploratory Data Analysis
6 pages
Exploratory Data Analysis Guide
No ratings yet
Exploratory Data Analysis Guide
6 pages
P23MBA547 Predictive Analytics
No ratings yet
P23MBA547 Predictive Analytics
133 pages
Unit 1 DXV
No ratings yet
Unit 1 DXV
28 pages
Exploratory Data Analysis
No ratings yet
Exploratory Data Analysis
13 pages
Systematic Approach To Perform Task Centric Exploratory Data Analysis With Case Study
No ratings yet
Systematic Approach To Perform Task Centric Exploratory Data Analysis With Case Study
8 pages
EDA Cheat Sheet - Supercharge Your Data Analysis!
No ratings yet
EDA Cheat Sheet - Supercharge Your Data Analysis!
2 pages

BA End Term Q1

Uploaded by

BA End Term Q1

Uploaded by

Question 1:

The dataset consists of 100 entries with 5 numerical columns:

● Customer ID (Unique identifier)

● Churn (1 = Churned, 0 = Not Churned)

There are no missing values.

All columns are in integer type.

Statistics for numerical data:

● Age: Ranges from 22 to 50.

● Annual Income: Ranges from $20,000 to $100,000

● Spending Score: Ranges from 30 to 90, average value is 64.5

● Churn Rate: 40% of customers churned.

Main conclusions arrived:

● Age: Mostly between 25 and 45 and 30-35 age is at slightly high.

● Annual Income: Distributed widely from $30,000 to $70,000.

Step 1: Load the Dataset and Perform EDA

# Load necessary libraries

# Read the dataset

# View basic information

# Check for missing values

ggplot(df, aes(x=Age)) + geom_histogram(binwidth=5, fill="blue", color="black") + ggtitle("Age

ggplot(df, aes(x=AnnualIncome)) + geom_histogram(binwidth=5000, fill="green", color="black") +

ggplot(df, aes(x=SpendingScore)) + geom_histogram(binwidth=5, fill="red", color="black") +

You might also like