0% found this document useful (0 votes)

44 views12 pages

Wholesale Spending Analysis in Portugal

stats2

Uploaded by

Githendra Vishal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

44 views12 pages

Wholesale Spending Analysis in Portugal

stats2

Uploaded by

Githendra Vishal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

1.

Problem 1: Wholesale Customers Analysis

Problem Statement:
A wholesale distributor operating in different regions of Portugal has information on annual spending of
several items in their stores across different regions and channels. The data consists of 440 large retailers’
annual spending on 6 different varieties of products in 3 different regions (Lisbon, Oporto, Other) and across
different sales channel (Hotel, Retail).
Solution:
1.1 Use methods of descriptive statistics to summarize data. Which Region and which Channel seems to
spend more? Which Region and which Channel seems to spend less?
Dataset has below variables
1. 'Buyer/Spender'
2. 'Channel' (categorical variable)
3. 'Region' (categorical variable)
4. 'Fresh'
5. 'Milk'
6. 'Grocery'
7. 'Frozen'
8. 'Detergents_Paper'
9. 'Delicatessen'

Observations are 440 and variables are 9

Let’s check datatypes of each variables

Let’s check descriptive statistics

Let’s see if there are any null values present in dataset

Let’s try to answer questions now,
1.1 Use methods of descriptive statistics to summarize data. Which Region and which Channel seems to
spend more? Which Region and which Channel seems to spend less?
I understand there are total four different questions to answer,
1. Which Region seems to spend more? – ‘Other’

2. Which Channel seems to spend more? – ‘Retail’

3. Which Region seems to spend less? – ‘Lisbon’

4. Which Channel seems to spend less? - ‘Hotel’

1.2 There are 6 different varieties of items are considered. Do all varieties show similar behaviour across
Region and Channel?
Lets see how strong corelation present between variables, check this heatmap
1.3 based on a descriptive measure of variability, which item shows the most inconsistent behaviour? Which
items show the least inconsistent behaviour?
Hmm…. Let’s see scatterplot also
Pair plot,
1.4 Are there any outliers in the data?
Below items hold outliers
1.Fresh
2.Milk
3.Grocery
4.Frozen
5.Detergents_Paper
6.Delicatessen

1.5 based on this report, what are the recommendations?

Recommendation
It appears that Grocery and Detergents_Paper have the strongest correlation of the pairs. It also looks like
there is some correlation between Detergents_Paper and Milk, and Grocery and Milk. This confirms my
suspicion above that Grocery was correlated with some other features that would allow for its value to be
predicted with some degree of accuracy. All of the distributions appear to be skewed to the right, with more
points hovering closer to the origin and some larger points extending it to the right. The shape of the
distributions of Detergents_Paper, Grocery, and Milk are all quite similar.
Problem 2 - (Download Survey Data)
The Student News Service at Clear Mountain State University (CMSU) has decided to gather data about the
undergraduate students that attend CMSU. CMSU creates and distributes a survey of 14 questions and
receives responses from 62 undergraduates (stored in the Survey data set).
2.1. For this data, construct the following contingency tables (Keep Gender as row variable)
2.1.1. Gender and Major

2.1.2. Gender and Grad Intention

2.1.3. Gender and Employment

2.1.4. Gender and Computer

2.2. Assume that the sample is representative of the population of CMSU. Based on the data, answer the
following question:
2.2.1. What is the probability that a randomly selected CMSU student will be male?
Probability that a randomly selected CMSU student will be male is 46.77 %
2.2.2. What is the probability that a randomly selected CMSU student will be female?
Probability that a randomly selected CMSU student will be Female is 53.23 %
2.3. Assume that the sample is representative of the population of CMSU. Based on the data, answer the
following question:
2.3.1. Find the conditional probability of different majors among the male students in CMSU.

2.3.2 Find the conditional probability of different majors among the female students of CMSU.

2.4. Assume that the sample is a representative of the population of CMSU. Based on the data, answer the
following question:
2.4.1. Find the probability That a randomly chosen student is a male and intends to graduate.
Probability That a randomly chosen student is a male and intends to graduate is 58.62
2.4.2 Find the probability that a randomly selected student is a female and does NOT have a laptop.
The probability that a randomly selected student is a female and does NOT have a laptop 12.12
2.5. Assume that the sample is representative of the population of CMSU. Based on the data, answer the
following question:
2.5.1. Find the probability that a randomly chosen student is either a male or has full-time
employment?
The probability that a randomly chosen student is either a male or has full-time employment is 14.32
2.5.2. Find the conditional probability that given a female student is randomly chosen, she is
majoring in international business or management.
The conditional probability that given a female student is randomly chosen, she is majoring in
international business or management is 24.0

2.6. Construct a contingency table of Gender and Intent to Graduate at 2 levels (Yes/No). The Undecided
students are not considered now and the table is a 2x2 table. Do you think the graduate intention and being
female are independent events?
No, they are not independent events.
2.7. Note that there are four numerical (continuous) variables in the data set, GPA, Salary, Spending, and
Text Messages.
Answer the following questions based on the data
2.6.1. If a student is chosen randomly, what is the probability that his/her GPA is less than 3?
If a student is chosen randomly, the probability that his/her GPA is less than 3 is 27.0
2.6.2. Find the conditional probability that a randomly selected male earns 50 or more. Find the conditional
probability that a randomly selected female earns 50 or more.
The conditional probability that a randomly selected male earns 50 or more is 48.28
The conditional probability that a randomly selected female earns 50 or more is 54.55

2.8. Note that there are four numerical (continuous) variables in the data set, GPA, Salary, Spending, and
Text Messages. For each of them comment whether they follow a normal distribution. Write a note
summarizing your conclusions.

GPA and Salary seem follow normal distributions

Spending and Text-messages seem not to follow normal distribution. They seem right skew

Problem 3 -
An important quality characteristic used by the manufacturers of ABC asphalt shingles is the amount of
moisture the shingles contain when they are packaged. Customers may feel that they have purchased a
product lacking in quality if they find moisture and wet shingles inside the packaging. In some cases,
excessive moisture can cause the granules attached to the shingles for texture and colouring purposes to fall
off the shingles resulting in appearance problems. To monitor the amount of moisture present, the
company conducts moisture tests. A shingle is weighed and then dried. The shingle is then reweighed, and
based on the amount of moisture taken out of the product, the pounds of moisture per 100 square feet is
calculated. The company would like to show that the mean moisture content is less than 0.35 pound per
100 square feet.
The file (A & B shingles.csv) includes 36 measurements (in pounds per 100 square feet) for A shingles and 31
for B shingles.
3.1 Do you think there is evidence that means moisture contents in both types of shingles are within the
permissible limits? State your conclusions clearly showing all steps.
1. Define null and alternative hypotheses
H0 = mean moisture content is not equal to 0.35 pound per 100 square feet
H1 = mean moisture content is less than 0.35 pound per 100 square feet
2. Decide the significance level
Here we select 𝛼 = 0.05 and the population standard deviation is not known
3. Identify the test statistic
We have two samples and we do not know the population standard deviation.
Sample sizes for both samples are not same. n1=36 n1=31
We use two sample t-test.
4. Calculate the p - value and test statistic
tstat 0.845
p-value for one-tail: 0.2025369351827172
5. Decide to reject or accept null hypothesis
Paired two-sample t-test p-value= 0.2025369351827172
We do not have enough evidence to reject the null hypothesis in favour of alternative hypothesis
We need to accept alternate hypothesis "mean moisture content is less than 0.35 pound per 100
square feet"
3.2 Do you think that the population mean for shingles A and B are equal? Form the hypothesis and conduct
the test of the hypothesis. What assumption do you need to check before the test for equality of means is
performed?
1. Define null and alternative hypotheses
H0: 𝜇𝐴 - 𝜇𝐵 ≠ 0
HA: 𝜇𝐴 - 𝜇𝐵 = 0
2. Decide the significance level
Here we select 𝛼 = 0.05 and the population standard deviation is not known
3. Identify the test statistic
We have two samples and we do not know the population standard deviation.
Sample sizes for both samples are not same. n1=36 n1=31
We use two sample t-test.
4. Calculate the p - value and test statistic
tstat 0.985249977839441
P Value 0.3284577916404776
5. Decide to reject or accept null hypothesis
We do not have enough evidence to reject the null hypothesis in favour of alternative hypothesis

Statistical Methods For Decision Making (SMDM) Project Report
100% (2)
Statistical Methods For Decision Making (SMDM) Project Report
22 pages
Nitin - Bilaye 05 Nov 2021
No ratings yet
Nitin - Bilaye 05 Nov 2021
10 pages
Arnab Chowdhury As1
No ratings yet
Arnab Chowdhury As1
12 pages
SMDM PROJECT REPORT Kriti
No ratings yet
SMDM PROJECT REPORT Kriti
6 pages
Project On Statistical Methods For Decision Making: by Ameya Udapure
No ratings yet
Project On Statistical Methods For Decision Making: by Ameya Udapure
32 pages
CMSU Student Survey Analysis Report
No ratings yet
CMSU Student Survey Analysis Report
26 pages
Problem 1
No ratings yet
Problem 1
5 pages
SMDM Project Instructions & Analysis
50% (2)
SMDM Project Instructions & Analysis
5 pages
Problem 1 - (Download Data) : Importing Nessceary Libraries
No ratings yet
Problem 1 - (Download Data) : Importing Nessceary Libraries
16 pages
Business Report SMDM
No ratings yet
Business Report SMDM
22 pages
CMSU Student Survey Analysis
No ratings yet
CMSU Student Survey Analysis
16 pages
2743021a949b2be20a570e94ff11f796 (1)
No ratings yet
2743021a949b2be20a570e94ff11f796 (1)
17 pages
Statistical Analysis by Kundan Sinha
0% (1)
Statistical Analysis by Kundan Sinha
4 pages
CMSU Student Survey Data Analysis
No ratings yet
CMSU Student Survey Data Analysis
16 pages
Akshaya SMDM Project Report
100% (1)
Akshaya SMDM Project Report
18 pages
Wholesale Customer Spending Analysis
No ratings yet
Wholesale Customer Spending Analysis
20 pages
SMDM Project SAMPLE REPORT
0% (2)
SMDM Project SAMPLE REPORT
7 pages
SMDM Project
0% (1)
SMDM Project
22 pages
SMDM Project Report: Data Analysis Insights
100% (1)
SMDM Project Report: Data Analysis Insights
15 pages
IS Extended Project Sri
No ratings yet
IS Extended Project Sri
7 pages
Spending Analysis of Retail Channels in Portugal
No ratings yet
Spending Analysis of Retail Channels in Portugal
16 pages
SMDM Project
100% (1)
SMDM Project
22 pages
Wholesale Customer Analysis & CMSU Survey
100% (1)
Wholesale Customer Analysis & CMSU Survey
19 pages
SMDM Project SAMPLE REPORT
No ratings yet
SMDM Project SAMPLE REPORT
7 pages
Annual Spending Analysis of Retailers in Portugal
No ratings yet
Annual Spending Analysis of Retailers in Portugal
12 pages
PDS Project SAMPLE REPORT
No ratings yet
PDS Project SAMPLE REPORT
7 pages
SMDM Project SAMPLE REPORT
No ratings yet
SMDM Project SAMPLE REPORT
7 pages
SMDM Project Sample Report
No ratings yet
SMDM Project Sample Report
7 pages
Business Report Project - Sheetal - SMDM
100% (1)
Business Report Project - Sheetal - SMDM
20 pages
Wholesale Customer & CMSU Data Analysis
No ratings yet
Wholesale Customer & CMSU Data Analysis
16 pages
Problem Statement 1
100% (1)
Problem Statement 1
17 pages
SMDM Project Sample Report
No ratings yet
SMDM Project Sample Report
7 pages
SMDM Project Report
100% (1)
SMDM Project Report
9 pages
Ashishpk 12-09 21
No ratings yet
Ashishpk 12-09 21
21 pages
SMDM Project SAMPLE REPORT
No ratings yet
SMDM Project SAMPLE REPORT
7 pages
Wholesale & University Data Insights
100% (2)
Wholesale & University Data Insights
21 pages
Nov 2024 p2 (1 of 3) Stats (Last Supper)
No ratings yet
Nov 2024 p2 (1 of 3) Stats (Last Supper)
7 pages
SMDM Assignment: Problem 1
0% (1)
SMDM Assignment: Problem 1
16 pages
Final Exam Review: Test Scores Frequency
100% (1)
Final Exam Review: Test Scores Frequency
10 pages
SMDM Project Sample Report
No ratings yet
SMDM Project Sample Report
8 pages
Workshop 18th - 20th May 23
No ratings yet
Workshop 18th - 20th May 23
6 pages
A Review of Basic Statistical Concepts: Answers To Problems and Cases 1
No ratings yet
A Review of Basic Statistical Concepts: Answers To Problems and Cases 1
94 pages
Prob & Stat (1) - 1
No ratings yet
Prob & Stat (1) - 1
7 pages
Worksheet II From Ch5 - 8
No ratings yet
Worksheet II From Ch5 - 8
5 pages
Statistical Methods for Decision Making
No ratings yet
Statistical Methods for Decision Making
15 pages
Statistics & Numerical Methods Q&A
No ratings yet
Statistics & Numerical Methods Q&A
13 pages
Business Stats Exercise Guide
No ratings yet
Business Stats Exercise Guide
53 pages
Advanced Statistics Business Report CMSU
No ratings yet
Advanced Statistics Business Report CMSU
25 pages
Wholesale Customer Data Analysis
100% (1)
Wholesale Customer Data Analysis
56 pages
Assighment Project 1
100% (3)
Assighment Project 1
18 pages
Google
No ratings yet
Google
170 pages
Probability and Statistics Practice Problems
No ratings yet
Probability and Statistics Practice Problems
10 pages
Worksheet Chapter 1-9
No ratings yet
Worksheet Chapter 1-9
7 pages
Dilla University: Page 1 of 6
100% (2)
Dilla University: Page 1 of 6
6 pages
Advanced Level Statistics Exam
No ratings yet
Advanced Level Statistics Exam
7 pages
Homework on Data Analysis Concepts
No ratings yet
Homework on Data Analysis Concepts
12 pages
Mas202 - 2022
No ratings yet
Mas202 - 2022
53 pages
1667750572final Assignment - ME06 Batch
No ratings yet
1667750572final Assignment - ME06 Batch
8 pages
Amanuel Project
No ratings yet
Amanuel Project
35 pages
Manual Xlstatpro
No ratings yet
Manual Xlstatpro
230 pages
Tutorial 5
No ratings yet
Tutorial 5
12 pages
Two-Sample Tests of Hypothesis: Mcgraw-Hill/Irwin
No ratings yet
Two-Sample Tests of Hypothesis: Mcgraw-Hill/Irwin
15 pages
Statistical Association Tests
No ratings yet
Statistical Association Tests
8 pages
Statistic and PRO. B. E. (Civil IV - Computer - E&C VI)
No ratings yet
Statistic and PRO. B. E. (Civil IV - Computer - E&C VI)
0 pages
Problem Solving A Statisticians Guide PDF
No ratings yet
Problem Solving A Statisticians Guide PDF
274 pages
Statistical Inference: Parametric vs Nonparametric Methods
No ratings yet
Statistical Inference: Parametric vs Nonparametric Methods
215 pages
Just in Time in Indian Context
No ratings yet
Just in Time in Indian Context
8 pages
Z-Test For 1 Sample Means
No ratings yet
Z-Test For 1 Sample Means
32 pages
ch15 PDF
No ratings yet
ch15 PDF
66 pages
Unit 3 DS
No ratings yet
Unit 3 DS
16 pages
Watts Rousseau Slippery Elm
No ratings yet
Watts Rousseau Slippery Elm
8 pages
Hypothesis Testing Cheat Sheet
No ratings yet
Hypothesis Testing Cheat Sheet
2 pages
Chi Square Test
No ratings yet
Chi Square Test
7 pages
Applied Statistics and Probability For Engineers
No ratings yet
Applied Statistics and Probability For Engineers
3 pages
DLL Stat 5th Week For COT
100% (1)
DLL Stat 5th Week For COT
5 pages
Research Methods
No ratings yet
Research Methods
250 pages
Indian Institute of Management Kashipur: Business Statistics, Term I, Academic Year 2021-2022 Syllabus
No ratings yet
Indian Institute of Management Kashipur: Business Statistics, Term I, Academic Year 2021-2022 Syllabus
8 pages
Statistical Tests in Data Analytics
No ratings yet
Statistical Tests in Data Analytics
23 pages
Humss A Pre and Post Test
No ratings yet
Humss A Pre and Post Test
14 pages
Statistical Data Analysis Overview
No ratings yet
Statistical Data Analysis Overview
53 pages
Advertising Appeal and Tone - Implications For Creative Strategy in TV Commercials
No ratings yet
Advertising Appeal and Tone - Implications For Creative Strategy in TV Commercials
16 pages
Alexithymia Questionnaire for Children
100% (1)
Alexithymia Questionnaire for Children
11 pages
ANOVA Test - Definition, Types, Examples
No ratings yet
ANOVA Test - Definition, Types, Examples
8 pages
Importance of Statistics in Various Fields
No ratings yet
Importance of Statistics in Various Fields
48 pages
Banking Behavior of Islamic Bank Custome
No ratings yet
Banking Behavior of Islamic Bank Custome
19 pages
Research Methods For Management: Dr.R.Prabhu
No ratings yet
Research Methods For Management: Dr.R.Prabhu
75 pages
Principles and Practice of Criminal Is Tics
88% (8)
Principles and Practice of Criminal Is Tics
393 pages
How To Write An Audit Report
100% (1)
How To Write An Audit Report
5 pages

Wholesale Spending Analysis in Portugal

Uploaded by

Wholesale Spending Analysis in Portugal

Uploaded by

1.

Problem 1: Wholesale Customers Analysis

Observations are 440 and variables are 9

Let’s check descriptive statistics

Let’s see if there are any null values present in dataset

2. Which Channel seems to spend more? – ‘Retail’

3. Which Region seems to spend less? – ‘Lisbon’

4. Which Channel seems to spend less? - ‘Hotel’

1.5 based on this report, what are the recommendations?

2.1.2. Gender and Grad Intention

2.1.3. Gender and Employment

2.1.4. Gender and Computer

GPA and Salary seem follow normal distributions

You might also like