Name :- Prachi
Course:- bsc.psc
Roll no:- 224525
University roll no:- 22024582047
Index
Practical no Sheet
Practical 1 Practical 1
Practical 2 Practical 2
Practical 3 Practical 3
Practical 4 Practical 4
Practical 5 Practical 5
Practical-1
Presentation and analysis of data by frequency table, Descriptive statistics a
Histogram
ques1 The distance (in km) of 20 students from their residence to their college were found as follows:
5 3 10 20 25
11 5 7 12 31
19 10 12 17 18
11 32 17 5 2
(a) Find the measures of central tendency (i.e. mean, median, mode).
(b) Find the measures of variability/spread (i.e. variance, standard deviation).
(c) Construct a frequency distribution table with class size 5.
(d) Construct a Histogram.
STEPS:
1. Go to File > Options > Add-ins > Excel Add-ins > Go, check Analysis ToolPak, and click OK.
2. Go to the Data tab, click Data Analysis, select Descriptive Statistics, and click OK.
3. Select Input Range(DATA), and select Output Range.
4. Check Summary Statistics, click OK
Distance DESCRIPTIVE STATISTICS
2
3 Mean 13.6
5 Standard Error 1.947197717
5 Median 11.5
7 Mode 5
10 Standard Deviation 8.70813292
10 Sample Variance 75.83157895
11 Kurtosis -0.07417383
11 Skewness 0.750817606
12 Range 30
12 Minimum 2
15 Maximum 32
17 Sum 272
17 Count 20
18
19
20
25
31
32
sum= 282
(c) CLASS INTERVAL BIN
0-5 5
9-Jan 10
15-Oct 15
15 - 20 20
20 - 25 25
25 - 30 30
30 - 35 35
Bin
5
10
15
20
25
30
35
More
DOT BLOT QUESTIONS
QUES1 1 Given the data set: [2, 4, 4, 5, 5, 5, 6, 7, 8, 8, 9], create a dot plot to represent the distribution o
DATA FREQUENCY
2 1
4 1
4 2
5 1
5 2
5 3
6 1
7 1
8 1
8 2
9 1
2 The heights (in cm) of 10 students in a class are [150, 152, 153, 150, 151, 152, 153, 154, 155,
HEIGHT FREQUENCY
150 1
150 2
150 3
151 1
152 1
152 2
153 1
153 2
154 1
155 1
BOX PLOT QUESTIONS
STEPS:
1. Enter your data in a single column.
2. Calculate the Minimum, Maximum, and Median values and the LOWER
CALCULATE USING MIN,=MAX,=MEDIAN functions.
3. Next, calculate the differences between each phase. Copy the minimum
4. Select all the data from the "Differences" and click Insert > Insert Colum
5. Right-click on the chart and click Select Data. Click Switch Row/Colum
6. Select the bottom part of the column.
7. Click Format > Current Selection > Format Selection. On the Fill tab, in
8. For WHISKERS, Select the new lower part. Then Click Design > Add C
9. Select the Upper part. Then repeat the same steps>Positive Error(CELL
10. BOX PLOT is done.
QUE 1 Given the test scores [55, 60, 65, 70, 75, 80, 85, 90, 95, 100], create a box plot and identify the
SCORE
55 MIN 55
60 LOWER FOURT 65
65 MEDIAN 77.5
70 UPPER FOURT 90
75 MAXIMAUN 100
80 FOURTH SPRE 25
85
90 QUARTILE 66.25
95 3rd QUARTILE 88.75
100 INTERQUARTILE 22.5
2 The age of company department employees is [25, 30, 35, 40, 45, 50, 55, 60, 65, 70]. Constr
AGE
25 MIN 25
30 LOWER FOURT 35
35 MEDIAN 47.5
40 UPPER FOURT 60
45 MAXIMAUN 70
50 FOURTH SPRE 25
55
60
65
70
Comparative Box Plot Questions
STEPS:
1. Enter your given data sets in two columns.
2. Calculate the Minimum, Maximum, and Median values and the LOWER
CALCULATE MIN,MAX,MEDIAN using functions.
3. Next, calculate the differences between each phase for both sets. Copy th
4. Select all the data from the "Difference A & Difference B" and click Ins
5. Right-click on the chart and click Select Data. Click Switch Row/Colum
6. Select the bottom parts of the columns.
7. Click Format> Current Selection> Format Selection. On the Fill tab, in t
8. For WHISKERS, Select the new lower parts. Then Click Design > Add
9. Select the Upper parts. Then repeat the same steps>Positive Error(CELL
10. Customise the axes, The result should look like a box plot.
que1 1. The heights (in cm) of male and female students in a class are given below:
a. Males: [160, 162, 165, 168, 170, 172, 175, 178, 180, 182]
b. Females: [150, 152, 155, 158, 160, 162, 165, 168, 170, 172]
Construct a comparative box plot to analyze the height distribution for males and females.
MALE FEMALE
160 150 MIN
162 152 LOWER FOURTH
165 155 MEDIAN
168 160 UPPER FOURTH
170 162 MAXIMAUN
172 165 FOURTH SPREAD
175 168
178 170
180 172
criptive statistics and Histogram.
found as follows:
, and click OK.
(A) MEAN 13.6
MEDIAN 11.5
MODE 5
(B) VARIANCE 75.83158
STANDARD 8.708133
Bin Frequency
5 5
10 3
15 4
20 5
25 1
30 0
35 2
More 0
Frequency
5 Histogram
3
12
4 10
Frequency
5 8
6 Frequency
1
4
0 2
2 0
0 Bin
o represent the distribution of the data
0, 151, 152, 153, 154, 155, 150]. Construct a dot plot for this data.
alues and the LOWER and UPPER FOURTH from the data set.
e. Copy the minimum values from the last table there directly.
k Insert > Insert Column Chart > Stacked Column.
ck Switch Row/Column.Then, Click OK
ion. On the Fill tab, in the Formal panel, select No Fill. Repeat the same step with the upper part of the column
Click Design > Add Chart Element > Error Bars >MORE Error bar options> Direction to Minus> Custom> N
>Positive Error(CELL WITH DIFFERENCE BETWEEN MAXIMUM AND UPPER FOURTH).
e a box plot and identify the median, quartiles, and interquartile range.
difference
55
10
12.5
12.5
10
, 50, 55, 60, 65, 70]. Construct a box plot for this data.
difference
25
10
12.5
12.5
10
alues and the LOWER and UPPER FOURTH from the data sets.
e for both sets. Copy the minimum values from the last table there directly.
rence B" and click Insert > Insert Column Chart > Stacked Column.
ck Switch Row/Column. Click OK
on. On the Fill tab, in the Formal panel, select No Fill. Repeat the same step with the upper parts of the column
n Click Design > Add Chart Element > Error Bars >MORE Error bar options> Direction to Minus> Custom> N
>Positive Error(CELLS WITH DIFFERENCE BETWEEN MAXIMUM AND UPPER FOURTH).
box plot.
re given below:
n for males and females.
male female DIFF(MALEDIFF(FEMALE)
160 150 160 150
165 155 5 5
170 162 5 7
175 168 5 6
180 172 5 4
upper part of the column.
n to Minus> Custom> Negative Error (CELL WITH DIFFERENCE BETWEEN THE LOWER FOURTH & M
FOURTH).
pper parts of the columns.
on to Minus> Custom> Negative Error (CELLS WITH DIFFERENCE BETWEEN THE LOWER FOURTH &
R FOURTH).
LOWER FOURTH & MINIMUM)> OK. Then No Fill the new lower part.
HE LOWER FOURTH & MINIMUM)> OK. Then, No Fill the new lower part.
Practical-2
Binomial distribution
Ques A coin is tossed 6 times in each of 120 experiments. The number of heads in each experimen
No. of
0 1 2 3 4 5 6
Heads
Frequenc 5 9 18 35 28 15 10
y
Fit a binomial distribution and find the expected frequencies
Steps 1Create a table to organize vour data.
2. Create a coluan for possible outcomes (x) and given frequencies) in different columns, And calculate
their sums.
3, Calculate the Mean using ( 2(x.0/21 )•
4. Calculate Probability of Success (p) (Use p» Mean/n, where n" ))
5. Calculate Binomial Distribution using BLYON. DISTs, n, P. FALSE)
6. Calculate Expected Frequencies Multiply each binomial probability by the total frequency (N-100)
7Round the expected frequencies 6. check their alignment with observed values.
X F X*F Binomial diExpected frequency
0 5 0 0.008151 0.978141
1 9 9 0.060112 7.21341
2 18 36 0.184709 22.16505
3 35 105 0.302701 36.32414
4 28 112 0.279038 33.48456
5 15 75 0.137186 16.46238
6 10 60 0.028103 3.372324
mean 3.308333
n 6
p 0.551389
Poisson distribution
Ques The number of printing errors on pages of a book was recorded as follows:
No. of
0 1 2 3 4 5
Errors (x)
Frequenc
30 25 20 15 5 5
y
Fit a Poisson distribution to the data and compare observed vs. expected values.
Steps 1.Create a table to organize your data.
2.Create a column for possible outcomes (x) and given frequencies in differeot columes, And calculate th
Caleulato the Afana mains ( E(x.D/E().
Set Up Poinon Dutribution «POISSON DISTix ,00, FALSE).
Maltiply each Poison probability by the total frequener (N-20)
Round the expected frequencies to match the observed data for easier compurtion.
No. of erro(F) X*F Poissan disExpected dFinal frequency
0 30 0 0.212248 21.2248 21
1 25 25 0.328984 32.89844 33
2 20 40 0.254963 25.49629 25
3 15 45 0.131731 13.17308 13
4 5 20 0.051046 5.104569 5
5 5 25 0.015824 1.582416 1
100
Mean 1.55
Normal Distribution Curve
ques The following data gives the marks (out of 100) obtained by students in a class:
Marks
0-10 20-Oct 20-30 30-40 40-50 50-60 60-70
Range
Frequenc
2 5 10 20 30 20 10
y
Fit a normal distribution using the mean and standard deviation, and calculate expected frequencie
steps Cater vour data seto la two colans
2. Calculata the Aiaimum, Aaximum, and Sfedian saluas and the LOWER and UPPER FOURTH from to dat
USE -MIN, - MAX. -MEDIAN functions.
3. Next, calculate the differences between each phase for both set. Copy the minimum values from the l
I. Select all the data from the "Difference A Ge Difference B' and elick Insert › Insert Columa Chart > Stack
• Rizlu elick on the chart aad click Suloct Data. Click Switch llow/Column. Click oK
et the bortoms part of the coluan
Class intervFrequency X F X*F (X*F)*F
of heads in each experiment is recorded as follows:
rent columns, And calculate
e total frequency (N-100)
reot columes, And calculate their sums.
70-80
3
calculate expected frequencies.
d UPPER FOURTH from to data ata
he minimum values from the last table theco dinetly.
t › Insert Columa Chart > Stacked Colucan
Practical-3
Central Limit Theorem
Ques Demonstrate Central Limit Theorem through 15 X 15 data.
steps
1. Uer de RANDBETWEENO function to create a column of random numbere. Repeat thie procee (or use
2. For cach sample, use the -AVERAGEO function to calculate the smaple mean
3. Create a range of bin values covering the range of the sample means.
4. Use the Data Analysis ToolPak - listogram to plot the smple means.
5. Selecs Input Renge (mean row) and Bin Range, specifv Output Range, and heck Chart Output.
sample 1sample 2sample 3sample 4sample 5sample6 sample 7sample 8sample 9
52 16 25 21 34 43 8 7 12
84 36 4 63 58 34 17 34 48
62 15 36 96 20 17 38 43 88
56 17 35 10 4 45 99 61 79
82 59 30 27 4 37 65 16 88
14 27 55 68 11 29 11 52 10
9 24 94 17 71 19 59 35 54
18 47 61 1 16 31 46 20 90
87 54 38 3 17 98 10 60 35
36 73 79 95 16 13 94 13 55
39 41 8 44 33 48 34 59 92
19 56 85 64 77 56 19 58 93
78 14 98 1 97 16 33 31 43
54 79 50 33 3 47 41 95 19
80 91 10 13 12 58 59 51 100
mean 51.3333 43.2667 47.2 37.0667 31.5333 39.4 42.2 42.3333 60.4
Bin Frequency
0 0 Frequency
9-Jan 22 40
20 36
35
30 16
40 27 30
50 25 25
60 30
70 15 20
80 19 15
90 15
100 20 10
0 5
0
0 10 20 30 40 50 60 70 8
re. Repeat thie procee (or use formulas to automate it for more sumples.
d heck Chart Output.
sample 1sample 1sample 1sample 1sample 1sample 15
57 58 60 92 31 12
18 83 26 75 43 21
74 94 68 46 83 70
61 66 71 46 41 52
59 41 7 18 87 3
54 81 92 75 36 41
100 89 64 82 17 67
60 48 9 38 44 85
80 30 38 1 72 50
27 38 73 31 46 75
19 40 79 94 100 41
59 23 27 63 60 65
72 20 27 11 34 16
98 6 47 7 20 56
30 10 52 22 79 90
57.8667 48.4667 49.3333 46.7333 52.8667 49.6
Bin Frequency
Frequency 0 0
10 0
20 0
30 0
40 0
50 7
60 3
70 5
80 0
90 0
100 0
More 0
40 50 60 70 80 90 100
Histogram
8
6
Frequency
4
2 Freq
0
0 10 20 30 40 50 60 70 80 90 00 r e
1 Mo
Bin
stogram
Frequency
50 60 70 80 90 100 ore
M
Bin
Practical -4
Fitting of regression line using principle of least squa
Ques 1 The table below shows a product's price (in ₹) and the corresponding quantity demanded. U
1. Plot a scatter plot with Price on the X-axis and Demand on the Y-axis.
2. Add a linear trendline.
3. Display the trendline equation and the R² value on the chart.
4. Use the equation to predict the demand when the price is ₹55.
Steps:
1. Enter the given x and y values in two columns in Excel.
2. Highlight x and y values and insert a scatter plot from the lasert tab.
3. Rightclick on the pointe - Add Trendline - Select "Linear".
4. Check the option to display the equation and R' value on the chart.
Chart Title
Price Quantity demanded
100
20 95
30 85 90 f(x) = − 1.07714285714286 x + 116.63809523809
R² = 0.993920675066143
40 72 80
50 65 70
60 50 60
70 42
50
40
a= 116.64
b= -1.0771 30
To predict the demand when the price is 55 20
x=55 10
y=-1.0771*55+116.64 0
y=57.3 10 20 30 40 50 6
Ques 2 The table shows the monthly income and monthly expenditure of different households. Use
Draw a scatter diagram. Add a linear trendline and get the equation. Predict the expenditur
Income Expenditure Chart Title
30000 22000 40000
35000 26000
35000
40000 29000 f(x) = 0.58 x + 5200
45000 31000 30000 R² = 0.987089201877934
50000 34000 25000
20000
15000
a= 5200 10000
b= 0.58
5000
to predict the expenditure :
x=55,000 0
25000 30000 35000 40000 45000
y=0.58*55000+5200
y=0.58*55000+5200
of least squares
g quantity demanded. Use Excel to:
Chart Title
85714286 x + 116.638095238095
5066143
40 50 60 70 80
erent households. Use Excel to:
Predict the expenditure for a monthly income of ₹55,000.
Chart Title
5200
9201877934
00 40000 45000 50000 55000
Practical-5
Ques 1 A researcher collects data on the number of
hours students study per week and their corresponding test scores. The data fo
6 students is as follows:
Hours studied: 5, 10, 15, 20, 25, 30
Test scores: 50, 55, 60, 65, 70, 75
Calculate the correlation coefficient to determine the strength and direction of
steps: create a table between sets for given data
Select the table and go on insert , then plot the graph(scatter plot)
Click anywhere on the graph , go on trendline add the best fit line by seeing the r2 value
To determine mine correlation coefficient use CORREL function
Hours stuTest scores
5 50 Chart Title
10 55 80
15 60
70 f(x) = x + 45
20 65 R² = 1
25 70 60
30 75 50
40
Correlation 1
30
20
10
0
0 5 10 15 20 25
Ques 2 A company tracks the monthly advertising expenditure (in thousands
Advertising expenditure: 10, 20, 30, 40, 50
Sales revenue: 80, 85, 90, 95, 100
Compute the correlation coefficient to assess the relationship between
expenditusales revenue
10 80 Chart Title
20 85 120
30 90
40 95 100
f(x) = 0.5 x + 75
50 100 R² = 1
80
Correlation 1
60
40
20
0
5 10 15 20 25 30
g test scores. The data for
trength and direction of the relationship between the number of hours studied and the test scores.
e by seeing the r2 value
Chart Title
15 20 25 30 35
penditure (in thousands of dollars) and the resulting monthly sales revenue (in thousands of dollars) for a product ov
the relationship between advertising expenditure and sales revenue.
Chart Title
0.5 x + 75
1
15 20 25 30 35 40 45 50 55
f dollars) for a product over 5 months: