0% found this document useful (0 votes)

42 views37 pages

Week 4 Bioscience

The document provides an overview of advanced study skills in biological sciences, focusing on data handling and statistics. It covers descriptive and inferential statistics, measures of central tendency, variability in biological data, and the impact of outliers on statistical measures. Additionally, it discusses standard deviation, standard error, and various statistical analyses such as T-tests and Chi-squared tests.

Uploaded by

ganievmuhammadaziz

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

42 views37 pages

Week 4 Bioscience

Uploaded by

ganievmuhammadaziz

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 37

Birmingham International Academy

Advanced Study Skills in

Biological Sciences:

Data Handling, Statistics &

Describing data.
Richard Banks
[email protected]
Statistics

• Statistics is a collection of mathematical techniques

that help to analyse and present data

• vital to the scientific method

- used to confirm or reject a hypothesis

• Classified into
‘Descriptive statistics’ and ‘Inferential statistics’
Descriptive Statistics
Used to summarise the basic features of a data set

• measures of central tendency

 mean, mode, median

• measures of spread
 range, standard deviation, standard error

• measures of distribution
 skewness

3
Variability in ‘biological’ data

Biological data often has a ‘Normal distribution’

i.e. A frequency distribution with the most frequent number near
the middle: central tendency
Frequency distribution.
• Number of times an observation occurs in the data set
• Often presented in a table or a histogram
• % Frequency can be calculated:
frequency of an observation X 100
total number of observations
Result Frequency
0 2
1 9 frequency of 0 = 2 / (2+9+26+25+10+3) x 100 = 2.67%
2 26 frequency of 2 = 26/(2+9+26+25+10+3) x 100 = 34.67%
3 25
4 10
5 3

• % Frequency can then be used to create a distribution histogram

Task:
• Calculate the % Frequency of the data set.
• Produce a sketch diagram of the percentage
distribution graph of the table from the
previous slide (also below):
Normal distribution: data with central tendency

Biological data often has a normal distribution

i.e. has a frequency distribution with the most frequent number
near the middle, i.e. central tendency
therefore, measuring of the "middle" value of the data set is useful
Measures of central tendency
Sum of observations
Σx
• Mean: x = n Number of observations

• Median: equal number of values above and

below (=Middle)

• Mode: Value with the highest frequency (=Most)

• A data set can be bimodal or even multimodal, with 2 or

more values being equally frequent.

*sample mean used as an estimate of the population mean

The Mean
• The mean (or average) is calculated by adding up all the
individual values and dividing the total by the number of values

The Median
• The median value is identified by putting all of the individual
values in size order (smallest to largest) to find the middle value
(if there are an even number of individual values, take the mean of the two
middle values)

The Mode
• The mode is the value that occurs most often in the data set
Task
• Make a start on
completing the
questions 1-5 on the
worksheet.

• 10 minutes
An example ‘data set’:
2 , 4 , 2 , 0 , 40 , 2 , 4 , 3 , 6

Calculate the mean, median and mode

Σx
Mean: x = n Σx = n= x =

Median: sort data

Mode: (Occurs the most times)

Which is most representative of the centre of the data?

An example ‘data set’:
2 , 4 , 2 , 0 , 40 , 2 , 4 , 3 , 6

Calculate the mean, median and mode

Σx 63
Mean: x = n Σx = 63 n = 9 x = 9 = 7

Is this an error / ‘real’ data point ?

Median: sort data 0 2 2 2 3 4 4 6 40

Mode: 2 (Occurs the most times)

Which is most representative of the centre of the data?

What happens if we exclude the ‘outlier’ ?

2 4 2 0 40 X2 4 3 6
New data set:
2 4 2 0 2 4 3 6
Σx 23
Mean: x = n Σx = 23 n = 8 x = 8 = 2.875 It was 7

Median: sort data 0 2 2 2 3 4 4 6 It was 3

Mode: 2 - occurs the most times It was 2

An ‘outlier’ can have a disproportionate effect on the mean

Median is a reasonably typical value (resistant to outliers)

Range: Difference between the maximum & minimum
• An estimate of the spread of the data (= ‘dispersion’)
e.g. experimental data of weight of lab rats
320 , 367, 423, 471, 480 grams

Range is calculated as ………………………….. 480 - 320 = 160 g

• Useful, BUT some data can be very different from other data
points – outliers
e.g. a small baby rat added to the data set
150, 320 , 367, 423, 471, 480 g 480 - 150 = 330 g

• So, not always an accurate description of the overall data set

Data with outliers: The mean and the range are altered
to a greater extent by outliers.
‘Normal’ distribution
Symmetrical
Mean = Median = Mode
Mean
Median
Mode ‘Skewed’ distributions
- caused by ‘outliers’

Positive (right) skew

Long upper tail (high values)
Mean > Median > Mode
Mean
Median
Mode
Mean is moved in the
direction of the skew
Negative (left) skew
Long lower tail (low values)
Mean < Median < Mode
Mean
Median
Mode
Task
• Make a start on
completing the
questions 6-8 on the
worksheet.

• 5 minutes.
Break
Standard deviation (SD)
A measure of how data is distributed about the mean

• Standard deviation is a measure of the distance of an

individual value from the overall sample mean

• Allows us to quantify the variability within the data

• Expressed as Mean  SD

• The lower the standard deviation, the less uncertainty

[or] More confidence in the experimental result
Standard deviation (SD)
A measure of how data is distributed about the mean
• Mean  SD
• Eg 55.3  3.3
• This means that 68% of the values in the data set lies within 6.6
of the mean value, ie from 52.0 to 58.6
• 95% of the values fall within 2SD ie 48.7 to 61.9
Standard deviation (SD)
a measure of how data is distributed about the mean

• Less spread of data around the

mean = small standard deviation
• More confidence in the data set.

• High spread around the mean =

higher standard deviation
• Lower confidence in the data set.
Standard Deviation and Variance

Sample Variance (S2)

x = each score/value
= mean (average)
n = number of scores/values
= sum of…
Standard Deviation = √ Variance
Standard deviation (SD):
Calculation Task:

•In a learning behaviour

study, rats had to press a
leaver to gain a food
reward.
•Number of leaver presses,
before rat gave up trying to
access food reward are
given on the next slide.
•Can you work out the
standard deviation of the
data set?
Standard deviation (SD)
calculation
Task 10 minutes:
Repetition of lab rat leaver pressing in a reward experiment:
Number of leaver presses: 9, 2, 5, 4, 12, 7, 8, 11, 9, 3, 7, 4, 12, 5, 4,
10, 9, 6, 9, 4.
n=20
To calculate SD

Step 1: Calculate the mean, . Add up all the numbers and divide by
the total number of data. = 7
Step 2: Subtract the mean from each data point and then square
each value.
Step 3: Calculate the sum of the squared values.
Step 4: To calculate the variance, divide the sum of the squared
values by n-1.
Step 5: The standard deviation is the square root of the variance.
Use a calculator to obtain this number.
Standard deviation (SD)
calculation
Task 10 minutes:
Usefulness of Standard Deviation

gives ‘reliability’ measure - 95% confidence interval (CI)

= 2 x SD
= range above and below the mean within which 95% of
the measurements lie
Expressing data points with SD error bars

Symbol or bar that indicates Mean value

Vertical line representing size of standard deviation
Standard Error (SE)

• SE is related to, but is not the same as, the

standard deviation (SD)

• SE = SD/√N

N = sample size

• expressed as Mean  SE, N= (sample size)

Overlap between error bars

If SE bars do not overlap this indicates differences in means are

meaningful
• Requires an appropriate Statistical Test to confirm
Task
• Make a start on
completing the
questions 9-13 on the
worksheet.

• 10 minutes.
Statistical Analyses
- A hypothesis can be confirmed by statistical approaches if sufficient
data has been collected.
- A statistical test confirms whether the difference between data sets is
statistically significant.
- The tests used depends on whether the data collected is independent
or matched/paired & the level of data collected (nominal, categorical,
ordinal, quantitative).
Statistical Analyses
T-Test
Can be used to test a hypothesis to
determine whether there is a significant
difference between the means of two
data sets that are normally distributed.

If the difference between the two data

sets is significant then the null
hypothesis can be rejected and the
alternative hypothesis, which always
states there is a significant difference
between the sets of data can be
accepted (see lecture 1, The Scientific
Method).
Statistical Analyses

Chi-Squared Test
Is used to determine whether there is a significant
difference between the observed set of data obtained
from an investigation is statistically significantly different
from that which was originally expected and stated in the
hypothesis.
The null hypothesis will always state there will be no
difference between the observed and expected values.
This test is often used in inheritance studies to see if
observable characteristics follow mendelian ratios
(eg 3:1 or 9:3:3:1)
Task
• Complete the questions
on the worksheet (14-
16), so you have them
ready for revision.

• Answer sheets will be

made available on
Canvas after the
session.
Useful links for more information
• University of Birmingham Academic Skills Gateway
http://libguides.bham.ac.uk/asg

• http://www.stats.gla.ac.uk/steps/glossary/presenting_data.
html

• http://explorable.com/statistics-tutorial
• http://www.engageinresearch.ac.uk/section_4/step_by_ste
p_statistics.shtml

• http://www.statstutor.ac.uk/topics/
• https://www.bmj.com/about-bmj/resources-readers/public

Biostatistics Revision DR - NJ
No ratings yet
Biostatistics Revision DR - NJ
67 pages
Understanding Standard Error in Statistics
No ratings yet
Understanding Standard Error in Statistics
14 pages
4x @6ote ) 'Btda2@m
No ratings yet
4x @6ote ) 'Btda2@m
55 pages
Intro Summary of Statistics PLTW Slide Show
No ratings yet
Intro Summary of Statistics PLTW Slide Show
47 pages
3 Measures of Central Tendency
No ratings yet
3 Measures of Central Tendency
30 pages
Central Tendency and Dispersion Explained
No ratings yet
Central Tendency and Dispersion Explained
9 pages
Stat 1101 4 7
No ratings yet
Stat 1101 4 7
18 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
34 pages
Measures of Central Tendency
No ratings yet
Measures of Central Tendency
29 pages
Math
No ratings yet
Math
6 pages
Statistics For Data Science
No ratings yet
Statistics For Data Science
93 pages
U3 IntroSummaryStatistics
No ratings yet
U3 IntroSummaryStatistics
47 pages
Central Tendency
No ratings yet
Central Tendency
11 pages
Week 7 - Basic Statistics-For Students
No ratings yet
Week 7 - Basic Statistics-For Students
22 pages
Basic Statistics Refresher For Business Analytics
No ratings yet
Basic Statistics Refresher For Business Analytics
5 pages
Unit 5 BRM
No ratings yet
Unit 5 BRM
17 pages
Lecture of BIOSTATISTICS 12.2022 RMDC
No ratings yet
Lecture of BIOSTATISTICS 12.2022 RMDC
85 pages
Standard Deviation Formulas Explained
No ratings yet
Standard Deviation Formulas Explained
10 pages
Measures of Central Tendency
No ratings yet
Measures of Central Tendency
19 pages
Data Analysis - Calculation of Spread
No ratings yet
Data Analysis - Calculation of Spread
37 pages
Health Statistics: Principles of Secondary Data Analysis
No ratings yet
Health Statistics: Principles of Secondary Data Analysis
61 pages
Measures of Central Tendency Dispersion
No ratings yet
Measures of Central Tendency Dispersion
30 pages
HNS 2321 Biostatistics Lecture 3 and 4 Descritive Statistics
No ratings yet
HNS 2321 Biostatistics Lecture 3 and 4 Descritive Statistics
36 pages
MCS Lecture 3
No ratings yet
MCS Lecture 3
57 pages
Lesson 6c, 7, 8
No ratings yet
Lesson 6c, 7, 8
46 pages
05 - Statistical Processing and Analysis of Medical Data
No ratings yet
05 - Statistical Processing and Analysis of Medical Data
14 pages
Statistics Basics for Data Science
100% (2)
Statistics Basics for Data Science
27 pages
Lecture 3-4descriptive Statistics Measures of Central Tendency
No ratings yet
Lecture 3-4descriptive Statistics Measures of Central Tendency
32 pages
Describing Data: Centre Mean Is The Technical Term For What Most People Call An Average. in Statistics, "Average"
No ratings yet
Describing Data: Centre Mean Is The Technical Term For What Most People Call An Average. in Statistics, "Average"
4 pages
Introduction to Statistical Analysis
No ratings yet
Introduction to Statistical Analysis
10 pages
Data Summarization
No ratings yet
Data Summarization
37 pages
Descriptive Statistics & Data Analysis
No ratings yet
Descriptive Statistics & Data Analysis
48 pages
Statistics, Statistical Modelling & Data Analytics
No ratings yet
Statistics, Statistical Modelling & Data Analytics
68 pages
CHAPTERS
No ratings yet
CHAPTERS
17 pages
Module I. Basic Calculations. Average, Standard Deviation by Excel
No ratings yet
Module I. Basic Calculations. Average, Standard Deviation by Excel
48 pages
Chapter2-Statistical Analysis
No ratings yet
Chapter2-Statistical Analysis
86 pages
Q & A - Unit 1 - Introduction To Statistics
No ratings yet
Q & A - Unit 1 - Introduction To Statistics
20 pages
Lecture 2-Summarizing Data - HSciences Biostats - 010232en
No ratings yet
Lecture 2-Summarizing Data - HSciences Biostats - 010232en
37 pages
Descreptive Statistics 1
No ratings yet
Descreptive Statistics 1
74 pages
Statistics and Standard Deviation Guide
No ratings yet
Statistics and Standard Deviation Guide
64 pages
Biostatistics: Khadeeja PK
0% (1)
Biostatistics: Khadeeja PK
27 pages
Chapter 4 Basic Statistics
No ratings yet
Chapter 4 Basic Statistics
22 pages
Week 4 Measures of Central Tendency
No ratings yet
Week 4 Measures of Central Tendency
29 pages
Unit II TYCS DS
No ratings yet
Unit II TYCS DS
176 pages
Summary Statistics
No ratings yet
Summary Statistics
28 pages
Lesson 6c, 7, 8-Print
No ratings yet
Lesson 6c, 7, 8-Print
5 pages
E Book - Unit 4
No ratings yet
E Book - Unit 4
12 pages
Descriptive Statistic
No ratings yet
Descriptive Statistic
37 pages
Lecture 6
No ratings yet
Lecture 6
84 pages
Measures of Central Tendency: Mean Median Mode
No ratings yet
Measures of Central Tendency: Mean Median Mode
20 pages
FDSA Unit 2
No ratings yet
FDSA Unit 2
44 pages
8614.educational Statitics Unit 4
No ratings yet
8614.educational Statitics Unit 4
34 pages
Central Tendency and Variation Measures
No ratings yet
Central Tendency and Variation Measures
29 pages
Lect 03 Measures of Central Tendency and Dispersion
No ratings yet
Lect 03 Measures of Central Tendency and Dispersion
12 pages
Term 3 Mathematics (Session 1 - 4) 2021 Learner Stats and Probability Final
No ratings yet
Term 3 Mathematics (Session 1 - 4) 2021 Learner Stats and Probability Final
52 pages
Statistics: Central Tendency & Variability
No ratings yet
Statistics: Central Tendency & Variability
8 pages
Lesson2 Shs
No ratings yet
Lesson2 Shs
4 pages
Chapter 2 Slides
No ratings yet
Chapter 2 Slides
19 pages
B.A./B.Sc. (STATISTICS)
No ratings yet
B.A./B.Sc. (STATISTICS)
34 pages
Stat MEM Chapter II Correlation
No ratings yet
Stat MEM Chapter II Correlation
5 pages
AMA1501 Ch1
No ratings yet
AMA1501 Ch1
6 pages
662b8d91f3edffinal Files QT
No ratings yet
662b8d91f3edffinal Files QT
13 pages
Business Statistics Assignment
No ratings yet
Business Statistics Assignment
5 pages
Maths Integration
No ratings yet
Maths Integration
7 pages
Statistics Question Bank
No ratings yet
Statistics Question Bank
4 pages
Analisis Kepadatan Penduduk dan COVID
No ratings yet
Analisis Kepadatan Penduduk dan COVID
5 pages
Presentation Mcqs
100% (2)
Presentation Mcqs
2 pages
Assignment No. 2: Assignment Submission Guidelines: Assignment Formatting Instructions
No ratings yet
Assignment No. 2: Assignment Submission Guidelines: Assignment Formatting Instructions
9 pages
Business Statistics Unit 4 Correlation and Regression
No ratings yet
Business Statistics Unit 4 Correlation and Regression
27 pages
Applied Statistics in Business & Economics: David P. Doane and Lori E. Seward
No ratings yet
Applied Statistics in Business & Economics: David P. Doane and Lori E. Seward
65 pages
Joker
No ratings yet
Joker
16 pages
Econometrics: Optimal GMM Weighting
No ratings yet
Econometrics: Optimal GMM Weighting
28 pages
2.5 - Normal Distribution
No ratings yet
2.5 - Normal Distribution
10 pages
Normal Distribution
No ratings yet
Normal Distribution
21 pages
Gr.12 August MEMO
No ratings yet
Gr.12 August MEMO
7 pages
2.1.1 Central Tendencies
No ratings yet
2.1.1 Central Tendencies
4 pages
Stat 101 LE1 Samplex
No ratings yet
Stat 101 LE1 Samplex
3 pages
Mps in Elementary Mathematics: First Quarter
No ratings yet
Mps in Elementary Mathematics: First Quarter
4 pages
2025 - ECN2331 - Lecture3 - Descriptive Statistics - Dispersion
No ratings yet
2025 - ECN2331 - Lecture3 - Descriptive Statistics - Dispersion
49 pages
Measures of Variability Worksheet
No ratings yet
Measures of Variability Worksheet
5 pages
Mathematics 10 Long Quiz on Position
No ratings yet
Mathematics 10 Long Quiz on Position
4 pages
Z-Score and Normal Distribution Review
No ratings yet
Z-Score and Normal Distribution Review
24 pages
Calculate Standard Deviation
0% (1)
Calculate Standard Deviation
3 pages
12-Statistics 35435028 2025 06 10 07 40
No ratings yet
12-Statistics 35435028 2025 06 10 07 40
23 pages
Lecture4B Slides
No ratings yet
Lecture4B Slides
8 pages
The Prediction Error of Bornhuetter/Ferguson: BY Homas ACK
No ratings yet
The Prediction Error of Bornhuetter/Ferguson: BY Homas ACK
17 pages

Week 4 Bioscience

Uploaded by

Week 4 Bioscience

Uploaded by

Birmingham International Academy

Advanced Study Skills in

Data Handling, Statistics &

• Statistics is a collection of mathematical techniques

• vital to the scientific method

• measures of central tendency

Biological data often has a ‘Normal distribution’

• % Frequency can then be used to create a distribution histogram

Biological data often has a normal distribution

• Median: equal number of values above and

• Mode: Value with the highest frequency (=Most)

• A data set can be bimodal or even multimodal, with 2 or

*sample mean used as an estimate of the population mean

Calculate the mean, median and mode

Median: sort data

Mode: (Occurs the most times)

Which is most representative of the centre of the data?

Calculate the mean, median and mode

Is this an error / ‘real’ data point ?

Median: sort data 0 2 2 2 3 4 4 6 40

Mode: 2 (Occurs the most times)

Which is most representative of the centre of the data?

Median: sort data 0 2 2 2 3 4 4 6 It was 3

Mode: 2 - occurs the most times It was 2

An ‘outlier’ can have a disproportionate effect on the mean

Median is a reasonably typical value (resistant to outliers)

Range is calculated as ………………………….. 480 - 320 = 160 g

• So, not always an accurate description of the overall data set

Positive (right) skew

• Standard deviation is a measure of the distance of an

• Allows us to quantify the variability within the data

• The lower the standard deviation, the less uncertainty

• Less spread of data around the

• High spread around the mean =

Sample Variance (S2)

•In a learning behaviour

gives ‘reliability’ measure - 95% confidence interval (CI)

Symbol or bar that indicates Mean value

• SE is related to, but is not the same as, the

• expressed as Mean  SE, N= (sample size)

If SE bars do not overlap this indicates differences in means are

If the difference between the two data

• Answer sheets will be

You might also like