0% found this document useful (0 votes)

314 views8 pages

I. Log Transformation: (Gomez and Gomez, 1984)

1. A study was conducted to evaluate the effect of different rates of poultry manure on maize plant leaves. The data showed unequal variances between treatments. 2. Log transformation was applied to make the data fit normal distribution assumptions. After transformation, the P-values increased but were still significant. R-square decreased slightly. 3. Most importantly, transformation revealed that the top treatment was not statistically different from the third treatment as was concluded before transformation, avoiding a type 1 error. Transformation provided a more accurate analysis.

Uploaded by

Audrey Ody

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

314 views8 pages

I. Log Transformation: (Gomez and Gomez, 1984)

Uploaded by

Audrey Ody

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

1.

Think of three (3) different experiments where you could use three (3)
different types of data transformation to demonstrate the importance of applying
data transformation in statistical analysis of the experiments. Your conclusion
must show a distinction between before and after transformation.

If a measurement variable does not fit a normal distribution or has greatly different standard
deviations in different groups, you should try a data transformation. Using a statistical test
such as an ANOVA or Linear Regression on such data may give a misleading result. In some
cases, transforming the data will make it fit the assumptions (Gomez and Gomez, 1984) better.
i.

Log Transformation

This consists of taking the log of each observation. You can use either base-10 logs (LOG10
in SAS) or base-e logs, also known as natural logs (LOG in SAS). It makes no difference for
a statistical test whether you use base-10 logs or natural logs, because they differ by a
constant factor; the base-10 log of a number is just 2.303 the natural log of the number.
You should specify which log you're using when you write up the results, as it will affect
things like the slope and intercept in a regression. Base-10 logs is preferred because it's
possible to look at them and see the magnitude of the original number: log(1)=0, log(10)=1,
log(100)=2, etc.
For example a field study was conducted to evaluate the effect of different rates of poultry
manure used on maize number of leaves after 70 days of planting. The experimental design
used was a randomized complete block design. Six treatments were evaluated with three
replicate each as indicated below;
T1= 1200 g poultry manure + 50% fertilization
T2= 1000 g poultry manure + 50% fertilization
T3= 800 g poultry manure + 50% fertilization
T4= 600 g poultry manure + 50% fertilization
T5= 400 g poultry manure + 50% fertilization
T6= 200 g poultry manure + 50% fertilization

Table 1: Raw data after field experiment showing

Treatement

Block 1

Block 2

Block 3

Standard

Mean

Means of X

(Plant-1)

deviation

0.5773503

16.3333333

17.3333333

0.5773503

15.6666667

16.6666667

0.5773503

14.6666667

15.6666667

0.5773503

13.6666667

14.6666667

0.5773503

12.6666667

13.6666667

1.5275252

11.6666667

12.6666667

Table 2: SAS input before and after transformation.

Input before transformation
data log before;
input trt $ blk leaf;
datalines;
T1 1 16
T1 2 17
T1 3 16
T2 1 15
T2 2 16
T2 3 16
T3 1 14
T3 2 15
T3 3 15
T4 1 14
T4 2 13
T4 3 14
T5 1 13
T5 2 13
T5 3 12
T6 1 12
T6 2 13
T6 3 10
;
proc anova;
class trt blk;
model leaf = trt blk;
means trt blk/lsd;
run;

Input for transformation

data log after;
input trt $ blk leaf;
x=leaf +1;
y=log (x);
datalines;
T1 1 16
T1 2 17
T1 3 16
T2 1 15
T2 2 16
T2 3 16
T3 1 14
T3 2 15
T3 3 15
T4 1 14
T4 2 13
T4 3 14
T5 1 13
T5 2 13
T5 3 12
T6 1 12
T6 2 13
T6 3 10
;
proc print;
proc anova;
class trt blk;
model y=trt blk;
means trt/duncan;
run;

The ANOVA Procedure

Table 3: Anova Table before transformation

Source

Sum of Squares

Mean Square

F Value

Pr > F

trt

47.77777778

9.55555556

14.58

0.0003

blk

1.44444444

0.72222222

1.10

0.3695

Error

6.55555556

0.65555556

Corrected Total

55.77777778
CV= 8.778059

R-Square

Coeff Var

Root MSE

leaf Mean

0.882470

5.737775

0.809664

14.11111

Means with the same letter

are not significantly different.
t Grouping

Mean

trt

16.3333

15.6667

14.6667

13.6667

12.6667

11.6667

A
A
B
B
B

C
D
D
D

E
E

From the results of the Anova Table 1, there is a significant different between treatment at =
0.05, P 0.0003 and there is no block effect since P 0.3695 is not significantly different at =
0.05 hence this experiment is RCBD. The R-Square value is 88.25%

Table 3: Transforming the data:

Obs trt

blk

leaf

1 T1

2.83321

2 T1

2.89037

3 T1

2.83321

4 T2

2.77259

5 T2

2.83321

6 T2

2.83321

7 T3

2.70805

8 T3

2.77259

9 T3

2.77259

10 T4

2.70805

11 T4

2.63906

12 T4

2.70805

13 T5

2.63906

14 T5

2.63906

15 T5

2.56495

16 T6

2.56495

17 T6

2.63906

18 T6

2.39790

The ANOVA Procedure

Table 5: Anova Table after transformation
Dependent Variable: y
Source

Sum of Squares

Mean Square

F Value

Pr > F

trt

0.21983173

0.04396635

11.90

0.0006

blk

0.00781452

0.00390726

1.06

0.3831

Error

0.03694465

0.00369446

Corrected Total

0.26459090
CV = 2.244301

R-Square

Coeff Var

Root MSE

y Mean

0.860371

2.244301

0.060782

2.708287

Means with the same letter

are not significantly different.
Duncan Grouping

Mean

N trt

2.85227

3 T1

2.81301

3 T2

2.75108

3 T3

2.68505

3 T4

2.61435

3 T5

2.53397

3 T6

A
A
A
A
B
B
B

C
D

D
D

This is the SAS output after transformation, the values under X are values after adding one
since three of the values were below 10 and the values under Y are the transformed values of
X (Table 3) and there is a there is a significant different between treatment at = 0.05, P
0.0006. The P value here is higher than that of before (P 0.0006).There is no block effect

since P 0.3831 is not significantly different at = 0.05 hence this experiment is still RCBD
and the P value here is higher than that of before (P 0.3695). The R-Square value is 86.04%
which is also lower than that of before transformation (88.25%)

Table 6: Comparing the means before and after transformation

Treatment
T1
T2
T3
T4
T5
T6

Mean before transformation

16.3333 a
15.6667 ab
14.6667 bc
13.6667 cd
12.6667 de
11.6667 e

Mean after transformation

17.3271 a
16.6599 a
15.6595 ab
14.6589 bc
13.6029 cd
12.6034 d

Note: Means with the same letter are not significantly different
Means before transformation show that T1 is statistically higher than that of T3 however,
after transformation the statistical values show that T1 is not statistically different from T3
(Table 6).

Conclusion
The P values of the data before transformation are higher than that of after transformation
however, if the data was not transformed, we would have concluded that T1 is statistically
different from T3 and commit type 1 error. Transformed data have lower R-Square value
but it is more realistic and precise.

ANOVA Assumptions & Transformations
No ratings yet
ANOVA Assumptions & Transformations
16 pages
2014 Lab 5 (Topic 8)
No ratings yet
2014 Lab 5 (Topic 8)
11 pages
R Data Transformation & Log Transformation
No ratings yet
R Data Transformation & Log Transformation
13 pages
Transformation of Data in Statistical Design
No ratings yet
Transformation of Data in Statistical Design
11 pages
T8 Transformation
No ratings yet
T8 Transformation
14 pages
Regn Lect 7
No ratings yet
Regn Lect 7
26 pages
Lecture 3: Transformations, Non-Parametric Tests and Multiple Comparisons
No ratings yet
Lecture 3: Transformations, Non-Parametric Tests and Multiple Comparisons
29 pages
Unit 4 - 2 Way Anova
No ratings yet
Unit 4 - 2 Way Anova
10 pages
Anova (12-10-11)
No ratings yet
Anova (12-10-11)
30 pages
Lecture 5 Biometry
No ratings yet
Lecture 5 Biometry
34 pages
Analysis of Variance Anova Q4
No ratings yet
Analysis of Variance Anova Q4
42 pages
ANOVA Mungbeans
No ratings yet
ANOVA Mungbeans
11 pages
Doe How To Transform Data With Unequal Variances
No ratings yet
Doe How To Transform Data With Unequal Variances
5 pages
Wilcoxon Rank Sum Test Overview
No ratings yet
Wilcoxon Rank Sum Test Overview
8 pages
ANOVA Millet
No ratings yet
ANOVA Millet
7 pages
Statistics and Probability Distributions
No ratings yet
Statistics and Probability Distributions
24 pages
Anova (5-1-12)
No ratings yet
Anova (5-1-12)
37 pages
RCBD
No ratings yet
RCBD
18 pages
Practical Test 2006
No ratings yet
Practical Test 2006
7 pages
ANOVA Maize
No ratings yet
ANOVA Maize
9 pages
2.-A Population Geneticist Wants To Test The Significance of The Differences in
No ratings yet
2.-A Population Geneticist Wants To Test The Significance of The Differences in
10 pages
SPSS Data Transformation Guide
No ratings yet
SPSS Data Transformation Guide
5 pages
ANOVA Cotton
No ratings yet
ANOVA Cotton
13 pages
Anova PDF
No ratings yet
Anova PDF
63 pages
Sas BD (Banyak Daun) Kedelai
No ratings yet
Sas BD (Banyak Daun) Kedelai
7 pages
Y # - Y Matrix Setup
No ratings yet
Y # - Y Matrix Setup
11 pages
ANOVA Analysis for Randomized Block Design
No ratings yet
ANOVA Analysis for Randomized Block Design
53 pages
Randomized Complete Block Design (RCBD) Description of The Design
No ratings yet
Randomized Complete Block Design (RCBD) Description of The Design
30 pages
Case Study - Pontius Data: at - at May Not Be Good Enough
No ratings yet
Case Study - Pontius Data: at - at May Not Be Good Enough
9 pages
Treatment Testing - Lecturs Slides Part 1
No ratings yet
Treatment Testing - Lecturs Slides Part 1
29 pages
ANOVA Sorghum
No ratings yet
ANOVA Sorghum
7 pages
ANOVA Factorial
No ratings yet
ANOVA Factorial
13 pages
ANOVA Analysis with SAS for DBI 11
No ratings yet
ANOVA Analysis with SAS for DBI 11
9 pages
ANOVA Models for Practice Effects
No ratings yet
ANOVA Models for Practice Effects
6 pages
Graeco-Latin Square TV Assembly Analysis
No ratings yet
Graeco-Latin Square TV Assembly Analysis
2 pages
Randomised Block Design (RBD)
No ratings yet
Randomised Block Design (RBD)
8 pages
Worksheet 4
No ratings yet
Worksheet 4
7 pages
Experimental Designs
No ratings yet
Experimental Designs
43 pages
Statistical Analysis for COPD Treatment
No ratings yet
Statistical Analysis for COPD Treatment
10 pages
ANOVA2
No ratings yet
ANOVA2
10 pages
Genotype X Enviroment
No ratings yet
Genotype X Enviroment
43 pages
Modified Levene Test for Variance Equality
No ratings yet
Modified Levene Test for Variance Equality
2 pages
Six Sigma - Live Lecture 14
No ratings yet
Six Sigma - Live Lecture 14
66 pages
RCBD Anova Notes (III)
No ratings yet
RCBD Anova Notes (III)
13 pages
SAS Library Data Transformations and Data Manipulation in SAS
No ratings yet
SAS Library Data Transformations and Data Manipulation in SAS
31 pages
SAS Library Data Transformations and Data Manipulation in SAS
No ratings yet
SAS Library Data Transformations and Data Manipulation in SAS
31 pages
Randomized Block Design: Model
No ratings yet
Randomized Block Design: Model
4 pages
Ujian Praktikum Mahasiswa Pertanian
No ratings yet
Ujian Praktikum Mahasiswa Pertanian
11 pages
ANOVA Solutions for Block Designs
100% (1)
ANOVA Solutions for Block Designs
43 pages
ch04 PDF
No ratings yet
ch04 PDF
43 pages
Using Transformations: Key Words
No ratings yet
Using Transformations: Key Words
10 pages
Understanding Randomized Complete Block Design
No ratings yet
Understanding Randomized Complete Block Design
8 pages
Analysis of Variance
No ratings yet
Analysis of Variance
51 pages
Exam Review for STAT 2509
No ratings yet
Exam Review for STAT 2509
23 pages
Post Test Inferences from Two Samples
No ratings yet
Post Test Inferences from Two Samples
75 pages
Tukey One Degree
No ratings yet
Tukey One Degree
5 pages
Module in Assessment of Learning 2
No ratings yet
Module in Assessment of Learning 2
17 pages
Sampling Techniques in IB Geography
No ratings yet
Sampling Techniques in IB Geography
20 pages
Practice
No ratings yet
Practice
2 pages
TOPIC 4 (Test of Hypothesis)
No ratings yet
TOPIC 4 (Test of Hypothesis)
28 pages
BA Sociology-Social Research Method
No ratings yet
BA Sociology-Social Research Method
17 pages
Com 216
100% (1)
Com 216
8 pages
Understanding Poisson Regression Models
No ratings yet
Understanding Poisson Regression Models
19 pages
Data Mining
No ratings yet
Data Mining
3 pages
Task 03: Data Analysis: House Pricing Vs Incinerator Installation
No ratings yet
Task 03: Data Analysis: House Pricing Vs Incinerator Installation
10 pages
RCBD
No ratings yet
RCBD
55 pages
Business Statistics Exam Solutions
No ratings yet
Business Statistics Exam Solutions
3 pages
Ba DS
No ratings yet
Ba DS
148 pages
GB Answers
No ratings yet
GB Answers
27 pages
Relative Bias Assessment of D4815 As An Alternative To D5599 For Determination of Ethanol in Gasoline
No ratings yet
Relative Bias Assessment of D4815 As An Alternative To D5599 For Determination of Ethanol in Gasoline
38 pages
Understanding Hypothesis Testing Basics
No ratings yet
Understanding Hypothesis Testing Basics
4 pages
Hypothesis Testing Essentials
100% (1)
Hypothesis Testing Essentials
27 pages
Sampling Distribution & Sample Size Analysis
No ratings yet
Sampling Distribution & Sample Size Analysis
10 pages
BBA 122 Notes On Probability
0% (1)
BBA 122 Notes On Probability
64 pages
Answer For Assignment I For Biostatistics Course 2024 PG1 1
No ratings yet
Answer For Assignment I For Biostatistics Course 2024 PG1 1
27 pages
Keller SME 12e PPT CH01
100% (1)
Keller SME 12e PPT CH01
19 pages
Statistical Estimation Guide
No ratings yet
Statistical Estimation Guide
19 pages
Lesson 7 For Basic Mathematics
No ratings yet
Lesson 7 For Basic Mathematics
27 pages
Semester 1
No ratings yet
Semester 1
22 pages
Tutorial Sheet Module5
No ratings yet
Tutorial Sheet Module5
2 pages
Bio206 Biostatistics Summary 08024665051
No ratings yet
Bio206 Biostatistics Summary 08024665051
40 pages
Time - Series - 1.ipynb - Colab
No ratings yet
Time - Series - 1.ipynb - Colab
10 pages
Statistical Tables: Useful Formulae
No ratings yet
Statistical Tables: Useful Formulae
5 pages
Message
No ratings yet
Message
6 pages
Tu Delft Paper
No ratings yet
Tu Delft Paper
14 pages

I. Log Transformation: (Gomez and Gomez, 1984)

Uploaded by

I. Log Transformation: (Gomez and Gomez, 1984)

Uploaded by

1.

Table 1: Raw data after field experiment showing

Table 2: SAS input before and after transformation.

Input for transformation

The ANOVA Procedure

Means with the same letter

Table 3: Transforming the data:

The ANOVA Procedure

Means with the same letter

Table 6: Comparing the means before and after transformation

Mean before transformation

Mean after transformation

You might also like