0% found this document useful (0 votes)

8 views5 pages

PH6205 RTutorial 3

This is the tutorial 3 for the course PH6205 at City University of Hong Kong.

Uploaded by

xuehrcityu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views5 pages

PH6205 RTutorial 3

This is the tutorial 3 for the course PH6205 at City University of Hong Kong.

Uploaded by

xuehrcityu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

PH6205 R Tutorial 3

Haoran Xue

(Last Updated February 18, 2025)

CHD Example
Let’s reproduce the results for CHD example in our lecture.

Load Data into R

First, we need to load the CHD data into R. The name of the dataset is CHDData.csv. On my computer
the path to this dataset is “/Users/x/Library/CloudStorage/Dropbox/PH6205/Slides3”, you need
to change the path according to the location of the dataset on your own computer when you read the data.
We use the function read.csv().
CHD.data = read.csv("/Users/x/Library/CloudStorage/Dropbox/PH6205/Slides3/CHDData.csv")

We use the head() function to show the first 6 rows of CHD data:
head(CHD.data)

## ID AGE CHD Sex

## 1 1 20 0 Female
## 2 2 23 0 Male
## 3 3 24 0 Male
## 4 4 25 0 Female
## 5 5 25 1 Male
## 6 6 26 0 Male
Extract variables CHD and Age:
CHD = CHD.data$CHD
Age = CHD.data$AGE

Linear Regression Is Not Appropriate

Run the linear regression of CHD ∼ Age:
lm.CHDAge = lm(CHD~Age)

Now lets draw the scatter plot of CHD versus Age, using red color to denote samples with CHD, and blue to
denote samples without CHD. (Note: This part of code is not required.)
ind1 = which(CHD == 1)
ind2 = which(CHD == 0)
plot(Age[ind2],CHD[ind2],
ylim = c(0,1),xlim = range(Age),col = "blue",
xlab = "Age",ylab = "CHD")
points(Age[ind1],CHD[ind1],col = "red")
abline(lm.CHDAge)

1
1.0
0.8
0.6
CHD

0.4
0.2
0.0

20 30 40 50 60 70

Age
Show the details of linear regression results:
summary(lm.CHDAge)

##
## Call:
## lm(formula = CHD ~ Age)
##
## Residuals:
## Min 1Q Median 3Q Max
## -0.85793 -0.33992 -0.07274 0.31656 0.99269
##
## Coefficients:
## Estimate Std. Error t value Pr(>|t|)
## (Intercept) -0.537960 0.168809 -3.187 0.00193 **
## Age 0.021811 0.003679 5.929 4.57e-08 ***
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## Residual standard error: 0.429 on 98 degrees of freedom
## Multiple R-squared: 0.264, Adjusted R-squared: 0.2565
## F-statistic: 35.15 on 1 and 98 DF, p-value: 4.575e-08

Simple Logistic Regression

Let’s fit the simple logistic regression of CHD ∼ Age:
logistic.CHDAge = glm(CHD ~ Age, family = binomial)
summary(logistic.CHDAge)

##
## Call:
## glm(formula = CHD ~ Age, family = binomial)
##

2
## Coefficients:
## Estimate Std. Error z value Pr(>|z|)
## (Intercept) -5.30945 1.13365 -4.683 2.82e-06 ***
## Age 0.11092 0.02406 4.610 4.02e-06 ***
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## (Dispersion parameter for binomial family taken to be 1)
##
## Null deviance: 136.66 on 99 degrees of freedom
## Residual deviance: 107.35 on 98 degrees of freedom
## AIC: 111.35
##
## Number of Fisher Scoring iterations: 4

Multiple Logistic Regression

Let’s fit the multiple logistic regression of CHD ∼ Age + Sex:
Sex = CHD.data$Sex
logistic.CHDAgeSex = glm(CHD ~ Age + Sex, family = binomial)
summary(logistic.CHDAgeSex)

##
## Call:
## glm(formula = CHD ~ Age + Sex, family = binomial)
##
## Coefficients:
## Estimate Std. Error z value Pr(>|z|)
## (Intercept) -5.43517 1.19000 -4.567 4.94e-06 ***
## Age 0.11041 0.02407 4.588 4.48e-06 ***
## SexMale 0.20176 0.54014 0.374 0.709
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## (Dispersion parameter for binomial family taken to be 1)
##
## Null deviance: 136.66 on 99 degrees of freedom
## Residual deviance: 107.21 on 97 degrees of freedom
## AIC: 113.21
##
## Number of Fisher Scoring iterations: 4

Compare Nested Models Using LRT Test

We use function lrtest() from the R package lmtest to perform the LRT test. First, you should install the
package lmtest, which is similar to what we did to install package car in R Tutorial 2. After successfully
installing lmtest, we load the package using function library():
library(lmtest)

## Loading required package: zoo

##
## Attaching package: 'zoo'
## The following objects are masked from 'package:base':

3
##
## as.Date, as.Date.numeric
Now we can use the function lrtest() to perform the LRT test:
lrtest(logistic.CHDAge, logistic.CHDAgeSex)

## Likelihood ratio test

##
## Model 1: CHD ~ Age
## Model 2: CHD ~ Age + Sex
## #Df LogLik Df Chisq Pr(>Chisq)
## 1 2 -53.677
## 2 3 -53.607 1 0.14 0.7083

HL Test
We use function hoslem.test() from the R package ResourceSelection to perform the Hosmer-Lemeshow
test. First, you should install the package ResourceSelection, which is similar to what we did to install
package car and lmtest. After successfully installing ResourceSelection, we load the package using
function library():
library(ResourceSelection)

## ResourceSelection 0.3-6 2023-06-27

The HL test takes two sets of values, with the first set as the binary outcome and the second set as the
predicted probabilities p̂i ’s. The function fitted() can be used to get the predicted probabilities:
fitted(logistic.CHDAgeSex)

## 1 2 3 4 5 6 7
## 0.03816625 0.06333496 0.07020957 0.06447555 0.07776843 0.08606576 0.08606576
## 8 9 10 11 12 13 14
## 0.10509814 0.10509814 0.11594475 0.12775098 0.12775098 0.12775098 0.10690499
## 15 16 17 18 19 20 21
## 0.12775098 0.12775098 0.15444378 0.15444378 0.14288754 0.16941897 0.18552758
## 22 23 24 25 26 27 28
## 0.15695020 0.15695020 0.15695020 0.18552758 0.17211894 0.17211894 0.22123058
## 29 30 31 32 33 34 35
## 0.18842604 0.22123058 0.24083707 0.20589388 0.24083707 0.22453305 0.26159750
## 36 37 38 39 40 41 42
## 0.28347926 0.24434039 0.26529728 0.30643177 0.33038578 0.28736807 0.35525318
## 43 44 45 46 47 48 49
## 0.35525318 0.35525318 0.35525318 0.38092752 0.33461756 0.38092752 0.40728524
## 50 51 52 53 54 55 56
## 0.35963241 0.40728524 0.40728524 0.43418763 0.43418763 0.46148346 0.46148346
## 57 58 59 60 61 62 63
## 0.48901221 0.43887755 0.48901221 0.51660777 0.51660777 0.51660777 0.54410241
## 64 65 66 67 68 69 70
## 0.54410241 0.54410241 0.57133084 0.57133084 0.59813416 0.62436343 0.62436343
## 71 72 73 74 75 76 77
## 0.60270860 0.60270860 0.67457219 0.65420877 0.69832867 0.69832867 0.72106802
## 78 79 80 81 82 83 84
## 0.72106802 0.72106802 0.70233019 0.74272480 0.70233019 0.74272480 0.74272480
## 85 86 87 88 89 90 91
## 0.70233019 0.76325219 0.76325219 0.76325219 0.78262110 0.74635133 0.76668026

4
## 92 93 94 95 96 97 98
## 0.80081893 0.81784789 0.83372326 0.80384284 0.84847137 0.86212770 0.86212770
## 99 100
## 0.87473498 0.91568744
Now we perform the HL test, the first argument CHD is the set of binary outcomes, and the second argument
fitted(logistic.CHDAgeSex) is the set of predicted probabilities:
hoslem.test(CHD,fitted(logistic.CHDAgeSex))

##
## Hosmer and Lemeshow goodness of fit (GOF) test
##
## data: CHD, fitted(logistic.CHDAgeSex)
## X-squared = 1.7167, df = 8, p-value = 0.9885

Logistic Regression in Malay Context
No ratings yet
Logistic Regression in Malay Context
44 pages
PH6205 RTutorial 2
No ratings yet
PH6205 RTutorial 2
15 pages
18logistic Regression Yilma
No ratings yet
18logistic Regression Yilma
88 pages
Assignment STAT5002
No ratings yet
Assignment STAT5002
5 pages
Logistic Regression Overview and Analysis
No ratings yet
Logistic Regression Overview and Analysis
29 pages
Predictive Modeling: Logistic Regression
No ratings yet
Predictive Modeling: Logistic Regression
13 pages
Advanced Regression Techniques
No ratings yet
Advanced Regression Techniques
28 pages
Stats 500 HW3 Solutions Analysis
No ratings yet
Stats 500 HW3 Solutions Analysis
4 pages
Analysis of Binary Logistic Regression
No ratings yet
Analysis of Binary Logistic Regression
10 pages
Statistical Computing by Using R
100% (1)
Statistical Computing by Using R
11 pages
Stata Commands for Data Analysis
No ratings yet
Stata Commands for Data Analysis
8 pages
Econometrics Assignment HW4
No ratings yet
Econometrics Assignment HW4
8 pages
Ex 5
No ratings yet
Ex 5
6 pages
Logistic Regression Lab Guide
No ratings yet
Logistic Regression Lab Guide
10 pages
Unit 2 Assignment SKELETON R spr18
No ratings yet
Unit 2 Assignment SKELETON R spr18
12 pages
ASSIGNMENT NO - 2, FDAS - SUMANYAKUMARI - Bfia
No ratings yet
ASSIGNMENT NO - 2, FDAS - SUMANYAKUMARI - Bfia
6 pages
Experiment 7 (I) : Artificial Intelligence & Machine Learning Lab
No ratings yet
Experiment 7 (I) : Artificial Intelligence & Machine Learning Lab
4 pages
Econometrics I Final Examination Summer Term 2013, July 26, 2013
No ratings yet
Econometrics I Final Examination Summer Term 2013, July 26, 2013
9 pages
Chi-Square & Logistic Regression Analysis
No ratings yet
Chi-Square & Logistic Regression Analysis
3 pages
Which Test When: 1 Exploratory Tests
No ratings yet
Which Test When: 1 Exploratory Tests
5 pages
Linear Regression Worksheets
No ratings yet
Linear Regression Worksheets
10 pages
4 - Logistic Reg 1
No ratings yet
4 - Logistic Reg 1
30 pages
Survival Analysis Lab
No ratings yet
Survival Analysis Lab
19 pages
Workshop Activity: X Seq y Length
No ratings yet
Workshop Activity: X Seq y Length
3 pages
Longitudinal Data Analysis Guide
No ratings yet
Longitudinal Data Analysis Guide
51 pages
Assignment 2
No ratings yet
Assignment 2
11 pages
Heart Disease App With Code
No ratings yet
Heart Disease App With Code
22 pages
Basics of Statistical Modelling: Marnielle A. Salig
No ratings yet
Basics of Statistical Modelling: Marnielle A. Salig
28 pages
Quiz 2 Solution Id 22070144
No ratings yet
Quiz 2 Solution Id 22070144
10 pages
Regression and Classification Analysis
No ratings yet
Regression and Classification Analysis
101 pages
CHNGPT Code R
No ratings yet
CHNGPT Code R
25 pages
Final Cc01 Group05-1
No ratings yet
Final Cc01 Group05-1
26 pages
Lab 8 - Shell
No ratings yet
Lab 8 - Shell
6 pages
Logistic and Linear Regression Overview
No ratings yet
Logistic and Linear Regression Overview
32 pages
Statistical Test Selection Guide
No ratings yet
Statistical Test Selection Guide
7 pages
DA R Assignment2
No ratings yet
DA R Assignment2
9 pages
HW4 Solutions: Problem 6.2
No ratings yet
HW4 Solutions: Problem 6.2
8 pages
Logistic Regression
0% (1)
Logistic Regression
71 pages
ECON20003 S1 2024 Sample Exam
No ratings yet
ECON20003 S1 2024 Sample Exam
27 pages
Chapter 8 Logistic Regression (Compatibility Mode)
No ratings yet
Chapter 8 Logistic Regression (Compatibility Mode)
22 pages
SL Paper133
No ratings yet
SL Paper133
18 pages
R Commander Data Analysis Guide
No ratings yet
R Commander Data Analysis Guide
19 pages
Linear and Multiple Regression Analysis
100% (2)
Linear and Multiple Regression Analysis
8 pages
Bioestadistica: Clara Carner 2023-05-29
No ratings yet
Bioestadistica: Clara Carner 2023-05-29
4 pages
U7 PPT6
No ratings yet
U7 PPT6
6 pages
Ester - Paksuniemi - Assignment3
No ratings yet
Ester - Paksuniemi - Assignment3
14 pages
ProbList5 24 SLN
No ratings yet
ProbList5 24 SLN
9 pages
Building Regression Models
No ratings yet
Building Regression Models
22 pages
Series 1
No ratings yet
Series 1
2 pages
Econ103 Lab1
No ratings yet
Econ103 Lab1
13 pages
R Programming Practical Exercises
No ratings yet
R Programming Practical Exercises
13 pages
Midterm - Solution - IS240 - Probability and Statistics LAB
No ratings yet
Midterm - Solution - IS240 - Probability and Statistics LAB
8 pages
Applied Regression Homework Guide
No ratings yet
Applied Regression Homework Guide
1 page
MH3511 Midterm 2017 Q
No ratings yet
MH3511 Midterm 2017 Q
4 pages
CS1B044
No ratings yet
CS1B044
6 pages
Sociology: Intermediate Quantitative Research Method
No ratings yet
Sociology: Intermediate Quantitative Research Method
34 pages
Thesis Using Logistic Regression
100% (2)
Thesis Using Logistic Regression
7 pages
A1
No ratings yet
A1
8 pages
Module3 BDA
No ratings yet
Module3 BDA
52 pages
Buku MFE Bu Juliasih
No ratings yet
Buku MFE Bu Juliasih
187 pages
SPSS Trial Expiry Notice
No ratings yet
SPSS Trial Expiry Notice
38 pages
Data Analytics - UNIT1
No ratings yet
Data Analytics - UNIT1
18 pages
MANOVA Analysis in SPSS Guide
No ratings yet
MANOVA Analysis in SPSS Guide
68 pages
The Reationship Between Phonics, Reading Comprehension and Vocabulary Achievement
No ratings yet
The Reationship Between Phonics, Reading Comprehension and Vocabulary Achievement
15 pages
Final Exam - Sample Test
No ratings yet
Final Exam - Sample Test
6 pages
統計摘要
No ratings yet
統計摘要
12 pages
Calic
No ratings yet
Calic
4 pages
Flexible Joint Modelling with stjm
No ratings yet
Flexible Joint Modelling with stjm
38 pages
QTB Project Report
No ratings yet
QTB Project Report
15 pages
SCADA & PMU in Power System Estimation
No ratings yet
SCADA & PMU in Power System Estimation
5 pages
Refrigerator Sales Data Analysis 2006-2010
No ratings yet
Refrigerator Sales Data Analysis 2006-2010
11 pages
HRM0092 1
No ratings yet
HRM0092 1
5 pages
Chapter 1
No ratings yet
Chapter 1
40 pages
Multiple Classical Linear Regression Model
No ratings yet
Multiple Classical Linear Regression Model
19 pages
FPP Manual
No ratings yet
FPP Manual
323 pages
E-Cld-6003 - Astm - E74
No ratings yet
E-Cld-6003 - Astm - E74
12 pages
Matlab Curve Fitting Guide
No ratings yet
Matlab Curve Fitting Guide
16 pages
Home Loan Analysis and Customer Insights
100% (1)
Home Loan Analysis and Customer Insights
61 pages
Time Series Forecasting Using A Hybrid ARIMA
No ratings yet
Time Series Forecasting Using A Hybrid ARIMA
17 pages
GATE Regression Exam Questions
No ratings yet
GATE Regression Exam Questions
13 pages
Cunningham 2021 Cap. 7
No ratings yet
Cunningham 2021 Cap. 7
71 pages
Case Study - Module 12
No ratings yet
Case Study - Module 12
27 pages
Mplus SEM Guide for Researchers
No ratings yet
Mplus SEM Guide for Researchers
28 pages
Quiz 1.1 - DSIOPMA - Operations Management
No ratings yet
Quiz 1.1 - DSIOPMA - Operations Management
8 pages
Test Bank For Business Forecasting 6th Edition Wilson
100% (85)
Test Bank For Business Forecasting 6th Edition Wilson
8 pages
Answers (Chapter 8)
No ratings yet
Answers (Chapter 8)
8 pages
International Scholarly Research Notices - 2012 - Ossai - Predictive Modelling of Wellhead Corrosion Due To Operating
No ratings yet
International Scholarly Research Notices - 2012 - Ossai - Predictive Modelling of Wellhead Corrosion Due To Operating
8 pages
M.A. Eco. First and Second Semester
100% (1)
M.A. Eco. First and Second Semester
18 pages
Studying The Evolution of PHP Web Application
No ratings yet
Studying The Evolution of PHP Web Application
20 pages

PH6205 RTutorial 3

Uploaded by

PH6205 RTutorial 3

Uploaded by

PH6205 R Tutorial 3

(Last Updated February 18, 2025)

Load Data into R

## ID AGE CHD Sex

Linear Regression Is Not Appropriate

Simple Logistic Regression

Multiple Logistic Regression

Compare Nested Models Using LRT Test

## Loading required package: zoo

## Likelihood ratio test

## ResourceSelection 0.3-6 2023-06-27

You might also like