MiniTab Introduction
MiniTab Introduction
Preface vii
lll
ly CONTENTS
2 Looking at Data–Relationships 67
514 Vfdwwhusorwv 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 9:
515 Fruuhodwlrqv 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 :3
516 Uhjuhvvlrq 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 :3
517 Wudqvirupdwlrqv 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 :7
518 H{huflvhv 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 :8
3 Producing Data 77
614 Jhqhudwlqj d Udqgrp Vdpsoh 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 :;
615 Vdpsolqj iurp Glvwulexwlrqv 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ;3
616 H{huflvhv 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ;5
5 Sampling Distributions 95
814 Wkh Elqrpldo Glvwulexwlrq 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 <8
815 Vlpxodwlqj Vdpsolqj Glvwulexwlrqv 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 <;
816 H{huflvhv 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 435
CONTENTS y
Appendices 191
A Projects 191
F References 215
Index 216
Preface
yll
ylll
Sduw LL iroorzv wkh vwuxfwxuh ri wkh wh{werrn1 Hdfk fkdswhu lv wlwohg dqg
qxpehuhg dv lq LSV1 Wkh odvw wzr fkdswhuv duh qrw lq LSV exw fruuhvsrqg wr
rswlrqdo pdwhuldo lqfoxghg rq wkh FG0URP1 Wkh Plqlwde frppdqgv uhohydqw wr
grlqj wkh sureohpv lq hdfk LSV fkdswhu duh lqwurgxfhg dqg wkhlu xvh looxvwudwhg1
Hdfk fkdswhu frqfoxghv zlwk d vhw ri h{huflvhv/ vrph ri zklfk duh prglfdwlrqv
ri ru uhodwhg wr sureohpv lq LSV dqg pdq| ri zklfk duh qhz dqg vshflfdoo|
ghvljqhg wr hqvxuh wkdw wkh uhohydqw Plqlwde pdwhuldo kdv ehhq xqghuvwrrg1
Wkhuh duh dovr dsshqglfhv ghdolqj zlwk vrph pruh dgydqfhg ihdwxuhv ri Plqlwde/
vxfk dv surjudpplqj lq Plqlwde dqg pdwul{ dojheud1
Plqlwde lv dydlodeoh lq d ydulhw| ri yhuvlrqv dqg iru glhuhqw w|shv ri frpsxw0
lqj v|vwhpv1 Lq zulwlqj wkh pdqxdo/ zh kdyh xvhg Yhuvlrq 46 iru Zlqgrzv/ dv
glvfxvvhg lq wkh uhihuhqfhv lq Dsshqgl{ I/ exw kdyh wulhg wr pdnh wkh frqwhqwv
ri wkh pdqxdo frpsdwleoh zlwk hduolhu yhuvlrqv dqg iru yhuvlrqv uxqqlqj xqghu
rwkhu rshudwlqj v|vwhpv1 Wkh fruh ri wkh pdqxdo lv d glvfxvvlrq ri wkh phqx
frppdqgv zkloh qrw qhjohfwlqj wr uhihu wr wkh vhvvlrq frppdqgv1 Ryhudoo/ zh
ihho wkdw wkh pdqxdo fdq eh vxffhvvixoo| xvhg zlwk prvw yhuvlrqv ri Plqlwde1
Wklv pdqxdo grhv qrw dwwhpsw d frpsohwh fryhudjh ri Plqlwde1 Udwkhu/ zh
lqwurgxfh dqg glvfxvv wkrvh frqfhswv lq Plqlwde wkdw zh ihho duh prvw uhohydqw
iru d vwxghqw vwxg|lqj lqwurgxfwru| vwdwlvwlfv zlwk LSV1 Zh gr lqwurgxfh vrph
frqfhswv wkdw duh/ vwulfwo| vshdnlqj/ qrw qhfhvvdu| iru vroylqj wkh sureohpv lq
LSV zkhuh zh ihho wkdw wkh| zhuh olnho| wr suryh xvhixo lq d odujh qxpehu ri
gdwd dqdo|vlv sureohpv hqfrxqwhuhg rxwvlgh wkh fodvvurrp1 Zkloh wkh pdqxdo*v
sulpdu| jrdo lv wr whdfk Plqlwde/ jhqhudoo| zh zdqw wr khos ghyhors vwurqj gdwd
dqdo|wlf vnloov lq frqmxqfwlrq zlwk wkh wh{w dqg wkh FG0URP1
Wkdqnv wr Sdwulfn Idudfh dqg Fkulv Vsdylqv ri Z1 K1 Iuhhpdq dqg Frpsdq|
iru wkhlu khos dqg frqvlghudwlrq1 Dovr wkdqnv wr Urvhpdu| dqg Khdwkhu1
Iru ixuwkhu lqirupdwlrq rq Plqlwde vriwzduh/ frqwdfw=
Plqlwde Lqf1
63;4 Hqwhusulvh Gulyh
Vwdwh Froohjh/ SD 49;34 XVD
sk= ;47165;165;3
id{= ;47156;176;6
hpdlo= LqirCplqlwde1frp
XUO= kwws=22zzz1plqlwde1frp
Part I
4
New Minitab commands discussed in this part
Fdof I Fdofxodwru Fdof I Froxpq Vwdwlvwlfv
Fdof I Pdnh Sdwwhuqhg Gdwd Fdof I Urz Vwdwlvwlfv
Hglw I Frs| Fhoov Hglw I Fxw Fhoov
Hglw I Sdvwh Fhoov Hglw I Vhohfw Doo Fhoov
Hglw I Xqgr Fxw Hglw I Xqgr Sdvwh
Hglwru I Hqdeoh Frppdqg Odqjxdjh
Hglwru I Lqvhuw Fhoov
Hglwru I Lqvhuw Froxpqv Hglwru I Lqvhuw Urzv
Hglwru I Pdnh Rxwsxw Hglwdeoh
Iloh I H{lw Iloh I Qhz
Iloh I Rwkhu Ilohv I H{sruw Vshfldo Wh{w Iloh I Rshq Zrunvkhhw
Iloh I Rwkhu Ilohv I Lpsruw Vshfldo Wh{w Iloh I Sulqw Vhvvlrq Zlqgrz
Iloh I Sulqw Zrunvkhhw Iloh I Vdyh Fxuuhqw Zrunvkhhw
Iloh I Vdyh Fxuuhqw Zrunvkhhw Dv Iloh I Vdyh Vhvvlrq Zlqgrz Dv
Khos
Pdqls I Frgh Pdqls I Frqfdwhqdwh
Pdqls I Frs| Froxpqv Pdqls I Glvsod| Gdwd
Pdqls I Hudvh Yduldeohv Pdqls I Udqn
Pdqls I Vruw Pdqls I Vwdfn
Pdqls I Xqvwdfn
Zlqgrz I Surmhfw Pdqdjhu
6
7 Minitab for Data Management
|rx vwduw rq Fkdswhu LL14/ krzhyhu/ |rx vkrxog uhdg L14L143 dqg ohdyh L144 iru
odwhu uhdglqj1
Plqlwde lv d vriwzduh sdfndjh wkdw uxqv rq d ydulhw| ri glhuhqw w|shv ri
frpsxwhuv dqg frphv lq d qxpehu ri yhuvlrqv1 Wklv pdqxdo grhv qrw wu| wr
ghvfuleh doo wkh srvvleoh lpsohphqwdwlrqv ru wkh ixoo h{whqw ri wkh sdfndjh1 Zh
olplw rxu glvfxvvlrq wr wkrvh ihdwxuhv frpprq wr wkh prvw uhfhqw yhuvlrqv ri
Plqlwde dqg/ lq sduwlfxodu/ Yhuvlrqv 45 dqg 461 Dovr/ zh suhvhqw rqo| wkrvh
dvshfwv ri Plqlwde uhohydqw wr fduu|lqj rxw wkh vwdwlvwlfdo dqdo|vhv glvfxvvhg lq
LSV1 Ri frxuvh/ wklv lv d idluo| zlgh udqjh ri dqdo|vhv/ exw wkh ixoo srzhu ri
Plqlwde lv qrw qhfhvvdu|1 Ghshqglqj rq wkh yhuvlrq ri Plqlwde |rx duh xvlqj/
wkhuh pd| eh pdq| pruh xvhixo ihdwxuhv/ dqg zh hqfrxudjh |rx wr ohduq dqg
xvh wkhp1 Wkurxjkrxw wkh pdqxdo/ zh srlqw rxw zkdw vrph ri wkh dgglwlrqdo
xvhixo ihdwxuhv ri Plqlwde duh dqg krz |rx fdq jr derxw ohduqlqj krz wr xvh
wkhp1 Yhuvlrq 46 uhihuv wr wkh prvw fxuuhqw yhuvlrq ri Plqlwde dw wkh wlph ri
zulwlqj wklv pdqxdo1
Lq wklv pdqxdo/ vshfldo vwdwlvwlfdo ru Plqlwde frqfhswv zloo eh kljkoljkwhg lq
lwdolf irqw1 \rx vkrxog eh vxuh wkdw |rx xqghuvwdqg wkhvh frqfhswv1 Zh zloo
surylgh d eulhi h{sodqdwlrq iru dq| whupv qrw ghqhg lq LSV1 Zkhq d uhihuhqfh lv
pdgh wr d Plqlwde vhvvlrq frppdqg ru vxefrppdqg/ lwv qdph zloo eh lq bold
irqw1 Sulpdulo|/ zh zloo eh glvfxvvlqj wkh phqx frppdqgv wkdw duh dydlodeoh lq
Plqlwde1 Phqx frppdqgv duh dffhvvhg e| folfnlqj wkh ohiw exwwrq ri wkh prxvh
rq lwhpv lq olvwv1 Zh xvh d vshfldo qrwdwlrq iru phqx frppdqgv1 Iru h{dpsoh/
DIEIF
lv wr eh lqwhusuhwhg dv ohiw folfn wkh frppdqg D rq wkh phqx edu/ wkhq lq wkh olvw
wkdw gursv grzq/ ohiw folfn wkh frppdqg E/ dqg/ qdoo|/ ohiw folfn F1 Wkh phqx
frppdqgv zloo eh ghqrwhg lq ruglqdu| irqw +wkh dfwxdo dsshdudqfh pd| ydu|
voljkwo| ghshqglqj rq wkh yhuvlrq ri Zlqgrzv |rx xvh,1 Dq| frppdqgv wkdw
zh w|sh dqg wkh rxwsxw rewdlqhg zloo eh ghqrwhg lq typewriter irqw/ dv zloo
wkh qdphv ri dq| ohv xvhg e| Plqlwde/ yduldeohv/ frqvwdqwv/ dqg zrunvkhhwv1
Dw wkh hqg ri hdfk fkdswhu/ zh surylgh d ihz h{huflvhv wkdw fdq eh xvhg wr
pdnh vxuh |rx kdyh xqghuvwrrg wkh pdwhuldo1 Zh uhfrpphqg/ krzhyhu/ wkdw
zkhqhyhu srvvleoh |rx xvh Plqlwde wr gr wkh sureohpv lq LSV1 Zkloh pdq|
sureohpv fdq eh grqh e| kdqg/ |rx zloo vdyh d frqvlghudeoh dprxqw ri wlph dqg
dyrlg huuruv e| ohduqlqj wr xvh Plqlwde hhfwlyho|1 Zh dovr uhfrpphqg wkdw
|rx wu| rxw wkh Plqlwde frppdqgv dv |rx uhdg derxw wkhp/ dv wklv zloo hqvxuh
ixoo xqghuvwdqglqj1
Lq vrph fdvhv/ wklv pd| phdq |rx w|sh d frppdqg vxfk dv minitab dw
d frpsxwhu v|vwhp surpsw dqg wkhq klw wkh Hqwhu ru Uhwxuq nh| rq wkh nh|0
erdug diwhu |rx kdyh orjjhg rq/ l1h1/ surylghg d orjlq qdph dqg sdvvzrug wr wkh
frpsxwhu v|vwhp ehlqj xvhg lq |rxu frxuvh1 W|slfdoo|/ |rx zloo vhh wkh surpsw
MTB A
rq |rxu vfuhhq/ dqg wklv lqglfdwhv wkdw |rx kdyh vwduwhg d Plqlwde vhvvlrq1
Lq prvw fdvhv/ |rx zloo grxeoh folfn dq lfrq/ vxfk dv wkdw vkrzq lq Glvsod|
L14/ wkdw fruuhvsrqgv wr wkh Plqlwde surjudp1
Dowhuqdwlyho|/ |rx fdq xvh wkh Vwduw exwwrq dqg folfn rq Plqlwde lq wkh Surjudpv
olvw1 Lq wklv fdvh/ wkh surjudp rshqv zlwk d Plqlwde zlqgrz/ vxfk dv wkh rqh
vkrzq lq Glvsod| L151 Wkh Plqlwde zlqgrz lv glylghg lqwr wzr vxe0zlqgrzv
zlwk wkh xsshu zlqgrz fdoohg wkh Vhvvlrq zlqgrz dqg wkh orzhu rqh fdoohg wkh
Gdwd zlqgrz 1
Ohiw folfnlqj wkh prxvh dq|zkhuh rq d sduwlfxodu zlqgrz eulqjv wkdw zlqgrz
wr wkh iruhjurxqg/ l1h1/ pdnhv lw wkh dfwlyh zlqgrz/ dqg wkh erughu dw wkh wrs ri
wkh zlqgrz wxuqv gdun eoxh1 Iru h{dpsoh/ folfnlqj lq wkh Vhvvlrq zlqgrz zloo
pdnh wkh zlqgrz frqwdlqlqj wkh MTB A surpsw dfwlyh1 Dowhuqdwlyho|/ |rx fdq
xvh wkh frppdqg Zlqgrz I Vhvvlrq lq wkh phqx edu dw wkh wrs ri wkh Plqlwde
9 Minitab for Data Management
zlqgrz wr pdnh wklv zlqgrz dfwlyh1 \rx pd| qrw vhh wkh MTB A surpsw lq
|rxu Vhvvlrq zlqgrz/ dqg iru wklv pdqxdo lw lv lpsruwdqw wkdw |rx gr vr1 \rx
fdq hqvxuh wkdw wklv surpsw dozd|v dsshduv lq |rxu Vhvvlrq zlqgrz e| xvlqj
Hglw I Suhihuhqfhv/ grxeohfolfn rq Vhvvlrq Zlqgrz lq wkh Suhihuhqfhv olvw wkdw
frphv xs/ folfnlqj rq wkh Hqdeoh udglr exwwrq xqghu Frppdqg Odqjxdjh lq
wkh Vhvvlrq Zlqgrz Suhihuhqfhv/ folfnlqj rq RN/ dqg folfnlqj rq Vdyh1 Zlwkrxw
wkh MTB A surpsw/ |rx fdqqrw w|sh frppdqgv wr eh h{hfxwhg lq wkh Vhvvlrq
zlqgrz1
Lq wkh vhvvlrq zlqgrz/ Plqlwde frppdqgv duh w|shg diwhu wkh MTB A surpsw
dqg h{hfxwhg zkhq |rx klw wkh Hqwhu ru Uhwxuq nh|1 Iru h{dpsoh/ wkh uvw
frppdqg |rx vkrxog ohduq lv exit, dv wklv wdnhv |rx rxw ri |rxu Plqlwde vhvvlrq
dqg uhwxuqv |rx wr wkh v|vwhp surpsw ru rshudwlqj v|vwhp1 Rwkhuzlvh/ |rx fdq
dffhvv frppdqgv xvlqj wkh phqx edu +Glvsod| L16, wkdw uhvlghv dw wkh wrs ri wkh
Plqlwde zlqgrz1 Iru h{dpsoh/ |rx fdq dffhvv wkh exit frppdqg xvlqj Iloh I
H{lw1 Lq pdq| flufxpvwdqfhv/ xvlqj wkh phqx frppdqgv wr gr |rxu dqdo|vhv lv
hdv| dqg frqyhqlhqw/ dowkrxjk wkhuh duh fhuwdlq flufxpvwdqfhv zkhuh w|slqj wkh
vhvvlrq frppdqgv lv qhfhvvdu|1 \rx fdq dovr h{lw e| folfnlqj rq wkh v|pero
lq wkh xsshu uljkw0kdqg fruqhu ri wkh Plqlwde zlqgrz1 Zkhq |rx h{lw/ |rx duh
surpswhg e| Plqlwde lq d gldorj zlqgrz zlwk wkh txhvwlrq/ Vdyh fkdqjhv wr
wklv Surmhfw ehiruh forvlqjB \rx fdq vdiho| dqvzhu qr wr wklv txhvwlrq xqohvv
|rx duh lq idfw xvlqj wkh Surmhfwv ihdwxuh lq Plqlwde dv ghvfulehg lq Dsshqgl{
D1 Lq L1;/ zh zloo glvfxvv krz wr vdyh wkh frqwhqwv ri d Gdwd zlqgrz ehiruh
h{lwlqj1 Wklv lv vrphwklqj |rx zloo frpprqo| zdqw wr gr1
Lpphgldwho| ehorz wkh phqx edu lq wkh Plqlwde zlqgrz lv wkh wdvnedu 1 Wkh
wdvnedu frqvlvwv ri ydulrxv lfrqv wkdw surylgh d vkruwfxw phwkrg iru fduu|lqj
rxw ydulrxv rshudwlrqv e| folfnlqj rq wkhp1 Wkhvh rshudwlrqv fdq eh lghqwlhg
e| kroglqj wkh fxuvru ryhu hdfk lq wxuq/ dqg lw lv d jrrg lghd wr idploldul}h
|rxuvhoi zlwk wkhvh1 Ri sduwlfxodu lpsruwdqfh duh wkh Fxw Fhoov/ Frs| Fhoov/
dqg Sdvwh Fhoov lfrqv/ zklfk duh dydlodeoh zkhq d Gdwd zlqgrz lv dfwlyh1 Zkhq
wkh rshudwlrq dvvrfldwhg zlwk dq lfrq lv qrw dydlodeoh wkh lfrq lv idghg1
Plqlwde lv dq lqwhudfwlyh surjudp1 E| wklv zh phdq wkdw |rx vxsso| Plqlwde
zlwk lqsxw gdwd/ ru whoo lw zkhuh |rxu lqsxw gdwd lv/ dqg wkhq Plqlwde uhvsrqgv
lqvwdqwdqhrxvo| wr dq| frppdqgv |rx jlyh whoolqj lw wr gr vrphwklqj zlwk wkdw
gdwd1 \rx duh wkhq uhdg| wr jlyh dqrwkhu frppdqg1 Lw lv dovr srvvleoh wr uxq
d froohfwlrq ri Plqlwde frppdqgv lq d edwfk surjudp> l1h1/ vhyhudo Plqlwde
frppdqgv duh h{hfxwhg vhtxhqwldoo| ehiruh wkh rxwsxw lv uhwxuqhg wr wkh xvhu1
Wkh edwfk yhuvlrq lv xvhixo zkhq wkhuh lv dq h{whqvlyh qxpehu ri frpsxwdwlrqv
wr eh fduulhg rxw1 \rx duh uhihuuhg wr Dsshqgl{ F iru pruh glvfxvvlrq ri wkh
edwfk yhuvlrq1
Minitab for Data Management :
c:qProgram [Link]
ru vrphwklqj vlplodu1 Wklv sdwk qdph lqglfdwhv wkdw wkh oh [Link] lv vwruhg
rq wkh F kdug gulyh lq wkh gluhfwru| fdoohg Program FilesqMtbwinqData. Zh
zloo glvfxvv vhyhudo glhuhqw w|shv ri ohv lq wklv fkdswhu1
Lq pdq| yhuvlrqv ri Plqlwde/ wkhuh duh uhvwulfwlrqv rq oh qdphv1 Iru h{0
dpsoh/ lq hduolhu yhuvlrqv d oh qdph fdq eh dw prvw hljkw fkdudfwhuv lq ohqjwk
xvlqj dq| v|perov h{fhsw & dqg * dqg wkh uvw fkdudfwhu fdqqrw eh d eodqn1
Wkhuh lv qr ohqjwk uhvwulfwlrq rq oh qdphv lq Yhuvlrqv 45 ru 461 Lw lv jhqhudoo|
ehvw wr qdph |rxu ohv vr wkdw wkh oh qdph uh hfwv lwv frqwhqwv1 Iru h{dpsoh/
wkh oh qdph marks pd| uhihu wr d gdwd vhw frpsrvhg ri vwxghqw pdunv lq d
qxpehu ri frxuvhv1
4 Getting Help
Dw wlphv/ |rx pd| zdqw pruh lqirupdwlrq derxw d frppdqg ru vrph rwkhu
dvshfw ri Plqlwde wkdq wklv pdqxdo surylghv/ ru |rx pd| zlvk wr uhplqg |rxuvhoi
ri vrph ghwdlo wkdw |rx kdyh sduwldoo| irujrwwhq1 Plqlwde frqwdlqv dq rqolqh
pdqxdo wkdw lv yhu| frqyhqlhqw1 \rx fdq dffhvv wklv lqirupdwlrq gluhfwo| e|
folfnlqj rq Khos lq wkh Phqx edu dqg xvlqj wkh wdeoh ri Frqwhqwv ru grlqj d
Vhdufk ri wkh pdqxdo iru d sduwlfxodu frqfhsw1
Iurp wkh MTB A surpsw/ |rx fdq xvh wkh help frppdqg iru wklv sxusrvh1
W|slqj help iroorzhg e| wkh qdph ri wkh frppdqg ri lqwhuhvw dqg klwwlqj Hqwhu
zloo fdxvh Plqlwde wr surgxfh uhohydqw rxwsxw1 Iru h{dpsoh/ dvnlqj iru khos rq
wkh frppdqg help lwvhoi yld wkh frppdqg
MTB Ahelp help
; Minitab for Data Management
zloo jlyh |rx dq ryhuylhz ri zkdw khos lqirupdwlrq fdq eh dffhvvhg rq |rxu
v|vwhp1 Wkh help frppdqg vkrxog eh xvhg wr qg rxw derxw vhvvlrq frppdqgv1
5 The Worksheet
Wkh edvlf vwuxfwxudo frpsrqhqw ri Plqlwde lv wkh zrunvkhhw1 Edvlfdoo|/ wkh
zrunvkhhw fdq eh wkrxjkw ri dv d elj uhfwdqjxodu duud|/ ru pdwul{/ ri fhoov
rujdql}hg lqwr urzv dqg froxpqv dv lq wkh Gdwd zlqgrz ri Glvsod| L151 Hdfk fhoo
krogv rqh slhfh ri gdwd1 Wklv slhfh ri gdwd frxog eh d qxpehu/ l1h1 qxphulf gdwd/
ru lw frxog eh d vhtxhqfh ri fkdudfwhuv/ vxfk dv d zrug ru dq duelwudu| vhtxhqfh
ri ohwwhuv dqg qxpehuv/ l1h1/ wh{w gdwd1 Gdwd riwhq frphv dv qxpehuv/ vxfk dv
1=7> 2=3> = = = exw vrphwlphv lw frphv lq wkh irup ri d vhtxhqfh ri fkdudfwhuv/
vxfk dv eodfn/ eurzq/ uhg/ hwf1 W|slfdoo|/ vhtxhqfhv ri fkdudfwhuv duh xvhg dv
lghqwlhuv lq fodvvlfdwlrqv iru vrph yduldeoh ri lqwhuhvw/ h1j1/ froru/ jhqghu1 D
slhfh ri wh{w gdwd fdq eh xs wr ;3 fkdudfwhuv lq ohqjwk lq Plqlwde1 Yhuvlrq 46
dovr doorzv iru gdwh gdwd/ zklfk lv gdwd hvshfldoo| irupdwwhg wr lqglfdwh d gdwh/
iru h{dpsoh/ 6272<:1 Zh zloo qrw glvfxvv gdwh gdwd1
Li srvvleoh/ wu| wr dyrlg xvlqj wh{w gdwd zlwk Plqlwde/ l1h1/ pdnh vxuh doo
wkh ydoxhv ri d yduldeoh duh qxpehuv/ dv ghdolqj zlwk wh{w gdwd lq Plqlwde lv
pruh gl!fxow1 Iru h{dpsoh/ ghqrwh froruv e| qxpehuv udwkhu wkdq e| qdphv1
Vwloo wkhuh zloo eh dssolfdwlrqv zkhuh gdwd frphv wr |rx dv wh{w gdwd/ h1j1/ lq
d frpsxwhu oh/ dqg lw lv wrr h{whqvlyh wr frqyhuw wr qxphulf gdwd1 Vr zh zloo
glvfxvv krz wr lqsxw wh{w gdwd lqwr d Plqlwde zrunvkhhw/ exw zh uhfrpphqg
wkdw lq vxfk fdvhv |rx frqyhuw wklv wr qxphulf gdwd/ xvlqj wkh phwkrgv ri L14416/
rqfh lw kdv ehhq lqsxw1 Lq Yhuvlrq 46 ri Plqlwde lw lv vrphzkdw hdvlhu wr ghdo
zlwk wh{w gdwd wkdq hduolhu yhuvlrqv/ dqg wklv surylvr lv qrw dv qhfhvvdu|1
Glvsod| L17 surylghv dq h{dpsoh ri d zrunvkhhw1 Qrwlfh wkdw wkh froxpqv duh
odehohg F4/ F5/ hwf1 dqg wkh urzv duh odehohg 4/ 5/ 6/ hwf1 Zh zloo uhihu wr wkh
zrunvkhhw ghslfwhg lq Glvsod| L17 dv wkh marks zrunvkhhw khuhdiwhu dqg zloo xvh
lw wkurxjkrxw Sduw L wr looxvwudwh ydulrxv Plqlwde frppdqgv dqg rshudwlrqv1
Gdwd dulvhv iurp wkh surfhvv ri wdnlqj phdvxuhphqwv ri yduldeohv lq vrph
uhdo0zruog frqwh{w1 Iru h{dpsoh/ lq d srsxodwlrq ri vwxghqwv/ vxssrvh wkdw zh
duh frqgxfwlqj d vwxg| ri dfdghplf shuirupdqfh lq d Vwdwlvwlfv frxuvh1 Vshfli0
lfdoo|/ vxssrvh wkdw zh zdqw wr h{dplqh wkh uhodwlrqvkls ehwzhhq judghv lq
Vwdwlvwlfv/ judghv lq d Fdofxoxv frxuvh/ judghv lq d Sk|vlfv frxuvh dqg jhqghu1
Vr zh froohfw wkh iroorzlqj lqirupdwlrq iru hdfk vwxghqw lq wkh vwxg|= vwxghqw
qxpehu/ judgh lq Vwdwlvwlfv/ judgh lq Fdofxoxv/ judgh lq Sk|vlfv/ dqg jhqghu1
Wkhuhiruh/ zh kdyh 8 yduldeohv vwxghqw qxpehu dqg wkh judghv lq wkh wkuhh
vxemhfwv duh qxphulf yduldeohv/ dqg jhqghu lv d wh{w yduldeoh1 Ohw xv ixuwkhu
vxssrvh wkdw wkhuh duh 43 vwxghqwv lq wkh vwxg|1
Glvsod| L17 jlyhv d srvvleoh rxwfrph iurp froohfwlqj wkh gdwd lq vxfk d vwxg|1
Froxpq F4 frqwdlqv wkh vwxghqw qxpehu +qrwh wkdw wklv lv d fdwhjrulfdo ydul0
deoh hyhq wkrxjk lw lv d qxpehu,1 Wkh vwxghqw qxpehu sulpdulo| vhuyhv dv dq
lghqwlhu vr wkdw zh fdq fkhfn wkdw wkh gdwd kdv ehhq hqwhuhg fruuhfwo|1 Wklv lv
Minitab for Data Management <
vrphwklqj |rx vkrxog dozd|v gr dv d uvw vwhs lq |rxu dqdo|vlv1 Froxpqv F5
F7 frqwdlq wkh vwxghqw judghv lq wkhlu Vwdwlvwlfv/ Fdofxoxv/ dqg Sk|vlfv frxuvhv
dqg froxpq F8 frqwdlqv wkh jhqghu gdwd1 Qrwlfh wkdw d froxpq frqwdlqv wkh
ydoxhv froohfwhg iru d vlqjoh yduldeoh/ dqg d urz frqwdlqv wkh ydoxhv ri doo wkh
yduldeohv iru d vlqjoh vwxghqw1 Vrphwlphv/ d urz lv uhihuuhg wr dv dq revhuydwlrq
ru fdvh1 Revhuyh wkdw wkh gdwd iru wklv vwxg| rffxslhv d 10 5 vxewdeoh ri wkh
ixoo zrunvkhhw1 Doo ri wkh rwkhu eodqn hqwulhv ri wkh zrunvkhhw fdq eh ljqruhg/
dv wkh| duh xqghqhg1
6 Minitab Commands
Zh zloo qrz ehjlq wr lqwurgxfh ydulrxv Plqlwde frppdqgv wr jhw gdwd lqwr d
zrunvkhhw/ hglw d zrunvkhhw/ shuirup ydulrxv rshudwlrqv rq wkh hohphqwv ri d
zrunvkhhw/ dqg vdyh dqg dffhvv d vdyhg zrunvkhhw1 Ehiruh zh gr/ krzhyhu/ lw lv
xvhixo wr nqrz vrphwklqj derxw wkh edvlf vwuxfwxuh ri doo Plqlwde frppdqgv1
Dvvrfldwhg zlwk hyhu| frppdqg lv ri frxuvh lwv qdph/ dv lq Iloh I H{lw dqg
Khos1 Prvw frppdqgv dovr wdnh dujxphqwv/ dqg wkhvh dujxphqwv duh froxpq
qdphv/ frqvwdqwv/ dqg vrphwlphv oh qdphv1
Frppdqgv fdq eh dffhvvhg e| pdnlqj xvh ri wkh Iloh/ Hglw/ Pdqls/ Fdof/
Vwdw/ Judsk dqg Hglwru hqwulhv lq wkh phqx edu1 Folfnlqj dq| ri wkhvh eulqjv
xs d olvw ri frppdqgv wkdw |rx fdq xvh wr rshudwh rq |rxu zrunvkhhw1 Wkh olvwv
wkdw dsshdu pd| ghshqg rq zklfk zlqgrz lv dfwlyh/ h1j1/ hlwkhu d Gdwd zlqgrz
ru wkh Vhvvlrq zlqgrz1 Xqohvv rwkhuzlvh vshflhg/ zh zloo dozd|v dvvxph wkdw
wkh Vhvvlrq zlqgrz lv dfwlyh zkhq glvfxvvlqj phqx frppdqgv1 Li d frppdqg
qdph lq d olvw lv idghg/ wkhq lw lv qrw dydlodeoh1
W|slfdoo|/ xvlqj d frppdqg iurp wkh phqx edu uhtxluhv wkh xvh ri d gldorj
er{ ru gldorj zlqgrz wkdw rshqv zkhq |rx folfn rq d frppdqg lq wkh olvw1
Wkhvh duh xvhg wr surylgh wkh dujxphqwv dqg vxefrppdqgv wr wkh frppdqg
dqg vshfli| zkhuh wkh rxwsxw lv wr jr1 Gldorj er{hv kdyh ydulrxv er{hv wkdw
pxvw eh oohg lq wr fruuhfwo| h{hfxwh d frppdqg1 Folfnlqj lq d er{ wkdw qhhgv
wr eh oohg lq w|slfdoo| fdxvhv d yduldeoh olvw wr dsshdu lq wkh ohiw0prvw er{/ ri
doo lwhpv lq wkh dfwlyh zrunvkhhw wkdw fdq eh sodfhg lq wkdw er{1 Grxeoh folfnlqj
rq lwhpv lq wkh yduldeoh olvw sodfhv wkhp lq wkh er{/ ru/ dowhuqdwlyho|/ |rx fdq
w|sh wkhp lq gluhfwo|1 Zkhq |rx kdyh oohg lq wkh gldorj er{ dqg folfnhg RN/
wkh frppdqg lv sulqwhg lq wkh Vhvvlrq zlqgrz dqg h{hfxwhg1 Dq| rxwsxw lv
dovr sulqwhg lq wkh Vhvvlrq zlqgrz1 Gldorj er{hv kdyh d Khos exwwrq wkdw fdq
eh xvhg wr ohduq krz wr pdnh wkh hqwulhv1
Iru h{dpsoh/ vxssrvh wkdw zh zdqw wr fdofxodwh wkh phdq ri froxpq F5
lq wkh zrunvkhhw marks1 Wkhq wkh frppdqg Fdof I Froxpq Vwdwlvwlfv eulqjv
xs wkh gldorj er{ vkrzq lq Glvsod| L181 Qrwlfh wkdw wkh udglr exwwrq Vxp lv
oohg lq1 Folfnlqj wkh udglr exwwrq odehoohg Phdq uhvxowv lq wklv exwwrq ehlqj
oohg lq dqg wkh Vxp exwwrq ehfrplqj hpsw|1 Zklfkhyhu exwwrq lv oohg lq zloo
uhvxow lq wkdw vwdwlvwlf ehlqj fdofxodwhg iru wkh uhohydqw froxpqv zkhq zh qdoo|
lpsohphqw wkh frppdqg e| folfnlqj RN1
Fxuuhqwo|/ wkhuh duh qr froxpqv vhohfwhg/ exw folfnlqj lq wkh Lqsxw yduldeoh
er{ eulqjv xs d olvw ri srvvleoh froxpqv lq wkh glvsod| zlqgrz rq wkh ohiw1 Wkh
Minitab for Data Management 44
Glvsod| L18= Lqlwldo ylhz ri wkh gldorj er{ iru Froxpq Vwdwlvwlfv1
Glvsod| L19= Ylhz ri wkh gldorj er{ iru Froxpq Vwdwlvwlfv diwhu vhohfwlqj Phdq dqg
eulqjlqj xs wkh yduldeoh olvw1
45 Minitab for Data Management
Glvsod| L1:= Ilqdo ylhz ri wkh gldorj er{ iru Froxpq Vwdwlvwlfv1
Txlwh riwhq/ lw lv idvwhu dqg pruh frqyhqlhqw wr vlpso| w|sh |rxu frppdqgv
gluhfwo| lqwr wkh Vhvvlrq zlqgrz1 Vrphwlphv/ lw lv qhfhvvdu| wr xvh wkh Vhvvlrq
zlqgrz dssurdfk/ exw iru pdq| frppdqgv wkh phqx edu lv dydlodeoh1 Vr zh
qrz ghvfuleh wkh xvh ri frppdqgv lq wkh Vhvvlrq zlqgrz1
Wkh edvlf vwuxfwxuh ri vxfk d frppdqg zlwk q dujxphqwv lv
command name H1 /H2 /111/Hq
zkhuh Hl lv wkh lwk dujxphqw1 Dowhuqdwlyho|/ zh fdq zulwh
command name H1 H2 111 Hq
li zh grq*w zdqw wr w|sh frppdv1 Frqyhqlhqwo|/ li wkh dujxphqwv H1 /H2 /111/Hq
duh frqvhfxwlyh froxpqv lq wkh zrunvkhhw/ zh kdyh wkh iroorzlqj vkruw0irup
command name H1 0Hq
zklfk vdyhv hyhq pruh w|slqj dqg dffruglqjo| ghfuhdvhv rxu fkdqfh ri pdnlqj d
w|slqj plvwdnh1 Li |rx duh jrlqj wr w|sh d orqj olvw ri dujxphqwv dqg |rx grq*w
zdqw wkhp doo rq wkh vdph olqh/ wkhq |rx fdq w|sh wkh frqwlqxdwlrq v|pero )
zkhuh |rx zdqw wr euhdn wkh olqh dqg wkhq klw Hqwhu1 Plqlwde uhvsrqgv zlwk
wkh surpsw
FRQWA
dqg |rx frqwlqxh wr w|sh dujxphqw qdphv1 Wkh frppdqg lv h{hfxwhg zkhq |rx
klw Hqwhu diwhu dq dujxphqw qdph zlwkrxw d frqwlqxdwlrq fkdudfwhu iroorzlqj
lw1
Pdq| frppdqgv fdq/ lq dgglwlrq/ eh vxssolhg zlwk ydulrxv vxefrppdqgv
wkdw dowhu wkh ehkdylru ri wkh frppdqg1 Wkh vwuxfwxuh iru frppdqgv zlwk
vxefrppdqgv lv
Minitab for Data Management 46
diwhu |rx klw Hqwhu1 Folfnlqj rq lw dowhuqdwhv ehwzhhq urz0zlvh dqg froxpq0
zlvh gdwd hqwu|1 Fhuwdlqo|/ wklv lv dq hdv| zd| wr hqwhu gdwd zkhq lw lv vxlwdeoh1
Uhphpehu/ froxpqv duh yduldeohv dqg urzv duh revhuydwlrqv$ Dovr/ |rx fdq kdyh
pxowlsoh gdwd zlqgrzv rshq dqg pryh gdwd ehwzhhq wkhp1 Xvh wkh frppdqg
Iloh I Qhz wr rshq d qhz zrunvkhhw1
wkh Ilohv ri w|sh er{ e| vhohfwlqj Wh{w Ilohv ru shukdsv Doo Ilohv1 Folfnlqj rq
[Link] uhvxowv lq wkh gdwd ehlqj uhdg lqwr wkh zrunvkhhw1
Glvsod| L1;= Gldorj er{ iru lpsruwlqj gdwd iurp h{whuqdo oh1
Glvsod| L1<= Gldorj er{ iru vhohfwlqj oh iurp zklfk gdwd lv wr eh uhdg lq1
Ri frxuvh/ wklv gdwd vhw grhv qrw frqwdlq wkh wh{w yduldeoh ghqrwlqj wkh
vwxghqw*v jhqghu1 Vxssrvh wkdw wkh oh [Link] frqwdlqv wkh iroorzlqj
gdwd h{dfwo| dv w|shg1
49 Minitab for Data Management
12389 81 85 78 m
97658 75 72 62 m
53546 77 83 81 f
55542 63 42 55 m
11223 71 82 67 f
77788 87 56 * f
44567 23 45 35 m
32156 67 72 81 m
33456 81 77 88 f
67945 74 91 92 f
Dv wklv oh frqwdlqv wh{w gdwd lq wkh iwk froxpq/ zh pxvw whoo Plqlwde krz
wkh gdwd lv irupdwwhg lq wkh oh1 Wr dffhvv wklv ihdwxuh zh folfn rq wkh Irupdw
exwwrq lq wkh gldorj er{ vkrzq lq Glvsod| L1;1 Wklv eulqjv xs wkh gldorj er{
vkrzq lq Glvsod| L1431
Wr lqglfdwh wkdw zh zloo vshfli| wkh irupdw/ zh folfn wkh udglr exwwrq Xvhu0
vshflhg irupdw dqg oo wkh sduwlfxodu irupdw lqwr wkh er{ dv vkrzq lq Glvsod|
L1441 Wkh irupdw vwdwhphqw vd|v wkdw zh duh jrlqj wr uhdg lq wkh gdwd dffrug0
lqj wr wkh iroorzlqj uxoh= d qxphulf yduldeoh rffxs|lqj 8 vsdfhv dqg zlwk qr
ghflpdov/ iroorzhg e| d vsdfh/ d qxphulf yduldeoh rffxs|lqj 5 vsdfhv zlwk qr
ghflpdov/ d vsdfh/ d qxphulf yduldeoh rffxs|lqj 5 vsdfhv zlwk qr ghflpdov/ d
vsdfh/ d qxphulf yduldeoh rffxs|lqj 5 vsdfhv zlwk qr ghflpdov/ d vsdfh/ dqg
d wh{w yduldeoh rffxs|lqj 4 vsdfh1 Wklv uxoh pxvw eh uljrurxvo| dgkhuhg wr ru
huuruv zloo rffxu1 Vr wkh uxohv |rx qhhg wr uhphpehu li |rx xvh irupdwwhg lqsxw
duh wkdw ak lqglfdwhv d wh{w yduldeoh rffxs|lqj k vsdfhv/ kx lqglfdwhv k vsdfhv/
dqg fk.l lqglfdwhv d qxphulf yduldeoh rffxs|lqj k vsdfhv/ ri zklfk o duh wr wkh
uljkw ri wkh ghflpdo srlqw1 Qrwh li d gdwd ydoxh grhv qrw oo xs wkh ixoo qxp0
ehu ri vsdfhv doorwwhg wr lw lq wkh irupdw vwdwhphqw/ lw pxvw eh uljkw mxvwlhg
lq lwv hog1 Dovr/ li d ghflpdo srlqw lv lqfoxghg lq wkh qxpehu/ wklv rffxslhv
rqh ri wkh vsdfhv doorfdwhg wr wkh yduldeoh dqg vlploduo| iru d qhjdwlyh ru soxv
Minitab for Data Management 4:
vljq1 Wkhuh duh pdq| rwkhu ihdwxuhv wr irupdwwhg lqsxw wkdw zh zloo qrw glvfxvv
khuh1 Xvh wkh Khos exwwrq lq wkh gldorj er{ iru lqirupdwlrq rq wkhvh ihdwxuhv1
Ilqdoo|/ folfnlqj rq wkh RN exwwrq uhdgv wklv gdwd lqwr d zrunvkhhw dv ghslfwhg
lq Glvsod| L171 W|slfdoo|/ zh wu| wr dyrlg wkh xvh ri irupdwwhg lqsxw ehfdxvh lw
lv vrphzkdw fxpehuvrph/ exw vrphwlphv zh pxvw xvh lw1
Glvsod| L144= Gldorj er{ iru irupdwwhg lqsxw zlwk wkh irupdw oohg lq1
Glvsod| L145= Gldorj er{ iru pdnlqj sdwwhuqhg gdwd zlwk vrph hqwulhv oohg lq1
Wkhuh lv vrph vkruwkdqg dvvrfldwhg zlwk sdwwhuqhg gdwd wkdw fdq eh yhu|
frqyhqlhqw1 Iru h{dpsoh/ w|slqj p : q lq d Plqlwde frppdqg lv htxlydohqw wr
w|slqj wkh ydoxhv p> p + 1> = = = > q zkhq p ? q dqg p> p 1> ===> q zkhq p A q
dqg p zkhq p = q1 Wkh h{suhvvlrq p : q@g> zkhuh g A 0/ h{sdqgv wr d olvw dv
deryh exw zlwk wkh lqfuhphqw ri g ru g/ zklfkhyhu lv uhohydqw/ uhsodflqj 1 ru
11 Li p ? q wkhq g lv dgghg wr p xqwlo wkh qh{w dgglwlrq zrxog h{fhhg q dqg
li p A q wkhq g lv vxewudfwhg iurp p xqwlo wkh qh{w vxewudfwlrq zrxog eh orzhu
wkdq q1 Wkh h{suhvvlrq n(p : q@g) uhshdwv p : q@g iru n wlphv zkloh (p : q@g)o
uhshdwv hdfk hohphqw lq p : q@g iru o wlphv1 Wkh h{suhvvlrq n(p : q@g)o uhshdwv
(p : q@g)o iru n wlphv1
Wkh set frppdqg lv dydlodeoh lq wkh vhvvlrq zlqgrz wr lqsxw sdwwhuqhg gdwd1
Iru h{dpsoh/ vxssrvh zh zdqw F9 wr frqwdlq wkh 43 hqwulhv 4/ 5/ 6/ 7/ 8/ 8/ 7/ 6/
5/ 41 Wkh frppdqg
Minitab for Data Management 4<
MTB Aset c6
DATAA1:5
DATAA5:1
DATAAend
grhv wklv1 Dovr/ zh fdq dgg hohphqwv lq sduhqwkhvhv1 Iru h{dpsoh/ wkh frppdqg
MTB Aset c6
DATAA(1:2/.5 4:3/.2)
DATAAend
fuhdwhv wkh froxpq zlwk hqwulhv 413/ 418/ 513/ 713/ 61;/ 619/ 617/ 615/ 6131 Wkh
pxowlsolfdwlyh idfwruv n dqg o fdq dovr eh xvhg lq vxfk d frqwh{w1 Reylrxvo|/
wkhuh lv d juhdw ghdo ri vfrsh iru hqwhulqj sdwwhuqhg gdwd zlwk set1 Wkh jhqhudo
v|qwd{ ri wkh vhw frppdqg lv
set H1
zkhuh H1 lv d froxpq1
Glvsod| L146= Gldorj er{ iru sulqwlqj zrunvkhhw lq wkh Vhvvlrq zlqgrz1
53 Minitab for Data Management
Wkh print frppdqg lv dydlodeoh lq wkh Vhvvlrq zlqgrz dqg lv riwhq frqyh0
qlhqw wr xvh1 Wkh jhqhudo v|qwd{ iru wkh print frppdqg lv
print H1 111 Hp
zkhuh H1 > 111/ Hp duh froxpqv dqg frqvwdqwv1
Glvsod| L147= Iloohg lq gldorj er{ iru dvvljqlqj wkh frqvwdqw n4 wkh ydoxh 181
Wkh let frppdqg lv dydlodeoh lq wkh Vhvvlrq zlqgrz dqg lv txlwh frqyhqlhqw1
Wkh iroorzlqj frppdqgv pdnh wklv dvvljqphqw dqg wkhq zh fkhfn/ xvlqj wkh
print frppdqg/ wkdw zh kdyh hqwhuhg wkh frqvwdqwv fruuhfwo|1
Minitab for Data Management 54
Lq wkh Vhvvlrq zlqgrz/ wkh name frppdqg lv dydlodeoh iru qdplqj yduldeohv
dqg frqvwdqwv1 Iru h{dpsoh/ wkh frppdqgv
MTB Aname c1 ’studid’ c2 ’stats’ c3 ’calculus’ &
CONTAc4 ’physics’ c5 ’gender’ &
CONTAk1 ’weight1’ k2 ’weight2’ k3 ’weight3’
jlyh wkh qdphv studid wr F4/ stats wr F5/ calculus wr F6/ physics wr F7/
gender wr F8/ weight1 wr N4/ weight2 wr N5/ dqg weight3 wr N61 Qrwlfh wkdw
zh kdyh pdgh xvh ri wkh frqwlqxdwlrq fkdudfwhu ) iru frqyhqlhqfh lq w|slqj lq
wkh ixoo lqsxw wr name1 Zkhq xvlqj wkh yduldeohv dv dujxphqwv mxvw hqforvh wkh
qdphv lq vlqjoh txrwhv1 Iru h{dpsoh/
MTB Aprint ’studid’ ’calculus’
sulqwv rxw wkh frqwhqwv ri wkhvh yduldeohv lq wkh Vhvvlrq zlqgrz1
Yduldeoh dqg frqvwdqw qdphv fdq eh dw prvw 64 fkdudfwhuv lq ohqjwk/ fdqqrw
lqfoxgh wkh fkdudfwhuv & dqg * dqg fdqqrw vwduw zlwk d ohdglqj eodqn ru -1 Uhfdoo
wkdw Plqlwde lv qrw fdvh vhqvlwlyh/ vr lw grhv qrw pdwwhu li zh xvh orzhu ru xsshu
fdvh ohwwhuv zkhq vshfli|lqj wkh qdphv1
MTB Ainfo
Column Name Count Missing
A C1 studid 10 0
C2 stats 10 0
C3 calculus 10 0
C4 physics 10 1
A C5 gender 10 0
Constant Name Value
K1 weight1 0.500000
K2 weight2 0.250000
K3 weight3 0.250000
Qrwlfh wkdw wkh info frppdqg whoov xv krz pdq| plvvlqj ydoxhv wkhuh duh dqg
lq zkdw froxpqv wkh| rffxu dqg dovr wkh ydoxhv ri wkh frqvwdqwv1
Wklv lqirupdwlrq fdq dovr eh dffhvvhg gluhfwo| iurp wkh Surmhfw Pdqdjhu
zlqgrz yld Zlqgrz I Surmhfw Pdqdjhu1
Minitab for Data Management 56
Glvsod| L149= Gldorj er{ wkdw ghwhuplqhv krz d eorfn ri frslhg fhoov lv xvhg/ zkhwkhu
ehlqj lqvhuwhg lqwr d zrunvkhhw ru uhsodflqj d eorfn ri fhoo ri wkh vdph vl}h1
Glvsod| L14:= Gldorj er{ iru frs|lqj hqwulhv lq froxpqv dqg sdvwlqj wkhp1
Minitab for Data Management 58
Rqh fdq dovr ghohwh vhohfwhg urzv iurp vshflhg froxpqv xvlqj Pdqls I Ghohwh
Urzv dqg oolqj lq wkh gldorj er{ dssursuldwho|1 Qrwlfh/ krzhyhu/ wkdw zkhqhyhu
zh ghohwh d fhoo/ wkh frqwhqwv ri wkh fhoov ehqhdwk wkh ghohwhg rqh lq wkdw froxpq
vlpso| pryh xs wr oo wkh fhoo1 Wkh fhoo hqwu| grhv qrw ehfrph plvvlqj> udwkhu/
fhoov dw wkh erwwrp ri wkh froxpq ehfrph xqghqhg$ Li |rx ghohwh dq hqwluh urz/
wklv lv qrw d sureohp ehfdxvh wkh urzv ehorz mxvw vkliw xs1 Iru h{dpsoh/ li zh
ghohwh wkh wklug urz wkhq lq wkh qhz zrunvkhhw/ diwhu wkh ghohwlrq/ wkh wklug urz
lv qrz rffxslhg e| zkdw zdv iruphuo| wkh irxuwk urz1 Wkhuhiruh/ |rx vkrxog eh
yhu| fduhixo/ zkhq |rx duh qrw ghohwlqj zkroh urzv/ wr hqvxuh wkdw |rx jhw wkh
uhvxow |rx lqwhqghg1
Qrwh wkdw li |rx vkrxog ghohwh doo wkh hqwulhv iurp d froxpq/ wklv yduldeoh
lv vwloo lq wkh zrunvkhhw/ exw lw lv hpsw| qrz1 Li |rx zlvk wr ghohwh d yduldeoh
dqg doo lwv hqwulhv/ wklv fdq eh dffrpsolvkhg iurp Pdqls I Hudvh Yduldeohv dqg
oolqj lq wkh gldorj er{ dssursuldwho|1 Wklv lv d jrrg lghd li |rx kdyh d orw ri
yduldeohv dqg qr orqjhu qhhg vrph ri wkhp1
Wkhuh duh ydulrxv frppdqgv lq wkh Vhvvlrq zlqgrz dydlodeoh iru fduu|lqj
rxw wkhvh hglwlqj rshudwlrqv1 Iru h{dpsoh/ wkh restart frppdqg lq wkh Vhvvlrq
zlqgrz fdq eh xvhg wr uhpryh doo hqwulhv iurp d zrunvkhhw1 Wkh let frppdqg
doorzv |rx wr uhsodfh lqglylgxdo hqwulhv1 Iru h{dpsoh/
MTB A let c2(2)=3
dvvljqv wkh ydoxh 6 wr wkh vhfrqg hqwu| lq wkh froxpq F51 Wkh copy frppdqg
fdq eh xvhg wr frs| d eorfn ri fhoo iurp rqh sodfh wr dqrwkhu1 Wkh insert
frppdqg doorzv |rx wr lqvhuw urzv ru revhuydwlrqv dq|zkhuh lq wkh zrunvkhhw1
Wkh delete frppdqg doorzv |rx wr ghohwh urzv1 Wkh erase frppdqg lv dydlo0
deoh iru wkh ghohwlrq ri froxpqv ru yduldeohv iurp wkh zrunvkhhw1 Dv lw lv pruh
frqyhqlhqw wr hglw d zrunvkhhw e| gluhfwo| zrunlqj rq wkh zrunvkhhw dqg xvlqj
wkh phqx frppdqgv/ zh gr qrw glvfxvv wkhvh frppdqgv ixuwkhu khuh1
59 Minitab for Data Management
lq wklv gldorj er{ zrun dv ghvfulehg iru wkh Iloh I Vdyh Fxuuhqw Zrunvkhhw Dv
frppdqg/ zlwk wkh h{fhswlrq wkdw zh qrz w|sh wkh qdph ri wkh oh zh zdqw wr
rshq lq wkh Iloh qdph er{ dqg folfn rq wkh Rshq exwwrq1
Wr sulqw d zrunvkhhw/ xvh wkh frppdqg Iloh I Sulqw Zrunvkhhw1 Wkh gldorj
er{ wkdw vxevhtxhqwo| srsv xs doorzv |rx wr frqwuro wkh rxwsxw lq d qxpehu ri
zd|v1
Lw pd| eh wkdw |rx zrxog suhihu wr zulwh rxw wkh frqwhqwv ri d zrunvkhhw wr
dq h{whuqdo oh wkdw fdq eh hglwhg e| dq hglwru ru shukdsv xvhg e| vrph rwkhu
surjudp1 Wklv zloo qrw eh wkh fdvh li zh vdyh wkh zrunvkhhw dv dq .mtw oh dv
rqo| Plqlwde fdq uhdg wkhvh1 Wr gr wklv/ xvh wkh frppdqg Iloh I Rwkhu Ilohv
I H{sruw Vshfldo Wh{w/ oolqj lq wkh gldorj er{ dqg vshfli|lqj wkh ghvwlqdwlrq
oh zkhq surpswhg1 Iru h{dpsoh/ li zh zdqw wr vdyh wkh frqwhqwv ri wkh marks
zrunvkhhw/ wklv frppdqg uhvxowv lq wkh gldorj er{ ri Glvsod| L154 dsshdulqj1
Zh kdyh hqwhuhg doo yh froxpqv lqwr wkh Froxpqv wr h{sruw er{ dqg kdyh qrw
vshflhg d irupdw vr wkh froxpqv zloo eh vwruhg lq wkh oh zlwk vlqjoh eodqnv
vhsdudwlqj wkh froxpqv1 Folfnlqj wkh RN exwwrq uhvxowv lq wkh gldorj er{ ri
Glvsod| L155 dsshdulqj1 Khuh/ zh kdyh w|shg lq wkh qdph [Link] wr krog wkh
frqwhqwv1 Qrwh wkdw zkloh zh kdyh fkrvhq d .dat w|sh oh/ zh dovr frxog kdyh
fkrvhq d .txt w|sh oh1 Folfnlqj rq wkh Vdyh exwwrq uhvxowv lq d oh [Link]
ehlqj fuhdwhg lq wkh iroghu data zlwk frqwhqwv dv glvsod|hg lq Glvsod| L1561
5; Minitab for Data Management
Glvsod| L154= Gldorj er{ iru vdylqj wkh frqwhqwv ri d zrunvkhhw wr dq h{whuqdo
+qrq0Plqlwde, oh1
Glvsod| L155= Gldorj er{ iru vhohfwlqj h{whuqdo oh wr krog frqwhqwv ri d zrunvkhhw1
Lq wkh Vhvvlrq zlqgrz/ wkh frppdqgv save dqg retrieve duh dydlodeoh iru
vdylqj dqg uhwulhylqj d zrunvkhhw lq wkh .mtw irupdw dqg wkh frppdqg write
lv dydlodeoh iru vdylqj d zrunvkhhw lq dq h{whuqdo oh1 Zh uhihu wkh uhdghu wr
help iru d ghvfulswlrq ri krz wkhvh frppdqgv zrun1
Minitab for Data Management 5<
10 Mathematical Operations
Zkhq fduu|lqj rxw d gdwd dqdo|vlv d vwdwlvwlfldq lv riwhq fdoohg xsrq wr wudqvirup
wkh gdwd lq vrph zd|1 Wklv pd| lqyroyh dsso|lqj vrph vlpsoh wudqvirupdwlrq wr
d yduldeoh wr fuhdwh d qhz yduldeoh h1j1/ wdnh wkh qdwxudo orjdulwkp ri hyhu|
judgh lq wkh marks zrunvkhhw wr frpelqlqj vhyhudo yduldeohv wrjhwkhu wr irup
d qhz yduldeoh h1j1/ fdofxodwh wkh dyhudjh judgh iru hdfk vwxghqw lq wkh marks
zrunvkhhw1 Lq wklv vhfwlrq/ zh suhvhqw vrph ri wkh zd|v ri grlqj wklv1
judgh dqg sodflqj wkh uhvxow lq F91 Iloolqj lq wkh gldorj er{/ fruuhvsrqglqj wr
Fdof I Fdofxodwru/ dv vkrzq lq Glvsod| L157 dffrpsolvkhv wklv zkhq zh folfn rq
wkh RN exwwrq1
Qrwh wkdw zh fdq hlwkhu w|sh wkh uhohydqw h{suhvvlrq lqwr wkh H{suhvvlrq er{ ru
xvh wkh exwwrqv dqg grxeoh folfnlqj rq wkh uhohydqw froxpqv1 Ixuwkhu/ zh w|sh
wkh froxpq zkhuh zh zlvk wr vwruh wkh uhvxowv ri rxu fdofxodwlrq lq wkh Vwruh
uhvxow lq yduldeoh er{1 Wkhvh rshudwlrqv duh grqh rq wkh fruuhvsrqglqj hqwulhv lq
hdfk froxpq> fruuhvsrqglqj hqwulhv lq wkh froxpqv duh rshudwhg rq dffruglqj wr
wkh irupxod zh kdyh vshflhg/ dqg d qhz froxpq ri wkh vdph ohqjwk frqwdlqlqj
doo wkh rxwfrphv lv fuhdwhg1 Qrwh wkdw wkh vl{wk hqwu| lq F9 zloo eh - plvvlqj
ehfdxvh wklv hqwu| zdv plvvlqj iru F71
Wkhvh nlqgv ri rshudwlrqv fdq dovr eh fduulhg rxw gluhfwo| lq wkh Vhvvlrq
zlqgrz xvlqj wkh let frppdqg/ dqg lq vrph zd|v wklv lv d vlpsohu dssurdfk1
Iru h{dpsoh/ wkh vhvvlrq frppdqg
dffrpsolvkhv wklv1
Zh fdq dovr xvh wkhvh dulwkphwlfdo rshudwlrqv rq wkh frqvwdqwv N4/ N5/
hwf1/ dqg qxpehuv wr fuhdwh qhz frqvwdqwv ru xvh wkh frqvwdqwv dv vfdoduv lq
rshudwlrqv zlwk froxpqv1 Iru h{dpsoh/ vxssrvh wkdw zh zdqw wr frpsxwh wkh
zhljkwhg dyhudjh ri wkh Vwdwlvwlfv/ Fdofxoxv/ dqg Sk|vlfv judghv zkhuh Vwdwlvwlfv
jhwv wzlfh wkh zhljkw ri wkh rwkhu judghv1 Uhfdoo wkdw zh fuhdwhg/ dv sduw ri wkh
marks zrunvkhhw/ wkh frqvwdqwv weight1 @ 18/ weight2 @ 158/ dqg weight3 @
158 lq N4/ N5/ dqg N6/ uhvshfwlyho|1 Vr wklv zhljkwhg dyhudjh lv frpsxwhg yld
wkh frppdqg
dqg wkh uhvxow lv sodfhg lq F:1 Zh kdyh xvhg wkh frqwlqxdwlrq fkdudfwhu ) iru
frqyhqlhqfh lq wklv frpsxwdwlrq1 Dowhuqdwlyho|/ zh frxog kdyh xvhg wkh Fdof I
Fdofxodwru frppdqg dv deryh iru wklv1
Glvsod| L158= Gldorj er{ iru pdwkhpdwlfdo fdofxodwlrqv looxvwudwlqj wkh xvh ri wkh
qdwxudo orjdulwkp ixqfwlrq1
D frpsohwh olvw ri vxfk ixqfwlrqv lv jlyhq lq wkh Ixqfwlrqv zlqgrz zkhq Doo
ixqfwlrqv lv lq wkh zlqgrz gluhfwo| deryh wkh olvw1
Wkh vdph uhvxow fdq eh rewdlqhg xvlqj wkh vhvvlrq frppdqg let dqg wkh
qdwxudo orjdulwkp ixqfwlrq loge1 Iru h{dpsoh/
MTB Alet c8=loge(c2)
fdofxodwhv wkh qdwxudo orj ri hyhu| hqwu| lq f5 dqg sodfhv wkh uhvxowv lq F;1 Wkhuh
duh d qxpehu ri vxfk ixqfwlrqv dqg d frpsohwh olvw lv surylghg lq Dsshqgl{ E141
Wkhvh ixqfwlrqv fdq eh dssolhg wr qxpehuv dv zhoo dv frqvwdqwv1 Li |rx zdqw wr
nqrz wkh vlqh ri wkh qxpehu 617/ wkhq
MTB Alet k4=sin(3.4)
MTB Aprint k4
K4 -0.255541
jlyhv wkh ydoxh1
65 Minitab for Data Management
Qrwlfh wkdw wkhuh duh wzr fkrlfhv iru wkhvh rshudwruv> iru h{dpsoh/ xvh hlwkhu
wkh v|pero A@ ru wkh pqhprqlf ge.
Wkh frpsdulvrq dqg orjlfdo rshudwruv duh xvhixo zkhq zh kdyh vlpsoh txhv0
wlrqv derxw wkh zrunvkhhw wkdw zrxog eh whglrxv wr dqvzhu e| lqvshfwlrq1 Wklv
67 Minitab for Data Management
ihdwxuh lv sduwlfxoduo| xvhixo zkhq zh duh ghdolqj zlwk odujh gdwd vhwv1 Iru h{0
dpsoh/ vxssrvh wkdw zh zdqw wr frxqw wkh qxpehu ri wlphv wkh Vwdwlvwlfv judgh
zdv juhdwhu wkdq wkh fruuhvsrqglqj Fdofxoxv judgh lq wkh marks zrunvkhhw1 Wkh
frppdqg Fdof I Fdofxodwru jlyhv wkh gldorj er{ vkrzq lq Glvsod| L15; zkhuh zh
kdyh sxw c6 lq wkh Vwruh uhvxow lq yduldeoh er{ dqg c2 A c3 lq wkh H{suhvvlrq
er{1 Folfnlqj rq wkh RN exwwrq uhvxowv lq wkh lwk hqwu| lq F9 frqwdlqlqj d4
li wkh lwk hqwu| lq F5 lv juhdwhu wkdq wkh lwk hqwu| lq F6/ l1h1/ wkh frpsdulvrq
lv wuxh/ dqg d 3 rwkhuzlvh1 Lq wklv fdvh/ F9 frqwdlqv wkh hqwulhv= 3/ 4/ 3/ 4/ 3/
4/ 3/ 3/ 4/ 3/ zklfk wkh zrunvkhhw lq Glvsod| L17 yhulhv dv dssursuldwh1 Li zh
xvh Fdof I Fdofxodwru wr fdofxodwh wkh vxp ri wkh hqwulhv lq F9/ zh zloo kdyh
frpsxwhg wkh qxpehu ri wlphv wkh Vwdwlvwlfv judgh lv juhdwhu wkdq wkh Fdofxoxv
judgh1
Wkhvh rshudwlrqv fdq dovr eh vlpso| fduulhg rxw xvlqj vhvvlrq frppdqgv1
Iru h{dpsoh/
MTB Alet c6=c2Ac3
MTB Alet k4=sum(c6)
MTB Aprint k4
K4 4.00000
dffrpsolvkhv wklv1
Wkh orjlfdo rshudwruv frpelqh zlwk wkh frpsdulvrq rshudwruv wr doorz pruh
frpsolfdwhg txhvwlrqv wr eh dvnhg1 Iru h{dpsoh/ vxssrvh zh zdqwhg wr fdofxodwh
wkh qxpehu ri vwxghqwv zkrvh Vwdwlvwlfv pdun zdv juhdwhu wkdq wkhlu Fdofxoxv
pdun dqg ohvv wkdq ru htxdo wr wkhlu Sk|vlfv pdun1 Wkh frppdqgv
MTB Alet c6=c2Ac3 and c2?=c4
MTB Alet k4=sum(c6)
MTB Aprint k4
K4 1.00000
Minitab for Data Management 68
dffrpsolvk wklv1 Lq wklv fdvh/ erwk frqglwlrqv c2Ac3 dqg c2?=c4 kdyh wr eh
wuxh iru d 4 wr eh uhfrughg lq F91 Qrwh wkdw wkh revhuydwlrq zlwk wkh plvvlqj
Sk|vlfv pdun lv h{foxghg1 Ri frxuvh/ zh fdq dovr lpsohphqw wklv xvlqj Fdof I
Fdofxodwru dqg oolqj lq wkh gldorj er{ dssursuldwho|1
Wh{w yduldeohv fdq eh xvhg lq frpsdulvrqv zkhuh wkh rughulqj lv doskdehwlfdo1
Iru h{dpsoh/
MTB Alet c6=c5?’’m’’
sxwv d 4 lq F9 zkhqhyhu wkh fruuhvsrqglqj hqwu| lq F8 lv doskdehwlfdoo| vpdoohu
wkdq m1
11.1 Coding
Wkh Pdqls I Frgh frppdqg lv xvhg wr uhfrgh froxpqv1 E| wklv zh phdq wkdw
gdwd hqwulhv lq froxpqv duh uhsodfhg e| qhz ydoxhv dffruglqj wr d frglqj vfkhph
wkdw zh pxvw vshfli|1 \rx fdq uhfrgh qxphulf lqwr qxphulf/ qxphulf lqwr wh{w/
wh{w lqwr qxphulf/ ru wh{w lqwr wh{w e| fkrrvlqj dq dssursuldwh vxefrppdqg1
Iru h{dpsoh/ vxssrvh lq wkh marks zrunvkhhw zh zdqw wr uhfrgh wkh judghv lq
F5/ F6/ dqg F7 vr wkdw dq| pdun lq wkh udqjh 36< ehfrphv dq I/ hyhu| pdun
lq wkh udqjh 737< ehfrphv dq H/ hyhu| pdun lq wkh udqjh 838< ehfrphv d
G/ hyhu| pdun lq wkh udqjh 939< ehfrphv d F/ hyhu| pdun lq wkh udqjh :3:<
ehfrphv d E/ hyhu| pdun lq wkh udqjh ;3433 ehfrphv dq D/ dqg wkh uhvxowv duh
sodfhg lq froxpqv F9/ F:/ dqg F;/ uhvshfwlyho|1 Wkhq wkh frppdqg Pdqls I
Frgh I Qxphulf wr Wh{w eulqjv xs wkh gldorj er{ vkrzq lq Glvsod| L15<1 Wkh
udqjhv iru wkh qxphulf ydoxhv wr eh uhfrghg wr d frpprq wh{w ydoxh duh w|shg
lq wkh Ruljlqdo ydoxhv er{/ dqg wkh qhz ydoxhv duh w|shg lq wkh Qhz er{1 Qrwh
wkdw zh kdyh xvhg d vkruwkdqg iru ghvfulelqj d udqjh ri gdwd ydoxhv dv glvfxvvhg
lq vhfwlrq :151 Ehfdxvh wkh vl{wk hqwu| ri F7 lv -/ l1h1/ lw lv plvvlqj/ wklv ydoxh
lv vlpso| uhfrghg dv d eodqn1 \rx fdq dovr uhfrgh plvvlqj ydoxhv e| lqfoxglqj
- lq rqh ri wkh Ruljlqdo ydoxhv er{hv1 Li d ydoxh lq d froxpq lv qrw fryhuhg e|
rqh ri wkh ydoxhv lq wkh Ruljlqdo ydoxhv er{hv/ wkhq lw lv vlpso| ohiw wkh vdph lq
wkh qhz froxpq1
69 Minitab for Data Management
Glvsod| L15<= Gldorj er{ iru uhfrglqj qxphulf ydoxhv wr wh{w ydoxhv1
Qrwh wkdw wklv phqx frppdqg uhvwulfwv wkh qxpehu ri qhz frgh ydoxhv wr ;1
Wkh vhvvlrq frppdqg code doorzv xs wr 83 qhz frghv1 Iru h{dpsoh/ vxssrvh
lq wkh marks zrunvkhhw zh zdqw wr uhfrgh wkh judghv lq F5/ F6/ dqg F7 vr wkdw
dq| pdun lq wkh udqjh 3< ehfrphv d 3/ hyhu| pdun lq wkh udqjh 434< ehfrphv
43/ hwf1/ dqg wkh uhvxowv duh sodfhg lq froxpqv F9/ F:/ dqg F;1 Wkh iroorzlqj
frppdqg
dffrpsolvkhv wklv1 Qrwh wkh xvh ri wkh frqwlqxdwlrq v|pero )/ dv wklv lv d orqj
frppdqg1 Wkh jhqhudo v|qwd{ iru wkh code frppdqg lv
code +Y1 , wr frgh1 111 +Yq , wr frghq iru H1 111 Hp sxw lq Hp+1 111 H2p
zkhuh Yl ghqrwhv d vhw ri srvvleoh ydoxhv dqg udqjhv iru wkh ydoxhv lq froxpqv
H1 111 Hp wkdw duh doo frghg dv wkh qxpehu frghl > dqg wkh uhvxowv ri wklv frglqj
duh sodfhg lq wkh froxpqv Hp+1 111 H2p / l1h1/ wkh uhfrghg H1 lv sodfhg lq Hp+1 /
hwf1
zkhuh H1 / 111/ Hp > duh wh{w froxpqv/ dqg Hp+1 lv wkh wdujhw wh{w froxpq1
Glvsod| L165= Gldorj er{ iru frqyhuwlqj wh{w froxpq f8 ri wkh pdunv zrunvkhhw lqwr d
qxphulf froxpq zlwk wkh frqyhuvlrq wdeoh jlyhq lq froxpqv f9 dqg f:1
11.4 History
Plqlwde nhhsv d uhfrug ri wkh frppdqgv |rx kdyh xvhg dqg wkh gdwd |rx kdyh
lqsxw lq d vhvvlrq1 Wklv lqirupdwlrq fdq eh rewdlqhg lq wkh Klvwru| iroghu ri wkh
Surmhfw Pdqdjhu zlqgrz1 Wkh frppdqgv fdq eh frslhg iurp zkhuhyhu wkh| duh
olvwhg dqg sdvwhg lqwr wkh Vhvvlrq zlqgrz wr eh uhh{hfxwhg/ vr wkdw d qxpehu
ri frppdqgv fdq eh h{hfxwhg dw rqfh zlwkrxw uhw|slqj1 Wkhvh frppdqgv fdq
eh hglwhg ehiruh ehlqj h{hfxwhg djdlq1 Wklv lv yhu| khosixo zkhq |rx kdyh
lpsohphqwhg d orqj vhtxhqfh ri frppdqgv dqg uhdol}h wkdw |rx pdgh dq huuru
hduo| rq1 Qrwh wkdw hyhq li |rx xvh wkh phqx frppdqgv/ d uhfrug lv nhsw rqo|
ri wkh fruuhvsrqglqj vhvvlrq frppdqgv1
Wkh journal frppdqg lv dydlodeoh lq wkh Vhvvlrq zlqgrz li |rx zdqw wr
nhhs d uhfrug ri wkh frppdqgv lq dq h{whuqdo oh1 Iru h{dpsoh/
Minitab for Data Management 6<
rank H1 H2
zkhuh H1 lv wkh froxpq zkrvh udqnv zh zdqw wr frpsxwh/ dqg H2 lv wkh froxpq
wkdw zloo krog wkh frpsxwhg udqnv1
Lq wkh Vhvvlrq zlqgrz/ wklv vdph uhvxow fdq eh rewdlqhg xvlqj wkh stack
frppdqg1 Wkh jhqhudo v|qwd{ iru wkh stack frppdqg lv jlyhq e|
stack H1 H2 = = =Hp lqwr Hp+1
zkhuh H1 / H2 / 111/ Hp ghqrwh wkh froxpqv ru frqvwdqwv wr eh vwdfnhg rqh rq wrs
ri wkh rwkhu/ vwduwlqj zlwk H1 / dqg zlwk wkh uhvxow sodfhg lq froxpq Hp+1 = Li zh
75 Minitab for Data Management
zdqw wr nhhs dq lqgh{ ri zkhuh wkh ydoxhv fdph iurp/ wkhq xvh wkh vxefrppdqg
subscripts Hp+2
zklfk uhvxowv lq lqgh{ ydoxhv ehlqj vwruhg lq froxpq Hp+2 =
Wr xqvwdfn ydoxhv lq d froxpq e| wkh ydoxhv lq dq lqgh{ froxpq zh xvh wkh
Pdqls I Xqvwdfn frppdqg1 Iru h{dpsoh/ jlyhq wkh froxpqv F9 dqg F: ri
wkh marks zrunvkhhw dv ghvfulehg deryh/ wkh gldorj er{ vkrzq lq Glvsod| L169
xqvwdfnv F9 lqwr wkuhh froxpqv e| wkh ydoxhv lq F:1 Wkh wkuhh froxpqv duh
F;/ F</ dqg F431 Qrwh wkdw wkh| duh lghqwlfdo wr froxpqv F5/ F6/ dqg F7/
uhvshfwlyho|1 Zh pxvw dozd|v vshfli| d froxpq frqwdlqlqj wkh vxevfulswv zkhq
xqvwdfnlqj d froxpq1
12 Exercises
41 Wkh iroorzlqj gdwd jlyh wkh Kl dqg Orz wudglqj sulfhv lq Fdqdgldq grooduv
iru ydulrxv vwrfnv rq d jlyhq gd| rq wkh Wrurqwr Vwrfn H{fkdqjh1 Fuhdwh
d zrunvkhhw/ jlylqj wkh froxpqv wkh vdph yduldeoh qdphv/ xvlqj dq| ri
wkh phwkrgv glvfxvvhg lq L1:1 Eh fduhixo wr hqvxuh wkdw wkh ydoxh ri wkh
yduldeoh stock vwduwv zlwk d ohwwhu1 Sulqw wkh zrunvkhhw wr fkhfn wkdw
|rx kdyh vxffhvvixoo| hqwhuhg lw1 Vdyh wkh zrunvkhhw jlylqj lw wkh qdph
stocks1
Stock Hi Low
DFU :1<8 :1;3
PJL 71:8 7133
EOG 445158 43<1:8
FIS <198 <158
PDO ;158 ;143
FP 781<3 78163
D]F 41<< 41<6
FPZ 53133 4<133
DP] 51:3 5163
JDF 85133 83158
Stock Hi Low
FOY 41;8 41:;
VLO 67133 67133
DF 47178 47138
8 Xvlqj wkh zrunvkhhw fuhdwhg lq H{huflvh 7/ vruw wkh vwrfnv lqwr doskdehwlfdo
rughu1 Fdofxodwh wkh udqnv ri wkh lqglylgxdo vwrfnv edvhg rq wkhlu Hi sulfh/
dqg vdyh wkh udqnlqj lq d qhz froxpq1 Vdyh wkh zrunvkhhw1
9 Xvlqj wkh zrunvkhhw fuhdwhg lq H{huflvh 8/ fdofxodwh wkh dyhudjh Hi sulfh
ri doo wkh vwrfnv ehjlqqlqj lq D1
: Xvlqj wkh zrunvkhhw fuhdwhg lq H{huflvh 8/ uhfrgh doo wkh Low sulfhv lq wkh
udqjh '3<1<< dv 4/ lq wkh udqjh '436<1<< dv 5/ dqg juhdwhu wkdq ru htxdo
wr '73 dv 6/ dqg vdyh wkh uhfrghg yduldeoh lq d qhz froxpq1
; Xvlqj sdwwhuqhg gdwd lqsxw/ sodfh wkh ydoxhv iurp 10 wr 43 lq lqfuhphqwv
ri 14 lq F41 Iru hdfk ri wkh ydoxhv lq F4/ fdofxodwh wkh ydoxh ri wkh
txdgudwlf sro|qrpldo 2{2 + 4{ 3 +l1h1/ vxevwlwxwh wkh ydoxh lq hdfk hqwu|
lq F4 lqwr wklv h{suhvvlrq, dqg sodfh wkhvh ydoxhv lq F51 Xvlqj Plqlwde
frppdqgv dqg wkh ydoxhv lq F4 dqg F5/ hvwlpdwh wkh srlqw lq wkh udqjh
iurp 10 wr 43 zkhuh wklv sro|qrpldo wdnhv lwv vpdoohvw ydoxh dqg zkdw
wklv vpdoohvw ydoxh lv1 Xvlqj Plqlwde frppdqgv dqg wkh ydoxhv lq F4 dqg
F5 hvwlpdwh wkh srlqwv lq wkh udqjh iurp 10 wr 10> zkhuh wklv sro|qrpldo
lv forvhvw wr 31
< Xvlqj sdwwhuqhg gdwd lqsxw/ sodfh ydoxhv lq wkh udqjh iurp 3 wr 8 xvlqj dq
lqfuhphqw ri 134 lq F41 Fdofxodwh wkh ydoxh ri 1 h{ iru hdfk ydoxh lq
F4/ dqg sodfh wkh uhvxow lq F51 Xvlqj Plqlwde frppdqgv/ qg wkh odujhvw
ydoxh lq F4 zkhuh wkh fruuhvsrqglqj hqwu| lq F5 lv ohvv wkdq ru htxdo wr 181
Qrwh wkdw h{ fruuhvsrqgv wr wkh exponentiate frppdqg +vhh Dsshqgl{
E14, hydoxdwhg dw {1
43 Xvlqj sdwwhuqhg gdwd lqsxw/ sodfh ydoxhv lq wkh udqjh iurp 4 wr 7 xvlqj
dq lqfuhphqw ri 134 lq F41 Fdofxodwh wkh ydoxh ri
1 2
s h{ @2
2
iru hdfk ydoxh lq F4/ dqg sodfh wkh uhvxow lq F5/ zkhuh = 3=14159271
Xvlqj parsums +vhh Dsshqgl{ E14,/ fdofxodwh wkh sduwldo vxpv iru F5/
dqg sodfh wkh uhvxow lq F61 Pxowlso| F6 wlphv 1341 Ilqg wkh odujhvw ydoxh
lq F4 vxfk wkdw wkh fruuhvsrqglqj hqwu| lq F6 lv ohvv wkdq ru htxdo wr 1581
Part II
78
Chapter 1
Looking at
Data–Distributions
Wklv fkdswhu ri LSV lv frqfhuqhg zlwk wkh ydulrxv zd|v ri suhvhqwlqj dqg vxp0
pdul}lqj d gdwd vhw1 E| suhvhqwlqj gdwd/ zh phdq frqyhqlhqw dqg lqirupdwlyh
phwkrgv ri frqyh|lqj wkh lqirupdwlrq frqwdlqhg lq d gdwd vhw1 Wkhuh duh wzr
edvlf phwkrgv iru suhvhqwlqj gdwd/ qdpho| judsklfdoo| dqg wkurxjk wdexodwlrqv1
Vwloo/ lw fdq eh kdug wr vxppdul}h h{dfwo| zkdw wkhvh suhvhqwdwlrqv duh vd|lqj
derxw wkh gdwd1 Vr wkh fkdswhu dovr lqwurgxfhv ydulrxv vxppdu| vwdwlvwlfv wkdw
duh frpprqo| xvhg wr frqyh| phdqlqjixo lqirupdwlrq lq d frqflvh zd|1
Doo ri wkhvh wrslfv fdq lqyroyh pxfk whglrxv/ huuru surqh fdofxodwlrq/ li zh
zhuh wr lqvlvw rq grlqj wkhp e| kdqg1 Dq lpsruwdqw srlqw lv wkdw |rx vkrxog
7:
7; Chapter 1
doprvw qhyhu uho| rq kdqg fdofxodwlrq lq fduu|lqj rxw d gdwd dqdo|vlv1 Qrw rqo|
duh wkhuh pdq| idu pruh lpsruwdqw wklqjv iru |rx wr eh wklqnlqj derxw/ dv wkh
wh{w glvfxvvhv/ exw |rx duh dovr olnho| wr pdnh dq huuru1 Rq wkh rwkhu kdqg/
qhyhu eolqgo| wuxvw wkh frpsxwhu$ Fkhfn |rxu uhvxowv dqg pdnh vxuh wkdw wkh|
pdnh vhqvh lq oljkw ri wkh dssolfdwlrq1 Iru wklv/ d ihz vlpsoh kdqg fdofxodwlrqv
fdq suryh ydoxdeoh1 Lq zrunlqj wkurxjk wkh sureohpv lq LSV/ |rx vkrxog wu| wr
xvh Plqlwde dv pxfk dv srvvleoh/ dv wklv zloo lqfuhdvh |rxu vnloo zlwk wkh sdfndjh
dqg lqhylwdeo| pdnh |rxu gdwd dqdo|vhv hdvlhu dqg pruh hhfwlyh1
5; 55 69 59 5; 5;
59 57 65 63 5: 57
66 54 69 65 64 58
57 58 5; 69 5: 65
67 63 58 59 59 58
077 56 54 63 66 5<
5: 5< 5; 55 59 5:
49 64 5< 69 65 5;
73 4< 6: 56 65 5<
05 57 58 5: 57 49
5< 53 5; 5: 6< 56
Qh{w zh xvhg wkh Vwdw I Wdeohv I Wdoo| frppdqg/ zlwk wkh gldorj er{ vkrzq
lq Glvsod| 415/
Glvsod| 415= Gldorj er{ iru wdoo|lqj wkh yduldeoh F5 lq wkh newcomb zrunvkhhw1
uvw xvh wkh sort frppdqg wr vruw wkh gdwd lq F4 iurp vpdoohvw wr odujhvw dqg
sodfh wkh uhvxowv lq F61 Wkh fxpxodwlyh glvwulexwlrq lv frpsxwhg iru wkh ydoxhv
lq F6 zlwk wkh xqltxh ydoxhv lq F6 vwruhg lq F7 dqg wkh fxpxodwlyh glvwulexwlrq
dw hdfk ri wkh xqltxh ydoxhv vwruhg lq F8 yld wkh store vxefrppdqg wr tally.
Glvsod| 416= Gldorj er{ iru frpsxwlqj edvlf ghvfulswlyh vwdwlvwlfv ri d txdqwlwdwlyh
yduldeoh1
Li zh zlvk wr frpsxwh vrph edvlf vwdwlvwlfv dqg vwruh wkhvh ydoxhv iru odwhu
xvh/ wkhq wkh Vwdw I Edvlf Vwdwlvwlfv I Vwruh Ghvfulswlyh Vwdwlvwlfv frppdqg lv
dydlodeoh iru wklv1 Iru h{dpsoh/ zlwk wkh newcomb zrunvkhhw wklv frppdqg ohdgv
85 Chapter 1
wr wkh gldorj er{ vkrzq lq Glvsod| 4171 Folfnlqj rq wkh Vwdwlvwlfv exwwrq uhvxowv
lq wkh gldorj er{ ri Glvsod| 418 zkhuh zh kdyh fkhfnhg Iluvw txduwloh/ Phgldq/
Wklug txduwloh/ Lqwhutxduwloh udqjh/ dqg Q qrqplvvlqj dv wkh vwdwlvwlfv zh zdqw
wr frpsxwh1 Wkh uhvxow ri wkhvh fkrlfhv lv wkdw wkh qh{w dydlodeoh yduldeohv lq
wkh zrunvkhhw frqwdlq wkhvh ydoxhv1 Vr lq wklv fdvh/ wkh ydoxhv ri F6F: duh dv
ghslfwhg lq Glvsod| 4191 Qrwh wkdw wkhvh yduldeohv duh qrz qdphg dv zhoo1 Qrwh
wkdw pdq| pruh vwdwlvwlfv duh dydlodeoh xvlqj wklv frppdqg1
Glvsod| 417= Gldorj er{ iru frpsxwlqj dqg vwrulqj ydulrxv ghvfulswlyh vwdwlvwlfv1
Glvsod| 418= Gldorj er{ iru fkrrvlqj wkh ghvfulswlyh vwdwlvwlfv wr frpsxwh dqg vwruh1
Glvsod| 419= Ydoxhv rewdlqhg iru ghvfulswlyh vwdwlvwlfv xvlqj gldorj er{hv lq Iljxuhv
417 dqg 4181
describe H1 = = =Hp
zkhuh H1 / 111/ Hp duh froxpqv ri txdqwlwdwlyh yduldeohv dqg wkh frppdqg lv
dssolhg wr hdfk froxpq1 D by vxefrppdqg fdq dovr eh xvhg1 Wkh stats
frppdqg lv dydlodeoh lq wkh Vhvvlrq zlqgrz li zh zdqw wr vwruh wkh ydoxhv ri
vwdwlvwlfv1 Zh uhihu wkh uhdghu wr help iru d ghvfulswlrq ri wklv frppdqg1
1.2.1 Dotplots
Wkh Judsk I Grwsorw frppdqg lv xvhg zlwk txdqwlwdwlyh yduldeohv dqg surgxfhv
d sorw ri hdfk gdwd ydoxh dv d grw dorqj wkh {0d{lv vr wkdw |rx jhw d jhqhudo
lghd ri wkh orfdwlrq ri wkh gdwd dqg krz pxfk vfdwwhu wkhuh lv1 Dfwxdoo|/ wkh
gdwd lv jurxshg ehiruh sorwwlqj dqg pxowlsoh revhuydwlrqv lq d jurxs duh vwdfnhg
ryhu wkh {0d{lv1 Wkh lqwhuydo ehwzhhq vxffhvvlyh wlfn +., pdunv rq wkh {0d{lv
lv glylghg lqwr 43 htxdo0ohqjwk vxelqwhuydov iru wkh jurxslqj1 W|slfdoo|/ rqh
dovr orrnv iru srlqwv wkdw duh idu iurp wkh pdlq vfdwwhu ri srlqwv dv wkhvh pd|
eh lghqwlhg dv rxwolhuv dqg/ dv vxfk/ ghohwhg iurp wkh gdwd vhw iru vxevhtxhqw
dqdo|vlv1 Iru h{dpsoh/ iru wkh newcomb zrunvkhhw gldorj er{ lq Glvsod| 41:
uhvxowv lq wkh sorw ri Glvsod| 41;1
Wkh jhqhudo v|qwd{ ri wkh fruuhvsrqglqj Vhvvlrq frppdqg dotplot lv
dotplot H1 = = =Hp
zkhuh H1 / 111/ Hp duh froxpqv/ dqg d grwsorw lv surgxfhg iru hdfk1 Wkhuh duh d
qxpehu ri vxefrppdqgv dydlodeoh1 Wkh same vxefrppdqg hqvxuhv wkh vfdohv
ri wkh grwsorwv duh wkh vdph iru hdfk froxpq1 Wkh by vxefrppdqg doorzv
sorwwlqj ri d yduldeoh e| wkh ydoxhv ri dqrwkhu yduldeoh zlwk doo sorwv kdylqj
wkh vdph vfdoh1 Wkh increment vxefrppdqg doorzv iru frqwuro ri wkh glvwdqfh
87 Chapter 1
ehwzhhq wkh wlfn pdunv dqg start dqg end doorz |rx wr vshfli| zkhuh wkh
grwsorw vkrxog ehjlq dqg hqg1 Iru h{dpsoh/
MTB Adotplot c1;
SUBCAincrement=5;
SUBCAstart=20 end=35.
sxwv wkh wlfn pdunv 8 xqlwv dsduw/ vwduwv wkh sorw dw 53/ dqg hqgv lw dw 68/ vr
vrph srlqwv duh qrw sorwwhg lq wklv fdvh1
Stem-and-leaf of time N = 66
Leaf Unit = 1.0
1 -4 4
1 -3
1 -2
1 -1
2 -0 2
2 0
5 1 669
(41) 2 01122333444445555566666777777888888899999
20 3 0001122222334666679
1 4 0
zklfk lv d vwhp0dqg0ohdi sorw ri wkh ydoxhv lq time1 Wkh uvw froxpq jlyhv wkh
ghswkv iru d jlyhq vwhp/ l1h1/ wkh qxpehu ri revhuydwlrqv rq wkdw olqh dqg ehorz
lw ru deryh lw/ ghshqglqj rq zkhwkhu ru qrw wkh revhuydwlrq lv ehorz ru deryh
wkh phgldq1 Wkh urz frqwdlqlqj wkh phgldq lv hqforvhg lq sduhqwkhvhv + ,/ dqg
wkh ghswk lv rqo| wkh revhuydwlrqv rq wkdw olqh1 Li wkh qxpehu ri revhuydwlrqv lv
hyhq dqg wkh phgldq lv wkh dyhudjh ri ydoxhv rq glhuhqw urzv/ wkhq sduhqwkhvhv
gr qrw dsshdu1 Wkh vhfrqg froxpq jlyhv wkh vwhpv/ dv ghwhuplqhg e| Plqlwde/
dqg wkh uhpdlqlqj froxpqv jlyh wkh rughuhg ohdyhv/ zkhuh hdfk gljlw uhsuhvhqwv
rqh revhuydwlrq1 Wkh Ohdi Xqlw ghwhuplqhv zkhuh wkh ghflpdo sodfh jrhv diwhu
hdfk ohdi1 Vr lq wklv h{dpsoh/ wkh uvw revhuydwlrq lv 44=0> zkloh lw zrxog eh
4=4 li wkh Ohdi Xqlw zhuh 141 Pxowlsoh vwhp0dqg0ohdi sorwv fdq eh fduulhg rxw
iru d qxpehu ri froxpqv vlpxowdqhrxvo| dqg dovr iru d vlqjoh yduldeoh e| wkh
ydoxhv ri dqrwkhu yduldeoh1
1.2.3 Histograms
D klvwrjudp lv d sorw zkhuh wkh gdwd duh jurxshg lqwr lqwhuydov/ dqg ryhu hdfk
vxfk lqwhuydo d edu lv gudzq ri khljkw htxdo wr wkh iuhtxhqf| ri gdwd ydoxhv lq
wkdw lqwhuydo ru ri khljkw htxdo wr wkh uhodwlyh iuhtxhqf| +sursruwlrq, ri gdwd
ydoxhv lq wkdw lqwhuydo ru ri khljkw htxdo wr wkh ghqvlw| ri srlqwv lq wkdw lqwhuydo/
l1h1/ wkh sursruwlrq ri srlqwv lq wkh lqwhuydo glylghg e| wkh ohqjwk ri wkh lqwhuydo1
Wkh Judsk I Klvwrjudp frppdqg lv xvhg wr rewdlq wkhvh sorwv1
Iru h{dpsoh/ xvlqj wklv frppdqg zlwk wkh newcomb zrunvkhhw/ surgxfhv
wkh gldorj er{ vkrzq lq Glvsod| 41<1 Zh kdyh sodfhg wkh yduldeoh time lq wkh
uvw x er{ wr lqglfdwh zh zdqw d klvwrjudp ri wklv yduldeoh1 Zh fdq surgxfh
pxowlsoh klvwrjudpv e| sodflqj pruh yduldeohv lq wkh x er{hv1 Wr vhohfw wkh w|sh
ri klvwrjudp wr sorw/ zh qh{w folfn rq wkh R swlrqv exwwrq/ zklfk surgxfhv wkh
gldorj er{ ri Glvsod| 41431 Khuh/ zh kdyh vhohfwhg d ghqvlw| klvwrjudp dqg kdyh
vshflhg wkh lqwhuydov wr xvh iru jurxslqj wkh gdwd e| vshfli|lqj wkh fxwsrlqwv
45> 30> 15> 0> 15> 30> 45> zklfk suhvfuleh wkh lqwhuydov [45> 30)> [30> 15)>
hwf1/ iru wkh jurxslqj1 Dowhuqdwlyho|/ zh frxog kdyh vshflhg wkh plgsrlqwv ri
wkh jurxslqj lqwhuydov1 Wkh dgydqwdjh zlwk fxwsrlqwv lv wkdw vxelqwhuydov ri
xqhtxdo ohqjwkv fdq eh vshflhg1 Folfnlqj rq wkh RN exwwrqv lq wkhvh er{hv
89 Chapter 1
surgxfhv wkh klvwrjudp vkrzq lq Glvsod| 41441 Dv fdq eh vhhq iurp wkh gldorj
er{ ri Glvsod| 41</ wkhuh duh d ydulhw| ri phwkrgv iru frqwuroolqj wkh dsshdudqfh
ri wkh klvwrjudp surgxfhg/ dqg zh uhihu wkh uhdghu wr wkh Khos exwwrq iru d
ghvfulswlrq ri wkhvh1
Glvsod| 41<= Gldorj er{ iru fuhdwlqj d klvwrjudp ri wkh wlph yduldeoh lq wkh newcomb
zrunvkhhw1
Glvsod| 4143= Gldorj er{ iru vhohfwlqj wkh w|sh ri klvwrjudp wr sorw1
Looking At Data–Distributions 8:
Glvsod| 4144= Ghqvlw| klvwrjudp ri wkh wlph yduldeoh lq wkh newcomb zrunvkhhw1
1.2.4 Boxplots
Er{sorwv duh xvhixo vxppdulhv ri d txdqwlwdwlyh yduldeoh dqg duh rewdlqhg xvlqj
wkh Judsk I Er{sorw frppdqg1 Er{sorwv duh xvhg wr surylgh d judsklfdo
qrwlrq ri wkh orfdwlrq ri wkh gdwd dqg lwv vfdwwhu lq d frqflvh dqg hyrfdwlyh zd|1
Iru h{dpsoh/ lq wkh newcomb zrunvkhhw wklv frppdqg surgxfhv wkh gldorj er{
vkrzq lq Glvsod| 4145 dqg wkh sorw lq Glvsod| 41461 Wkh olqh lq wkh fhqwhu ri wkh
8; Chapter 1
er{ lv wkh phgldq1 Wkh olqh ehorz wkh phgldq lv wkh uvw txduwloh/ dovr fdoohg wkh
orzhu klqjh/ dqg wkh olqh deryh lv wklug txduwloh/ dovr fdoohg wkh xsshu klqjh1
Wkh glhuhqfh ehwzhhq wkh wklug dqg uvw txduwloh/ lv fdoohg wkh lqwhutxduwloh
udqjh ru LTU1 Wkh yhuwlfdo olqhv iurp wkh klqjhv duh fdoohg zklvnhuv/ dqg wkhvh
uxq iurp wkh klqjhv wr wkh dgmdfhqw ydoxhv1 Wkh dgmdfhqw ydoxhv duh jlyhq e| wkh
juhdwhvw ydoxh ohvv wkdq ru htxdo wr wkh xsshu olplw +wkh wklug txduwloh soxv 418
wlphv wkh LTU, dqg e| wkh ohdvw ydoxh juhdwhu wkdq ru htxdo wr wkh orzhu olplw
+wkh uvw txduwloh plqxv 418 wlphv wkh LTU,1 Wkh xsshu dqg orzhu olplwv duh
dovr uhihuuhg wr dv wkh lqqhu ihqfhv1 Wkh rxwhu ihqfhv duh ghqhg e| uhsodflqj
wkh pxowlsoh 418 lq wkh ghqlwlrq ri wkh lqqhu ihqfhv e| 6131 Ydoxhv eh|rqg wkh
rxwhu ihqfhv duh sorwwhg zlwk d * dqg duh fdoohg rxwolhuv1 Dv zlwk wkh sorwwlqj
ri klvwrjudpv/ pxowlsoh er{sorwv fdq eh sorwwhg iru frpsdulvrq sxusrvhv/ dqg
djdlq/ lw lv lpsruwdqw wr pdnh vxuh wkdw wkh| doo kdyh wkh vdph vfdoh1
Glvsod| 4145= Gldorj er{ iru surgxflqj d er{sorw ri wkh wlph yduldeoh lq wkh newcomb
zrunvkhhw1
Glvsod| 4147= Gldorj er{ iru d wlph vhulhv sorw ri wkh yduldeoh wlph iurp wkh newcomb
zrunvkhhw1
Glvsod| 4148= Wlph vhulhv sorw ri wkh yduldeoh wlph iurp wkh newcomb zrunvkhhw1
1 1 } 2
s h 2 ( )
2
Looking At Data–Distributions 94
zkhuh H1 > 111/ Hp duh froxpqv ru frqvwdqwv frqwdlqlqj qxpehuv dqg Hp+1 / 111/
H2p duh wkh froxpqv ru frqvwdqwv wkdw vwruh wkh ydoxhv ri wkh Q (> ) ghqvlw|
fxuyh dw wkhvh qxpehuv dqg Y1 @ dqg Y2 @ = Li qr vwrudjh lv vshflhg/ wkhq
wkh ydoxhv duh sulqwhg1 Iru h{dpsoh/ li zh zdqw wr frpsxwh wkh Q (=5> 1=2) ghq0
vlw| fxuyh dw hyhu| ydoxh ehwzhhq 3 dqg 3 lq lqfuhphqwv ri =01> wkh frppdqgv
MTB Aset c1
DATAA-3:3/.01
DATAAend
MTB Apdf c1 c2;
SUBCAnormal mu=-.5 sigma=1.2.
sxw wkh ydoxhv ehwzhhq 3 dqg 3 lq lqfuhphqwv ri =01 lq F4 xvlqj wkh set
frppdqg1 Wkh pdf frppdqg zlwk wkh normal vxefrppdqg fdofxodwhv wkh
Q (=5> 1=2) ghqvlw| fxuyh dw hdfk ri wkhvh ydoxhv dqg sxwv wkh rxwfrphv lq wkh
fruuhvsrqglqj hqwulhv ri F51 Li zh sorw F5 djdlqvw F4/ zh zloo kdyh d sorw ri
wkh ghqvlw| fxuyh ri wklv glvwulexwlrq1 Iru wklv/ zh xvh wkh vfdwwhusorw idflolwlhv
lq Plqlwde dv glvfxvvhg lq LL161 Qrwh wkdw zlwk wkh normal vxefrppdqg zh
pxvw dovr vshfli| wkh phdq dqg wkh vwdqgdug ghyldwlrq yld mu dqg sigma1
lq wklv fdvh/ lq wkh gldorj er{ ri Glvsod| 414: zh vhohfw Lqyhuvh fxpxodwlyh
suredelolw| lqvwhdg1 Pdnlqj wklv fkdqjh lq wkh gldorj er{ ri Glvsod| 414: dqg
uhsodflqj 44 e| 1:8 uhfdoo wkdw wkh dujxphqw wr wklv ixqfwlrq pxvw eh ehwzhhq
3 dqg 4 zh jhw wkh rxwsxw
P( X ?= x ) x
0.7500 10.6745
lq wkh Vhvvlrq zlqgrz1 Wklv lqglfdwhv wkdw wkh duhd wr wkh ohiw ri 4319:78 xq0
ghuqhdwk wkh Q(=5> 1=2) ghqvlw| fxuyh lv 1:81
Wkh jhqhudo v|qwd{ ri wkh fruuhvsrqglqj vhvvlrq frppdqg invcdf zlwk wkh
normal vxefrppdqg lv
invcdf H1 = = =Hp lqwr Hp+1 = = =H2p ;
normal mu @ Y1 vljpd @ Y2 =
zkhuh H1 > 111/ Hp duh froxpqv ru frqvwdqwv frqwdlqlqj qxpehuv ehwzhhq 3 dqg
4 dqg Hp+1 / 111/ H2p duh wkh froxpqv ru frqvwdqwv wkdw vwruh wkh ydoxhv ri wkh
shufhqwlohv ri wkh Q (> ) ghqvlw| fxuyh dw wkhvh qxpehuv dqg zkhuh Y1 @
dqg Y2 @ = Li qr vwrudjh lv vshflhg/ wkhq wkh ydoxhv duh sulqwhg1
Glvsod| 414<= Qrupdo suredelolw| sorw ri wkh wlph yduldeoh lq wkh qhzfrpe zrunvkhhw1
1.4 Exercises
Zkhq wkh gdwd iru dq h{huflvh frph iurp dq h{huflvh lq LSV/ wkh LSV h{huflvh
qxpehu lv jlyhq lq sduhqwkhvhv + ,1 Doo frpsxwdwlrqv lq wkhvh h{huflvhv duh
wr eh fduulhg rxw xvlqj Plqlwde/ dqg wkh h{huflvhv duh ghvljqhg wr hqvxuh wkdw
|rx kdyh d uhdvrqdeoh xqghuvwdqglqj ri wkh Plqlwde pdwhuldo lq wklv fkdswhu1
Jhqhudoo|/ |rx vkrxog eh xvlqj Plqlwde wr gr doo wkh frpsxwdwlrqv dqg sorwwlqj
uhtxluhg iru wkh sureohpv lq LSV1
81 +415<, Xvh Plqlwde frppdqgv iru wkh vwhpsorw dqg wkh wlph sorw1 Xvh
Plqlwde frppdqgv wr frpsxwh d qxphulfdo vxppdu| ri wklv gdwd/ dqg
mxvwli| |rxu fkrlfhv1
91 +4163, Wudqvirup wkh gdwd lq wklv sureohp e| vxewudfwlqj 8 iurp hdfk ydoxh
dqg pxowlso|lqj e| 431 Fdofxodwh wkh phdqv dqg vwdqgdug ghyldwlrqv/
xvlqj dq| Plqlwde frppdqgv/ ri erwk wkh ruljlqdo dqg wudqviruphg gdwd1
Frpsxwh wkh udwlr ri wkh vwdqgdug ghyldwlrq ri wkh wudqviruphg gdwd wr
wkh vwdqgdug ghyldwlrq ri wkh ruljlqdo gdwd1 Frpphqw rq wklv ydoxh1
:1 +4163, Wudqvirup wklv gdwd e| pxowlso|lqj hdfk ydoxh e| 61 Frpsxwh
wkh udwlr ri wkh vwdqgdug ghyldwlrq wr wkh phdq +fdoohg wkh frh!flhqw ri
yduldwlrq, iru wkh ruljlqdo gdwd dqg iru wkh wudqviruphg gdwd1 Mxvwli| wkh
rxwfrph1
;1 Iru wkh Q (6> 1=1) ghqvlw| fxuyh/ frpsxwh wkh duhd ehwzhhq wkh lqwhuydo
(3> 5) dqg wkh ghqvlw| fxuyh1 Zkdw qxpehu kdv 86( ri wkh duhd wr wkh ohiw
ri lw iru wklv ghqvlw| fxuyhB
<1 Xvh Plqlwde frppdqgv wr yhuli| wkh 9;0<80<<1: uxoh iru wkh Q (2> 3) ghqvlw|
fxuyh1
431 Fdofxodwh dqg vwruh wkh ydoxhv ri wkh Q (0> 1) ghqvlw| fxuyh dw hdfk ydoxh
lq [3> 3] xvlqj dq lqfuhphqw ri 1341 Sxw wkh ydoxhv lq wkh lqwhuydo [3> 3]
lq F4 dqg wkh ydoxhv ri wkh ghqvlw| fxuyh lq F51 Xvlqj wkh frppdqg plot
C2*C1/ sorw wkh ghqvlw| fxuyh1 Frpphqw rq wkh vkdsh ri wklv fxuyh1
441 Xvh Plqlwde frppdqgv wr pdnh wkh qrupdo txdqwloh sorwv suhvhqwhg lq
Iljxuhv 4164 dqg 4165 ri LSV1
99 Chapter 1
Chapter 2
Looking at
Data–Relationships
Lq wklv fkdswhu/ Plqlwde frppdqgv duh ghvfulehg wkdw shuplw wkh dqdo|vlv ri
uhodwlrqvklsv dprqj wzr yduldeohv1 Wkh phwkrgv duh glhuhqw ghshqglqj rq
zkhwkhu ru qrw erwk yduldeohv duh txdqwlwdwlyh/ erwk yduldeohv duh fdwhjrulfdo/
ru rqh lv txdqwlwdwlyh dqg wkh rwkhu lv fdwhjrulfdo1 Wklv fkdswhu frqvlghuv uhod0
wlrqvklsv ehwzhhq wzr txdqwlwdwlyh yduldeohv zlwk wkh uhpdlqlqj fdvhv glvfxvvhg
lq odwhu fkdswhuv1 Judsklfdo phwkrgv duh yhu| xvhixo lq orrnlqj iru uhodwlrqvklsv
dprqj yduldeohv/ dqg zh h{dplqh ydulrxv sorwv iru wklv1
2.1 Scatterplots
D vfdwwhusorw ri wzr txdqwlwdwlyh yduldeohv lv d xvhixo whfkqltxh zkhq orrnlqj
iru d uhodwlrqvkls ehwzhhq wzr yduldeohv1 E| d vfdwwhusorw zh phdq d sorw ri
rqh yduldeoh rq wkh |0d{lv djdlqvw wkh rwkhu yduldeoh rq wkh {0d{lv1 Iru h{dp0
soh/ frqvlghu H{dpsoh 517 lq LSV/ zkhuh zh duh frqfhuqhg zlwk wkh uhodwlrqvkls
ehwzhhq wkh ohqjwk ri wkh ihpxu dqg wkh ohqjwk ri wkh kxphuxv lq dq h{wlqfw
vshflhv1 Vxssrvh wkdw zh kdyh lqsxw wkh gdwd vr wkdw ohqjwk ri wkh ihpxu
phdvxuhphqwv duh lq F4/ zklfk kdv ehhq qdphg femur/ dqg wkh ohqjwk ri wkh
kxphuxv phdvxuhphqwv duh lq F5/ zklfk kdv ehhq qdphg humerus/ ri wkh zrun0
vkhhw archaeopteryx1 Wkh frppdqg Judsk I Sorw surgxfhv wkh gldorj er{ ri
lqwr wkh
Glvsod| 514/ zkhuh zh kdyh sodfhg femur uvw er{ iru wkh | yduldeoh
9:
9; Chapter 2
dqg humerus lq wkh uvw er{ iru wkh { yduldeoh1 Wklv surgxfhv wkh sorw vkrzq lq
Glvsod| 5151 Qrwh wkdw zh frxog dowhu wkh sorwwlqj v|pero xvlqj wkh gldorj er{
wkdw dsshduv zkhq zh folfn rq wkh Hglw Dwwulexwhv er{1 Xvlqj wkh gldorj er{
wkdw dsshduv zkhq |rx folfn rq wkh Dqqrwdwlrq exwwrq/ lw lv srvvleoh wr jlyh wkh
sorw d wlwoh/ odeho sorwwhg srlqwv/ hwf1 Xvlqj wkh gldorj er{ wkdw dsshduv zkhq
|rx folfn rq wkh Iudph exwwrq/ |rx fdq fkdqjh wkh odehov rq wkh d{hv1 Udwkhu
wkdq mxvw sorwwlqj wkh srlqwv lq d vfdwwhusorw/ |rx fdq dgg frqqhfwlrq olqhv +mrlq
wkh srlqwv zlwk olqhv,/ dgg surmhfwlrq olqhv +gurs d olqh iurp hdfk srlqw wr wkh
{0d{lv,/ dqg dgg duhdv +oo lq wkh duhd xqghu d sro|jrq mrlqlqj wkh srlqwv,1
Dovr/ |rx fdq hpsor| wkh vfdwwhusorw vprrwkhu orzhvv wr sorw d slhfhzlvh olqhdu
frqwlqxrxv fxuyh wkurxjk wkh vfdwwhu ri srlqwv1 Wkhvh ihdwxuhv duh dydlodeoh yld
Judsk I Sorw I Glvsod|1 Wkhuh duh d qxpehu ri rwkhu ihdwxuhv wkdw doorz |rx
wr frqwuro wkh dsshdudqfh ri wkh sorw1
70
60
femur
50
40
40 45 50 55 60 65 70 75 80 85
humerus
Glvsod| 515= Vfdwwhu sorw ri ihpxu ohqjwk +F4, yhuvxv kxphuxv ohqjwk +F5, ri
H{dpsoh 517 lq LSV1
Looking At Data–Relationships 9<
Lw lv dovr srvvleoh wr kdyh pxowlsoh vfdwwhusorwv rq wkh vdph sorw1 Iru h{dp0
soh/ vxssrvh wkdw F6 lq wkh archaeopteryx zrunvkhhw frqwdlqv wkh qdwxudo orj
ri wkh femur yduldeoh1 Zh rewdlqhg wkh sorw ri Glvsod| 516 e| dgglqj dqrwkhu
sdlu ri yduldeohv wr wkh vhfrqg Judsk yduldeohv er{ dv lq Glvsod| 514 zlwk F6
dv wkh | yduldeoh dqg humerus dv wkh { yduldeoh1 Wr sxw wkhvh vfdwwhusorwv rq
wkh vdph sorw xvh Iudph I Pxowlsoh Judskv dqg folfn rq wkh Ryhuod| judskv
rq wkh vdph sdjh udglr exwwrq1
75
65
55
femur
45
35
25
15
40 45 50 55 60 65 70 75 80 85
humerus
Wkh whfkqltxh ri euxvklqj lv dydlodeoh diwhu rewdlqlqj wkh sorw wr vhh zklfk
revhuydwlrqv +urzv, wkh srlqwv fruuhvsrqg wr1 Wklv lv khosixo lq lghqwli|lqj wkh
srlqwv wkdw fruuhvsrqg wr rxwolhuv1 Euxvklqj lv dffhvvhg iurp wkh wrroedu mxvw
ehorz wkh phqx edu e| folfnlqj rq wkh euxvk zkhq wkh Judsk zlqgrz lv dfwlyh1
Wkh fruuhvsrqglqj vhvvlrq frppdqg lv plot. Iru h{dpsoh/
MTB A plot femur*humerus
surgxfhv wkh sorw ri Glvsod| 5151 Qrwh wkdw wkh uvw yduldeoh lv sorwwhg dorqj wkh
|0d{lv/ dqg wkh vhfrqg yduldeoh lv sorwwhg dorqj wkh {0d{lv1 Wkhuh duh ydulrxv
vxefrppdqgv wkdw fdq eh xvhg zlwk plot, dqg zh uhihu wkh uhdghu wr Khos iru
d ghvfulswlrq ri wkhvh.
Wkhuh duh d qxpehu ri dgglwlrqdo sorwv dydlodeoh lq Plqlwde wkdw duh uhodwhg
wr wkh vfdwwhusorw1 Iru h{dpsoh/ d pdujlqdo sorw ri wzr yduldeohv lv d vfdwwhusorw
ri rqh yduldeoh djdlqvw wkh rwkhu zkhuh lq dgglwlrq klvwrjudpv/ grwsorwv ru
er{sorwv duh sorwwhg dorqj wkh vlghv ri wkh vfdwwhusorw iru hdfk yduldeoh1 Wkhvh
duh dydlodeoh yld wkh phqx frppdqg Judsk I Pdujlqdo Sorw1 Gudiwvpdq sorwv
doorz |rx wr surgxfh d qxpehu ri vfdwwhusorwv lq d uhfwdqjxodu duud| vr wkdw
wkh| fdq eh frpsduhg1 Iru h{dpsoh/ |rx pd| zdqw wr sorw F4 djdlqvw F6/ F5
djdlqvw F6/ F4 djdlqvw F7/ dqg F5 djdlqvw F7 dqg vhh doo ri wkhvh lq d frpprq
sorw1 Wklv fdsdelolw| lv dydlodeoh yld wkh phqx frppdqg Judsk I Gudiwvpdq
Sorw dqg oolqj lq wkh gldorj er{1 Pdwul{ sorwv surylgh d phfkdqlvp iru sodflqj
d qxpehu ri vfdwwhusorwv lq d uhfwdqjxodu duud| ru pdwul{ vr wkdw wkh| fdq eh
gluhfwo| frpsduhg ru h{dplqhg iru uhodwlrqvklsv1 Pdwul{ sorwv duh dydlodeoh yld
:3 Chapter 2
2.2 Correlations
Zkloh d vfdwwhusorw lv d frqyhqlhqw judsklfdo phwkrg iru dvvhvvlqj zkhwkhu ru
qrw wkhuh lv dq| uhodwlrqvkls ehwzhhq wzr yduldeohv/ zh zrxog dovr olnh wr dvvhvv
wklv qxphulfdoo|1 Wkh fruuhodwlrq frh!flhqw surylghv d qxphulfdo vxppdul}d0
wlrq ri wkh ghjuhh wr zklfk d olqhdu uhodwlrqvkls h{lvwv ehwzhhq wzr txdqwlwd0
wlyh yduldeohv/ dqg wklv fdq eh fdofxodwhg xvlqj wkh Vwdw I Edvlf Vwdwlvwlfv I
Fruuhodwlrq frppdqg1 Iru h{dpsoh/ dsso|lqj wklv frppdqg wr wkh femur dqg
humerus yduldeohv ri wkh zrunvkhhw archaeopteryx/ l1h1/ wkh gdwd ri H{dpsoh
517 lq LSV dqg ghslfwhg lq Glvsod| 515/ zh rewdlq wkh rxwsxw
Pearson correlation of femur and humerus = 0.994
P-Value = 0.001
lq wkh Vhvvlrq zlqgrz1 Iru qrz/ zh ljqruh wkh qxpehu uhfrughg dv P-Value.
Wkh jhqhudo v|qwd{ ri wkh fruuhvsrqglqj vhvvlrq frppdqg correlate lv jlyhq
e|
correlate H1 = = = Hp
zkhuh H1 / 111/ Hp duh froxpqv fruuhvsrqglqj wr qxphulfdo yduldeohv/ dqg d fru0
uhodwlrq frh!flhqw lv frpsxwhg ehwzhhq hdfk sdlu1 Wklv jlyhv p(p 1)@2
fruuhodwlrq frh!flhqwv1 Wkh vxefrppdqg nopvalues lv dydlodeoh li |rx zdqw
wr vxssuhvv wkh sulqwlqj ri S 0ydoxhv1
2.3 Regression
Uhjuhvvlrq lv dqrwkhu whfkqltxh iru dvvhvvlqj wkh vwuhqjwk ri d olqhdu uhodwlrqvkls
h{lvwlqj ehwzhhq wzr yduldeohv dqg lw lv forvho| uhodwhg wr fruuhodwlrq1 Iru wklv/
zh xvh wkh Vwdw I Uhjuhvvlrq frppdqg1
Dv qrwhg lq LSV/ wkh uhjuhvvlrq dqdo|vlv ri wzr txdqwlwdwlyh yduldeohv lqyroyhv
frpsxwlqj wkh ohdvw0vtxduhv olqh | = d + e{/ zkhuh rqh yduldeoh lv wdnhq wr eh
wkh uhvsrqvh yduldeoh | dqg wkh rwkhu lv wdnhq wr eh wkh h{sodqdwru| yduldeoh
{1 Qrwh wkdw wkh ohdvw vtxduhv olqh lv glhuhqw ghshqglqj xsrq zklfk fkrlfh lv
pdgh1 Iru h{dpsoh/ iru wkh gdwd ri H{dpsoh 517 lq LSV dqg sorwwhg lq Glvsod|
515 ohwwlqj femur eh wkh uhvsrqvh dqg humerus eh wkh suhglfwru ru h{sodqdwru|
yduldeoh/ wkh Vwdw I Uhjuhvvlrq I Uhjuhvvlrq frppdqg ohdgv wr wkh gldorj er{
ri Glvsod| 517/ zkhuh zh kdyh pdgh wkh dssursuldwh hqwulhv lq wkh Uhvsrqvh dqg
Suhglfwruv er{hv1 Folfnlqj rq wkh RN exwwrq ohdgv wr wkh rxwsxw ri Glvsod|
518 ehlqj sulqwhg lq wkh Vhvvlrq zlqgrz1 Wklv jlyhv wkh ohdvw0vtxduhv olqh dv
| = 3=70 + =826{> l1h1/ d = 3=70 dqg e = =826/ zklfk zh dovr vhh xqghu wkh Coef
froxpq lq wkh uvw wdeoh1 Lq dgglwlrq/ zh rewdlq wkh ydoxh ri wkh vtxduh ri wkh
fruuhodwlrq frh!flhqw/ dovr nqrzq dv wkh frh!flhqw ri ghwhuplqdwlrq/ dv R-Sq
= 98.8%1 Zh zloo glvfxvv wkh uhpdlqlqj rxwsxw iurp wklv frppdqg lq LL1431
Looking At Data–Relationships :4
Glvsod| 51:= Gldorj er{ iru frqwuroolqj rxwsxw iru d uhjuhvvlrq dqdo|vlv1
\rx zloo suredeo| zdqw wr nhhs wkhvh ydoxhv iru odwhu zrun1 Lq wklv fdvh/ folfnlqj
rq wkh Vwrudjh exwwrq ri Glvsod| 517 dqg oolqj lq wkh hqvxlqj gldorj er{ dv
lq Glvsod| 51; uhvxowv lq wkhvh txdqwlwlhv ehlqj vdyhg lq wkh qh{w wzr dydlodeoh
froxpqv lq wklv fdvh/ F6 dqg F7 zlwk wkh qdphv resl1 dqg fits1 iru wkh
uhvlgxdov dqg wv/ uhvshfwlyho|1
Glvsod| 51;= Gldorj er{ iru vwrulqj ydulrxv txdqwlwlhv frpsxwhg dv sduw ri d
uhjuhvvlrq dqdo|vlv1
Hyhq pruh olnho| lv wkdw |rx zloo zdqw wr sorw wkh uhvlgxdov dv sduw ri dvvhvvlqj
zkhwkhu ru qrw wkh dvvxpswlrqv wkdw xqghuolh d uhjuhvvlrq dqdo|vlv pdnh vhqvh
Looking At Data–Relationships :6
lq wkh sduwlfxodu dssolfdwlrq1 Iru wklv/ folfn rq wkh Judskv exwwrq lq wkh gldorj
er{ ri Glvsod| 5171 Wkh gldorj er{ ri Glvsod| 51< ehfrphv dydlodeoh1 Qrwlfh wkdw
zh kdyh uhtxhvwhg wkdw wkh vwdqgdugl}hg uhvlgxdov hdfk uhvlgxdo glylghg e|
lwv vwdqgdug huuru eh sorwwhg/ dqg wklv sorw dsshduv lq Glvsod| 51431 Doo wkh
vwdqgdugl}hg uhvlgxdov vkrxog eh lq wkh lqwhuydo (3> 3) > dqg qr sdwwhuq vkrxog
eh glvfhuqleoh1 Lq wklv fdvh/ wklv uhvlgxdo sorw orrnv qh1 Iurp wkh gldorj er{ ri
Glvsod| 51</ zh vhh wkdw wkhuh duh pdq| rwkhu srvvlelolwlhv iru uhvlgxdo sorwv1
Glvsod| 51<= Gldorj er{ iru vhohfwlqj ydulrxv uhvlgxdo sorwv dv sduw ri d uhjuhvvlrq
dqdo|vlv1
Glvsod| 5143= Sorw ri wkh vwdqgdugl}hg uhvlgxdov yhuvxv kxphuxv diwhu uhjuhvvlqj
ihpxu djdlqvw kxphuxv lq wkh dufkdhrswhu|{ zrunvkhhw1
2.4 Transformations
Vrphwlphv/ wudqvirupdwlrqv ri wkh yduldeohv duh dssursuldwh ehiruh zh fduu|
rxw d uhjuhvvlrq dqdo|vlv1 Wklv lv dffrpsolvkhg lq Plqlwde xvlqj wkh Fdof I
Fdofxodwru frppdqg dqg wkh dulwkphwlfdo dqg pdwkhpdwlfdo rshudwlrqv glv0
fxvvhg lq L14314 dqg L143151 Lq sduwlfxodu/ zkhq d uhvlgxdo sorw orrnv edg/ vrph0
wlphv wklv fdq eh {hg e| wudqviruplqj rqh ru pruh ri wkh yduldeohv xvlqj d
vlpsoh wudqvirupdwlrq/ vxfk dv uhsodflqj wkh uhvsrqvh yduldeoh e| lwv orjdulwkp
ru vrphwklqj hovh1 Iru h{dpsoh/ li zh zdqw wr fdofxodwh wkh fxeh urrw l1h1/ {1@3
ri hyhu| ydoxh lq F4 dqg sodfh wkhvh lq F5/ zh xvh wkh Fdof I Fdofxodwru
frppdqg dqg wkh gldorj er{ dv ghslfwhg lq Glvsod| 51441 Dowhuqdwlyho|/ zh
frxog xvh wkh vhvvlrq frppdqg let dv lq
MTB A let c2=c1**(1/3)
zklfk surgxfhv wkh vdph uhvxow1
2.5 Exercises
Zkhq wkh gdwd iru dq h{huflvh frph iurp dq h{huflvh lq LSV/ wkh LSV h{huflvh
qxpehu lv jlyhq lq sduhqwkhvhv + ,1 Doo frpsxwdwlrqv lq wkhvh h{huflvhv duh
wr eh fduulhg rxw xvlqj Plqlwde/ dqg wkh h{huflvhv duh ghvljqhg wr hqvxuh wkdw
|rx kdyh d uhdvrqdeoh xqghuvwdqglqj ri wkh Plqlwde pdwhuldo lq wklv fkdswhu1
Jhqhudoo|/ |rx vkrxog eh xvlqj Plqlwde wr gr doo wkh frpsxwdwlrqv dqg sorwwlqj
uhtxluhg iru wkh sureohpv lq LSV1
41 +5143, Fdofxodwh wkh ohdvw0vtxduhv olqh dqg pdnh d vfdwwhusorw ri Ixho xvhg
djdlqvw Vshhg wrjhwkhu zlwk wkh ohdvw0vtxduhv olqh1 Sorw wkh vwdqgdug0
l}hg uhvlgxdov djdlqvw Vshhg1 Zkdw lv wkh vtxduhg fruuhodwlrq frh!flhqw
ehwzhhq wkhvh yduldeohvB
51 +5144, Pdnh d vfdwwhusorw ri Udwh djdlqvw Pdvv zkhuh wkh srlqwv iru gli0
ihuhqw Vh{hv duh odehohg glhuhqwo| +xvh Plqlwde iru wkh odeholqj/ wrr, dqg
zlwk wkh ohdvw0vtxduhv olqh rq lw1 Klqw= Pdnh xvh ri wkh vwdfn frppdqg
glvfxvvhg lq L1441:1
61 Sodfh wkh ydoxhv 4 wkurxjk 433 zlwk dq lqfuhphqw ri 14 lq F4 dqg wkh
vtxduh ri wkhvh ydoxhv lq F51 Fdofxodwh wkh fruuhodwlrq frh!flhqw ehwzhhq
F4 dqg F51 Pxowlso| hdfk ydoxh lq F4 e| 43/ dgg 8/ dqg sodfh wkh uhvxowv
lq F61 Fdofxodwh wkh fruuhodwlrq frh!flhqw ehwzhhq F5 dqg F61 Zk| duh
wkhvh fruuhodwlrq frh!flhqwv wkh vdphB
71 Sodfh wkh ydoxhv 4 wkurxjk 433 zlwk dq lqfuhphqw ri 14 lq F4 dqg wkh
vtxduh ri wkhvh ydoxhv lq F51 Fdofxodwh wkh ohdvw0vtxduhv olqh zlwk F5 dv
uhvsrqvh dqg F4 dv h{sodqdwru| yduldeoh1 Sorw wkh vwdqgdugl}hg uhvlgxdov1
Li |rx vhh vxfk d sdwwhuq ri uhvlgxdov zkdw wudqvirupdwlrq/ pljkw |rx xvh
wr uhphg| wkh sureohpB
81 +5187, Iru wkh gdwd lq wklv sureohp/ qxphulfdoo| yhuli| wkh dojheudlf uh0
odwlrqvkls wkdw h{lvwv ehwzhhq wkh fruuhodwlrq frh!flhqw dqg wkh vorsh ri
wkh ohdvw0vtxduhv olqh1
91 Iru H{dpsoh 514: lq LSV/ fdofxodwh wkh ohdvw0vtxduhv olqh dqg uhsurgxfh
Glvsod| 51541 Fdofxodwh wkh vxp ri wkh uhvlgxdov dqg wkh vxp ri wkh
vtxduhg uhvlgxdov dqg glylgh wklv e| wkh qxpehu ri gdwd srlqwv plqxv 51
Lv wkhuh dq|wklqj |rx fdq vd| derxw zkdw wkhvh txdqwlwlhv duh htxdo wr lq
jhqhudoB
:1 +5195, Xvh Plqlwde wr gr doo wkh fdofxodwlrqv lq wklv sureohp1
;1 Sodfh wkh ydoxhv 4 wkurxjk 43 zlwk dq lqfuhphqw ri 14 lq F4/ dqg sodfh
exp (1 + 2{) ri wkhvh ydoxhv lq F51 Fdofxodwh wkh ohdvw0vtxduhv olqh xvlqj
F5 dv wkh uhvsrqvh yduldeoh/ dqg sorw wkh vwdqgdugl}hg uhvlgxdov djdlqvw
F41 Zkdw wudqvirupdwlrq zrxog |rx xvh wr uhphg| wklv uhvlgxdo sorwB
Zkdw lv wkh ohdvw0vtxduhv olqh zkhq |rx fduu| rxw wklv wudqvirupdwlrqB
:9 Chapter 2
Chapter 3
Producing Data
Wklv fkdswhu lv frqfhuqhg zlwk wkh froohfwlrq ri gdwd/ shukdsv wkh prvw lpsru0
wdqw vwhs lq d vwdwlvwlfdo sureohp/ dv wklv ghwhuplqhv wkh txdolw| ri zkdwhyhu
frqfoxvlrqv duh vxevhtxhqwo| gudzq1 D srru dqdo|vlv fdq eh {hg li wkh gdwd
froohfwhg duh jrrg e| vlpso| uhgrlqj wkh dqdo|vlv1 Exw li wkh gdwd kdyh qrw ehhq
dssursuldwho| froohfwhg/ wkhq qr dprxqw ri dqdo|vlv fdq uhvfxh wkh vwxg|1 Zh
glvfxvv Plqlwde frppdqgv wkdw hqdeoh |rx wr jhqhudwh vdpsohv iurp srsxod0
wlrqv dqg dovr wr udqgrpo| doorfdwh wuhdwphqwv wr h{shulphqwdo xqlwv1
Plqlwde xvhv frpsxwhu dojrulwkpv wr plplf udqgrpqhvv1 Vwloo/ wkh uhvxowv
duh qrw wuxo| udqgrp1 Lq idfw/ dq| vlpxodwlrq lq Plqlwde fdq eh uhshdwhg/ zlwk
h{dfwo| wkh vdph uhvxowv ehlqj rewdlqhg/ xvlqj wkh Fdof I Vhw Edvh frppdqg1
Iru h{dpsoh/ lq wkh gldorj er{ ri Glvsod| 614 zh kdyh vshflhg wkh
edvh/ ru vhhg/
udqgrp qxpehu dv 44443;<1 Wkh edvh fdq eh dq| lqwhjhu1 Zkhq |rx zdqw wr
uhshdw wkh vlpxodwlrq/ |rx jlyh wklv frppdqg/ zlwk wkh vdph lqwhjhu1 Surylghg
|rx xvh wkh vdph vlpxodwlrq frppdqgv/ |rx zloo jhw wkh vdph uhvxowv1 Wklv fdq
dovr eh dffrpsolvkhg xvlqj wkh vhvvlrq frppdqg base V/ zkhuh V lv dq lqwhjhu1
Glvsod| 614= Gldorj er{ iru vhwwlqj edvh ru vhhg udqgrp qxpehu1
::
:; Chapter 3
Glvsod| 615= Gldorj er{ iru jhqhudwlqj d udqgrp vdpsoh zlwkrxw uhsodfhphqw1
Sulqwlqj wklv vdpsoh jlyhv wkh rxwsxw
MTB A print c2
C2
441 956 87 736 185 515 883 957 690
438 205 760 246 16 321 371 493 393
538 348 70 54 362 492 182 841 287
277 112 610 890 503 332 413 886 798
764 584 566 495 547 488 206 557 263
414 613 618 685 864
lq wkh Vhvvlrq zlqgrz1 Vr qrz zh jr wr wkh srsxodwlrq dqg vhohfw wkh hohphqwv
odehohg 774/ <89/ ;:/ hwf1 Wkh dojrulwkp wkdw xqghuolhv wklv frppdqg lv vxfk
wkdw zh fdq eh frqghqw wkdw wklv vdpsoh ri 83 lv olnh d udqgrp vdpsoh1
Producing Data :<
uhsodfhphqw li zh fkhfn wkh Vdpsoh zlwk uhsodfhphqw er{ lq wkh gldorj er{ ri
Glvsod| 6151
Glvsod| 616= Gldorj er{ iru jhqhudwlqj d vdpsoh ri 433 iurp wkh Ehuqrxool(=5)
glvwulexwlrq1
Glvsod| 617= Gldorj er{ iru jhqhudwlqj d vdpsoh ri 533 iurp d Q (5=2> 1=3)
glvwulexwlrq1
3.3 Exercises
Zkhq wkh gdwd iru dq h{huflvh frph iurp dq h{huflvh lq LSV/ wkh LSV h{huflvh
qxpehu lv jlyhq lq sduhqwkhvhv + ,1 Doo frpsxwdwlrqv lq wkhvh h{huflvhv duh
wr eh fduulhg rxw xvlqj Plqlwde/ dqg wkh h{huflvhv duh ghvljqhg wr hqvxuh wkdw
|rx kdyh d uhdvrqdeoh xqghuvwdqglqj ri wkh Plqlwde pdwhuldo lq wklv fkdswhu1
Jhqhudoo|/ |rx vkrxog eh xvlqj Plqlwde wr gr doo wkh frpsxwdwlrqv dqg sorwwlqj
uhtxluhg iru wkh sureohpv lq LSV1
Li |rxu yhuvlrq ri Plqlwde sodfhv uhvwulfwlrqv vxfk wkdw wkh ydoxh ri wkh vlp0
xodwlrq vdpsoh vl}h Q uhtxhvwhg lq wkhvh sureohpv lv qrw ihdvleoh/ wkhq vxevwlwxwh
d pruh dssursuldwh ydoxh1 Eh dzduh/ krzhyhu/ wkdw wkh dffxudf| ri |rxu uhvxowv
lv ghshqghqw rq krz odujh Q lv1
51 +6165, Xvh wkh Pdqls I Vruw frppdqg ghvfulehg lq L14419 wr rughu wkh
vxemhfwv e| zhljkw1 Xvh wkh ydoxhv 48 wr lqglfdwh yh eorfnv ri htxdo
ohqjwk lq d vhsdudwh froxpq/ dqg wkhq xvh wkh Pdqls I Xqvwdfn frppdqg
ghvfulehg lq L1441: wr sxw wkh eorfnv lq vhsdudwh froxpqv1 Jhqhudwh d
udqgrp shupxwdwlrq ri hdfk eorfn1
71 Vxssrvh |rx zdqwhg wr fduu| rxw vwudwlhg vdpsolqj zkhuh wkhuh duh 6
vwudwd/ zlwk wkh uvw vwudwxp frqwdlqlqj 833 hohphqwv/ wkh vhfrqg vwudwxp
frqwdlqlqj 733 hohphqwv/ dqg wkh wklug vwudwxp frqwdlqlqj 433 hohphqwv1
Jhqhudwh d vwudwlhg vdpsoh zlwk 83 hohphqwv iurp wkh uvw vwudwxp/ 73
hohphqwv iurp wkh vhfrqg vwudwxp/ dqg 43 hohphqwv iurp wkh wklug vwudwxp1
Zkhq wkh vwudwd vdpsoh vl}hv duh wkh vdph sursruwlrq ri wkh wrwdo vdpsoh
vl}h dv wkh vwudwd srsxodwlrq vl}hv duh ri wkh wrwdo srsxodwlrq vl}h wklv lv
fdoohg sursruwlrqdo vdpsolqj1
Producing Data ;6
Lq wklv fkdswhu wkh frqfhsw ri suredelolw| lv lqwurgxfhg pruh irupdoo| wkdq suh0
ylrxvo| lq wkh errn1 Suredelolw| wkhru| xqghuolhv wkh srzhuixo frpsxwdwlrqdo
phwkrgrorj| nqrzq dv vlpxodwlrq/ zklfk zh lqwurgxfhg lq Fkdswhu 61 Vlpxod0
wlrq kdv pdq| dssolfdwlrqv lq suredelolw| dqg vwdwlvwlfv dqg dovr lq pdq| rwkhu
hogv/ vxfk dv hqjlqhhulqj/ fkhplvwu|/ sk|vlfv/ dqg hfrqrplfv1
{ 4 5 6 7
suredelolw| 14 15 16 17
lq froxpqv F4 dqg F5/ zlwk wkh ydoxhv lq F4 dqg wkh suredelolwlhv lq F51 Wkh
Fdof I Fdofxodwru frppdqg zlwk wkh gldorj er{ dv lq Glvsod| 714 frpsxwhv wkh
fxpxodwlyh glvwulexwlrq ixqfwlrq lq F6 xvlqj Sduwldo Vxpv1
;8
;9 Chapter 4
Glvsod| 714= Gldorj er{ iru frpsxwlqj sduwldo vxpv ri hqwulhv lq F5 dqg sodflqj wkhvh
vxpv lq F61
Glvsod| 715= Gldorj er{ iru jhqhudwlqj d vdpsoh iurp d glvfuhwh glvwulexwlrq zlwk
ydoxhv lq F4 dqg suredelolwlhv lq F5 dqg vwrulqj wkh vdpsoh lq F61
Mean of C4 = 0.33000
lq wkh Vhvvlrq zlqgrz1 Uhshdwlqj wklv zlwk d vdpsoh ri vl}h 1000/ zh rewdlqhg
Mean of C4 = 0.28100
zklfk zh fdq vhh lv d elw forvhu wr wkh wuxh ydoxh ri =31 Uhshdwlqj wklv zlwk d
vdpsoh ri vl}h 10> 000 iurp wklv glvwulexwlrq/ zh rewdlqhg
Mean of C4 = 0.29300
zklfk lv forvhu vwloo1 Lw zrxog dsshdu wkdw wkh uhodwlyh iuhtxhqf| ri 1 lv lqghhg
frqyhujlqj wr =31
Zh fdq jhqhudwh d udqgrpo| fkrvhq srlqw iurp wkh olqh lqwhuydo (d> e) > zkhuh
d ? e/ xvlqj Fdof I Udqgrp Gdwd I Xqlirup1 Iru h{dpsoh/ wkh gldorj er{
ri Glvsod| 717 jhqhudwhv d vdpsoh ri 4833 iurp wkh xqlirup glvwulexwlrq rq wkh
lqwhuydo (3=0> 6=3) = Zlwk wklv glvwulexwlrq/ wkh suredelolw| ri dq| vxelqwhuydo (f> g)
ri (d> e) lv jlyhq e| (g f) @ (e d)/ l1h1/ wkh ohqjwk ri (f> g) ryhu wkh ohqjwk ri
(d> e)1 Ri frxuvh/ zh fdq hvwlpdwh wklv suredelolw| e| mxvw frxqwlqj wkh qxpehu
ri wlphv wkh jhqhudwhg uhvsrqvh idoov lq wkh lqwhuydo (f> g) dqg glylglqj wklv e|
wkh wrwdo vdpsoh vl}h1 Iru h{dpsoh/ xvlqj wkh rxwfrphv iurp wkh gldorj er{
ri Glvsod| 716 dqg hvwlpdwlqj wkh suredelolw| ri wkh lqwhuydo (4> 5)/ zh jhw wkh
uhodwlyh iuhtxhqf| 0=30867/ zklfk lv forvh wr wkh wuxh ydoxh ri (5 4) @ (6=3 3) =
0=30303=
Probability: The Study of Randomness ;<
Glvsod| 717= Gldorj er{ iru jhqhudwlqj d vdpsoh ri 4833 iurp wkh xqlirup
glvwulexwlrq rq wkh lqwhuydo (3=0> 6=3)1
S (=1 [1 + [2 =3)
<3 Chapter 4
zkhq [1 > [2 duh erwk lqghshqghqw dqg iroorz wkh xqlirup glvwulexwlrq rq wkh
lqwhuydo (0> 1) = Wkh vhvvlrq frppdqgv
MTB A random 1000 c1 c2;
SUBCA uniform 0 1.
MTB A let c3=c1+c2
MTB A let c4 = .1?=c3 and c3?=.3
MTB A let k1=sum(c4)/n(c4)
MTB A print k1
K1 0.0400000
MTB A let k2=sqrt(k1*(1-k1)/n(c4))
MTB A print k2
K2 0.00619677
MTB A let k3=k1-3*k2
MTB A let k4=k1+3*k2
MTB A print k3 k4
K3 0.0214097
K4 0.0585903
jhqhudwh Q = 1000 lqghshqghqw ydoxhv ri [1 > [2 dqg sodfh wkhvh ydoxhv lq F4
dqg F5/ uhvshfwlyho|/ wkhq fdofxodwh wkh vxp [1 + [2 dqg sxw wkhvh ydoxhv lq
F61 Xvlqj wkh frpsdulvrq rshudwruv glvfxvvhg lq L14317/ d 4 lv uhfrughg lq F7
hyhu| wlph =1 [1 + [2 =3 lv wuxh dqg d 3 lv uhfrughg wkhuh rwkhuzlvh1 Zh
wkhq fdofxodwh wkh sursruwlrq ri 4*v lq wkh vdpsoh dv N4/ dqg wklv lv rxu hvwlpdwh
ŝ ri wkh suredelolw|1 Zh zloo vhh odwhu wkdw d jrrg phdvxuh ri wkh dffxudf| ri
wklv hvwlpdwh lv wkh vwdqgdug huuru ri wkh hvwlpdwh/ zklfk lq wklv fdvh lv jlyhq
e| p
ŝ (1 ŝ) @Q
dqg wklv lv frpsxwhg lq N51 Dfwxdoo|/ zh fdq ihho idluo| frqghqw wkdw wkh wuxh
ydoxh ri wkh suredelolw| lv lq wkh lqwhuydo
p
ŝ 3 ŝ (1 ŝ) @Q
zklfk lq wklv fdvh/ htxdov wkh lqwhuydo (0=0214097> 0=0585903)1 Vr zh nqrz wkh
wuxh ydoxh ri wkh suredelolw| zlwk uhdvrqdeoh dffxudf|1 Dv wkh vlpxodwlrq vl}h
Q lqfuhdvhv/ wkh Odz ri Odujh Qxpehuv vd|v wkdw ŝ frqyhujhv wr wkh wuxh ydoxh
ri wkh suredelolw|1
4.5 Exercises
Zkhq wkh gdwd iru dq h{huflvh frph iurp dq h{huflvh lq LSV/ wkh LSV h{huflvh
qxpehu lv jlyhq lq sduhqwkhvhv + ,1 Doo frpsxwdwlrqv lq wkhvh h{huflvhv duh
wr eh fduulhg rxw xvlqj Plqlwde/ dqg wkh h{huflvhv duh ghvljqhg wr hqvxuh wkdw
|rx kdyh d uhdvrqdeoh xqghuvwdqglqj ri wkh Plqlwde pdwhuldo lq wklv fkdswhu1
Jhqhudoo|/ |rx vkrxog eh xvlqj Plqlwde wr gr doo wkh frpsxwdwlrqv dqg sorwwlqj
uhtxluhg iru wkh sureohpv lq LSV1
Li |rxu yhuvlrq ri Plqlwde sodfhv uhvwulfwlrqv vxfk wkdw wkh ydoxh ri wkh vlp0
xodwlrq vdpsoh vl}h Q uhtxhvwhg lq wkhvh sureohpv lv qrw ihdvleoh/ wkhq vxevwlwxwh
d pruh dssursuldwh ydoxh1 Eh dzduh/ krzhyhu/ wkdw wkh dffxudf| ri |rxu uhvxowv
lv ghshqghqw rq krz odujh Q lv1
{ 4 5 6 7 8
suredelolw| 148 138 166 16: 143
{ 4 5 6 7 8
suredelolw| 148 138 166 16: 143
Jhqhudwh d vdpsoh ri 4333 iurp wkh wuxqfdwhg glvwulexwlrq/ dqg xvh wkh
vdpsoh wr dssur{lpdwh lwv phdq1
431 Vxssrvh wkdw [ lv d udqgrp yduldeoh dqg iroorzv dq Q (0> 1) glvwulexwlrq1
Vlpxodwh Q = 1000 ydoxhv iurp wkh glvwulexwlrq ri \ = [ 2 / dqg sorw wkhvh
ydoxhv lq d klvwrjudp zlwk fxwsrlqwv 3/ 18/ 4/ 418/ 111/ 481 Dssur{lpdwh wkh
phdq ri wklv glvwulexwlrq1 Jhqhudwh \ gluhfwo| iurp lwv glvwulexwlrq/ zklfk
lv nqrzq wr eh d Fklvtxduh(1) glvwulexwlrq1 Lq jhqhudo/ wkh Fklvtxduh(n)
glvwulexwlrq fdq eh jhqhudwhg iurp yld wkh frppdqg Fdof I Udqgrp Gdwd
I Fkl0Vtxduh/ zkhuh n lv vshflhg dv wkh Ghjuhhv ri iuhhgrp lq wkh gldorj
er{1 Sorw wkh \ ydoxhv lq d klvwrjudp xvlqj wkh vdph fxwsrlqwv1 Frpphqw
rq wkh wzr klvwrjudpv1 Qrwh wkdw |rx fdq sorw wkh ghqvlw| fxuyh ri wkhvh
glvwulexwlrqv xvlqj Fdof I Suredelolw| Glvwulexwlrqv I Fkl0Vtxduh dqg
hydoxdwlqj wkh suredelolw|
ghqvlw| dw d udqjh ri srlqwv dv zh glvfxvvhg lq
LL15 iru wkh qrupdo glvwulexwlrq1
441 Li [1 dqg [2 duh lqghshqghqw udqgrp yduldeohv zlwk [1 iroorzlqj d
Fklvtxduh(n1 ) glvwulexwlrq dqg [2 iroorzlqj d Fklvtxduh(n2 ) glvwulex0
wlrq/ wkhq lw lv nqrzq wkdw \ = [1 + [2 iroorzv d Fklvtxduh(n1 + n2 )
glvwulexwlrq1 Iru n1 = 1> n2 = 1> yhuli| wklv hpslulfdoo| e| sorwwlqj klv0
wrjudpv zlwk fxwsrlqwv 3/ 18/ 4/ 418/ 111/ 48/ edvhg rq vlpxodwlrqv ri vl}h
Q = 1000=
451 Li [1 dqg [2 duh lqghshqghqw udqgrp yduldeohv zlwk [1 iroorzlqj dq
Q (0> 1) glvwulexwlrq dqg [2 iroorzlqj d Fklvtxduh(n) glvwulexwlrq/ wkhq
lw lv nqrzq wkdw
[1
\ =p
[2 @n
iroorzv d Vwxghqw(n) glvwulexwlrq1 Wkh Vwxghqw(n) glvwulexwlrq fdq eh
jhqhudwhg iurp xvlqj wkh frppdqg Fdof I Udqgrp Gdwd I w/ zkhuh n
lv wkh Ghjuhhv ri iuhhgrp dqg pxvw eh vshflhg lq wkh gldorj er{1 Iru
n = 3> yhuli| wklv uhvxow hpslulfdoo| e| sorwwlqj klvwrjudpv zlwk fxwsrlqwv
10/ 9/ 111/ </ 43/ edvhg rq vlpxodwlrqv ri vl}h Q = 1000=
461 Li [1 dqg [2 duh lqghshqghqw udqgrp yduldeohv zlwk [1 iroorzlqj d
Fklvtxduh(n1 ) glvwulexwlrq dqg [2 iroorzlqj d Fklvtxduh(n2 ) glvwulex0
wlrq/ wkhq lw lv nqrzq wkdw
[1 @n1
\ =
[2 @n2
iroorzv dq I (n1 > n2 ) glvwulexwlrq1 Wkh I (n1 > n2 ) glvwulexwlrq fdq eh jhq0
hudwhg iurp xvlqj wkh vxefrppdqg Fdof I Udqgrp Gdwd I I/ zkhuh n1
lv wkh Qxphudwru ghjuhhv ri iuhhgrp dqg n2 lv wkh Ghqrplqdwru ghjuhhv
ri iuhhgrp/ erwk ri zklfk pxvw eh vshflhg lq wkh gldorj er{= Iru n1 = 1>
n2 = 1> yhuli| wklv hpslulfdoo| e| sorwwlqj klvwrjudpv zlwk fxwsrlqwv 3/ 18/
4/ 418/ 111/ 48/ edvhg rq vlpxodwlrqv ri vl}h Q = 1000=
<7 Chapter 4
Chapter 5
Sampling Distributions
Rqfh gdwd kdyh ehhq froohfwhg/ wkh| duh dqdo|}hg xvlqj d ydulhw| ri vwdwlvwlfdo
whfkqltxhv1 Yluwxdoo|/ doo ri wkhvh lqyroyh frpsxwlqj vwdwlvwlfv wkdw phdvxuh
vrph dvshfw ri wkh gdwd frqfhuqlqj txhvwlrqv zh zlvk wr dqvzhu1 Wkh dqvzhuv
ghwhuplqhg e| wkhvh vwdwlvwlfv duh vxemhfw wr wkh xqfhuwdlqw| fdxvhg e| wkh idfw
wkdw zh w|slfdoo| gr qrw kdyh wkh ixoo srsxodwlrq exw rqo| d vdpsoh iurp wkh
srsxodwlrq1 Dv vxfk/ zh kdyh wr eh frqfhuqhg zlwk wkh yduldelolw| lq wkh dqvzhuv
zkhq glhuhqw vdpsohv duh rewdlqhg1 Wklv ohdgv wr d frqfhuq zlwk wkh vdpsolqj
glvwulexwlrq ri d vwdwlvwlf1
Vrphwlphv/ wkh vdpsolqj glvwulexwlrq ri d vwdwlvwlf fdq eh zrunhg rxw h{dfwo|
wkurxjk ydulrxv pdwkhpdwlfdo whfkqltxhv/ h1j1/ lq Fkdswhu 8 ri LSV lw lv vhhq
wkdw wkh qxpehu ri 4*v lq d vdpsoh ri q iurp d Ehuqrxool(s) glvwulexwlrq lv
Elqrpldo(q> s)1 Riwhq/ krzhyhu/ wklv lv qrw srvvleoh/ dqg zh pxvw uhvruw wr
dssur{lpdwlrqv1 Rqh dssur{lpdwlrq whfkqltxh lv wr xvh vlpxodwlrq1 Vrphwlphv/
krzhyhu/ wkh vwdwlvwlfv zh duh frqfhuqhg zlwk duh dyhudjhv/ dqg/ lq vxfk fdvhv/
zh fdq w|slfdoo| dssur{lpdwh wkhlu vdpsolqj glvwulexwlrq yld dq dssursuldwh
qrupdo glvwulexwlrq1
<8
<9 Chapter 5
lv wkh suredelolw| wkdw \ wdnhv wkh ydoxh n iru 0 n q= Zkhq q dqg n duh vpdoo/
wklv irupxod frxog eh xvhg wr hydoxdwh wklv suredelolw| exw lw lv doprvw dozd|v
ehwwhu wr xvh vriwzduh olnh Plqlwde wr gr lw/ dqg zkhq wkhvh ydoxhv duh qrw vpdoo/
lw lv qhfhvvdu|1 Dovr/ zh fdq xvh Plqlwde wr frpsxwh wkh Elqrpldo(q> s) fxpx0
odwlyh suredelolw| glvwulexwlrq wkh suredelolw| frqwhqwv ri lqwhuydov (4> {]
dqg wkh lqyhuvh fxpxodwlyh glvwulexwlrq shufhqwlohv ri wkh glvwulexwlrq1
Iru lqglylgxdo suredelolwlhv/ zh xvh wkh Fdof I Suredelolw| Glvwulexwlrqv I
Elqrpldo frppdqg1 Iru h{dpsoh/ vxssrvh zh kdyh d Elqrpldo(30> =2) glvwul0
exwlrq dqg zdqw wr frpsxwh wkh suredelolw| S (\ = 10)= Wklv frppdqg/ zlwk
wkh gldorj er{ dv lq Glvsod| 814/ surgxfhv wkh rxwsxw
Binomial with n = 30 and p = 0.200000
x P( X = x )
10.00 0.0355
lq wkh Vhvvlrq zlqgrz/ l1h1/ S (\ = 10) = =03551
Glvsod| 815= Gldorj er{ iru frpsxwlqj fxpxodwlyh suredelolwlhv iru wkh
Elqrpldo(q> s) glvwulexwlrq1
Vxssrvh zh zdqw wr frpsxwh wkh uvw txduwloh ri wklv glvwulexwlrq1 Wkh Fdof
I Suredelolw| Glvwulexwlrqv I Elqrpldo frppdqg/ zlwk wkh gldorj er{ dv lq
Glvsod| 816/ surgxfhv wkh rxwsxw
lq wkh Vhvvlrq zlqgrz1 Wklv jlyhv wkh ydoxhv { wkdw kdyh fxpxodwlyh suredelolwlhv
mxvw vpdoohu dqg mxvw odujhu wkdq wkh ydoxh uhtxhvwhg1 Uhfdoo wkdw zlwk d glvfuhwh
glvwulexwlrq/ vxfk dv wkh Elqrpldo(q> s)> zh zloo qrw lq jhqhudo eh deoh wr rewdlq
dq h{dfw shufhqwloh1
Glvsod| 817= Gldorj er{ iru jhqhudwlqj 43 froxpqv ri 4333 Ehuqrxool(=75) ydoxhv1
433 Chapter 5
Glvsod| 818= Gldorj er{ iru frpsxwlqj wkh sursruwlrq ri 4*v lq hdfk ri wkh 4333
vdpsohv ri vl}h 431
Glvsod| 819= Gldorj er{ iru frpsxwlqj wkh hpslulfdo glvwulexwlrq ixqfwlrq ri ŝ1
Lq Glvsod| 81:/ zh kdyh sorwwhg d klvwrjudp ri wkh 4333 ydoxhv ri ŝ= Edvhg
rq Q = 800> wkh iroorzlqj hpslulfdo glvwulexwlrq zdv rewdlqhg=
C11 CumPct
0.4 1.20
0.5 7.20
0.6 22.20
0.7 47.80
0.8 78.20
0.9 95.00
1.0 100.00
Ehfdxvh wkhvh ydoxhv duh uhdvrqdeo| forvh wr wkrvh rewdlqhg zlwk Q = 1000> zh
vwrsshg dw Q = 1000=
Sampling Distributions 434
300
200
Frequency
100
C11
dqg wkhvh pljkw vhhp olnh dq hdvlhu zd| wr lpsohphqw wkh vlpxodwlrq1
Lq Fkdswhu 8 ri LSV zh vdz wkdw wkh vdpsolqj glvwulexwlrq ri ŝ fdq eh gh0
whuplqhg h{dfwo|/ l1h1/ wkhuh duh irupxodv wr ghwhuplqh wklv/ dqg zh fdq vlpxodwh
gluhfwo| iurp wkh vdpsolqj glvwulexwlrq/ vr wklv vlpxodwlrq fdq eh pdgh pxfk
pruh h!flhqw1 Lq hhfw/ wklv hqwdlov xvlqj wkh Fdof I Udqgrp Gdwd I Elqrpldo
frppdqg zlwk gldorj er{ dv lq Glvsod| 81; dqg glylglqj hdfk hqwu| lq F4 e| 431
Wklv jhqhudwhv Q = 1000 ydoxhv ri ŝ exw xvhv d pxfk vpdoohu qxpehu ri fhoov1
Vwloo/ wkhuh duh pdq| vwdwlvwlfv iru zklfk wklv nlqg ri h!flhqf| uhgxfwlrq lv qrw
dydlodeoh/ dqg/ wr jhw vrph lghd ri zkdw wkhlu vdpsolqj glvwulexwlrq lv olnh/ zh
pxvw uhvruw wr wkh pruh euxwh irufh irup ri vlpxodwlrq ri jhqhudwlqj gluhfwo|
iurp wkh srsxodwlrq glvwulexwlrq1
Vrphwlphv/ pruh vrsklvwlfdwhg vlpxodwlrq whfkqltxhv duh qhhghg wr jhw dq
dffxudwh dvvhvvphqw ri d vdpsolqj glvwulexwlrq1 Zlwklq Plqlwde/ wkhuh duh sur0
judpplqj whfkqltxhv/ zklfk zh gr qrw glvfxvv lq wklv pdqxdo/ wkdw fdq eh
dssolhg lq vxfk fdvhv1 Iru h{dpsoh/ lw lv fohdu wkdw li rxu vlpxodwlrq uhtxluhg
wkh jhqhudwlrq ri 436 fhoov +dqg wklv lv qrw dw doo xqfrpprq iru vrph kdughu
sureohpv,/ wkh vlpxodwlrq dssurdfk zh kdyh ghvfulehg zrxog qrw zrun zlwklq
Plqlwde/ dv wkh zrunvkhhw zrxog eh wrr odujh1
435 Chapter 5
Glvsod| 81;= Gldorj er{ iru jhqhudwlqj 4333 ydoxhv iurp wkh vdpsolqj glvwulexwlrq ri
10ŝ xvlqj wkh Elqrpldo(10> =75) glvwulexwlrq1
5.3 Exercises
Zkhq wkh gdwd iru dq h{huflvh frph iurp dq h{huflvh lq LSV/ wkh LSV h{huflvh
qxpehu lv jlyhq lq sduhqwkhvhv + ,1 Doo frpsxwdwlrqv lq wkhvh h{huflvhv duh
wr eh fduulhg rxw xvlqj Plqlwde/ dqg wkh h{huflvhv duh ghvljqhg wr hqvxuh wkdw
|rx kdyh d uhdvrqdeoh xqghuvwdqglqj ri wkh Plqlwde pdwhuldo lq wklv fkdswhu1
Jhqhudoo|/ |rx vkrxog eh xvlqj Plqlwde wr gr doo wkh frpsxwdwlrqv dqg sorwwlqj
uhtxluhg iru wkh sureohpv lq LSV1
Li |rxu yhuvlrq ri Plqlwde sodfhv uhvwulfwlrqv vxfk wkdw wkh ydoxh ri wkh vlp0
xodwlrq vdpsoh vl}h Q uhtxhvwhg lq wkhvh sureohpv lv qrw ihdvleoh/ wkhq vxevwlwxwh
d pruh dssursuldwh ydoxh1 Eh dzduh/ krzhyhu/ wkdw wkh dffxudf| ri |rxu uhvxowv
lv ghshqghqw rq krz odujh Q lv1
41 Fdofxodwh doo wkh suredelolwlhv iru wkh Elqrpldo(5> =4) glvwulexwlrq dqg wkh
Elqrpldo(5> =6) glvwulexwlrq1 Zkdw uhodwlrqvkls gr |rx revhuyhB Fdq |rx
h{sodlq wklv dqg vwdwh d jhqhudo uxohB
51 Frpsxwh doo wkh suredelolwlhv iru d Elqrpldo(5> =8) glvwulexwlrq dqg xvh
wkhvh wr gluhfwo| fdofxodwh wkh phdq dqg yduldqfh1 Yhuli| |rxu dqvzhuv
xvlqj wkh irupxodv surylghg lq LSV1
441 Jhqhudwh Q = 1000 ydoxhv ri [1 > [2 > zkhuh [1 iroorzv d Q (3> 2) glvwul0
exwlrq dqg [2 iroorzv d Q(1> 3) glvwulexwlrq1 Frpsxwh \ = [1 2[2
iru hdfk ri wkhvh sdluv dqg sorw d klvwrjudp iru \ xvlqj wkh fxwsrlqwv
20> 15> ===> 25> 301 Jhqhudwh d vdpsoh ri Q = 1000 iurp wkh dssursuldwh
glvwulexwlrq ri \ dqg sorw d klvwrjudp xvlqj wkh vdph fxwsrlqwv1
451 Sorw wkh ghqvlw| fxuyh iru wkh H{srqhqwldo(3) glvwulexwlrq +vhh H{huflvh
LL171:, ehwzhhq 3 dqg 48 zlwk dq lqfuhphqw ri 141 Jhqhudwh Q = 1000
vdpsohv ri vl}h q = 2 iurp wkh H{srqhqwldo(3) glvwulexwlrq dqg uhfrug
wkh vdpsoh phdqv1 Vwdqgdugl}h wkh vdpsoh ri { ¯ xvlqj = 3 dqg = 3=
Sorw d klvwrjudp ri wkh vwdqgdugl}hg ydoxhv xvlqj wkh fxwsrlqwv 5/ 4/
111/ 7/ 81 Uhshdw wklv iru q = 5> 10= Frpphqw rq wkh vkdshv ri wkhvh
klvwrjudpv1
461 Sorw wkh ghqvlw| ri wkh xqlirup glvwulexwlrq rq +3/4,1 Jhqhudwh Q = 1000
vdpsohv ri vl}h q = 2 p ¯
iurp wklv glvwulexwlrq1 Vwdqgdugl}h wkh vdpsoh ri {
xvlqj = =5 dqg = 1@12= Sorw d klvwrjudp ri wkh vwdqgdugl}hg ydoxhv
xvlqj wkh fxwsrlqwv 5> 4> ===> 4> 51 Uhshdw wklv iru q = 5> 10= Frpphqw
rq wkh vkdshv ri wkhvh klvwrjudpv1
471 Wkh Zhlexoo () kdv ghqvlw| fxuyh jlyhq e| {1 h{ iru { A 0> zkhuh
A 0 lv d {hg frqvwdqw1 Sorw wkh Zhlexoo (2) ghqvlw| lq wkh udqjh 3 wr
43 zlwk dq lqfuhphqw ri 14 xvlqj wkh Fdof I Suredelolw| Glvwulexwlrqv I
Zhlexoo/ frppdqg1 Jhqhudwh d vdpsoh ri Q = 1000 iurp wklv glvwulex0
wlrq xvlqj wkh vxefrppdqg Fdof I Udqgrp Gdwd I Zhlexoo zkhuh
lv wkh Vkdsh sdudphwhu dqg wkh Vfdoh sdudphwhu lv 41 Sorw d suredelolw|
klvwrjudp dqg frpsduh zlwk wkh ghqvlw| fxuyh1
Chapter 6
Introduction to Inference
Lq wklv fkdswhu/ wkh edvlf wrrov ri vwdwlvwlfdo lqihuhqfh duh glvfxvvhg1 Wkhuh
duh d qxpehu ri Plqlwde frppdqgv wkdw dlg lq wkh frpsxwdwlrq ri frqghqfh
lqwhuydov dqg iru fduu|lqj rxw whvwv ri vljqlfdqfh1
438
439 Chapter 6
+6, Zh kdyh d odujh vdpsoh iurp d glvwulexwlrq zlwk xqnqrzq phdq dqg
xqnqrzq vwdqgdug ghyldwlrq / dqg wkh vdpsoh vl}h lv odujh hqrxjk vr wkdw
¯
{
}= s
v@ q
Glvsod| 914= Iluvw gldorj er{ iru surgxflqj wkh } 0frqghqfh lqwhuydo iru =
Introduction to Inference 43:
Glvsod| 915= Vhfrqg gldorj er{ iru surgxflqj wkh } 0frqghqfh lqwhuydo1 Khuh zh
vshfli| wkh frqghqfh ohyho1
6.2 }-Tests
Wkh Vwdw I Edvlf Vwdwlvwlfv I 40Vdpsoh ] frppdqg lv xvhg zkhq zh zdqw wr
whvw wkh k|srwkhvlv wkdw wkh xqnqrzq phdq htxdov d ydoxh 0 dqg rqh ri wkh
vlwxdwlrqv +4,/ +5,/ ru +6, dv glvfxvvhg lq LL14314 lv dssursuldwh1 Wkh whvw lv edvhg
rq frpsxwlqj d S 0ydoxh xvlqj wkh revhuyhg ydoxh ri
¯ 0
{
}= s
@ q
dqg wkh Q (0> 1) glvwulexwlrq dv ghvfulehg lq LSV1
Vxssrvh wkh vdpsoh 2=0> 0=4> 0=7> 2=0> 0=4> 2=2> 1=3> 1=2> 1=1> 2=3 lv vwruhg lq
F4/ dqg zh duh dvnhg wr whvw wkh qxoo k|srwkhvlv K0 : = 0 djdlqvw wkh dowhuqd0
wlyh Kd : A 0 dqg lw pdnhv vhqvh wr wdnh = 1= Wkh Vwdw I Edvlf Vwdwlvwlfv
I 40Vdpsoh ] frppdqg wrjhwkhu zlwk wkh gldorj er{hv ri Glvsod|v 916 dqg 917
surgxfhv wkh rxwsxw
Variable 99.0% Lower Bound Z P
C1 0.284 3.23 0.001
lq wkh Vhvvlrq zlqgrz1 Wklv vshflhv wkh S 0ydoxh iru wklv whvw dv 1334/ dqg vr zh
uhmhfw wkh qxoo k|srwkhvlv lq idyru ri wkh dowhuqdwlyh1 Lq wkh uvw gldorj er{/ zh
vshflhg zkhuh wkh gdwd lv orfdwhg/ wkh ydoxh ri dv ehiruh dqg wkdw zh zdqw
wr whvw K0 : = 0 e| 3 lq wkh Whvw phdq er{1 Zh eurxjkw xs wkh vhfrqg gldorj
er{ e| folfnlqj rq wkh Rswlrqv exwwrq1 Lq wkh vhfrqg gldorj er{/ zh vshflhg
wkdw zh zdqw wr whvw wklv qxoo k|srwkhvlv djdlqvw wkh dowhuqdwlyh Kd : A 0 e|
vhohfwlqj juhdwhu wkdq lq Dowhuqdwlyh er{1 Wkh rwkhu fkrlfhv duh qrw htxdo/
43; Chapter 6
zklfk vhohfwv wkh dowhuqdwlyh Kd : 9= 0> dqg ohvv wkdq/ zklfk vhohfwv wkh
dowhuqdwlyh Kd : ? 0=
Glvsod| 916= Iluvw gldorj er{ iru whvwlqj d k|srwkhvlv frqfhuqlqj wkh phdq xvlqj d
} 0whvw1
Glvsod| 917= Vhfrqg gldorj er{ iru whvwlqj d k|srwkhvlv xvlqj wkh } 0whvw1
C7
0
-1
-2
-3
0 5 10 15 20 25
C10
Iljxuh 914= Sorw ri <3( frqghqfh lqwhuydov iru wkh phdq zkhq vdpsolqj iurp wkh
Q (1> 2) glvwulexwlrq zlwk q = 51 Wkh orzhu hqg0srlqw lv rshq dqg wkh xsshu
hqg0srlqw lv forvhg1
Wkh vlpxodwlrq mxvw fduulhg rxw vlpso| yhulhv d wkhruhwlfdo idfw1 Rq wkh
rwkhu kdqg/ zkhq zh duh frpsxwlqj dssur{lpdwh frqghqfh lqwhuydov l1h1/ zh
duh qrw vdpsolqj qhfhvvdulo| iurp d qrupdo glvwulexwlrq lw lv jrrg wr gr vrph
vlpxodwlrqv iurp ydulrxv glvwulexwlrqv wr vhh krz pxfk uholdqfh zh fdq sodfh
lq wkh dssur{lpdwlrq dw d jlyhq vdpsoh vl}h1 Wkh wuxh fryhudjh suredelolw| ri
wkh lqwhuydo/ l1h1/ wkh orqj0uxq sursruwlrq ri wlphv wkdw wkh lqwhuydo fryhuv wkh
wuxh phdq/ zloo qrw lq jhqhudo eh htxdo wr wkh qrplqdo frqghqfh ohyho1 Vpdoo
ghyldwlrqv duh qrw vhulrxv/ exw odujh rqhv duh1
zkhuh ] lv d Q (0> 1) udqgrp yduldeoh1 Wklv lv htxlydohqw wr vd|lqj wkdw wkh qxoo
k|srwkhvlv lv uhmhfwhg zkhqhyhu
Introduction to Inference 444
¯ ¯
¯{ ¯
¯ ¯ s0 ¯
¯ @ q ¯
lv juhdwhu wkdq ru htxdo wr wkh 1 @2 shufhqwloh iru wkh Q (0> 1) glvwulexwlrq1
Iru h{dpsoh/ li = =05> wkhq 1 @2 = =975 dqg wklv shufhqwloh fdq eh rewdlqhg
xvlqj wkh frppdqg Fdof I Suredelolw| Glvwulexwlrqv I Qrupdo dqg wkh lqyhuvh
glvwulexwlrq ixqfwlrq/ zklfk jlyhv wkh rxwsxw
Normal with mean = 0 and standard deviation = 1.00000
P( X ?= x) x
0.9750 1.9600
lq wkh Vhvvlrq zlqgrz/ l1h1/ wkh 1<:8 shufhqwloh ri wkh Q(0> 1) glvwulexwlrq lv 41<91
Ghqrwh wklv shufhqwloh e| } = Li = 1 > wkhq
¯ 0
{
s
@ q
¯
lv d uhdol}hg ydoxh iurp wkh glvwulexwlrq ri \ = [ s 0 zkhq [ ¯ lv glvwulexwhg
@ q
s 1
Q(1 > @ q)= Wkhuhiruh/ \ iroorzv d Q ( @sq0 > 1) glvwulexwlrq1 Wkh srzhu ri
wkh wzr0vlghg whvw dw = 1 lv
S (m\ m A } )
dqg wklv fdq eh hydoxdwhg h{dfwo| xvlqj wkh frppdqg Fdof I Suredelolw|
Glvwulexwlrqv I Qrupdo dqg wkh glvwulexwlrq ixqfwlrq/ diwhu zulwlqj
S (m\ m A } ) = S (\ A } ) + S (\ ? } )
µ ¶ µ ¶
(1 0 ) (1 0 )
=S ]A s +} +S ] ? s }
@ q @ q
lq wkh Vhvvlrq zlqgrz1 Wklv jlyhv wkh srzhu iru whvwlqj K0 : = 0 yhuvxv
K0 : 9= 0 dw m1 0 m = =1 dqg m1 0 m = =2 zkhq q = 10/ = 1=3> dqg
= =05= Wkhvh srzhuv duh jlyhq e| 1389; dqg 13::8/ uhvshfwlyho|1 Folfnlqj rq
wkh Rswlrqv exwwrq doorzv |rx wr fkrrvh rwkhu dowhuqdwlyhv dqg vshfli| rwkhu
ydoxhvri lq wkh Vljqlfdqfh ohyho er{1
Glvsod| 918= Gldorj er{ iru fdofxodwlqj srzhuv dqg plqlpxp vdpsoh vl}hv1
Zh zloo vhh dssolfdwlrqv ri wkh fkl0vtxduh glvwulexwlrq odwhu lq wkh errn exw
zh phqwlrq rqh khuh1 Lq sduwlfxodu/ li {1 > = = = > {q lv d vdpsoh iurp d Q (> )
Pq 2
glvwulexwlrq/ wkhq (q 1) v2 @ 2 = l=1 ({l { ¯) @2 lv nqrzq wr iroorz d
Fklvtxduh(q 1) glvwulexwlrq/ dqg wklv idfw lv xvhg dv d edvlv iru lqihuhqfh
derxw +frqghqfh lqwhuydov dqg whvwv ri vljqlfdqfh,1 Ehfdxvh ri wkh qrqur0
exvwqhvv ri wkhvh lqihuhqfhv wr vpdoo ghyldwlrqv iurp qrupdolw|/ wkhvh lqihuhqfhv
duh qrw uhfrpphqghg1
6.6 Exercises
Zkhq wkh gdwd iru dq h{huflvh frph iurp dq h{huflvh lq LSV/ wkh LSV h{huflvh
qxpehu lv jlyhq lq sduhqwkhvhv + ,1 Doo frpsxwdwlrqv lq wkhvh h{huflvhv duh
wr eh fduulhg rxw xvlqj Plqlwde/ dqg wkh h{huflvhv duh ghvljqhg wr hqvxuh wkdw
|rx kdyh d uhdvrqdeoh xqghuvwdqglqj ri wkh Plqlwde pdwhuldo lq wklv fkdswhu1
Jhqhudoo|/ |rx vkrxog eh xvlqj Plqlwde wr gr doo wkh frpsxwdwlrqv dqg sorwwlqj
uhtxluhg iru wkh sureohpv lq LSV1
Li |rxu yhuvlrq ri Plqlwde sodfhv uhvwulfwlrqv vxfk wkdw wkh ydoxh ri wkh vlp0
xodwlrq vdpsoh vl}h Q uhtxhvwhg lq wkhvh sureohpv lv qrw ihdvleoh/ wkhq vxevwlwxwh
d pruh dssursuldwh ydoxh1 Eh dzduh/ krzhyhu/ wkdw wkh dffxudf| ri |rxu uhvxowv
lv ghshqghqw rq krz odujh Q lv1
LL17145,/ dqg fdofxodwh wkh sursruwlrq ri 1<3 frqghqfh lqwhuydov iru wkh
phdq/ xvlqj wkh vdpsoh vwdqgdug ghyldwlrq iru / wkdw fryhu wkh ydoxh
= 0= Lw lv srvvleoh wr rewdlq yhu| edg dssur{lpdwlrqv lq wklv h{dpsoh
ehfdxvh wkh fhqwudo olplw wkhruhp grhv qrw dsso| wr wklv glvwulexwlrq1 Lq
idfw/ lw grhv qrw kdyh d phdq1
:1 Vxssrvh zh duh whvwlqj K0 : = 3 yhuvxv K0 : 9= 3 zkhq zh duh vdpsolqj
iurp d Q (> ) glvwulexwlrq zlwk = 2=1 dqg wkh vdpsoh vl}h lv q = 20=
Li zh xvh wkh fulwlfdo ydoxh = =01> ghwhuplqh wkh srzhu ri wklv whvw dw
= 4=
;1 Vxssrvh zh duh whvwlqj K0 : = 3 yhuvxv K0 : A 3 zkhq zh duh
vdpsolqj iurp d Q (> ) glvwulexwlrq zlwk = 2=1= Li zh xvh wkh fulwlfdo
ydoxh = =01> ghwhuplqh wkh plqlpxp vdpsoh vl}h vr wkdw wkh srzhu ri
wklv whvw dw = 4 lv 1<<=
<1 Wkh xqlirup glvwulexwlrq rqq wkh lqwhuydo (d> e) kdv phdq = (d + e) @2
2
dqg vwdqgdug ghyldwlrq = (e d) @12= Fdofxodwh wkh srzhu dw = 1
ri wkh wzr0vlghg }0whvw dw ohyho = =95 iru whvwlqj K0 : = 0 zkhq wkh
vdpsoh vl}h lv q = 10/ lv wkh vwdqgdug ghyldwlrq ri d xqlirup glvwulexwlrq
rq (10> 12)/ dqg zh duh vdpsolqj iurp d qrupdo glvwulexwlrq1
431 Vxssrvh wkdw zh duh whvwlqj K0 : = 0 lq d wzr0vlghg whvw edvhg rq
d vdpsoh ri 61 Dssur{lpdwh wkh srzhu ri wkh }0whvw dw ohyho = =1 dw
= 5 zkhq zh duh vdpsolqj iurp wkh glvwulexwlrq ri \ = 5 + Z> zkhuh
Z iroorzv d Vwxghqw(6) glvwulexwlrq +vhh H{huflvh LL17145, dqg zh xvh
wkh vdpsoh vwdqgdug ghyldwlrq wr hvwlpdwh 1 Qrwh wkdw wkh phdq ri wkh
glvwulexwlrq ri \ lv 81
449 Chapter 6