0% found this document useful (0 votes)

25 views13 pages

Advanced Data Science

This document contains code and analysis for problem sets from an advanced microeconometrics course. It includes: 1) Code to load and summarize housing price data, estimate OLS regressions of price on house characteristics, and calculate marginal effects. 2) Code analyzing a dataset on county-level election results, including adding outliers and dummy variables, testing for heteroskedasticity, and weighted regressions. 3) Code using an instrumental variables approach to analyze the effect of 401k participation on IRA contributions, including tests of instrument strength and model comparisons. 4) Code fitting probit and logit models to analyze factors affecting medical expenditures, and calculating average marginal effects.

Uploaded by

21523010

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

25 views13 pages

Advanced Data Science

Uploaded by

21523010

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

******************************************************************************

* Advanced Fundamentals of Microeconometrics and Data Science (AFMD)

* Vietnamese German University
* Fall Semester 2023
* Prof. Michael Binder, Ph.D.
*
* Problem Set 1: House Prices
******************************************************************************

cls // clear display in results window

clear // clear previous work out of memory
// change working directory

cd "/Users/macbook/Desktop/AFMD 30.9_PhD.Binder/Day 2"

use "[Link]" // Load Data

* Data Description
summarize price housesize bdrms lotsize assess
* Histograms
* House Prices
label variable price " "
histogram price, normal bin(20) ///
title("House Prices for 88 Homes") ///
name(histprices, replace)
* Housesize
label variable housesize " "
histogram housesize, normal bin(20) ///
title("House Sizes for 88 Homes") ///
name(housesize, replace)
* Bedrooms
label variable bdrms " "
histogram bdrms, normal bin(20) ///
title("Number of Bedrooms for 88 Homes") ///
name(histbrms, replace)
* Lotsize
label variable lotsize " "
histogram lotsize, normal bin(20) ///
title("Lot Sizes for 88 Homes") ///
name(histlotsize, replace)
* Assessment Value
label variable assess " "
histogram assess, normal bin(20) ///
title("Assessment Values for 88 Homes") ///
name(histassess, replace)

* OLS Estimation
* First Model
regress price housesize bdrms
predict pricehat1, xb
list price pricehat1 in 1
estat ic
* Second Model
regress price housesize bdrms lotsize assess
estat ic
predict pricehat2, xb
list price pricehat2 in 1

* Calculating Ceteris Paribus Effect (Frisch-Waugh Result)

* Auxiliary Regression 1
quietly regress price lotsize assess
predict residpriceadj, resid
// changes in price not driven by changes in lotsize and assess
* Auxiliary Regression 2a
quietly regress housesize lotsize assess
predict residhsadj, resid
// changes in housesize not driven by changes in lotsize and assess
* Auxiliary Regression 2b
quietly regress bdrms lotsize assess
predict residbradj, resid
// changes in bdrms not driven by changes in lotsize and assess
* Ceteris Paribus Effect
regress residpriceadj residhsadj residbradj
// marginal effect of changes in housesize and bedrooms on price
******************************************************************************
* Advanced Fundamentals of Microeconometrics and Data Science (AFMD)
* Vietnamese German University
* Fall Semester 2023
* Prof. Michael Binder, Ph.D.
*
* Problem Set 2: Election Recount
******************************************************************************

cls // clear display in results window

clear // clear previous work out of memory
cd "/Users/macbook/Desktop/AFMD 30.9_PhD.Binder/Day 3"
// change working directory

use "[Link]" // Load Data

* Data Description
summarize

* OLS Estimation
* First Model
regress BUCHANAN GORE
* Scatter Plot with OLS Regression Line
twoway (scatter BUCHANAN GORE) (lfitci BUCHANAN GORE, level(90)), ///
title("67 Counties") name(scatterlfitci1, replace)

* Remove Outlier and redo regression analysis

destring OBS, generate(COUNTY)
drop if COUNTY == 50
* First Model
regress BUCHANAN GORE
* Scatter Plot with OLS Regression Line
twoway (scatter BUCHANAN GORE) (lfitci BUCHANAN GORE, level(90)), ///
title("66 Counties") name(scatterlfitci2, replace)

* Return to Full Data Set

clear
use "[Link]"
destring OBS, generate(COUNTY)
* Create Dummy Variable for Palm Beach County
generate PBC = 0
replace PBC = 1 if COUNTY == 50
* Second Model
regress BUCHANAN GORE PBC
test _b[PBC]=975
display "H1: coef >= 975 p-value = " 1-ttail(r(df_r),sqrt(r(F)))

* Heteroskedasticity
* Breusch-Pagan Test
quietly regress BUCHANAN GORE
estat hettest TOTAL, mtest // iid normal disturbance terms
estat hettest TOTAL, iid // iid disturbance terms
quietly regress BUCHANAN GORE PBC
estat hettest TOTAL, mtest
estat hettest TOTAL, iid

* Fraction of Votes
generate FBUCHANAN = BUCHANAN/TOTAL
generate FGORE = GORE/TOTAL
* First Model
regress FBUCHANAN FGORE
* Scatter Plot with OLS Regression Line
twoway (scatter FBUCHANAN FGORE) (lfitci FBUCHANAN FGORE, level(90)), ///
title("67 Counties") name(scatterlfitci1, replace)
* Breusch-Pagan Test
generate TOTALINV = TOTAL^(-1)
estat hettest TOTALINV, mtest
estat hettest TOTALINV, iid
* Second Model
regress FBUCHANAN FGORE PBC
estat hettest TOTALINV, mtest
estat hettest TOTALINV, iid

* Feasible GLS Estimation

generate TOTAL2 = TOTAL^2
* First Model
regress BUCHANAN GORE [aw = 1/TOTAL2] // variance of the i-th observation
// is Var(eps)/TOTAL2; i-th observation
// is divided by TOTAL
regress BUCHANAN GORE [aw = 1/TOTAL2], vce(hc3)
* Second Model
regress BUCHANAN GORE PBC [aw = 1/TOTAL2]
regress BUCHANAN GORE PBC [aw = 1/TOTAL2], vce(hc3)
******************************************************************************
* Advanced Fundamentals of Microeconometrics and Data Science (AFMD)
* Vietnamese German University
* Fall Semester 2023
* Prof. Michael Binder, Ph.D.
*
* Problem Set 3: Individual Retirements Accounts (IRAs)
******************************************************************************

cls // clear display in results window

clear // clear previous work out of memory
cd "/Users/macbook/Desktop/AFMD 30.9_PhD.Binder/Day 4"
// change working directory

use "[Link]" // Load Data

* Data Description
summarize

* OLS Estimation
regress pira p401k inc incsq age agesq

* Instrument Analysis
regress p401k e401k inc incsq age agesq

* Just-Identified IV Estimation
global xlistexo inc incsq age agesq
ivregress 2sls pira $xlistexo (p401k = e401k), first
ivregress 2sls pira $xlistexo (p401k = e401k), first vce(robust)
* Quality of Instruments
estat firststage
* Comparing OLS and IV Estimators (Hausman Test)
estat endogenous
******************************************************************************
* Advanced Fundamentals of Microeconometrics and Data Science (AFMD)
* Vietnamese German University
* Fall Semester 2023
* Prof. Michael Binder, Ph.D.
*
* Problem Set 4: Deciding Whether to See a Medical Doctor
******************************************************************************

cls // clear display in results window

clear // clear previous work out of memory
cd "/Users/macbook/Desktop/AFMD 30.9_PhD.Binder/Day 5"
// change working directory

use "[Link]" // Load Data

* dmed: 1 if annual USD medical expenditure (excluding dental and outpatient

* mental expenditure) > 0, 0 otherwise
* linc: logarithm of annual USD family income
* lc: log(coinsrate+1) where coinsurance rate is 0 to 100
* idp: 1 if individual deductible plan
* lpi: log(annual participation incentive payment) or 0 if no payment
* fmde: log(max(medical deductible expenditure (mde))) if idp=1 and mde>1
* or 0 otherwise.
* ndisease: number of chronic diseases
* physlim: 1 if physical limitation
* hlthg: 1 if good health
* hlthf: 1 if good health
* hlthp: 1 if good health (omitted is excellent)
* lfam: log of family size
* educdec: years of schooling of decision maker
* age: exact age
* black: 1 if black
* female: 1 if female
* child: 1 if child
* femchild: 1 if female child

* Data Description
summarize

* Define inDependent Variable List

global xlist1 linc lc lpi fmde idp ndisease physlim ///
hlthg hlthf hlthp lfam educdec age black female child femchild

* Probit Model
probit dmed $xlist1
estimates store ProbitEst
* Average Marginal Effects
margins, dydx(*)
* Logit Model
logit dmed $xlist1
estimates store LogitEst
estimates table ProbitEst LogitEst, b
* Average Marginal Effects
margins, dydx(*)
* Odds Ratios
logistic dmed $xlist1
******************************************************************************
* Advanced Fundamentals of Microeconometrics and Data Science (AFMD)
* Vietnamese German University
* Fall Semester 2023
* Prof. Michael Binder, Ph.D.
*
* Problem Set 5: Ambulatory Expenditure
******************************************************************************

cls // clear display in results window

clear // clear previous work out of memory
cd "/Users/macbook/Desktop/AFMD 30.9_PhD.Binder/Day 8"
// change working directory

use "[Link]" // Load Data

* y: ambulatory expenditure
* lny: logarithm of ambulatory expenditure, with zeros replacing NA's
* dy: 1 if ambulatory expenditure is greater than zero
* educ: educational attainment, in years
* age: age / 10
* income: income
* female: 1 if female
* totch: number of chronic diseases
* blhisp: 1 if black or hispanic ethnicity
* ins: 1 if insured

* Data Description
summarize

* Define inDependent Variable List

global xlist1 ins totch age age2 educ blhisp

* Tobit I Model
tobit y $xlist1 income, ll //
* Marginal Effects: E(y|x,y>0)
margins, dydx(*) predict(ystar(0,.))

* Tobit II Model
heckman lny $xlist1 income, select (dy = $xlist1 income) /// Prob >>> => fail to reject Ho: (corr e1,
e2)=0 ==> can have correlation
* Marginal Effects: E(y|x,y>0)
margins, dydx(*) predict(ystar(0,.))

* Tobit II Model With Exclusion Restrictions

heckman lny $xlist1, select (dy = $xlist1 income)
* Marginal Effects: E(y|x,y>0)
margins, dydx(*) predict(ystar(0,.))

* Tobit I Predictions
* Part 1: Probit
probit dy $xlist1 income
predict probpart1, p
* Part 2: OLS for lny corresponding to y>0
regress lny $xlist1 income if y>0
scalar sert1 = e(rmse)
predict predlnaepos1, xb
generate predaepos1 = exp(predlnaepos1+(sert1^2)/2)
generate predaeall1 = probpart1*predaepos1
* First Set of Tobit II Predictions
heckman lny $xlist1 income, select (dy = $xlist1 income)
scalar sert2 = e(sigma)
predict predlaepos2, ycond
predict probpart2, psel
generate predaepos2 = exp(predlaepos2+(sert2^2)/2)
generate predaeall2 = probpart2*predaepos2
* Second Set of Tobit II Predictions
heckman lny $xlist1, select (dy = $xlist1 income)
scalar shml3 = e(sigma)
predict predlaepos3, ycond
predict probpart3, psel
generate predaepos3 = exp(predlaepos3+(shml3^2)/2)
generate predaeall3 = probpart3*predaepos3
* Prediction Results
summarize y predaepos1 predaepos2 predaepos3 if y>0
summarize y predaeall1 predaeall2 predaeall3
******************************************************************************
* Advanced Fundamentals of Microeconometrics and Data Science (AFMD)
* Vietnamese German University
* Fall Semester 2022
* Prof. Michael Binder, Ph.D.
*
* Problem Set 6: Firm Investment
******************************************************************************

cls // clear display in results window

clear // clear previous work out of memory
cd "/Users/macbook/Desktop/AFMD 30.9_PhD.Binder/Day 9" // change working directory

use "[Link]" // Load Data

* Generate identifiers for cross-sectional and time-series observation numbers

generate id = [_n]

generate firm = 1 if id <= 20

replace firm = 2 if id > 20 & id < =40
replace firm = 3 if id > 40 & id < =60
replace firm = 4 if id > 60 & id < =80
replace firm = 5 if id > 80

generate t = _n if id <= 20
replace t = _n-20 if id > 20 & id < =40
replace t = _n-40 if id > 40 & id < =60
replace t = _n-60 if id > 60 & id < =80
replace t = _n-80 if id > 80

* Declare identifiers for cross-sectional and time-series observation numbers

xtset id t

* Data Description
summarize I F C

* Generate Firm Dummies

generate Dum1 = ([_n] <= 20)
generate Dum2 = ([_n] >20 & [_n] <= 40)
generate Dum3 = ([_n] >40 & [_n] <= 60)
generate Dum4 = ([_n] >60 & [_n] <= 80)
generate Dum5 = ([_n] >80 & [_n] <= 100)

* Alternatively(Generate Firm Dummies):

drop Dum1 Dum2 Dum3 Dum4 Dum5
tabulate firm, gen(Dum) /// dễ làm

* Fixed Effects Model

* Least Suqares Dummy Variables
regress I Dum1 Dum2 Dum3 Dum4 Dum5 F C, nocons/// all dummy va ko co intercept
regress I Dum1 Dum2 Dum3 Dum4 F C /// ko thể 5 dummy and intercept b/c mulcticollinrity

/*
regress I Dum1 Dum2 Dum3 Dum4 F C

Source | SS df MS Number of obs = 100

-------------+---------------------------------- F(6, 93) = 232.32 Model | 6659149.41 6
1109858.23 Prob > F = 0.0000
Residual | 444288.411 93 4777.29474 R-squared = 0.9375
-------------+---------------------------------- Adj R-squared = 0.9334
Total | 7103437.82 99 71751.8972 Root MSE = 69.118

------------------------------------------------------------------------------
I | Coef. Std. Err. t P>|t| [95% Conf. Interval]
-------------+----------------------------------------------------------------
Dum1 | -168.6053 41.50614 -4.06 0.000 -251.0282 -86.1823
Dum2 | -121.9121 29.05297 -4.20 0.000 -179.6056 -64.21868
Dum3 | -334.7093 22.01613 -15.20 0.000 -378.429 -290.9896
Dum4 | -150.438 29.19629 -5.15 0.000 -208.416 -92.45992
F | .1059799 .015891 6.67 0.000 .0744236 .1375363
C | .3466596 .0241612 14.35 0.000 .2986803 .3946388
_cons | 92.53855 33.23551 2.78 0.006 26.53941 158.5377
------------------------------------------------------------------------------
*/

* STATA Panel Data Command Syntax

xtreg I F C, fe i(firm) /// strong reject indvidual effect =0 => #

* Random Effects Model

xtreg I F C, mle i(firm)
xtreg I F C, re i(firm) /// different FE and RE ==> strongly reject Corr =0

* Hypothesis Testing
* Individual Effects
quietly regress I Dum1 Dum2 Dum3 Dum4 Dum5 F C, nocons
testparm Dum1-Dum5, equal

* Hausman Test
quietly xtreg I F C, fe i(firm)
estimates store fixed
quietly xtreg I F C, re i(firm)
estimates store random
hausman fixed random /// thấp hơn standard error.

* b - B: positive
* Var (b) - Var(B): phu thuoc vao dau cua Var ==> Var(FI) increase > ==> negative hamen test ==>
therefore model fail
******************************************************************************
* Advanced Fundamentals of Microeconometrics and Data Science (AFMD)
* Vietnamese German University
* Fall Semester 2023
* Prof. Michael Binder, Ph.D.
*
* Problem Set 7: Individual Retirements Accounts (IRAs)
******************************************************************************

cls // clear display in results window

clear // clear previous work out of memory
cd "/Users/macbook/Desktop/AFMD 30.9_PhD.Binder/ Day 11"
// change working directory

use "[Link]" // Load Data

* Setting up Lists of Regressors

global xlistexo inc incsq age agesq
global rlist c.($xlistexo)##c.($xlistexo)
* Note: SFlaTATA will remove redundant regressors
regress pira $rlist

* Predictions
* Create Training and Prediction Samples
splitsample pira, generate(train) split(1 4) values(0 1) rseed(90210)
* training sample: train = 1
* fitting sample: train = 0
tabulate train
* Ridge
elasticnet linear pira p401k $rlist if train==1, alpha(0)
lassoinfo
predict y_ridge, xb
* Lasso
lasso linear pira p401k $rlist if train==1
lassoinfo
predict y_lasso, xb
* Elastic Net
elasticnet linear pira p401k $rlist if train==1
lassoinfo
predict y_elanet, xb
* Multiple Linear Regression Model
regress pira p401k $xlistexo
predict y_mlr, xb
* Compare Predictive Performance
foreach var of varlist y_ridge y_lasso y_elanet y_mlr {
quietly generate `var'errorsq1 = (`var'-pira)^2
quietly summarize `var'errorsq1 if train == 1
quietly scalar mse`var'train1 = r(mean)
quietly summarize `var'errorsq1 if train == 0
quietly scalar mse`var'test1 = r(mean)
display "Predictor: " "`var'" _col(21) ///
" Train MSE = " %8.6f mse`var'train1 " Test MSE = " %8.6f mse`var'test1
}

* For Comparison: OLS Estimation of Multiple Linear Regression

regress pira p401k inc incsq age agesq
* Partialed-Out Lasso-OLS
poregress pira p401k, controls($rlist)
lassoinfo
predict y_lassools, xb
* For Comparison: IV Estimation of Multiple Linear Regression
ivregress 2sls pira inc incsq age agesq (p401k = e401k)
* Partialed-Out Lasso-IV
poivregress pira (p401k=e401k), controls($rlist)
lassoinfo
predict y_lassoiv, xb
* Compare Predictive Performance
foreach var of varlist y_ridge y_lasso y_elanet y_mlr y_lassools y_lassoiv {
quietly generate `var'errorsq2 = (`var'-pira)^2
quietly summarize `var'errorsq2 if train == 1
quietly scalar mse`var'train2 = r(mean)
quietly summarize `var'errorsq2 if train == 0
quietly scalar mse`var'test2 = r(mean)
display "Predictor: " "`var'" _col(21) ///
" Train MSE = " %8.6f mse`var'train2 " Test MSE = " %8.6f mse`var'test2
}

Applied Statistics MAT1011
No ratings yet
Applied Statistics MAT1011
22 pages
ML Observation
No ratings yet
ML Observation
29 pages
Sunil Test
No ratings yet
Sunil Test
15 pages
Saurabh
No ratings yet
Saurabh
22 pages
GianluigiDeRubertis 228766
No ratings yet
GianluigiDeRubertis 228766
9 pages
Predictive Analytics Group Assignment
No ratings yet
Predictive Analytics Group Assignment
21 pages
A Short List of The Most Useful R Commands
No ratings yet
A Short List of The Most Useful R Commands
8 pages
Standard Deviation in RStudio Guide
No ratings yet
Standard Deviation in RStudio Guide
10 pages
Essential R Commands Guide
No ratings yet
Essential R Commands Guide
11 pages
Experiment 5
No ratings yet
Experiment 5
13 pages
Welcome To Cmpe140 Final Exam: Studentid
No ratings yet
Welcome To Cmpe140 Final Exam: Studentid
21 pages
Analysis Using Statistical: Introduction & Data Exploration
No ratings yet
Analysis Using Statistical: Introduction & Data Exploration
23 pages
BDA MSC It
No ratings yet
BDA MSC It
35 pages
Notes
No ratings yet
Notes
6 pages
Da Lab File 2
No ratings yet
Da Lab File 2
13 pages
Formulario
No ratings yet
Formulario
7 pages
Stastistics and Probability With R Programming Language: Lab Report
67% (3)
Stastistics and Probability With R Programming Language: Lab Report
44 pages
R Course
No ratings yet
R Course
7 pages
FRA Group Assignment - Report
No ratings yet
FRA Group Assignment - Report
22 pages
R Programming Basics and Data Analysis
No ratings yet
R Programming Basics and Data Analysis
18 pages
R Syntax Examples 1
No ratings yet
R Syntax Examples 1
6 pages
A Short List of Some Useful R Commands: Input and Display
No ratings yet
A Short List of Some Useful R Commands: Input and Display
2 pages
R Loan Scoring Model Development
No ratings yet
R Loan Scoring Model Development
32 pages
Percobaan 16 Nov
No ratings yet
Percobaan 16 Nov
7 pages
Out Put File
No ratings yet
Out Put File
13 pages
Econometrics Homework: Wage & Housing Analysis
No ratings yet
Econometrics Homework: Wage & Housing Analysis
3 pages
SanatKulkarni - AP22110010183 - Assignment3-1
No ratings yet
SanatKulkarni - AP22110010183 - Assignment3-1
4 pages
Eviews Commands
100% (1)
Eviews Commands
3 pages
PSQF6270 Example4b Continuous QuantReg
No ratings yet
PSQF6270 Example4b Continuous QuantReg
13 pages
Stata Basics: Data & Regression Guide
No ratings yet
Stata Basics: Data & Regression Guide
59 pages
Data Science
No ratings yet
Data Science
20 pages
Linear Regression Analysis - Polynomial Regression
No ratings yet
Linear Regression Analysis - Polynomial Regression
25 pages
R Tutorial
No ratings yet
R Tutorial
6 pages
Linear Regression in R
No ratings yet
Linear Regression in R
19 pages
Instrumental Variable Estimation 2: Implementation in R: Instructor: Yuta Toyama Last Updated: 2021-05-18
No ratings yet
Instrumental Variable Estimation 2: Implementation in R: Instructor: Yuta Toyama Last Updated: 2021-05-18
34 pages
Lab 2
No ratings yet
Lab 2
22 pages
Nishant R File
No ratings yet
Nishant R File
49 pages
Linear Regression Assumptions Explained
No ratings yet
Linear Regression Assumptions Explained
37 pages
Resolución Caso 2 - Milagro
No ratings yet
Resolución Caso 2 - Milagro
12 pages
Granger Causality and VAR Models
No ratings yet
Granger Causality and VAR Models
1 page
Exp 1 A
No ratings yet
Exp 1 A
5 pages
Ecotric Project
No ratings yet
Ecotric Project
3 pages
Data Scinece Practical File
No ratings yet
Data Scinece Practical File
23 pages
Code and Outputs
No ratings yet
Code and Outputs
25 pages
Run Stata 17-11
No ratings yet
Run Stata 17-11
7 pages
Econometrics Exam Instructions for MBA
No ratings yet
Econometrics Exam Instructions for MBA
4 pages
Econometrics
No ratings yet
Econometrics
28 pages
DADS301 MBA Sem 3programming in DS
No ratings yet
DADS301 MBA Sem 3programming in DS
10 pages
HW 3
No ratings yet
HW 3
20 pages
Stata Commands for Data Analysis
No ratings yet
Stata Commands for Data Analysis
10 pages
Workshop Activity: X Seq y Length
No ratings yet
Workshop Activity: X Seq y Length
3 pages
Project Paarth
No ratings yet
Project Paarth
21 pages
R Regression Functions Guide
No ratings yet
R Regression Functions Guide
5 pages
Matlab Optimization for Microeconometrics
No ratings yet
Matlab Optimization for Microeconometrics
16 pages
Stat A Red
No ratings yet
Stat A Red
4 pages
R File Code
No ratings yet
R File Code
16 pages
Lecture 10 R
No ratings yet
Lecture 10 R
117 pages
DSC2608 Exercise - Manual S1 2025
No ratings yet
DSC2608 Exercise - Manual S1 2025
8 pages
HRM vs. HRD and SHRM Overview
No ratings yet
HRM vs. HRD and SHRM Overview
17 pages
Midea PTAC Installation Manual 7,000-15,000 BTU
No ratings yet
Midea PTAC Installation Manual 7,000-15,000 BTU
23 pages
Maggot Moon by Sally Gardner - Q&A
No ratings yet
Maggot Moon by Sally Gardner - Q&A
2 pages
Nut Bolt Cuant
No ratings yet
Nut Bolt Cuant
7 pages
Baba's Legacy: Lessons Beyond Grades
No ratings yet
Baba's Legacy: Lessons Beyond Grades
1 page
Payroll Period of August 2022
No ratings yet
Payroll Period of August 2022
1 page
CV of ALi Mughal
No ratings yet
CV of ALi Mughal
2 pages
Force and Work on a 5 kg Block
No ratings yet
Force and Work on a 5 kg Block
4 pages
A Review of Polymers and Plastic High Index Optical Materials
No ratings yet
A Review of Polymers and Plastic High Index Optical Materials
15 pages
Dragonflies vs. Birds: Key Facts
No ratings yet
Dragonflies vs. Birds: Key Facts
1 page
Pesticides Use, Pesticides Trade
No ratings yet
Pesticides Use, Pesticides Trade
13 pages
Thesis Statement Guide for WWI & WWII
100% (3)
Thesis Statement Guide for WWI & WWII
6 pages
Problems: Problem 12 - 2
No ratings yet
Problems: Problem 12 - 2
9 pages
Grade 3 Mother Tongue Lesson Plan
No ratings yet
Grade 3 Mother Tongue Lesson Plan
3 pages
Bayfront Park Solar License Agreement
No ratings yet
Bayfront Park Solar License Agreement
31 pages
Sales & Distribution Blueprint
100% (4)
Sales & Distribution Blueprint
29 pages
Dream-Recall Frequency A N D Waking Imagery '
No ratings yet
Dream-Recall Frequency A N D Waking Imagery '
8 pages
9467 - Baramati Five - PO For TMT Bar - OFB Tech - 14.08.2025.
No ratings yet
9467 - Baramati Five - PO For TMT Bar - OFB Tech - 14.08.2025.
7 pages
Girls Sex Video 09
No ratings yet
Girls Sex Video 09
4 pages
Unit 1-2-3 - 27-Octorber
No ratings yet
Unit 1-2-3 - 27-Octorber
38 pages
M.V. Syllabus
No ratings yet
M.V. Syllabus
37 pages
Activity 3 - Financial Ratios
No ratings yet
Activity 3 - Financial Ratios
3 pages
Political Theory Meaning and Approaches 1662521903423
No ratings yet
Political Theory Meaning and Approaches 1662521903423
11 pages
Jsa For Complete Erection of Tank-001
80% (10)
Jsa For Complete Erection of Tank-001
52 pages
DLL Science 7 4th Week
No ratings yet
DLL Science 7 4th Week
7 pages
Radiol Clin North Am 2022 60 1 131-48
No ratings yet
Radiol Clin North Am 2022 60 1 131-48
18 pages
Effect of Starch Admixtures On Fresh and Hardened Properties of Concrete
No ratings yet
Effect of Starch Admixtures On Fresh and Hardened Properties of Concrete
4 pages
Partial Differential Equations Q&A Guide
No ratings yet
Partial Differential Equations Q&A Guide
9 pages
Pandas
No ratings yet
Pandas
82 pages
Chemistry Teaching Practice Report
No ratings yet
Chemistry Teaching Practice Report
3 pages

Advanced Data Science

Uploaded by

Advanced Data Science

Uploaded by

******************************************************************************

* Advanced Fundamentals of Microeconometrics and Data Science (AFMD)

cls // clear display in results window

cd "/Users/macbook/Desktop/AFMD 30.9_PhD.Binder/Day 2"

* Calculating Ceteris Paribus Effect (Frisch-Waugh Result)

cls // clear display in results window

use "[Link]" // Load Data

* Remove Outlier and redo regression analysis

* Return to Full Data Set

* Feasible GLS Estimation

cls // clear display in results window

use "[Link]" // Load Data

cls // clear display in results window

use "[Link]" // Load Data

* dmed: 1 if annual USD medical expenditure (excluding dental and outpatient

* Define inDependent Variable List

cls // clear display in results window

use "[Link]" // Load Data

* Define inDependent Variable List

* Tobit II Model With Exclusion Restrictions

cls // clear display in results window

use "[Link]" // Load Data

* Generate identifiers for cross-sectional and time-series observation numbers

generate firm = 1 if id <= 20

* Declare identifiers for cross-sectional and time-series observation numbers

* Generate Firm Dummies

* Alternatively(Generate Firm Dummies):

* Fixed Effects Model

Source | SS df MS Number of obs = 100

* STATA Panel Data Command Syntax

* Random Effects Model

cls // clear display in results window

use "[Link]" // Load Data

* Setting up Lists of Regressors

* For Comparison: OLS Estimation of Multiple Linear Regression

You might also like