0% found this document useful (0 votes)

16 views3 pages

Stata Class Notes - Modifying Data

Quick guide to stata

Uploaded by

Ismaila Yusuf

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views3 pages

Stata Class Notes - Modifying Data

Quick guide to stata

Uploaded by

Ismaila Yusuf

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Stata Class Notes: Modifying Data http://www.ats.ucla.edu/stat/stata/notes/modifying12.

htm

Help the Stat Consulting Group by

stat > stata > notes > modifying12.htm

Stata Class Notes

Modifying Data

1.0 Stata commands in this unit

codebook Show codebook information for file

order Order the variables in a data set

label data Apply a label to a data set

label variable Apply a label to a variable

label define Define value labels for a categorical variable

label values Apply value labels to a variable

encode Create numeric version of a string variable

list Lists the observations

rename Rename a variable

recode Recode the values of a variable

notes Apply notes to the data file

generate Creates a new variable

replace Replaces values for an existing variable

egen Extended generate - has special functions that can be used when creating a new variable

2.0 Demonstration and explanation

use http://www.ats.ucla.edu/stat/data/hs0, clear

Let's use the codebook command to see what our variables look like. Because we have not listed any
variables after the command, Stata will show us the codebook for all of the variables.

codebook

First, let's order the variables in a way that makes sense. While there are several possible orderings
that are logical, we will put the id variable first, followed by the demographic variables, such as
gender, ses and prgtype. We will put the variables regarding the test scores at the end.

order id gender

Now let's include some variable and value labels so that we know a little more about the variables.

label variable schtyp "type of school"

label define scl 1 public 2 private
label values schtyp scl
codebook schtyp
list schtyp in 1/10
list schtyp in 1/10, nolabel

Now let's create a new numeric version of the string variable prgtype. We will call our new variable
prog.

encode prgtype, gen(prog)

label variable prog "type of program"

1 of 3 4/27/2015 5:12 PM
Stata Class Notes: Modifying Data http://www.ats.ucla.edu/stat/stata/notes/modifying12.htm

codebook prog
list prog in 1/10
list prog in 1/10, nolabel

The variable gender may give us trouble in the future because it is difficult to know what the 1s and
2s mean.

rename gender female

recode female (1=0)(2=1)
label define fm 1 female 0 male
label values female fm
codebook female
list female in 1/10
list female in 1/10, nolabel

Let's recode the value 5 in the variable race to be missing.

list race if race == 5

recode race 5 = .
list race if race == .

Now let's create a variable that is a total of some of the test scores.

generate total = read + write + math + science

summarize total

Note that there are five missing values of total because there are five missing values of science.

Now let's see if we can assign some letter grades to these test scores.

recode total (0/140=0 F) (141/180=1 D) (181/210=2 C) (211/234=3 B) (235/300=4 A), gen(grade)

label variable grade "combined grades of read, write, math, science"
codebook grade
list read write math science total grade in 1/10
list read write math science total grade in 1/10, nolabel

Let's label the dataset itself so that we will remember what the data are. We can also add some notes
to the data set.

label data "High School and Beyond"

notes female: the variable gender was renamed to female

notes race: values of race coded as 5 were recoded to be missing
notes

Stata has another way of generating new variables called egen which stands for extended generation. The egen command is a
useful tool for many of specialized situations.

In our first example, we will use egen to create standard scores for the variable read.

egen zread = std(read)

summarize zread
list read zread in 1/10

Next we will a variable that has the mean of read for each level of ses.

egen readmean = mean(read), by(ses)

list read ses readmean in 1/10

Now we will compute the average of several variables for each observation. Please note that there will be a mean for observation 9
even though it has a missing value for science.

egen row_mean = rowmean(read write math science)

list read write math science row_mean in 1/10

These are just a few of the many useful egen functions built-in to Stata.
Finally, we will save our data and continue on to the next unit.

save hs1

2 of 3 4/27/2015 5:12 PM
Stata Class Notes: Modifying Data http://www.ats.ucla.edu/stat/stata/notes/modifying12.htm

3.0 For more information

Data Management Using Stata: A Practical Handbook

Chapters 4-5

Statistics with Stata 12

Chapter 2

Gentle Introduction to Stata, Revised Third Edition

Chapter 3

Data Analysis Using Stata, Third Edition

Chapter 5

An Introduction to Stata for Health Researchers, Third Edition

Chapters 7-8

Stata Learning Modules

Labeling data
Creating and recoding variables
Stata Frequently Asked Questions
How can I quickly convert many string variables into numeric variables?
How can I quickly recode continuous variables into groups?
How do I standardize variables in Stata?

3 of 3 4/27/2015 5:12 PM

Stata Data Managment
No ratings yet
Stata Data Managment
79 pages
STATAfor Econ Workshop 3
No ratings yet
STATAfor Econ Workshop 3
12 pages
STATA Basics and Regression Guide
No ratings yet
STATA Basics and Regression Guide
57 pages
Using Datediff in Stata
100% (1)
Using Datediff in Stata
52 pages
Applied Econometrics Course Guide
No ratings yet
Applied Econometrics Course Guide
68 pages
Introduction To Stata and Data Management
No ratings yet
Introduction To Stata and Data Management
30 pages
Stata Basics for Data Management
No ratings yet
Stata Basics for Data Management
32 pages
Using STATA Infile and Infix Commands
No ratings yet
Using STATA Infile and Infix Commands
6 pages
Summary of Basic STATA Commands and Syntax
No ratings yet
Summary of Basic STATA Commands and Syntax
5 pages
An Introduction To Stata For Economists: Data Management
No ratings yet
An Introduction To Stata For Economists: Data Management
49 pages
Beginners Stata Training Guide
No ratings yet
Beginners Stata Training Guide
38 pages
Stata Review
No ratings yet
Stata Review
9 pages
Command List For Fall 2015 Workshop
No ratings yet
Command List For Fall 2015 Workshop
4 pages
Stata Class Notes - Entering Data
No ratings yet
Stata Class Notes - Entering Data
4 pages
Presentation 5 - Data Formating
No ratings yet
Presentation 5 - Data Formating
12 pages
Stata - 2 - Data Managment - 1-1-1
No ratings yet
Stata - 2 - Data Managment - 1-1-1
22 pages
Stata: Generate & Replace Variables
No ratings yet
Stata: Generate & Replace Variables
5 pages
STATA Basics Regression and Panal Data
100% (1)
STATA Basics Regression and Panal Data
26 pages
STATA Commands
100% (2)
STATA Commands
35 pages
CH - 1 - Introduction To Econometrics Software Stata
No ratings yet
CH - 1 - Introduction To Econometrics Software Stata
35 pages
Data Prep & Stats by Oscar Torres-Reyna
No ratings yet
Data Prep & Stats by Oscar Torres-Reyna
50 pages
Stata Notebook
No ratings yet
Stata Notebook
9 pages
Computing Stata Notes
No ratings yet
Computing Stata Notes
5 pages
Stata User Guide Release 18 - Data Management, Generate
No ratings yet
Stata User Guide Release 18 - Data Management, Generate
6 pages
Introduction To STATA: Introduction To STATA About STATA Basic Operations Regression Analysis Panel Data Analysis
No ratings yet
Introduction To STATA: Introduction To STATA About STATA Basic Operations Regression Analysis Panel Data Analysis
27 pages
Data Management Techniques in STATA
No ratings yet
Data Management Techniques in STATA
44 pages
Creating New Variables: Generate and Replace
No ratings yet
Creating New Variables: Generate and Replace
7 pages
Data Cleaning and Management Guide
No ratings yet
Data Cleaning and Management Guide
6 pages
Data Prep 101
No ratings yet
Data Prep 101
51 pages
Stata - Tips PDF
100% (1)
Stata - Tips PDF
114 pages
Lecture 1
No ratings yet
Lecture 1
47 pages
Generate and Replace Variables in Stata
No ratings yet
Generate and Replace Variables in Stata
6 pages
Converting ASCII Data to Stata
No ratings yet
Converting ASCII Data to Stata
50 pages
GSW 11
No ratings yet
GSW 11
8 pages
Netcourse 101: Answers To Exercises in Lesson 2
No ratings yet
Netcourse 101: Answers To Exercises in Lesson 2
4 pages
Computing For Research I: Spring 2012
No ratings yet
Computing For Research I: Spring 2012
34 pages
Data Management in Stata
No ratings yet
Data Management in Stata
19 pages
Michael N. Mitchell - Data Management Using Stata - A Practical Handbook-STATA Press (2010)
100% (1)
Michael N. Mitchell - Data Management Using Stata - A Practical Handbook-STATA Press (2010)
405 pages
Stata Basics for Beginners
No ratings yet
Stata Basics for Beginners
63 pages
STATA Basics for Economics Students
No ratings yet
STATA Basics for Economics Students
6 pages
Stata Commands for Data Analysis
No ratings yet
Stata Commands for Data Analysis
3 pages
Spss Notes by Asprabhu
100% (1)
Spss Notes by Asprabhu
38 pages
Stoc
No ratings yet
Stoc
44 pages
Exercises
No ratings yet
Exercises
20 pages
SPSS Recode, Compute, Count
No ratings yet
SPSS Recode, Compute, Count
5 pages
Stata Data Analysis Commands Guide
No ratings yet
Stata Data Analysis Commands Guide
43 pages
SPSS Data Analysis Guide and Techniques
100% (1)
SPSS Data Analysis Guide and Techniques
9 pages
Study Theme 1 - Chapter 1 - Hello Data
No ratings yet
Study Theme 1 - Chapter 1 - Hello Data
23 pages
Lab0 R Tutorial EHS
No ratings yet
Lab0 R Tutorial EHS
9 pages
Stata Data Management Seminar Guide
No ratings yet
Stata Data Management Seminar Guide
64 pages
STATA Precourse
No ratings yet
STATA Precourse
6 pages
Stata Data Management Basics Guide
100% (1)
Stata Data Management Basics Guide
24 pages
Stata
No ratings yet
Stata
26 pages
Training at Gudar Campus
100% (1)
Training at Gudar Campus
83 pages
Stata Guide for VA Data Analysts
No ratings yet
Stata Guide for VA Data Analysts
35 pages
Stata Guide 1
No ratings yet
Stata Guide 1
2 pages
Stata Class Notes - Exploring Data
No ratings yet
Stata Class Notes - Exploring Data
3 pages
LASU Accounting Course Outline 2008-2012
No ratings yet
LASU Accounting Course Outline 2008-2012
139 pages
BMAS for Nigerian University Programs
No ratings yet
BMAS for Nigerian University Programs
115 pages
Causal Mediation Analysis Guide
No ratings yet
Causal Mediation Analysis Guide
41 pages
2025 Donovan Marine Master Catalog
No ratings yet
2025 Donovan Marine Master Catalog
1,556 pages
Dong Feng Catalogue
100% (2)
Dong Feng Catalogue
7 pages
Service Instructions: Oilgear Type "CH" High-Low Compensator Control For "PVWH" and "PVW" Pumps
No ratings yet
Service Instructions: Oilgear Type "CH" High-Low Compensator Control For "PVWH" and "PVW" Pumps
4 pages
Revised FSA Guidelines for IMO Rule-Making
No ratings yet
Revised FSA Guidelines for IMO Rule-Making
71 pages
Vip 1
No ratings yet
Vip 1
1 page
Building A Roadmap and Tracking Dependencies Across Teams With Delivery Plans
No ratings yet
Building A Roadmap and Tracking Dependencies Across Teams With Delivery Plans
15 pages
Human Resource Management Systems Explained
No ratings yet
Human Resource Management Systems Explained
5 pages
Pengenalan Python: Mohammad Syarief
No ratings yet
Pengenalan Python: Mohammad Syarief
9 pages
The YouTube Algorithm Determines How Videos Are Recommended and Ranked On The Platform
No ratings yet
The YouTube Algorithm Determines How Videos Are Recommended and Ranked On The Platform
6 pages
Lampiran Hasil Output Spss
No ratings yet
Lampiran Hasil Output Spss
9 pages
Rtiodisha - Gov.in Pages PrintAllManual Office Id 4442 Lang
No ratings yet
Rtiodisha - Gov.in Pages PrintAllManual Office Id 4442 Lang
13 pages
Violation of OLS Assumption - Multicollinearity
No ratings yet
Violation of OLS Assumption - Multicollinearity
18 pages
Design Calculation Sheet: Date: Sheet No.: Project No.: 1203 Computed By: Alaa Ramadan Approved By: Checked by
No ratings yet
Design Calculation Sheet: Date: Sheet No.: Project No.: 1203 Computed By: Alaa Ramadan Approved By: Checked by
1 page
CS 200-EE 201-Introduction To Programming-Zartash A Uzmi-Saqib Ilyas
No ratings yet
CS 200-EE 201-Introduction To Programming-Zartash A Uzmi-Saqib Ilyas
7 pages
Utility Stores Corporation: Location Ledger
No ratings yet
Utility Stores Corporation: Location Ledger
11 pages
Kicad Tutorial
No ratings yet
Kicad Tutorial
584 pages
Fabric Structures in Architecture 1st Edition J Llorens Download PDF
No ratings yet
Fabric Structures in Architecture 1st Edition J Llorens Download PDF
47 pages
Findom Fastrack Blueprint
33% (3)
Findom Fastrack Blueprint
9 pages
GSF35-2 PU: Part. No.: 3410.0345
No ratings yet
GSF35-2 PU: Part. No.: 3410.0345
2 pages
EXTRA Judicial
No ratings yet
EXTRA Judicial
2 pages
Target Complete Workflow PDF
No ratings yet
Target Complete Workflow PDF
289 pages
Unitized Group Rations
No ratings yet
Unitized Group Rations
12 pages
Practical Exercises
No ratings yet
Practical Exercises
5 pages
BrandPRO Participant Handbook
No ratings yet
BrandPRO Participant Handbook
35 pages
Digital Forensics & Psychology
No ratings yet
Digital Forensics & Psychology
3 pages
Class 12 Accountancy Exam 2022
No ratings yet
Class 12 Accountancy Exam 2022
15 pages
CCI Recruitment 2020: Apply Online Now
No ratings yet
CCI Recruitment 2020: Apply Online Now
39 pages
Viscose Rayon Manufacturing Process Overview
100% (1)
Viscose Rayon Manufacturing Process Overview
13 pages
Macro Volatility Digest Sep222025 U1
No ratings yet
Macro Volatility Digest Sep222025 U1
6 pages
Kaeser Screw Compressors DSD Series
50% (2)
Kaeser Screw Compressors DSD Series
8 pages

Stata Class Notes - Modifying Data

Uploaded by

Stata Class Notes - Modifying Data

Uploaded by

Stata Class Notes: Modifying Data http://www.ats.ucla.edu/stat/stata/notes/modifying12.

Help the Stat Consulting Group by

stat > stata > notes > modifying12.htm

Stata Class Notes

1.0 Stata commands in this unit

order Order the variables in a data set

label data Apply a label to a data set

label variable Apply a label to a variable

label define Define value labels for a categorical variable

label values Apply value labels to a variable

encode Create numeric version of a string variable

list Lists the observations

rename Rename a variable

recode Recode the values of a variable

notes Apply notes to the data file

generate Creates a new variable

replace Replaces values for an existing variable

2.0 Demonstration and explanation

label variable schtyp "type of school"

encode prgtype, gen(prog)

rename gender female

Let's recode the value 5 in the variable race to be missing.

list race if race == 5

generate total = read + write + math + science

recode total (0/140=0 F) (141/180=1 D) (181/210=2 C) (211/234=3 B) (235/300=4 A), gen(grade)

label data "High School and Beyond"

notes female: the variable gender was renamed to female

egen zread = std(read)

egen readmean = mean(read), by(ses)

egen row_mean = rowmean(read write math science)

3.0 For more information

Data Management Using Stata: A Practical Handbook

Statistics with Stata 12

Gentle Introduction to Stata, Revised Third Edition

Data Analysis Using Stata, Third Edition

An Introduction to Stata for Health Researchers, Third Edition

Stata Learning Modules

You might also like