0% found this document useful (0 votes)

25 views9 pages

Stata Review

Uploaded by

laura.tello

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

25 views9 pages

Stata Review

Uploaded by

laura.tello

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

Econometrics I

Stata review

Prof. Miguel Ángel Borrella Mas1

1
School of Economics and Business Administration , Universidad de Navarra

February 9, 2023

1
1 Directories and folders

What Stata looks like:

To specify working directory:

• cd “c:/Stata15” −→ Change directory to “c:/Stata15”

• mkdir stata review −→ Create a new directory within the current one (here, C:/Stata15/stata review)

• dir −→ List contents of directory or folder

Notice! Stata is case sensitive, so it will not recognise the command CD or Cd. To work in an

already created folder you have to specify the root −→ cd “C:/Stata15/stata review”

Playing with data into Stata

DATA: Table of numeric and string (non-numeric) variables. Usually each row is an observation;

each column is a variable.

To read the files saved in Stata-format (ending with .dta) in your current directory, you can use

several alternatives —the clear option will clear the revised dataset currently in memory before

opening the other one:

2
1. use caschool.dta, clear

2. use “C:/Stata15/stata review/caschool.dta”, clear

3. use caschool, clear

4. Or just open the file using the button

To import files saved in other format (for example, an excel file) in your current directory, you

can use the option “Archive/Import/” and then select the corresponding option according to the

structure of your original file. Notice! Once you have done this procedure the first time, you can

copy/paste the command from the review windows in order to save it for the future.

Once opened the data, to LOOK at the data use the command browse or just press the corresponding

button . Notice! Most commands can be abbreviated, which saves some typing. For example,

browse can be abbreviated: br or b. The abbreviations are noted in the Stata manuals.

To EDIT new values or change the current ones, write edit (or just ed ) o just press the corresponding

button . Notice! You have to close the data window to continue working in Stata.

2 Keeping track of things

Stata has a number of tools to help you keep track of what work you did to datasets.

Log-files

All output appearing in the Results window can be captured in a log file. To start a log −→ log

using Stata review log, text replace, where the “text replace” allows overwriting the existing log-file

in text format. To pause and resume a log:

• log off −→ Temporarily suspends log file

• log on −→ Resumes log file

These commands can be useful to create a log that contains only results and not intermediate

programming. Finally, to close it (at the end of the do-file): log close.

3
Do-files

Instead of typing commands one-by-one interactively, you can type them all in one go within a

do-file and simply run the do-file once. The results of each command can be recorded in a log-file

for review when the do-file is finished running. Do-files can be written in any text editor, such as

Notepad or Word (making sure to save it as a “Text Only” file). Stata also has its own editor built in

—to open it, click the corresponding button . An example is provided in the following image:

You can write notes along the do-file, so that when you look back over it, you know what you

were trying to achieve with each command or set of commands. You can insert notes in two different

ways:

1. Exercise 1 −→ Stata will ignore a line if it starts with an asterisk

2. Or you can place notes after a command by inserting it inside these pseudo-parentheses, for

example: /* linear regression */

Notice! If your line is too long, you can “cut it” by using the environment delimit. For instance,

the following commands are equivalent:

• reg testscr str avginc comp stu expn stu if gr span==“KK-08”, r

• #delimit;

reg testscr str avginc

comp stu expn stu if gr span==“KK-08”, r;

#delimit cr

4
3 Examining the data

To examine the data within result window −→ list. Notice! To see the further info you have to

press more in the result window. To stop all the information we have to press the button “cancel”.

However, there is a “trick” to block this issue −→ Write at the beginning of the do-file: set more

off. We have different options here:

• list testscr str comp stu −→ To see specific variables

• list testscr str comp stu in 1/5 −→ Or list just some of the observations specifying the numbers

• list testscr str if comp stu==0 −→ To list the variables satisfying some additional conditions

(test scores and str with 0 computers per student are reported)

• list testscr if str==22 & comp stu==0 −→ The test score for districts with str==22 and no

computers per student are reported

Notice! To add variables in command window you can just click on them in variables window.

To report some basic information about the dataset and its variables −→ describe or des or just d.

To describe subset of variables: d testscr str comp stu. Notice! You can save retyping commands

by clicking on them in the Review Window —they will then appear in the Command Window. You

can also cycle back and forth through previous commands using the PageUp and PageDown keys

on your keyboard.

To get extra information on the variables, such as summary statistics of numerics, example data-

points of strings, details of missing values, data ranges, and so on −→ codebook.

To get summary statistics, such as means, standard deviations, minimum and maximum values −→

summarize or sum. Notice! To get additional information about the distribution of the variable

use the detail option: sum testscr, detail.

To produce a frequency table of one variable −→ tabulate gr span or tab gr span. Notice! You must

specify here a variable. If you write two variables, then you obtain the contingency table between

two (categorical) variables: tab gr span county.

5
To calculate and display the correlation or covariance matrix −→ correlate or corr. Notice! You

can use this command to obtain the correlation coefficient between two variables: corr testscr str.

If you want to check if the correlation is significant, then use: pwcorr testscr str, sig.

Remember! It is always useful to plot your data before doing any regression. There are several

alternatives:

• histogram testscr, bin(10) normal −→ To study the distribution of a variable, with the option

normal to check if it can be normally distributed

• scatter testscr str −→ To draw a two-way scatterplot (this is the figure shown in class)

• gr twoway scatter testscr str, by(gr span) −→ To plot graphs for different categories, you can

create a matrix of scatterplots

Notice! You can customize your graph adding a lot of options. See the do-file for an example.

After you are happy with your graph, you can save it using: gr export name.pdf, replace.

4 Generating variables

To create a new variable that is an algebraic expression of other variables −→ generate or gen. For

example, to obtain the interaction between the number of computers and the number of teachers:

gen comp teachers=computer*teachers. Notice! If you write in the previous line of the do-file cap

drop comp teachers, you can re-run the do-file multiples times without obtaining a mistake.

To creates new variables based on summary measures, such as sum, mean, min and max −→ egen:

• egen testscr mean=mean(testscr), by (gr span) −→ To obtain the mean value of test scores by

type of school

• egen totaltestscr=total(testscr), by(gr span) −→ To obtain the total sum of test scores by type

of school

• egen maxtestscr=max(testscr) −→ To obtain the max value of test scores

6
To create a dummy variable, we can follow two alternatives:

1. Use gen and replace: gen small=0 /* create vector of zeros */

replace small=1 if str < 20 /* replace =1 for corresponding values*/

2. Directly: gen small=(str < 20) if str!=.

In any case, it is always useful to see the results using the command tab with the option m (of

missing); and to label the variable with a simple explanation in order to be able to quickly identify

the variable in the future.

Saving the dataset

The command is simply save: save caschool.dta, replace, or save “C:/Stata15/stata review/caschool.dta”,

replace. Notice! The replace option overwrites any previous version of the file in the directory you

try saving to. The only way to alter the original file permanently is to save the revised dataset.

Thus, if you make some changes but then decide you want to restart, just re-open the original file.

5 Linear estimation

To estimate the simple OLS regression −→ regress or reg with the next structure:

reg y x1 x2 . . . xn if, options

For example: reg testscr str. How to read the output table:

• The first variable listed after the regress command is the dependent variable, and all subse-

quently listed variables are the independent variables

• Stata automatically adds the constant term or intercept to the list of independent variables

(type reg, noconstant if you want to exclude it)

• The top-left corner gives the ANOVA decomposition of the sum of squares in the dependent

variable (Total) into the explained (Model) and unexplained (Residual)

• The top-right corner gives the statistical significance results for the model as a whole, e.g.

R-squared

7
• The bottom section gives the results for the individual independent variables, e.g. standard

errors

• To display the estimated variance-covariance matrix of the estimator −→ matrix list e(V)

Notice! You can obtain different CI by writing the option level(n). In addition, if you write eret list

after obtaining the regression, you get general information about the last estimation. Finally, to take

into consideration possible heteroskedasticity, you have to add the robust or r option: reg testscr str,

r. When doing so, you get robust-to-heteroskedasticity standard errors and the subsequent tests are

also hetersokedasticity-robust. To see the estimated variance-covariance matrix of the coefficients

you use the same command: matrix list e(V), but in this case the displayed variance-covariance is

the robust-to-heteroskedasticity matrix.

Estimation by subsamples and interactions

To specify the particular subsample: reg testscr str if gr span==“KK-08”, r. Finally, we can use

three different alternatives to estimate a more sophisticated model with interactions —notice that

you cannot use a string variable in the regression framework, but you can (almost) always create a

numerical variable (type) from a string variable (gr span):

• reg testscr c.str##type, r

• reg testscr str type c.str#type, r

• cap drop str type

gen str type=str*type

reg testscr str type str type, r

Saving regressions

There are several ways to store the output from a regression in a txt, word, excel or LaTeX file.

Among the alternatives, you can use: outreg2 and esttab. Check the problem set related to the stata

class to see an example.

8
6 Hypothesis testing and confidence intervals

The results of each estimation automatically include for each independent variable a t-test for linear

regressions on the null hypothesis that the “true” coefficient is equal to zero. You can also do

this test by writing just after the estimation: test str. In addition, you can do other tests, for

example: test str=-2.

To obtain a confidence interval, we have several alternatives:

• ci means testscr, level(90) −→ CI for the mean of test scores, with α = 10%

• ci means testscr, level(99) −→ α = 1%. Notice that this CI is wider with respect to the

previous one

• bysort small: ci means testscr −→ CI for test scores by small/large class size

Notice! They are only correct if the variable is distributed normally, and asymptotically correct

for all other distributions satisfying the conditions of the CLT

Stata Tutorial
No ratings yet
Stata Tutorial
63 pages
BASH Guide - Joseph DeVeau
100% (2)
BASH Guide - Joseph DeVeau
227 pages
Stata Notes
No ratings yet
Stata Notes
7 pages
Stata Application Part I
No ratings yet
Stata Application Part I
27 pages
Stata Absolute Beginners
No ratings yet
Stata Absolute Beginners
38 pages
Using Stata With The Fundamentals of Political: Science Research
No ratings yet
Using Stata With The Fundamentals of Political: Science Research
20 pages
Stata Prirucnik
No ratings yet
Stata Prirucnik
75 pages
An Introduction To Stata For Economists: Data Management
No ratings yet
An Introduction To Stata For Economists: Data Management
49 pages
Gravity13 Stata
No ratings yet
Gravity13 Stata
80 pages
Stata Excel
No ratings yet
Stata Excel
25 pages
Introduction Stata Slides 2
No ratings yet
Introduction Stata Slides 2
25 pages
Econometrics Computer Exercise Week 1: Introduction Stata + Simple Regression Model
No ratings yet
Econometrics Computer Exercise Week 1: Introduction Stata + Simple Regression Model
4 pages
Advanced Stata
No ratings yet
Advanced Stata
54 pages
Stata: A Brief Introduction
No ratings yet
Stata: A Brief Introduction
9 pages
Applied Econometrics Using Stata
100% (2)
Applied Econometrics Using Stata
100 pages
Tutorial of Stata
No ratings yet
Tutorial of Stata
11 pages
Introduction To Stata: 1 Data Manipulation
No ratings yet
Introduction To Stata: 1 Data Manipulation
6 pages
Lecture 1-2 Applied Econometrics
No ratings yet
Lecture 1-2 Applied Econometrics
68 pages
Software Material
No ratings yet
Software Material
13 pages
Introduction To STATA: Introduction To STATA About STATA Basic Operations Regression Analysis Panel Data Analysis
No ratings yet
Introduction To STATA: Introduction To STATA About STATA Basic Operations Regression Analysis Panel Data Analysis
27 pages
Stata - Tutorial MATERIAL
No ratings yet
Stata - Tutorial MATERIAL
3 pages
A I S ECMT1020: N Ntroduction To Tata
No ratings yet
A I S ECMT1020: N Ntroduction To Tata
15 pages
Stat A Guide
No ratings yet
Stat A Guide
16 pages
A Short Introduction To STATA
No ratings yet
A Short Introduction To STATA
8 pages
An Introduction To Stata For Economists: Data Analysis
No ratings yet
An Introduction To Stata For Economists: Data Analysis
48 pages
A Short Guide To Stata 10 For Windows
No ratings yet
A Short Guide To Stata 10 For Windows
7 pages
Stata
No ratings yet
Stata
6 pages
Applied Econometrics Using Stata
100% (1)
Applied Econometrics Using Stata
100 pages
Computing For Research I: Spring 2012
No ratings yet
Computing For Research I: Spring 2012
34 pages
Training at Gudar Campus
No ratings yet
Training at Gudar Campus
83 pages
Research Methods: Wiji Arulampalam
No ratings yet
Research Methods: Wiji Arulampalam
45 pages
STATA Commands
No ratings yet
STATA Commands
42 pages
Introduction To Stata Software, MaU, 2022
No ratings yet
Introduction To Stata Software, MaU, 2022
93 pages
Basics of STATA Software
No ratings yet
Basics of STATA Software
67 pages
Getting Started With Your Data: Using Stata
No ratings yet
Getting Started With Your Data: Using Stata
32 pages
CH - 1 - Introduction To Econometrics Software Stata
No ratings yet
CH - 1 - Introduction To Econometrics Software Stata
35 pages
Intro To Stata 2022
No ratings yet
Intro To Stata 2022
36 pages
Stata Excel Spreadsheet
No ratings yet
Stata Excel Spreadsheet
43 pages
Command Window: Stata Results Window: Variables Window: Review Window
No ratings yet
Command Window: Stata Results Window: Variables Window: Review Window
3 pages
Stataguide
No ratings yet
Stataguide
17 pages
Stata Manual Introduction
No ratings yet
Stata Manual Introduction
24 pages
Introduction To STATA With Econometrics in Mind: January 2010
No ratings yet
Introduction To STATA With Econometrics in Mind: January 2010
47 pages
Stata Basics13
No ratings yet
Stata Basics13
23 pages
STATA Capacity Building March 8
No ratings yet
STATA Capacity Building March 8
15 pages
STATA
No ratings yet
STATA
26 pages
A Short Guide To Stata 15: Version: 20-9-2021, 22:10
No ratings yet
A Short Guide To Stata 15: Version: 20-9-2021, 22:10
17 pages
Using Stata: The Opening Display
No ratings yet
Using Stata: The Opening Display
16 pages
Stata Slides
No ratings yet
Stata Slides
45 pages
Stata An Introduction Summer 2020
No ratings yet
Stata An Introduction Summer 2020
60 pages
Stata For Survey Analysis
No ratings yet
Stata For Survey Analysis
164 pages
Stata Excel
No ratings yet
Stata Excel
44 pages
Basic Tutorial Stata PDF
No ratings yet
Basic Tutorial Stata PDF
5 pages
Stat A Tutorial
No ratings yet
Stat A Tutorial
40 pages
STATAfor Econ Workshop 3
No ratings yet
STATAfor Econ Workshop 3
12 pages
Manual
No ratings yet
Manual
14 pages
The Mac Terminal Reference and Scripting Primer
From Everand
The Mac Terminal Reference and Scripting Primer
Jay Docherty
4.5/5 (3)
The Project Gutenberg RST Manual
From Everand
The Project Gutenberg RST Manual
Marcello Perathoner
No ratings yet
Introduction to Algorithms
From Everand
Introduction to Algorithms
S VASIST
No ratings yet
Base SAS Interview Questions You'll Most Likely Be Asked
From Everand
Base SAS Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
Python: Advanced Guide to Programming Code with Python: Python Computer Programming, #4
From Everand
Python: Advanced Guide to Programming Code with Python: Python Computer Programming, #4
Charlie Masterson
No ratings yet
Bash Command Line Pro Tips
From Everand
Bash Command Line Pro Tips
Jason Cannon
4.5/5 (8)
Value Creation Through Mergers and Acquistion - Eicher Motors
No ratings yet
Value Creation Through Mergers and Acquistion - Eicher Motors
21 pages
DAY 6 PATHFit 1
No ratings yet
DAY 6 PATHFit 1
34 pages
Analysis of Organic Acids 2370 PDF
No ratings yet
Analysis of Organic Acids 2370 PDF
22 pages
PowerPoint Presentation
No ratings yet
PowerPoint Presentation
60 pages
(FREE PDF Sample) Mostly Codeless Game Development: New School Game Engines Robert Ciesla Ebooks
100% (2)
(FREE PDF Sample) Mostly Codeless Game Development: New School Game Engines Robert Ciesla Ebooks
55 pages
Procurement Profile
No ratings yet
Procurement Profile
18 pages
Brand Personality
No ratings yet
Brand Personality
3 pages
Abitha J - Resume
No ratings yet
Abitha J - Resume
1 page
Week 1 - Lecture Prinsip Perakaunan Principles of Accounting (Bt11003)
No ratings yet
Week 1 - Lecture Prinsip Perakaunan Principles of Accounting (Bt11003)
30 pages
CLS Aipmt-18-19 XIII Bot Study-Package-1 SET-1 Chapter-1 PDF
No ratings yet
CLS Aipmt-18-19 XIII Bot Study-Package-1 SET-1 Chapter-1 PDF
38 pages
Ultrasonic Calculator
No ratings yet
Ultrasonic Calculator
6 pages
He Sas 1
No ratings yet
He Sas 1
3 pages
Banyuhay: Katutubong Sayaw Sa Makabagong Pananaw Playbill
No ratings yet
Banyuhay: Katutubong Sayaw Sa Makabagong Pananaw Playbill
18 pages
What Is Failure Mode Effects Analysis
No ratings yet
What Is Failure Mode Effects Analysis
6 pages
Mechatronics Q & A
No ratings yet
Mechatronics Q & A
3 pages
Open Silicon Pakistan Brochure
No ratings yet
Open Silicon Pakistan Brochure
1 page
Morphology of Flowering Plants Learn Cbse
No ratings yet
Morphology of Flowering Plants Learn Cbse
6 pages
Quran & Prime Numbers - Part 2
No ratings yet
Quran & Prime Numbers - Part 2
6 pages
FM Modulators: Experiment 7
100% (2)
FM Modulators: Experiment 7
17 pages
Buckling of Orthotropic, Curved, Sandwich Panels Subjected To Edge Shear Loads
No ratings yet
Buckling of Orthotropic, Curved, Sandwich Panels Subjected To Edge Shear Loads
4 pages
3GPP TS 36.331 V10.12.0 (2013-12)
No ratings yet
3GPP TS 36.331 V10.12.0 (2013-12)
312 pages
Test 03a
No ratings yet
Test 03a
4 pages
Off-Line Programming Techniques For Multirobot Cooperation System
No ratings yet
Off-Line Programming Techniques For Multirobot Cooperation System
17 pages
Principles of Public Speaking Syllabus - Ms. Catherine Linobo
No ratings yet
Principles of Public Speaking Syllabus - Ms. Catherine Linobo
7 pages
Guidelines To Fill Student Data University Marksheet v2.9
No ratings yet
Guidelines To Fill Student Data University Marksheet v2.9
6 pages
A Detailed Lesson Plan in Science Grade 7
No ratings yet
A Detailed Lesson Plan in Science Grade 7
10 pages
Module 4 - Provide Valet Services To Guest
No ratings yet
Module 4 - Provide Valet Services To Guest
5 pages
Paradoxes
No ratings yet
Paradoxes
528 pages
StayingStrong Extract
100% (5)
StayingStrong Extract
13 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Stata Review

Uploaded by

Stata Review

Uploaded by

Econometrics I

Prof. Miguel Ángel Borrella Mas1

What Stata looks like:

To specify working directory:

• cd “c:/Stata15” −→ Change directory to “c:/Stata15”

• dir −→ List contents of directory or folder

Playing with data into Stata

each column is a variable.

opening the other one:

2. use “C:/Stata15/stata review/caschool.dta”, clear

3. use caschool, clear

4. Or just open the file using the button

2 Keeping track of things

in text format. To pause and resume a log:

• log off −→ Temporarily suspends log file

• log on −→ Resumes log file

1. *Exercise 1 −→ Stata will ignore a line if it starts with an asterisk *

example: /* linear regression */

the following commands are equivalent:

• reg testscr str avginc comp stu expn stu if gr span==“KK-08”, r

reg testscr str avginc

comp stu expn stu if gr span==“KK-08”, r;

off. We have different options here:

• list testscr str comp stu −→ To see specific variables

computers per student are reported

points of strings, details of missing values, data ranges, and so on −→ codebook.

use the detail option: sum testscr, detail.

two (categorical) variables: tab gr span county.

normal to check if it can be normally distributed

create a matrix of scatterplots

• egen maxtestscr=max(testscr) −→ To obtain the max value of test scores

1. Use gen and replace: gen small=0 /* create vector of zeros */

replace small=1 if str < 20 /* replace =1 for corresponding values*/

2. Directly: gen small=(str < 20) if str!=.

the variable in the future.

Saving the dataset

reg y x1 x2 . . . xn if, options

quently listed variables are the independent variables

(type reg, noconstant if you want to exclude it)

variable (Total) into the explained (Model) and unexplained (Residual)

also hetersokedasticity-robust. To see the estimated variance-covariance matrix of the coefficients

the robust-to-heteroskedasticity matrix.

Estimation by subsamples and interactions

numerical variable (type) from a string variable (gr span):

• reg testscr c.str##type, r

• reg testscr str type c.str#type, r

• cap drop str type

gen str type=str*type

reg testscr str type str type, r

class to see an example.

example: test str=-2.

To obtain a confidence interval, we have several alternatives:

for all other distributions satisfying the conditions of the CLT

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

1. Exercise 1 −→ Stata will ignore a line if it starts with an asterisk