0% found this document useful (0 votes)

9 views

stata codes

The document provides a comprehensive guide on using Stata for data analysis, covering commands for data inspection, summarization, statistical tests, and graphing. It includes practical examples for commands like 'describe', 'summarize', 'ttest', and 'regress', as well as techniques for data manipulation such as creating and recoding variables. Additionally, it discusses handling missing values and subsetting data to streamline analysis.

Uploaded by

asim

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views

stata codes

Uploaded by

asim

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

You are on page 1/ 8

clear

set more off

*close log
*log using SB2, replace

*cd"D:\1 SANDEE\standee"
*insheet using SMOKE.csv, clear

*Fundamentals of Using Stata (part I)

use "D:\0 Stata training\auto.dta", clear

*The describe command shows you basic information about a Stata data file.

describe

*The codebook command is a great tool for getting a quick overview of the variables
in the data file.
*It produces a kind of electronic codebook from the data file. Have a look at what
it produces below.

codebook

*Another useful command for getting a quick overview of a data file is the inspect
command. Here is what the inspect command produces for the auto data file.
inspect

*The list command is useful for viewing all or a range of observations. Here we
look at make, price, mpg, rep78 and foreign for the first 10 observations.
list make price mpg rep78 foreign in 1/10

*Creating tables
*The tabulate command is useful for obtaining frequency tables. Below, we make a
table for rep78 and a table for foreign. The command can also be shortened to tab.

tabulate rep78
tabulate foreign

*The tab1 command can be used as a shortcut to request tables for a series of
variables
*(instead of typing the tabulate command over and over again for each variable of
interest).

tab1 rep78 foreign

*We can use the plot option to make a plot to visually show the tabulated values.

tabulate rep78, plot

tabulate rep78 foreign

tabulate rep78 foreign, column

tabulate rep78 foreign, column nofreq

*Generating summary statistics with summarize

summarize mpg

summarize mpg, detail

*To get these values separately for foreign and domestic, we could use the by
foreign: prefix as shown below.
*Note that we first had to sort the data before using by foreign:.

sort foreign
by foreign: summarize mpg // not an efficient way so use

tabulate foreign, summarize(mpg)

tabulate rep78, summarize(price)

///////////////////////////////////////////////////////////////////////////////////
//////////////////

*USING IF WITH STATA COMMANDS | STATA LEARNING MODULES

Most Stata commands can be followed by if, for example

*Summarize if rep78 equals 2

summarize if rep78 == 2

*Summarize if rep78 is greater than or equal to 2

summarize if rep78 >= 2

*Summarize if rep78 greater than 2

summarize if rep78 > 2

*Summarize if rep78 less than or equal to 2

summarize if rep78 <= 2

*Summarize if rep78 less than 2

summarize if rep78 <2

*Summarize if rep78 not equal to 2

summarize if rep78 != 2

*If expressions can be connected with

*| for OR
*& for AND

*Missing Values

*Missing values are represented as '.' and are the highest value possible.
Therefore, when values are missing, be careful with commands like

summarize if rep78 > 3

summarize if rep78 >= 3
summarize if rep78 != 3

*to omit missing values, use

summarize if rep78 > 3 & !missing(rep78)

summarize if rep78 >= 3 & !missing(rep78)
summarize if rep78 != 3 & !missing(rep78)

//////////////////////////////////////////////////////////////////////////////

some common statistical tests in Stata

*t-tests
*Let�s do a t-test comparing the miles per gallon (mpg) of foreign and domestic
cars.

ttest mpg , by(foreign)

*Chi-square
*Let�s compare the repair rating (rep78) of the foreign and domestic cars. We can
make a crosstab of rep78 by foreign.
*We may want to ask whether these variables are independent. We can use the chi2
option to request
*a chi-square test of independence as well as the crosstab.

tabulate rep78 foreign, chi2

*The chi-square is not really valid when you have empty cells. In such cases when
you have empty cells,
*or cells with small frequencies, you can request Fisher�s exact test with the
exact option.

tabulate rep78 foreign, chi2 exact

*Correlation
*We can use the correlate command to get the correlations among variables. Let�s
look at the correlations among price mpg weight and rep78.
*(We use rep78 in the correlation even though it is not continuous to illustrate
what happens when you use correlate with variables with missing data.)

correlate price mpg weight rep78

correlate price mpg weight rep78, obs

*Regression
*Let�s look at doing regression analysis in Stata. For this example, let�s drop the
cases where rep78 is 1 or 2 or missing.

drop if (rep78 <= 2) | (rep78==.)

regress mpg price weight

*Analysis of variance
*If you wanted to do an analysis of variance looking at the differences in mpg
among the three repair groups, you can use the oneway command to do this.

oneway mpg rep78

oneway mpg rep78, tabulate

*If you want to include covariates, you need to use the anova command. The
continuous(price weight) option tells Stata that those variables are covariates.
anova mpg rep78 c.price c.weight

///////////////////////////////////////////////////////////////////////////////////
///////////////////////////////////////////////////

*AN OVERVIEW OF STATA SYNTAX

summarize
summarize mpg price
summarize mpg price if (foreign == 1)
summarize mpg price if foreign == 1 & mpg <30
summarize mpg price if foreign == 1 & mpg <30 , detail
summarize in 1/10

sort foreign
by foreign: summarize

///////////////////////////////////////////////////////////////////////////////////
///////////////////////////////////////////////////

*INTRODUCTION TO GRAPHS IN STATA

histogram mpg

*If you are creating a histogram for a categorical variable such as rep78, you can
add the option discrete.

hist rep78, percent discrete

*The graph box command can be used to produce a boxplot which can help you examine
the distribution of mpg.
*If mpg were normally distributed, the line (the median) would be in the middle of
the box

graph box mpg

*The boxplot can be done separately for foreign and domestic cars using the by( )
or over( ) option.

graph box mpg, by(foreign)

graph box mpg, over(foreign) noout \\ remove outliers

*pie chqrts

graph pie, over(rep78) plabel(_all name) title("Repair Record 1978")

*Scqtter Plot
graph twoway scatter mpg weight

twoway scatter mpg weight

twoway lfit mpg weight

twoway (scatter mpg weight) (lfit mpg weight)

twoway (scatter mpg weight, mlabel(make) ) (lfit mpg weight)

twoway (scatter mpg weight, mlabel(make) mlabangle(45)) (lfit mpg weight)

graph matrix mpg weight price \\ matrix graph

\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\
\\\\\

Reading Data in STATA

des
sum

generate price2 = 2*price

save auto

generate price3 = 3*price

save auto2 \\\ give error

save auto, replace

\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\
\\\

*LABELING DATA

describe

label data "This file contains auto data for the year 1978"

label variable rep78 "the repair record from 1978"

label variable price "the price of the car in 1978"
label variable mpg "the miles per gallon for the car"

\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\
\\\\

*CREATING AND RECODING VARIABLES

*The variable length contains the length of the car in inches. Below we see summary
statistics for length.

summarize length

generate len_ft = length / 12

summarize length len_ft

generate length2 = length^2

summarize length2
generate loglen = log(length)

summarize loglen

summarize length

generate zlength = (length - 187.93) / 22.27

summarize zlength

*Recoding new variables using generate and replace

*Suppose that we wanted to break mpg down into three categories. Let�s look at a
table of mpg to see where we might draw the lines for such categories.

tabulate mpg

*Let�s convert mpg into three categories to help make this more readable. Here we
convert mpg into three categories using generate and replace.

generate mpg3 = .

replace mpg3 = 1 if (mpg <= 18)

replace mpg3 = 2 if (mpg >= 19) & (mpg <=23)

replace mpg3 = 3 if (mpg >= 24) & (mpg <.)

tabulate mpg mpg3

tabulate mpg3 foreign, column

*Recoding variables using recode

generate mpg3a = mpg

recode mpg3a (min/18=1) (19/23=2) (24/max=3)

tabulate mpg mpg3a

\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\
\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\

*SUBSETTING DATA

*Keeping and dropping variables

*Suppose we want to just have make mpg and price, we can keep just those variables,
as shown below.

keep make mpg price

*now if we are not interested in the variables displ and gear_ratio. We can get rid
of them using the drop command shown below.
drop displ gear_ratio

*Keeping and dropping observations

*he variable rep78 has values 1 to 5, and also has some missing values, as shown
below.

tabulate rep78 , missing

drop if missing(rep78)

*The keep if command can be used to eliminate observations, except that the part
after the keep if specifies which observations should be kept.

keep if (rep78 <= 3)

\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\
\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\\

SAP ABAP Performance Tuning
From Everand
SAP ABAP Performance Tuning
May
4.5/5 (28)
Stata Cheat Sheets
100% (1)
Stata Cheat Sheets
6 pages
Azure App Service
No ratings yet
Azure App Service
1,959 pages
AZ-104 Study Guide
0% (1)
AZ-104 Study Guide
14 pages
Topic 3-SPSS and STATA
100% (1)
Topic 3-SPSS and STATA
73 pages
Computing New Variables Using Generate and Replace
No ratings yet
Computing New Variables Using Generate and Replace
9 pages
Data analysis using stata
No ratings yet
Data analysis using stata
13 pages
Cheat Sheet: With Stata 15
No ratings yet
Cheat Sheet: With Stata 15
1 page
Lec11-Stata Regression
No ratings yet
Lec11-Stata Regression
9 pages
AllCheatSheets Stata v15 PDF
No ratings yet
AllCheatSheets Stata v15 PDF
6 pages
Cheat Sheet: With Stata 15
No ratings yet
Cheat Sheet: With Stata 15
6 pages
AllCheatSheets Stata v15
100% (1)
AllCheatSheets Stata v15
6 pages
AllCheatSheets_Stata_v15
No ratings yet
AllCheatSheets_Stata_v15
6 pages
All Cheat Sheets
No ratings yet
All Cheat Sheets
5 pages
Stata Demo 3 Econ 396A F2016
No ratings yet
Stata Demo 3 Econ 396A F2016
12 pages
Cheat Sheet: With Stata
No ratings yet
Cheat Sheet: With Stata
6 pages
6 Stata-1
No ratings yet
6 Stata-1
2 pages
Introduction To Stata: 1 Data Manipulation
No ratings yet
Introduction To Stata: 1 Data Manipulation
6 pages
Getting Started With Stata 11.2
No ratings yet
Getting Started With Stata 11.2
136 pages
Summary of Basic STATA Commands and Syntax
No ratings yet
Summary of Basic STATA Commands and Syntax
5 pages
Statacheatsheets
No ratings yet
Statacheatsheets
6 pages
StataCheatSheet Analysis
No ratings yet
StataCheatSheet Analysis
1 page
Introduction To Stata: Li-Pin Juan
No ratings yet
Introduction To Stata: Li-Pin Juan
41 pages
Stata
No ratings yet
Stata
6 pages
Data Analysis
No ratings yet
Data Analysis
1 page
Creating New Variables: Generate and Replace
No ratings yet
Creating New Variables: Generate and Replace
7 pages
Stata Datawork
No ratings yet
Stata Datawork
22 pages
An Introduction To Stata Graphics
No ratings yet
An Introduction To Stata Graphics
53 pages
Computing For Research I: Spring 2012
No ratings yet
Computing For Research I: Spring 2012
34 pages
Introduction To Stata and Data Management
No ratings yet
Introduction To Stata and Data Management
30 pages
Stat A Cheat Sheets
No ratings yet
Stat A Cheat Sheets
6 pages
STATA
No ratings yet
STATA
26 pages
Data_Wrangling Analysis
No ratings yet
Data_Wrangling Analysis
26 pages
stata data managment ALEX final
No ratings yet
stata data managment ALEX final
44 pages
Stata Reference Suz Release 7 Stata Press instant download
No ratings yet
Stata Reference Suz Release 7 Stata Press instant download
77 pages
Stata Introduction To Stata
No ratings yet
Stata Introduction To Stata
12 pages
Stata Reference Manual: What You Should Know About Stata After Taking The Stata Introduction Course
No ratings yet
Stata Reference Manual: What You Should Know About Stata After Taking The Stata Introduction Course
26 pages
Stata Cheat Sheet: Command in "User" Menu Useful For What? Additional Options More Info
No ratings yet
Stata Cheat Sheet: Command in "User" Menu Useful For What? Additional Options More Info
2 pages
Getting Started With Your Data: Using Stata
No ratings yet
Getting Started With Your Data: Using Stata
32 pages
rtabstat
No ratings yet
rtabstat
6 pages
Stata - Tips PDF
100% (1)
Stata - Tips PDF
114 pages
ECON6067 Stata (I) 2022
No ratings yet
ECON6067 Stata (I) 2022
28 pages
Stat A Tutorial
No ratings yet
Stat A Tutorial
40 pages
An Introduction To Stata For Economists: Data Management
No ratings yet
An Introduction To Stata For Economists: Data Management
49 pages
stata notes
No ratings yet
stata notes
7 pages
Introduction To STATA
No ratings yet
Introduction To STATA
57 pages
Advanced Stata Workshop
No ratings yet
Advanced Stata Workshop
59 pages
Stata Application Part I
No ratings yet
Stata Application Part I
27 pages
MGMT 469 Helpful Stata Commands
No ratings yet
MGMT 469 Helpful Stata Commands
8 pages
Gsu
No ratings yet
Gsu
150 pages
software material
No ratings yet
software material
13 pages
STATAforEconWorkshop3
No ratings yet
STATAforEconWorkshop3
12 pages
6.1_stata
No ratings yet
6.1_stata
62 pages
Introduction To Stata 2012 - Econ4150
No ratings yet
Introduction To Stata 2012 - Econ4150
17 pages
Stata Basics13
No ratings yet
Stata Basics13
23 pages
Introduction To Stata Data Management: Chang Y. Chung Office of Population Research Princeton University September 2013
100% (1)
Introduction To Stata Data Management: Chang Y. Chung Office of Population Research Princeton University September 2013
24 pages
Stataguide
No ratings yet
Stataguide
17 pages
Advanced Stata Skills
No ratings yet
Advanced Stata Skills
10 pages
Lisp Programming Language
From Everand
Lisp Programming Language
Faiz ul haque Zeya
No ratings yet
Projects With Microcontrollers And PICC
From Everand
Projects With Microcontrollers And PICC
Guillermo Perez Guillen
5/5 (1)
Graphs with MATLAB (Taken from "MATLAB for Beginners: A Gentle Approach")
From Everand
Graphs with MATLAB (Taken from "MATLAB for Beginners: A Gentle Approach")
Peter Kattan
4/5 (2)
CSS Grid Layout
From Everand
CSS Grid Layout
Abdelfattah Ragab
No ratings yet
Galliyat Forest
No ratings yet
Galliyat Forest
19 pages
statistics - solutions
No ratings yet
statistics - solutions
3 pages
Presentation1
No ratings yet
Presentation1
12 pages
2017 Econ Intro Lecture 1
No ratings yet
2017 Econ Intro Lecture 1
43 pages
Immediate download INFORMATION RETRIEVAL a biomedical and health perspective 4th Edition William Hersh ebooks 2024
100% (3)
Immediate download INFORMATION RETRIEVAL a biomedical and health perspective 4th Edition William Hersh ebooks 2024
55 pages
Kmu Mdcat 2023 With Keys
No ratings yet
Kmu Mdcat 2023 With Keys
15 pages
Toshiba Machine Co., LTD.: User's Manual Product SHAN5 Version 1.12
No ratings yet
Toshiba Machine Co., LTD.: User's Manual Product SHAN5 Version 1.12
39 pages
ED550EL
No ratings yet
ED550EL
12 pages
Solar Battery Catalogue
No ratings yet
Solar Battery Catalogue
16 pages
How To Replace Timing Belt On Peugeot 307 2.0 HDi 2005-2007
No ratings yet
How To Replace Timing Belt On Peugeot 307 2.0 HDi 2005-2007
8 pages
Website Development and Hosting
No ratings yet
Website Development and Hosting
60 pages
Strategic Management 2nd Edition Rothaermel Solutions Manual 1
100% (68)
Strategic Management 2nd Edition Rothaermel Solutions Manual 1
19 pages
Ez 1 Timer Dims
No ratings yet
Ez 1 Timer Dims
1 page
MGP1-TPS-AOS-MS-2105-0001 DATA SHEET FOR DIESEL STORAGE TANK with comments code B
No ratings yet
MGP1-TPS-AOS-MS-2105-0001 DATA SHEET FOR DIESEL STORAGE TANK with comments code B
4 pages
E-Brochure BRABUS Exterior
No ratings yet
E-Brochure BRABUS Exterior
10 pages
Microsoft Word - Handling Precautions and Guideline User Manual
No ratings yet
Microsoft Word - Handling Precautions and Guideline User Manual
3 pages
BS en 12201 3
No ratings yet
BS en 12201 3
32 pages
Final Submission
No ratings yet
Final Submission
28 pages
S257a (s40) Requirement To Provide Biometrics
No ratings yet
S257a (s40) Requirement To Provide Biometrics
4 pages
Case Diagram ... 11 Class Diagram ..13 ER Diagram ..14 Database Design ... 17
No ratings yet
Case Diagram ... 11 Class Diagram ..13 ER Diagram ..14 Database Design ... 17
81 pages
The Complete Guide To Marketing Automation
No ratings yet
The Complete Guide To Marketing Automation
33 pages
SMP-SE-012 Temporary Repair
No ratings yet
SMP-SE-012 Temporary Repair
16 pages
Tax Invoice Cum Acknowledgement Receipt of PAN Application (Form 49A)
No ratings yet
Tax Invoice Cum Acknowledgement Receipt of PAN Application (Form 49A)
1 page
L3-Manufacturing Considerations
No ratings yet
L3-Manufacturing Considerations
24 pages
X-Screen 3D R[48]
No ratings yet
X-Screen 3D R[48]
2 pages
Submission of Assessment For Midas Labs Internship Cum PPO Recruitment Drive-2024 Graduating Batch
No ratings yet
Submission of Assessment For Midas Labs Internship Cum PPO Recruitment Drive-2024 Graduating Batch
25 pages
Model Paper 1 MCQ
No ratings yet
Model Paper 1 MCQ
8 pages
Solar Street Lights-1
No ratings yet
Solar Street Lights-1
23 pages
logitech-wireless-combo-mk345
No ratings yet
logitech-wireless-combo-mk345
16 pages
Module 3 SW5
No ratings yet
Module 3 SW5
13 pages
Comp Rog
No ratings yet
Comp Rog
2 pages
Solidworks Flow Simulation
0% (1)
Solidworks Flow Simulation
7 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

stata codes

Uploaded by

stata codes

Uploaded by

clear

set more off

*Fundamentals of Using Stata (part I)

use "D:\0 Stata training\auto.dta", clear

tab1 rep78 foreign

tabulate rep78, plot

tabulate rep78 foreign

tabulate rep78 foreign, column

tabulate rep78 foreign, column nofreq

*Generating summary statistics with summarize

summarize mpg, detail

tabulate foreign, summarize(mpg)

*USING IF WITH STATA COMMANDS | STATA LEARNING MODULES

Most Stata commands can be followed by if, for example

*Summarize if rep78 equals 2

*Summarize if rep78 is greater than or equal to 2

*Summarize if rep78 greater than 2

*Summarize if rep78 less than or equal to 2

*Summarize if rep78 less than 2

*Summarize if rep78 not equal to 2

*If expressions can be connected with

summarize if rep78 > 3

*to omit missing values, use

summarize if rep78 > 3 & !missing(rep78)

some common statistical tests in Stata

ttest mpg , by(foreign)

tabulate rep78 foreign, chi2

tabulate rep78 foreign, chi2 exact

correlate price mpg weight rep78

drop if (rep78 <= 2) | (rep78==.)

oneway mpg rep78

*AN OVERVIEW OF STATA SYNTAX

*INTRODUCTION TO GRAPHS IN STATA

hist rep78, percent discrete

graph box mpg

graph box mpg, by(foreign)

graph box mpg, over(foreign) noout \\ remove outliers

graph pie, over(rep78) plabel(_all name) title("Repair Record 1978")

twoway scatter mpg weight

twoway lfit mpg weight

twoway (scatter mpg weight) (lfit mpg weight)

twoway (scatter mpg weight, mlabel(make) mlabangle(45)) (lfit mpg weight)

graph matrix mpg weight price \\ matrix graph

Reading Data in STATA

generate price2 = 2*price

generate price3 = 3*price

save auto2 \\\ give error

save auto, replace

label variable rep78 "the repair record from 1978"

*CREATING AND RECODING VARIABLES

generate len_ft = length / 12

summarize length len_ft

generate length2 = length^2

generate zlength = (length - 187.93) / 22.27

*Recoding new variables using generate and replace

replace mpg3 = 1 if (mpg <= 18)

replace mpg3 = 2 if (mpg >= 19) & (mpg <=23)

replace mpg3 = 3 if (mpg >= 24) & (mpg <.)

tabulate mpg mpg3

tabulate mpg3 foreign, column

*Recoding variables using recode

generate mpg3a = mpg

recode mpg3a (min/18=1) (19/23=2) (24/max=3)

tabulate mpg mpg3a

*Keeping and dropping variables

keep make mpg price

*Keeping and dropping observations

tabulate rep78 , missing

keep if (rep78 <= 3)

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.