0% found this document useful (0 votes)

31 views36 pages

Intro To Stata 2022

The document provides an introduction to Stata, a statistical software package used for data analysis, emphasizing its capabilities in data collection, manipulation, and analysis. It outlines the various forms of Stata, key commands for data management and analysis, and the structure of data within the software. Additionally, it explains how to load, explore, and modify datasets, as well as the importance of documentation and reproducibility in research.

Uploaded by

kassekas7

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

31 views36 pages

Intro To Stata 2022

Uploaded by

kassekas7

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 36

Chapter One:

Introduction to
Softwares
Debark University
Department of Economics
Introduction
 Currently the dynamic nature of the world leads
to question among people in their daily lives.
 To answer these questions, the collection,
organization, analysis and interpretation of data is
critical.
 Data are the information that you collect to learn,
draw conclusions, and test hypotheses.
 This data can be collected and stored in numerous
ways, depending on
 the type of data,
 source & context,
 study design,
 data volume & turnaround time and
 data security.
Cont.…
 The field of economic statistics and econometrics
is rapidly changing.
 Increasing data availability combined with
powerful computing and advanced software
allows research to address issues of statistical
inference and analysis in innovative ways.
 Statistical skills enable you to intelligently
collect, analyze and interpret data relevant to
decision-making.
Cont.…
Some of the software
packages for analysis and
collection of data
INTRODUCTION TO
STATA
What is Stata?
 Stata is a general purpose Statistical software
package which is created in 1985 by economists
 Stata is a statistical analysis package, used for
exploring, graphing, summarizing and
manipulating data files.
 The word Stata is a combination of the words
`statistics' and `data.'
 Stata is not an acronym and should not appear all
letters capitalized.
Cont..
 Stata is an integrated statistical analysis
packaged designed for research professionals and
handling and manipulating large data sets.
 It is a multi-purpose statistical package to help
you explore, summarize and analyze datasets.
 Stata utilizes command line interface so users
can type commands to perform specific tasks.
 Users can also run commands in batch using a
do-file.
Cont..
 In addition, Stata has menus and dialog boxes
that give the user access to nearly all built-in
commands.
 Stata is case-sensitive; thus, it distinguishes
between lower and upper case letters.
 Most Stata built-in commands are lower case, a
convention most programmers follow.
Cont..
Forms or ‘flavors’ of Stata
 There are 4 flavors':
 STATA MP (multi-processor) which is the most powerful
 STATA SE (special edition) extended
 STATA IC (Intercooled)
 Small STATA

 Most features are shared by the other

flavors of Stata.
 The version differ basically in terms of
 the number of variables handled
 the speed of processing
Why Stata?
 Documentation and reproducibility of data
and results
 Manipulating data, carrying out statistical
analyses, and producing publication
quality graphics
 Time and energy saver for advanced user
Steps in data analysis
 Locate or gather data

 Load data into software package

 Manipulate as needed

 Analyze
“Data”
 A set of numbers and/or text describing
specific phenomena
 Mortality, drug effectiveness, economy,
weather, traffic, pollution levels, etc.

 Data always organized in rectangular way:

 columns contain “variables”
 rows contain “observations”
Stata windows or
interface
 When Stata is started, a screen opens as shown in Figure
containing four windows labeled:

History
Variable
s

Results

Command line
interface
Windows Cont’d
 Each of the Stata windows can be resized
and moved around in the usual way
 To bring a window forward that may be
obscured by other windows, make the
appropriate selection in the Window
menu.
Ways to use Stata
 Point & click

 Command line interface

 Batch file (called a “do-file”)

Cont..
 Stata has a Graphical User Interface (GUI) that
allows almost all commands to be accessed via
point-and-click.
 Simply start by clicking into the Data, Graphics,
or Statistics menus, make the relevant
selections, fill in a dialog box, and click OK.
 Stata then behaves exactly as if the
corresponding command had been typed with the
command appearing in the Stata Results and
Datasets
 Stata datasets have the .dta extension and
can be loaded into Stata in the usual way
through the File menu
 Data is a set of numbers and/or text
describing specific phenomena
 Mortality, drug effectiveness, economy, weather,
traffic, pollution levels, etc.

 Always rectangular:
Stata file types
 Stata uses and creates many types of files, which are
distinguished by extensions at the end of the filename. The
extensions used by Stata are
 .ado Programs that add commands to Stata, such as the
SPost commands.
 .do Batch files that execute a set of Stata commands.
 .dta Data files in Stata’s format.
 .gph Graphs saved in Stata’s proprietary format.
 .hlp The text displayed when you use the help command.
For example, fitstat.hlp has help for fitstat.
 .log Output saved as plain text by the log using command.
Loading data into Stata
 The dataset may be viewed as a spreadsheet by opening the
Data Browser with the button and edited by clicking to open
the Data Editor

 Stata command:
 use file path/file name.dta, clear
 e.g. use "C:\Users\Malede\Desktop\data.dta", clear
 A command is typed in the Stata Command window and
executed by pressing the Return (or Enter) key.
 working directory: using data, saving data, or logging output.
 type cd in the Command Window and to change use: cd "C:\Users\Malede\
Desktop"
Do-files

Double click
Editor window
Log files
 log allows you to make a full record of your Stata session. A
log is a file containing what you type and Stata's output.
 At the beginning of a Stata session, Press the
button , type a filename into the dialog box, and choose
Save.
 By default, this produces a SMCL (Stata Markup and
Control Language, pronounced ‘smicle’) file with
extension .smcl, but an ordinary ASCII text file can be
produced by selecting the .log extension.
 Log files can also be opened, viewed, and closed by
selecting Log from the File menu, followed by Begin...,
View..., or Close.
 log using mylog, replace
 log using mylog2, name(mylog2)
 . log using firstfile, name(log1) text

 . log using secondfile, name(log2) smcl

Getting help
 Select Stata Command

 Keywords search and press OK from Frequently Asked

Questions (FAQs) are available
 search keywords
 help Keywords
Data input and output
 Stata has its own data format with default extension .dta.
 Reading and saving a Stata file are straightforward.
 use “file path/file name”
 save “file path/file name”
 There are essentially two kinds of variables in Stata: string and
numeric.
 The storage types are byte, int, long, float, and double for
numeric variables and str1 to str80 for string variables of
different lengths.
 Besides the storage type, variables have associated with them a
name, a label, and a format.
Entering Data
 Insheet: Read ASCII (text) data created by a spreadsheet (.csv
files only)
 Infile: Read unformatted ASCII (text) data (space delimited
files)
 Input: Enter data from keyboard
 Describe: Describe contents of data in memory or on disk
 Compress: Compress data in memory
 Save: Store the dataset currently in memory on disk in Stata
data format
 Count: Show the number of observations
 List: List values of variables
Exploring data
 Describe: Describe a dataset
 List List the contents of a dataset
 Codebook: Detailed contents of a dataset
 Log: Create a log file
 Summarize: Descriptive statistics
 Tabstat: Table of descriptive statistics
 Table: Create a table of statistics
 Stem: Stem-and-leaf plot
 Graph: High resolution graphs
 Kdensity: Kernal density plot
 Sort: Sort observations in a dataset
 Histogram: Histogram for continuous and categorical variables
 Tabulate: One- and two-way frequency tables
 Correlate: Correlations
 Pwcorr: Pairwise correlations
 Type: Display an ASCII file
Modifying Data
 label data: Apply a label to a data set
 Order: Order the variables in a data set
 label variable: Apply a label to a variable
 label define: Define a set of a label for the levels of a categorical
variable
 label values: Apply value labels to a variable
 List: Lists the observations
 Rename: Rename a variable
 Recode: Recode the values of a variable
 Notes: Apply notes to the data file
 Generate: Creates a new variable
 Replace: Replaces one value with another value
Managing Data
 Pwd: Show current directory (pwd=print working
directory)
 dir or ls: Show files in current directory
 cd Change directory
 keep if: Keep observations if condition is met
 Keep: Keep variables (dropping others)
 Drop: Drop variables (keeping others)
 append using: Append a data file to current file
 Merge: Merge a data file with current file
Analyzing Data
 Ttest: t-test
 Regress: Regression
 Predict: Predicts after model estimation
 Kdensity: Kernel density estimates and graphs
 Pnorm: Graphs a standardized normal plot
 Qnorm: Graphs a quantile plot
 Rvfplot: Graphs a residual versus fitted plot
 Rvpplot: Graphs a residual versus individual
predictor plot
 Xi: Creates dummy variables during model estimation
 Test: Test linear hypotheses after model estimation
 Oneway: One-way analysis of variance
 Anova: Analysis of variance
 Logistic: Logistic regression
Must-Know Commands

 System  Data Management

 clear  Use
 exit  sysuse
 log  Infile, infix
 set  list
 # delimit  describe
 net  keep, drop
 search  generate, replace, rename
 help  save, out file
Must-Know Commands

 Data Analysis
 summarize  Statistical Analysis
 correlate
 regress
 graph
 predict
 two way, scatter,…

 hist
 test
 dwstat
 hettest
Comments and Notes
 Stata treats lines that begin with an asterisk * or are
located between a pair of /* and */ as comments that are
simply echoed to the output
 If a command continues over two lines, we use /* at the
end of the first line and */ at the beginning of the second
line to make Stata ignore the line break.

 An alternative would be to use /// at the end of the line.

 Variable names are case-sensitive.
Missing value
 A missing values in a numeric variable is represented by a
period ‘.’ (system missing values), or by a period followed by
a letter, such as .a,.b. etc.
 Missing values are interpreted as very large positive
numbers with . < .a < .b, etc.
 Note that this can lead to mistakes in logical expressions.
 Numerical missing value codes (such as ‘−99’) may be
converted to missing values (and vice versa) using the
command mvdecode.
 mvdecode x, mv(-99)
Data management
 Looking at your data
 Browse: opens a spreadsheet in which you can scroll to
look at the data, but you cannot change the data.
 Edit : You can look and change data
 List : creates a list of values of specified variables and
observations.
Cont..
 Getting information about variables
 describe: provides information on the size of
the dataset and the names, labels, and types of
variables.
 codebook summarizes a variable in a format
designed for printing a codebook.
 summarize: provides summary statistics. By
default, summarize presents the number of non
missing observations, the mean, the standard
deviation, the minimum values, and the
maximum. Adding the detail option includes
additional information. Eg. . sum age, detail
 tabulate: creates the frequency distribution for
a variable. If you do not want the value labels

Slab Design Eurocode
100% (2)
Slab Design Eurocode
6 pages
Stata Training Course
No ratings yet
Stata Training Course
43 pages
Unit One Software Aplication in Economics
No ratings yet
Unit One Software Aplication in Economics
34 pages
Introduction To Stata Software, MaU, 2022
No ratings yet
Introduction To Stata Software, MaU, 2022
93 pages
Stata Application Part I
No ratings yet
Stata Application Part I
27 pages
Basics of STATA Software
No ratings yet
Basics of STATA Software
67 pages
Stata
No ratings yet
Stata
6 pages
Introduction Stata Slides 2
No ratings yet
Introduction Stata Slides 2
25 pages
Software Material
No ratings yet
Software Material
13 pages
Training at Gudar Campus
No ratings yet
Training at Gudar Campus
83 pages
Gravity13 Stata
No ratings yet
Gravity13 Stata
80 pages
Stata: A Brief Introduction
No ratings yet
Stata: A Brief Introduction
9 pages
Stata 1
No ratings yet
Stata 1
24 pages
Computing For Research I: Spring 2012
No ratings yet
Computing For Research I: Spring 2012
34 pages
Stata Notes
No ratings yet
Stata Notes
7 pages
Introduction To Stata: Ucla Idre Statistical Consulting Group
No ratings yet
Introduction To Stata: Ucla Idre Statistical Consulting Group
119 pages
Compiled by Solomon Kebede
No ratings yet
Compiled by Solomon Kebede
136 pages
Chapter Three
No ratings yet
Chapter Three
100 pages
Intro Stata
No ratings yet
Intro Stata
126 pages
An Introduction To Stata For Economists: Data Management
No ratings yet
An Introduction To Stata For Economists: Data Management
49 pages
Applied Econometrics Using Stata
100% (2)
Applied Econometrics Using Stata
100 pages
Applied Econometrics Using Stata
100% (1)
Applied Econometrics Using Stata
100 pages
Manual
No ratings yet
Manual
14 pages
6.1 Stata
No ratings yet
6.1 Stata
62 pages
A I S ECMT1020: N Ntroduction To Tata
No ratings yet
A I S ECMT1020: N Ntroduction To Tata
15 pages
Introduction To Stata For Data Management
No ratings yet
Introduction To Stata For Data Management
7 pages
Stata Manual Introduction
No ratings yet
Stata Manual Introduction
24 pages
Introduction To Statistical Computing in Clinical Research: Biostatistics 212
No ratings yet
Introduction To Statistical Computing in Clinical Research: Biostatistics 212
39 pages
Stata Introduction To Stata
No ratings yet
Stata Introduction To Stata
12 pages
Stata - Tutorial MATERIAL
No ratings yet
Stata - Tutorial MATERIAL
3 pages
Introduction To Stata 2024-06-18 Handout
No ratings yet
Introduction To Stata 2024-06-18 Handout
52 pages
CH - 1 - Introduction To Econometrics Software Stata
No ratings yet
CH - 1 - Introduction To Econometrics Software Stata
35 pages
Presentation 1
No ratings yet
Presentation 1
23 pages
STATAfor Econ Workshop 1
No ratings yet
STATAfor Econ Workshop 1
12 pages
Stata0 2008 Quique Moral Benito
No ratings yet
Stata0 2008 Quique Moral Benito
8 pages
Introduction To Stata: 1 Data Manipulation
No ratings yet
Introduction To Stata: 1 Data Manipulation
6 pages
Stata Basics13
No ratings yet
Stata Basics13
23 pages
Stata Prirucnik
No ratings yet
Stata Prirucnik
75 pages
Computing Stata Notes
No ratings yet
Computing Stata Notes
5 pages
Stat A Guide
No ratings yet
Stat A Guide
10 pages
Using Stata: The Opening Display
No ratings yet
Using Stata: The Opening Display
16 pages
Stata Tutorial
No ratings yet
Stata Tutorial
44 pages
What Is Stata?
No ratings yet
What Is Stata?
16 pages
Stata Cheat Sheet: Command in "User" Menu Useful For What? Additional Options More Info
No ratings yet
Stata Cheat Sheet: Command in "User" Menu Useful For What? Additional Options More Info
2 pages
23-24 M4 RM TA Basic Stata Use
No ratings yet
23-24 M4 RM TA Basic Stata Use
19 pages
A Short Introduction To STATA
No ratings yet
A Short Introduction To STATA
8 pages
Getting Started With Your Data: Using Stata
No ratings yet
Getting Started With Your Data: Using Stata
32 pages
Stata Tutorial
No ratings yet
Stata Tutorial
42 pages
Stata Datawork
No ratings yet
Stata Datawork
22 pages
Getting Started With Stata 11.2
No ratings yet
Getting Started With Stata 11.2
136 pages
ECON6067 Stata (I) 2022
No ratings yet
ECON6067 Stata (I) 2022
28 pages
Stata Tutoriel
No ratings yet
Stata Tutoriel
28 pages
ECN3311 - Stata Workshop 1-1
No ratings yet
ECN3311 - Stata Workshop 1-1
12 pages
Zorn - Stata 4 Dummies - 2007
No ratings yet
Zorn - Stata 4 Dummies - 2007
12 pages
Stat A Guide
No ratings yet
Stat A Guide
16 pages
STATA Notes 2022
No ratings yet
STATA Notes 2022
25 pages
SPSS For Beginners: An Illustrative Step-by-Step Approach to Analyzing Statistical data
From Everand
SPSS For Beginners: An Illustrative Step-by-Step Approach to Analyzing Statistical data
Hunt Robert D.
No ratings yet
Crystal Reports Introduction: Versions 2008-2016
From Everand
Crystal Reports Introduction: Versions 2008-2016
Seth Bonder
No ratings yet
Tableau 8.2 Training Manual: From Clutter to Clarity
From Everand
Tableau 8.2 Training Manual: From Clutter to Clarity
Larry Keller
No ratings yet
Study Guide MO-500 Certification Exam Microsoft Access Expert ( Office 2019)
From Everand
Study Guide MO-500 Certification Exam Microsoft Access Expert ( Office 2019)
Anand Vemula
No ratings yet
The Data Detective's Toolkit: Cutting-Edge Techniques and SAS Macros to Clean, Prepare, and Manage Data
From Everand
The Data Detective's Toolkit: Cutting-Edge Techniques and SAS Macros to Clean, Prepare, and Manage Data
Kim Chantala
No ratings yet
UNIT TWO Economics of Agriculture
No ratings yet
UNIT TWO Economics of Agriculture
13 pages
Chapter 4
No ratings yet
Chapter 4
43 pages
Chapter 3
No ratings yet
Chapter 3
42 pages
Institutional & Behavioral Eco CH 2
No ratings yet
Institutional & Behavioral Eco CH 2
21 pages
CH 4 Agricultural Devt LL
No ratings yet
CH 4 Agricultural Devt LL
51 pages
International Eco II Ch-2
No ratings yet
International Eco II Ch-2
47 pages
UNIT ONE Economics of Agriculture
No ratings yet
UNIT ONE Economics of Agriculture
11 pages
Chapter 5
No ratings yet
Chapter 5
40 pages
Chapter 6
No ratings yet
Chapter 6
23 pages
International Economics II Ch-3
No ratings yet
International Economics II Ch-3
62 pages
Crisis and Trauma
No ratings yet
Crisis and Trauma
115 pages
Oral Evidence and Real Evidence
No ratings yet
Oral Evidence and Real Evidence
112 pages
CH 3 Migration & Devt
No ratings yet
CH 3 Migration & Devt
43 pages
CH 1 Development Economics I
No ratings yet
CH 1 Development Economics I
54 pages
Time Series Econometrics 2
No ratings yet
Time Series Econometrics 2
15 pages
Brand Management 1
No ratings yet
Brand Management 1
50 pages
Econometrics II Chapter 4 Panel Data Econometrics
No ratings yet
Econometrics II Chapter 4 Panel Data Econometrics
31 pages
Basic Writing Skills (EnLa 2012)
No ratings yet
Basic Writing Skills (EnLa 2012)
43 pages
International Economics II Ch-1
No ratings yet
International Economics II Ch-1
80 pages
Financial Economics-1
No ratings yet
Financial Economics-1
7 pages
Rule 37-38 Full Text Cases PDF
No ratings yet
Rule 37-38 Full Text Cases PDF
202 pages
Otondro Prohori, Guarding Who, Against What
No ratings yet
Otondro Prohori, Guarding Who, Against What
10 pages
Gail India Ltd. Report
No ratings yet
Gail India Ltd. Report
8 pages
Report On Rural Haat
83% (6)
Report On Rural Haat
22 pages
Agri-Fishery Arts: Module 1: Importance of Planting Trees
No ratings yet
Agri-Fishery Arts: Module 1: Importance of Planting Trees
22 pages
Eagle Point
100% (1)
Eagle Point
5 pages
Term Paper Topic:"Parking Management System"
No ratings yet
Term Paper Topic:"Parking Management System"
8 pages
Soil PH-WPS Office
No ratings yet
Soil PH-WPS Office
2 pages
Community Consultation On The Response Actions (CORA) For COVID-19 - 1
No ratings yet
Community Consultation On The Response Actions (CORA) For COVID-19 - 1
35 pages
Woman-Centered Coaching Revolution - Lesson 1 - Handout
No ratings yet
Woman-Centered Coaching Revolution - Lesson 1 - Handout
28 pages
DS4510 5010
100% (1)
DS4510 5010
2 pages
Republic Act No 11479
No ratings yet
Republic Act No 11479
2 pages
Schischek Product Catalogue en PUB113 001 00
No ratings yet
Schischek Product Catalogue en PUB113 001 00
76 pages
Assignment 3 BTF3363
No ratings yet
Assignment 3 BTF3363
5 pages
新电影评论和评分
100% (2)
新电影评论和评分
7 pages
WORKBOOK - Product Design Workshop-2
No ratings yet
WORKBOOK - Product Design Workshop-2
34 pages
Phillies 03 - 17 - 2020
No ratings yet
Phillies 03 - 17 - 2020
80 pages
Inventor Tutorials
100% (3)
Inventor Tutorials
1,264 pages
Edu 210 Quiz
No ratings yet
Edu 210 Quiz
4 pages
000400000007AF00
No ratings yet
000400000007AF00
7 pages
Opa 2863
No ratings yet
Opa 2863
52 pages
Iron Ore Mining Feasibility Study Word
No ratings yet
Iron Ore Mining Feasibility Study Word
13 pages
Winback - en Brochure Rshock Version J3 Mars 2021 A
100% (1)
Winback - en Brochure Rshock Version J3 Mars 2021 A
12 pages
Global Maritime Distress and Safety System (GMDSS) : Companies Can Opt For Block Booking
100% (1)
Global Maritime Distress and Safety System (GMDSS) : Companies Can Opt For Block Booking
1 page
7th Sem Mech Internal Question Papers
No ratings yet
7th Sem Mech Internal Question Papers
16 pages
Backface Removal
No ratings yet
Backface Removal
4 pages
Groundnut
No ratings yet
Groundnut
64 pages
Excel MCQ
No ratings yet
Excel MCQ
29 pages
Commonwealth Chess Championship 2023 24 Regulations
No ratings yet
Commonwealth Chess Championship 2023 24 Regulations
5 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Intro To Stata 2022

Uploaded by

Intro To Stata 2022

Uploaded by

Chapter One:

 Most features are shared by the other

 Load data into software package

 Data always organized in rectangular way:

 Command line interface

 Batch file (called a “do-file”)

 . log using secondfile, name(log2) smcl

 Keywords search and press OK from Frequently Asked

 System  Data Management

 An alternative would be to use /// at the end of the line.

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.