0% found this document useful (0 votes)

12 views4 pages

Stats Notes

Statistics lecture notes

Uploaded by

ozgur.dincer03

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views4 pages

Stats Notes

Statistics lecture notes

Uploaded by

ozgur.dincer03

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 4

 Corollary: if we are sure there is nothing in the error term correlated with the slope, we

can interpret the coefficient.

 Exploratory data analysis:
o Library(GGally) : ggpairs
o ggpairs(data_set)
o for selecting variables in ggpairs:
 data_set %>% select(var1, var2, var3, …) %>% ggpairs()
o library(mosaic)
o inspect(data_set)
 Editing data:
o Library(dplyr) : select
o Newdf <- select(data_set, var1, var2, var3, …)
o Adding a new column:
 mutate(data_set, new_var = var*2 (or other function))
o filtering data:
 filter(data_set, filtering options)
 Graphing to see the relation:
o Library(ggplot2)
o Scatterplot: ggplot(data_set, aes(x = x_var, y = y_var)) + geom_point()
o Classic linear regression:
 Model <- lm(y_var ~ x_var, data = data_set)
 summary(model)
 use multiple R squared to report.
 To compare models:
o library(stargazer)
o stargazer(model1, model2, type = ‘text’, report = (‘vc*sp’))
 Omitted Variable Bias:
o only occur if x2 and y are related & x2 and x1 are related
o The sign of the bias is the product of the correlation between x2-x1 and x2-y
 Interpreting categorical variables:
o all of the analysis shown in R are interpreted against the reference category, the
one that is not shown.
 analyzing models without the effect of heteroscedasticity:
o make your model (model1)
o use summary to get R2: summary(model1)
o Use coefficient test:
 library(sandwich)
 library(lmtest)
 coeftest(model1, vcov = vcovHC, type = ‘HC1’)
 Interpreting interaction terms class 6:
o if one of the terms is a dummy variable then use scenarios or eyeball it.
o to see the result of different matchs use:
 library(margins)
 margins(model1, variables = ‘var1’, at = list(dummy = c(0, 1)))
 Quadratic Models:
o use when the ggpairs function implies a quadratic relation between variables
o Both the linear and quadratic terms being significant means that you need a
quadratic function
o model1 <- lm(y-var ~ x-var + I(x-var)^2
o Interpreting the coefficient of quadratic term:
 take partial derivative to see when the change in x-var slows down.
 Logarithmic models:
o use to make the effect on variables a unit change, and makes large outliers less
problematic.
o Effective when a variable is significantly right skewed but not zero.
o only if variables are greater than zero, use log models.
o Log-log models:
 both changes are in percentage
 interpretation: 1% change in x-var is associated with a coefficient%
change in y-var.
o Log-linear models:
 interpretation: each additional increase in x-var is associated with a
coefficient*100 % change in y-var
o linear-log models:
 interpretation: a 1% change in x-var is associated with a coefficient*0.01
change in y-var
 Logistic Regression:
o use when y-variable is a binary or categorical
o model1 <- glm(y-var~x-var, family = binomial(link = ‘logit’), data = data)
o summary(model1)
o interpretation:
 exp(coef(model1))
 for odds ratio <1 : the odds of y-var if x-var is increased by one unit is 1-
exp(coef(model1)) decrease, on average, of what they are if you maintain
the same x-var.
 for odd ratio >1: increasing the x-var by one unit would make the odds of
y-var exp(coef(model1))-1 times higher than what they would have been
if x-var did not increase, cet par.
 make predictions:
 dataset <- dataset %>% mutate(predictions = predict(model1,
type = ‘response’, dataset)
 Create scenarios:
 library(tidyr)
 scenarios <- expand_grid(x-var1 = seq(val1, val2, val3), x-var2 =
seq(val1, val2))
 scenarios <- scenarios %>% mutate(prediction = predict(model1,
scenarios, type = ‘response’))

 Fixed Effects Models:

o use when there are variables that you want to keep the effects constant for each
entity, panel data.
o EDA:
 ggplot(data_set, aes(x = x-var, y = y-var, color = categorical-var )) +
geom_line()
o Some variables do not change instantly (ea police number) get lagged data for
that:
 library(dplyr)
 data_set <- data_set %>% group_by(categorical_var) %>%
mutate( lag_var = dplyr::lag(var, order_by = ordering_var (ea time) )) %>
% ungroup()
o Pooled model: regression model with every data point in it, no grouping or
filtering.
 function : lm
o fixed effects models are used to eliminate differences between units over the
time of the study, such as differences in average income in different states
o creating the model to keep one variables constant:
 library(plm)
 model1 <- plm(y-var ~ x-var, data = data_set, index = ‘categorical_var’,
model = ‘within’)
o creating the model to keep two variables constant:
 library(plm)
 model1 <- plm(y-var ~ x-var, data = data_set, index = c(‘entity_var’,
‘time_var’), model = ‘within’, effect = ‘twoways’)
o Checking to see time variations and individual variations:
 pvar(data_set, index = c(‘entity-variable’, ‘time_variable’))
o interpreting fixed effects models:
 coeftest(model1, vcoc = vcovHC, type = ‘HC1’)
 Dif-in-Dif:
o Key: there has to be a treatment group and a control group selected at random
o parallel trends assumption
o model1 <- lm(y-var~x-var*treatment-var+ control, data = data_set)
o interpreting dif-in-dif models:
 coeftest(model1, vcov = vcovHC, type = ‘HC1’)
 Regression Discontinuity:
o Key: there is a threshold that determines if you are in the treatment or not.
o the threshold variable is called an assignment variable
o EDA:
 make a scatterplot
 color the treatment group
 add lines of best fit
 add vertical line at cutoff:
 data_set %>% ggplot(aes(x = x-var, y = y-var, color = (assignment
condition ea age <21))) + geom_point() + geom_smooth(method = ‘lm’,
se = FALSE) + geom_vline(xintercept = assignment value)
 if there is a shift aka discontinuity, it is an indication of this model being a
good match
o making the model:
 without centering the var:
 model1 <- lm(y-var~(assignment var) + control, data = data_set)
 with centering the var:
 model1 <- lm(y-var~I(x-var—assignment value)*(assignment
variable) + control, data = data_set)
o interpreting models:
 coeftest(model1, vcov= vcovHC, type = “HC1”)
 in both cases you interpret the key variable not the interaction term
 Instrumental variable 2SLS:
o an instrumental variable needs to be correlated with the x-var and uncorrelated
with the y-var
o use it when there is a lottery case or any other random treatment group but
there is no evidence if the treatment group actually got the treatment, is not
used to find the effect of the treatment on the y-var—non-compliance.
o creating the model:
 library(ivreg)
 model1 <- ivreg::ivreg(y-var~x-var + control1 + control2 | instrumental-
var + control1 + control2)
o interpreting results:
 summary(model1, vcov. = vcovHC)
 Good instrumental var:
 one instrumental var: weak instruments test p-value = 0
 more than one instrumental-var: sargan p-value small:
o at least one of the instruments is not exogenous
 R^2 doesn’t matter because it gets unreliable in 2SLS tests
 Graphing & Visualization
o Graphing for logistic scenarios:
 scenarios %>% ggplot(aes(x = x-var, y = prediction, color =
as.factor(categoprical-var))) + geom_point() + geom_line() +
facet_wrap(~x-var, ncol = 5)

DMV Unit 3 PPT - RSK - 250419 - 125620 Jfhuehiwhu
No ratings yet
DMV Unit 3 PPT - RSK - 250419 - 125620 Jfhuehiwhu
89 pages
Lecture Notes Week 2
No ratings yet
Lecture Notes Week 2
76 pages
Unit 4 - R Programming
No ratings yet
Unit 4 - R Programming
26 pages
15 Types of Regression You Should Know
No ratings yet
15 Types of Regression You Should Know
30 pages
Building Regression Models
No ratings yet
Building Regression Models
22 pages
Unit 5-1
No ratings yet
Unit 5-1
17 pages
Lec 11
No ratings yet
Lec 11
14 pages
Dav Exp
No ratings yet
Dav Exp
11 pages
Statistical Modelling
No ratings yet
Statistical Modelling
39 pages
R Codes
No ratings yet
R Codes
5 pages
Florian Heiss - Using R For Introductory Econometrics-Florian Heiss (2020)
No ratings yet
Florian Heiss - Using R For Introductory Econometrics-Florian Heiss (2020)
379 pages
R Notesss
No ratings yet
R Notesss
12 pages
Prof. Dr. Moustapha Ibrahim Salem Mansourms@alexu - Edu.eg 01005857099
No ratings yet
Prof. Dr. Moustapha Ibrahim Salem Mansourms@alexu - Edu.eg 01005857099
110 pages
R Workshop PART 2
No ratings yet
R Workshop PART 2
36 pages
Applied Statistics
No ratings yet
Applied Statistics
457 pages
Florian Heiss - Using R For Introductory Econometrics - 2016
100% (6)
Florian Heiss - Using R For Introductory Econometrics - 2016
356 pages
Econometrics All R Codes Final
No ratings yet
Econometrics All R Codes Final
12 pages
R Codes 1
No ratings yet
R Codes 1
3 pages
Linear Regression
No ratings yet
Linear Regression
13 pages
R Practical Ecotrix
No ratings yet
R Practical Ecotrix
4 pages
Dav Pracs
No ratings yet
Dav Pracs
9 pages
Edit Code
No ratings yet
Edit Code
9 pages
BA - Advanced Statistical Method Using R (P2)
No ratings yet
BA - Advanced Statistical Method Using R (P2)
12 pages
Unit 2 Notes
No ratings yet
Unit 2 Notes
4 pages
Cheatsheet Part 2
No ratings yet
Cheatsheet Part 2
2 pages
Heiss F. Using R For Introductory Econometrics 2ed 2020
No ratings yet
Heiss F. Using R For Introductory Econometrics 2ed 2020
379 pages
Correlation and Regression
No ratings yet
Correlation and Regression
2 pages
MIT 302 - Statistical Computing II - Tutorial 03
No ratings yet
MIT 302 - Statistical Computing II - Tutorial 03
16 pages
Introduction To Statistical Learning R Labs and Exercises Code
No ratings yet
Introduction To Statistical Learning R Labs and Exercises Code
33 pages
Essential R
No ratings yet
Essential R
261 pages
Using R For Introductory Econometrics
No ratings yet
Using R For Introductory Econometrics
378 pages
CS ELEC 4 Finals Module
No ratings yet
CS ELEC 4 Finals Module
57 pages
CS 2008 3complete PDF
No ratings yet
CS 2008 3complete PDF
53 pages
Regression Models Course Notes
No ratings yet
Regression Models Course Notes
102 pages
CH 06
No ratings yet
CH 06
22 pages
SC&RP - Unit 5
No ratings yet
SC&RP - Unit 5
36 pages
Lecture 19: Interactions
No ratings yet
Lecture 19: Interactions
4 pages
20BCE1205 Lab3
No ratings yet
20BCE1205 Lab3
9 pages
R Unit 4th and 5th
No ratings yet
R Unit 4th and 5th
17 pages
Linear Regression
No ratings yet
Linear Regression
17 pages
Experiment No.8 - Fit Simple Linear Regression Models Using Built-In Functions.
No ratings yet
Experiment No.8 - Fit Simple Linear Regression Models Using Built-In Functions.
8 pages
Common Stat 101 Commands For Rstudio: 1 One Categorical Variable
No ratings yet
Common Stat 101 Commands For Rstudio: 1 One Categorical Variable
5 pages
API Casing To Recommended Bit Size
100% (1)
API Casing To Recommended Bit Size
3 pages
Simple Regression Model Fitting
No ratings yet
Simple Regression Model Fitting
5 pages
Mindanao State University General Santos City: Simple Linear Regression
No ratings yet
Mindanao State University General Santos City: Simple Linear Regression
12 pages
Com - Upgadata.up7723 Logcat
No ratings yet
Com - Upgadata.up7723 Logcat
47 pages
Skillnet Ireland - Network Brand Guidelines
100% (1)
Skillnet Ireland - Network Brand Guidelines
59 pages
StotraNidhi Telugu 15-Books Combo
No ratings yet
StotraNidhi Telugu 15-Books Combo
1 page
Cloud Computing Chapter3 2
0% (1)
Cloud Computing Chapter3 2
36 pages
How To Use "Qqplot": X: Independent Variable, Y: Dependent Variable
No ratings yet
How To Use "Qqplot": X: Independent Variable, Y: Dependent Variable
6 pages
R Functions List
No ratings yet
R Functions List
8 pages
Unit5 R
No ratings yet
Unit5 R
5 pages
RegrCorr PDF
No ratings yet
RegrCorr PDF
20 pages
R Course
No ratings yet
R Course
7 pages
RStudio Cheat Sheet 2022
No ratings yet
RStudio Cheat Sheet 2022
1 page
Rstudio Study Notes For PA 20181126
No ratings yet
Rstudio Study Notes For PA 20181126
6 pages
Linear Regression With LM Function, Diagnostic Plots, Interaction Term, Non-Linear Transformation of The Predictors, Qualitative Predictors
100% (1)
Linear Regression With LM Function, Diagnostic Plots, Interaction Term, Non-Linear Transformation of The Predictors, Qualitative Predictors
15 pages
JAVA PROGRAMMING Lab Manual
No ratings yet
JAVA PROGRAMMING Lab Manual
42 pages
Suraj Data
No ratings yet
Suraj Data
100 pages
Movie Recommendation System
No ratings yet
Movie Recommendation System
28 pages
R Regression Commands
No ratings yet
R Regression Commands
5 pages
hw4 Sol PDF
100% (2)
hw4 Sol PDF
23 pages
PDF (SG) - EAP11 - 12 - Unit 12 - Lesson 1 - Organizing Data From Surveys
No ratings yet
PDF (SG) - EAP11 - 12 - Unit 12 - Lesson 1 - Organizing Data From Surveys
18 pages
R Commands: Appendix B
No ratings yet
R Commands: Appendix B
5 pages
Snowplow 101 Guide To Marketing Attribution - 2023
No ratings yet
Snowplow 101 Guide To Marketing Attribution - 2023
16 pages
Physics Investigatory Project
No ratings yet
Physics Investigatory Project
17 pages
Eg - Points & Lines - MCQ
No ratings yet
Eg - Points & Lines - MCQ
6 pages
Draft - R1-2312083 Summary of UE Features For NR NTN - v002 - DCM - HW&HiSi
No ratings yet
Draft - R1-2312083 Summary of UE Features For NR NTN - v002 - DCM - HW&HiSi
23 pages
Pms Deck Nasyda Linso
100% (1)
Pms Deck Nasyda Linso
21 pages
Lecture 01.1 Introduction To Website Development
No ratings yet
Lecture 01.1 Introduction To Website Development
22 pages
Using File Server Resource Manager To Screen For Ransomware
No ratings yet
Using File Server Resource Manager To Screen For Ransomware
19 pages
Using R For Linear Regression
No ratings yet
Using R For Linear Regression
9 pages
Print Production: Digital Images
No ratings yet
Print Production: Digital Images
24 pages
Sf6 Gas Density Monitor
No ratings yet
Sf6 Gas Density Monitor
2 pages
3 Categories of Entrants
No ratings yet
3 Categories of Entrants
5 pages
PKG List (Submit To Mr. Jeong)
No ratings yet
PKG List (Submit To Mr. Jeong)
6 pages
Teens English DWDM Book 1 INT U4
No ratings yet
Teens English DWDM Book 1 INT U4
4 pages
Series 1
No ratings yet
Series 1
2 pages
Lowongan Pekerjaan - Employee Referral Program (10022021)
No ratings yet
Lowongan Pekerjaan - Employee Referral Program (10022021)
5 pages
System Requirements Guidelines NX 8 5
No ratings yet
System Requirements Guidelines NX 8 5
3 pages
248HSL
No ratings yet
248HSL
8 pages
Which Test When: 1 Exploratory Tests
No ratings yet
Which Test When: 1 Exploratory Tests
5 pages
Toshiba 500gb Dt01aca Dt01aca050!3!5 Internal Hard Hdkpc01 282179 User Manual
No ratings yet
Toshiba 500gb Dt01aca Dt01aca050!3!5 Internal Hard Hdkpc01 282179 User Manual
2 pages
Psyc325 U5 Ip Final Turn in This One 2
No ratings yet
Psyc325 U5 Ip Final Turn in This One 2
6 pages
Double Skin Ducted Blower Split System (A5DSB-H/A5MC-H) Double Skin Ducted Blower Split System (A5DSB-H/A5MC-H)
No ratings yet
Double Skin Ducted Blower Split System (A5DSB-H/A5MC-H) Double Skin Ducted Blower Split System (A5DSB-H/A5MC-H)
1 page
Essay and Hackathon
No ratings yet
Essay and Hackathon
2 pages
DSA Patterns and Problems
No ratings yet
DSA Patterns and Problems
10 pages
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
From Everand
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
Jeffrey M. Wooldridge
No ratings yet
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
Ordered Weighted Averaging Aggregation Operator: Fundamentals and Applications
From Everand
Ordered Weighted Averaging Aggregation Operator: Fundamentals and Applications
Fouad Sabry
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Stats Notes

Uploaded by

Stats Notes

Uploaded by

 Corollary: if we are sure there is nothing in the error term correlated with the slope, we

can interpret the coefficient.

 Fixed Effects Models:

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.