0% found this document useful (0 votes)

17 views

Bayesian Multiple Linear Regression

The document discusses Bayesian multiple linear regression as a method for predicting food spending based on various factors, highlighting its advantages in handling uncertainty and incorporating prior knowledge. However, it argues that given the large dataset size and observed heteroskedasticity, Ordinary Least Squares (OLS) regression with robust standard errors is a more suitable approach. The document emphasizes the computational demands of Bayesian methods and suggests alternatives like Generalized Least Squares (GLS) for addressing heteroskedasticity.

Uploaded by

josefvillacampa

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views

Bayesian Multiple Linear Regression

Uploaded by

josefvillacampa

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 7

Bayesian Multiple Linear Regression

Imagine you're trying to predict how much money a family spends on food (LOGFOODEXP) based on several
factors like their income (LOGHINC), household size (HSIZE), and age of the household head (HHAGE).

Traditional Regression gives you a single estimate for how each factor affects food spending. It's like getting a
single number that says, "For every extra dollar in income, food spending increases by this much." Bayesian
Regression does the same thing but also tells you how confident you should be in those estimates. It's like getting a
range of possible numbers instead of just one, so you can see how likely each possible effect is.

Why Use Bayesian Regression?

Uncertainty: It helps you understand how sure you are about your predictions. This is important because real-world
data can be messy and uncertain.

Prior Knowledge: If you already know something about the problem like how income affects spending. Bayesian
regression lets you use that knowledge to improve your predictions.

Small Data: It works well even when you don't have a lot of data, which is common in many fields.

Suppose you want to predict food spending based on income. Bayesian regression might tell you: "For every extra
dollar in income, food spending probably increases by between $0.20 and $0.50." This range shows the uncertainty
in the estimate. It's like having a more nuanced understanding of how things work, which can be really helpful in
making decisions.

Assumptions

1. Linearity: Linear relationship between dependent and independent variables.

2. Normality of Errors: Errors follow a normal distribution.
3. Independence of Errors: Errors are independent across observations.
4. No Multicollinearity: Predictors are not highly correlated.
5. Homoscedasticity: Constant variance of errors.
6. Valid Priors: Reasonable prior distributions for parameters.
7. Fixed Predictors: Predictors are fixed constants.
8. Sufficient Data: Enough data to update priors meaningfully.
9. Correct Model Specification: Likelihood function correctly represents the data-generating process.

DIAGNOSTIC TESTS

1. Linearity of variables: Scatterplots and Partial residual regression plots

2. Normality of Errors: histogram of residuals, qnorm residuals.

3. Independence of Errors: Durbin-Watson test to check for autocorrelation in residuals. Variance of

residuals
is constant across predicted values.
4. No Multicollinearity: Variance inflation factors estat vif

5. Homoscedasticity: Constant variance of errors. Breusch-Pagan test: (estat hottest)

6. Valid Priors:
7. Fixed Predictors: Predictors are fixed constants.

8. Sufficient Data: Enough data to update priors meaningfully.

9. Correct Model Specification: Likelihood function correctly represents the data-generating process.

Given the size of the dataset (147,717 observations) and the issues with heteroskedasticity
observed in the OLS regression, Bayesian multiple regression may not be the most suitable
approach for this analysis. Bayesian methods, while offering flexibility and the ability to
incorporate prior knowledge, can be computationally intensive, especially when dealing with
large datasets. Markov Chain Monte Carlo (MCMC) sampling, which is commonly used in
Bayesian regression, can become inefficient as the dataset size increases, resulting in longer
fitting times and potential convergence issues (Gelman et al., 2013). This could present
challenges when working with large datasets like the one in question, where computational
resources may become a limiting factor.

In addition, heteroskedasticity, which violates the homoscedasticity assumption in ordinary least

squares (OLS) regression, could also pose challenges to the assumptions of Bayesian regression.
In Bayesian models, error variance is assumed to be constant across observations. If
heteroskedasticity is present, it could bias the model results or render them less reliable
(Kruschke, 2015). While Bayesian regression offers flexibility in modeling uncertainty, the
assumption of homoscedasticity remains important. In situations where heteroskedasticity is
observed, alternative methods such as Generalized Least Squares (GLS) or heteroskedasticity-
robust standard errors may provide more reliable results. These methods are specifically
designed to address non-constant error variance, making them more appropriate for datasets
exhibiting heteroskedasticity (White, 1980).

In this instance, I believe that OLS (Ordinary Least Squares) is a better fit for my research,
given the large dataset (147,717 observations) and the issue of heteroskedasticity observed in the
initial OLS regression. Despite the heteroskedasticity, OLS remains a widely used method for
regression analysis due to its simplicity, efficiency, and the availability of robust methods that
can address violations of assumptions, such as heteroskedasticity (Wooldridge, 2010). For large
datasets, OLS can still provide reliable and efficient estimates when robust standard errors are
applied, which allows for valid inference even when heteroskedasticity is present (White, 1980).

The heteroskedasticity in my data, which violates the assumption of constant error variance in
OLS, can be addressed using White's heteroskedasticity-consistent standard errors estimator,
ensuring valid parameter estimates without transforming the model (White, 1980). Alternatively,
Generalized Least Squares (GLS) could be employed if the heteroskedasticity is suspected to
be systematic, though OLS with robust standard errors remains a simpler and effective solution
for addressing this issue.
While Bayesian methods offer flexibility in modeling uncertainty and prior information, they are
computationally demanding, particularly with large datasets. According to Gelman et al. (2013),
Bayesian models require substantial computational resources due to the use of Markov Chain
Monte Carlo (MCMC) sampling, which may not be efficient for datasets of this size. Moreover,
the need to specify priors and interpret the posterior distributions in Bayesian analysis can add
unnecessary complexity, and may not yield substantial advantages over OLS when working with
large, well-behaved datasets.

reference:

Therefore, unless there is a strong requirement for prior information or complex uncertainty
modeling, I find that OLS with robust standard errors or GLS is a more efficient and practical
choice for my research.

Gelman, A., Carlin, J. B., Stern, H. S., Dunson, D. B., Vehtari, A., & Rubin, D. B. (2013).
Bayesian data analysis (3rd ed.). CRC Press.

White, H. (1980). A heteroskedasticity-consistent covariance matrix estimator and a direct test

for heteroskedasticity. Econometrica, 48(4), 817–838. https://doi.org/10.2307/1912934

Wooldridge, J. M. (2010). Econometric analysis of cross section and panel data (2nd ed.). MIT
Press.

Gelman, A., Carlin, J. B., Stern, H. S., Dunson, D. B., Vehtari, A., & Rubin, D. B. (2013).
Bayesian data analysis (3rd ed.). CRC Press.

Kruschke, J. K. (2015). Doing Bayesian data analysis: A tutorial with R, JAGS, and Stan (2nd
ed.). Academic Press.

White, H. (1980). A heteroskedasticity-consistent covariance matrix estimator and a direct test

for heteroskedasticity. Econometrica, 48(4), 817–838. https://doi.org/10.2307/1912934
BAYESIAN RESULTS FOR FIES 2018

Modern Applied Regressions
No ratings yet
Modern Applied Regressions
298 pages
Bayesian Statistical Methods (Brian J. Reich, Sujit K. Ghosh)
No ratings yet
Bayesian Statistical Methods (Brian J. Reich, Sujit K. Ghosh)
288 pages
Making Models With Bayes
No ratings yet
Making Models With Bayes
51 pages
Introduction To Econometrics - Summary
No ratings yet
Introduction To Econometrics - Summary
23 pages
(Ebooks PDF) Download The Multivariate Social Scientist Introductory Statistics Using Generalized Linear Models Sofroniou Full Chapters
100% (3)
(Ebooks PDF) Download The Multivariate Social Scientist Introductory Statistics Using Generalized Linear Models Sofroniou Full Chapters
84 pages
The multivariate social scientist introductory statistics using generalized linear models Sofroniou download
100% (1)
The multivariate social scientist introductory statistics using generalized linear models Sofroniou download
51 pages
Immediate download The multivariate social scientist introductory statistics using generalized linear models Sofroniou ebooks 2024
100% (10)
Immediate download The multivariate social scientist introductory statistics using generalized linear models Sofroniou ebooks 2024
50 pages
Regression Analysis
No ratings yet
Regression Analysis
18 pages
The multivariate social scientist introductory statistics using generalized linear models Sofroniou - The ebook in PDF format with all chapters is ready for download
No ratings yet
The multivariate social scientist introductory statistics using generalized linear models Sofroniou - The ebook in PDF format with all chapters is ready for download
49 pages
Get The multivariate social scientist introductory statistics using generalized linear models Sofroniou free all chapters
100% (1)
Get The multivariate social scientist introductory statistics using generalized linear models Sofroniou free all chapters
61 pages
The multivariate social scientist introductory statistics using generalized linear models Sofroniou download
100% (2)
The multivariate social scientist introductory statistics using generalized linear models Sofroniou download
63 pages
Violations of Assumptions
No ratings yet
Violations of Assumptions
1 page
Unit 2 - ML - SRM
No ratings yet
Unit 2 - ML - SRM
66 pages
Course Notes
No ratings yet
Course Notes
141 pages
BEC 340 Econometrics I Course Outline
No ratings yet
BEC 340 Econometrics I Course Outline
6 pages
1
No ratings yet
1
130 pages
CH 08
No ratings yet
CH 08
22 pages
Comparisonbetween Multiple Regressionand Bayesian Regressionwith
No ratings yet
Comparisonbetween Multiple Regressionand Bayesian Regressionwith
18 pages
Yaregal Birhanu
No ratings yet
Yaregal Birhanu
8 pages
Heteroscedasticity
No ratings yet
Heteroscedasticity
21 pages
Regression With One Regressor-Hypothesis Tests and Confidence Intervals
100% (1)
Regression With One Regressor-Hypothesis Tests and Confidence Intervals
53 pages
Heteroscedasticity Workshop
No ratings yet
Heteroscedasticity Workshop
72 pages
Data Science Interview Preparation
100% (1)
Data Science Interview Preparation
113 pages
1
No ratings yet
1
16 pages
Template of BAMME 2021-3953
No ratings yet
Template of BAMME 2021-3953
14 pages
Oversikt ECN402
No ratings yet
Oversikt ECN402
40 pages
Assignment
No ratings yet
Assignment
12 pages
Econometrics I - Lecture 8 (Wooldridge)
No ratings yet
Econometrics I - Lecture 8 (Wooldridge)
38 pages
Midterm Exam Version B
No ratings yet
Midterm Exam Version B
19 pages
Admin - JSDS, Oc 4. Journal Welly Fransiska-WAG-Bedah
No ratings yet
Admin - JSDS, Oc 4. Journal Welly Fransiska-WAG-Bedah
11 pages
Assignment of Econometrics
No ratings yet
Assignment of Econometrics
12 pages
Statistical Testing and Prediction Using Linear Regression: Abstract
No ratings yet
Statistical Testing and Prediction Using Linear Regression: Abstract
10 pages
Econometrics
No ratings yet
Econometrics
13 pages
Stat 378
No ratings yet
Stat 378
73 pages
(Ebook) The multivariate social scientist: introductory statistics using generalized linear models by Sofroniou, Nick;Hutcheson, Graeme D ISBN 9780761952008, 9780761952015, 0761952004, 0761952012 - The latest updated ebook is now available for download
100% (1)
(Ebook) The multivariate social scientist: introductory statistics using generalized linear models by Sofroniou, Nick;Hutcheson, Graeme D ISBN 9780761952008, 9780761952015, 0761952004, 0761952012 - The latest updated ebook is now available for download
56 pages
Introductory Econometrics A Modern Approach 7th Edition Ebook
No ratings yet
Introductory Econometrics A Modern Approach 7th Edition Ebook
31 pages
Syllabus (BUS 524)
No ratings yet
Syllabus (BUS 524)
3 pages
Multivariate Regression Model - Lecture Notes
No ratings yet
Multivariate Regression Model - Lecture Notes
17 pages
Heteroskedasticity
No ratings yet
Heteroskedasticity
9 pages
Cursus Advanced Econometrics
No ratings yet
Cursus Advanced Econometrics
129 pages
High-Dimensional Covariance Estimation: With High-Dimensional Data
From Everand
High-Dimensional Covariance Estimation: With High-Dimensional Data
Mohsen Pourahmadi
No ratings yet
ML Assignment 1
No ratings yet
ML Assignment 1
7 pages
Mungadze Linear
No ratings yet
Mungadze Linear
21 pages
Application of SGT Family Distributions in QMLE
No ratings yet
Application of SGT Family Distributions in QMLE
22 pages
Heteros Ce Dasti City
No ratings yet
Heteros Ce Dasti City
15 pages
Lecture 09_02.09.2024_Regression-01
No ratings yet
Lecture 09_02.09.2024_Regression-01
62 pages
An Introduction To Modern Bayesian Econometrics: Tony Lancaster May 26, 2003
No ratings yet
An Introduction To Modern Bayesian Econometrics: Tony Lancaster May 26, 2003
10 pages
Robust Statistic
No ratings yet
Robust Statistic
599 pages
(Ebook) The multivariate social scientist: introductory statistics using generalized linear models by Sofroniou, Nick;Hutcheson, Graeme D ISBN 9780761952008, 9780761952015, 0761952004, 0761952012 2024 Scribd Download
100% (2)
(Ebook) The multivariate social scientist: introductory statistics using generalized linear models by Sofroniou, Nick;Hutcheson, Graeme D ISBN 9780761952008, 9780761952015, 0761952004, 0761952012 2024 Scribd Download
76 pages
Regresion Heterocedástica
No ratings yet
Regresion Heterocedástica
21 pages
Bayesian Linear Regression-II
No ratings yet
Bayesian Linear Regression-II
12 pages
2 PDF
No ratings yet
2 PDF
60 pages
ETW2510 Lecture 8 Heteroskedasticity
No ratings yet
ETW2510 Lecture 8 Heteroskedasticity
29 pages
JacobRich
No ratings yet
JacobRich
38 pages
Eco Trix
No ratings yet
Eco Trix
16 pages
Heteroskedasticity & Autocorrelation
No ratings yet
Heteroskedasticity & Autocorrelation
31 pages
Notes
No ratings yet
Notes
199 pages
Applied Linear Regression
No ratings yet
Applied Linear Regression
9 pages
(Chapman & Hall - CRC Texts in Statistical Science) Paul Roback and Julie Legler - Beyond Multiple Linear Regression-Applied Generalized Linear Models and Multilevel Models in R-CRC Press (2020)
No ratings yet
(Chapman & Hall - CRC Texts in Statistical Science) Paul Roback and Julie Legler - Beyond Multiple Linear Regression-Applied Generalized Linear Models and Multilevel Models in R-CRC Press (2020)
437 pages
Bayesian Inference: Fundamentals and Applications
From Everand
Bayesian Inference: Fundamentals and Applications
Fouad Sabry
No ratings yet
DISNEY-PROGRAM-FLOW-GUIDE
No ratings yet
DISNEY-PROGRAM-FLOW-GUIDE
2 pages
CamScanner 06-18-2025 08.37
No ratings yet
CamScanner 06-18-2025 08.37
6 pages
RIPPPE
No ratings yet
RIPPPE
2 pages
FOUND ED 203 Module 2
No ratings yet
FOUND ED 203 Module 2
30 pages
Undergrad-Oral-Defense-Compliance-Form-Kurz-1
No ratings yet
Undergrad-Oral-Defense-Compliance-Form-Kurz-1
3 pages
FIESTA-LEAGUE-2025-AWARDING-CEREMONY
No ratings yet
FIESTA-LEAGUE-2025-AWARDING-CEREMONY
5 pages
references for research 2025
No ratings yet
references for research 2025
6 pages
yang-et-al-2025-addressing-endogeneity-using-a-two-stage-copula-generated-regressor-approach
No ratings yet
yang-et-al-2025-addressing-endogeneity-using-a-two-stage-copula-generated-regressor-approach
23 pages
BOSeJ_1_3_Article+3 (1)
No ratings yet
BOSeJ_1_3_Article+3 (1)
14 pages
Copula-Regression-Parsa-Klugman
No ratings yet
Copula-Regression-Parsa-Klugman
10 pages
MidTerm-Exam-Stata
No ratings yet
MidTerm-Exam-Stata
8 pages
Industry Analysis Universal Banking 3rd Yr 1stsem
No ratings yet
Industry Analysis Universal Banking 3rd Yr 1stsem
14 pages
ECN 1101, The Costs of Production
No ratings yet
ECN 1101, The Costs of Production
1 page
Aggregate Demand and Aggregate Supply
No ratings yet
Aggregate Demand and Aggregate Supply
14 pages
Assignment-2
No ratings yet
Assignment-2
1 page
1
No ratings yet
1
1 page
Chapter 8
No ratings yet
Chapter 8
20 pages
Bayesian Multiple Linear Regression
No ratings yet
Bayesian Multiple Linear Regression
7 pages
04 Violation of Assumptions All
No ratings yet
04 Violation of Assumptions All
24 pages
P08 - 178380 - Eviews Guide
No ratings yet
P08 - 178380 - Eviews Guide
9 pages
Lecture 7 Heteroskedasticity
No ratings yet
Lecture 7 Heteroskedasticity
41 pages
Heteroscedasticity
No ratings yet
Heteroscedasticity
4 pages
Econ Ex1 Final
No ratings yet
Econ Ex1 Final
15 pages
AEphd 2023 Week 2 Small
No ratings yet
AEphd 2023 Week 2 Small
10 pages
Baker Et Al-2008-Real Estate Economics
No ratings yet
Baker Et Al-2008-Real Estate Economics
16 pages
Econ 335 Wooldridge CH 8 Heteroskedasticity
No ratings yet
Econ 335 Wooldridge CH 8 Heteroskedasticity
23 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Bayesian Multiple Linear Regression

Uploaded by

Bayesian Multiple Linear Regression

Uploaded by

Bayesian Multiple Linear Regression

Why Use Bayesian Regression?

1. Linearity: Linear relationship between dependent and independent variables.

1. Linearity of variables: Scatterplots and Partial residual regression plots

2. Normality of Errors: histogram of residuals, qnorm residuals.

3. Independence of Errors: Durbin-Watson test to check for autocorrelation in residuals. Variance of

5. Homoscedasticity: Constant variance of errors. Breusch-Pagan test: (estat hottest)

8. Sufficient Data: Enough data to update priors meaningfully.

In addition, heteroskedasticity, which violates the homoscedasticity assumption in ordinary least

White, H. (1980). A heteroskedasticity-consistent covariance matrix estimator and a direct test

White, H. (1980). A heteroskedasticity-consistent covariance matrix estimator and a direct test

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.