0% found this document useful (0 votes)

98 views6 pages

L28 Bayseian Linear Regression Linchpin Sampler PDF

Uploaded by

Ananya Agarwal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

98 views6 pages

L28 Bayseian Linear Regression Linchpin Sampler PDF

Uploaded by

Ananya Agarwal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

MTH 511a - 2020: Lecture 28

Instructor: Dootika Vats

The instructor of this course owns the copyright of all the course materials. This lecture
material was distributed only to the students attending the course MTH511a: “Statistical
Simulation and Data Analysis” of IIT Kanpur, and should not be distributed in print or
through electronic media without the consent of the instructor. Students can make their own
copies of the course materials for their use.
In this lecture, we will focus only the Bayesian linear regression model, and discuss
accept-reject algorithms that sample from the posterior.

1 Bayesian linear regression

Consider a Bayesian version of the linear regression model, where prior distributions
are assigned to both the regression coefficient and the variance, 2 . Recall that the
likelihood is
y = (y1 , . . . , yn ) | , 2 ⇠ N (X , 2 In ) .
2
The parameters of interest are and and popular prior distributions assume inde-
pendent priors:
2 2
⇠ Np (0, Ip ) and ⇠ Inverse Gamma(a, b) .

Note that the Inverse Gamma distribution has density

✓ ◆ a 1
2 1 2
⇡( ) / 2
e b/ .

The posterior distribution is

n
Y
2 2 2
⇡( , |y) / ⇡( , ) f (yi | , )
i=1
n
Y
2 2
= ⇡( )⇡( ) f (yi | , )
i=1
✓ ◆ a 1 ✓ ◆p/2 ⇢ T
✓ ◆n/2 ⇢
1 b/ 2 1 1 (y X )T (y X )
= 2
e 2
exp 2 2
exp
2 2 2

1
⇢
2 n/2 p/2 a 1 (y X )T (y X ) T
b
= exp
2 2 2 2 2

The above is the (p + 1)-dimensional posterior distribution and we want to obtain

samples from this using accept-reject. We already know that accept-reject does not
work well in higher dimensions. So in any
So, to run an accept-reject sampler consider a proposal distribution q( , 2 ) = ⇡( )⇡( 2 ).
That is the proposal distribution is the same as the prior distribution. Then, if MLE
exists,
Q
⇡˜ ( , 2 |y) ⇡( )⇡( 2 ) ni=1 f (yi | , 2 )
=
⇡( )⇡( 2 ) ⇡( )⇡( 2 )
Yn
= f (yi | , 2 )
i=1
n
Y
 f (yi | ˆMLE , ˆ2 MLE ) := M .
i=1

So an accept-reject sampler is theoretically possible to implement. However, we note

that the dimensionality of the problem will certainly impede efficiency. So that it will
be very inefficient to implement AR here and possibly close to impossible to get even
one draw from the posterior distribution in reasonable time.

1.1 Linchpin variable samplers

As we have discussed plenty of times now, it is difficult to implement AR when the

target is high-dimensional or when the upper bound is hard to get. In the first case, a
linchpin variable trick can be very useful. Suppose the target density is

⇡(x, y) .

Then, we can split the joint distribution as the product of conditional times marginal.
That is
⇡(x, y) = ⇡(x|y) ⇡(y) .
If X|Y is known in closed-form and we can sample from it, then we may try and get
samples from the marginal distribution of y. This is beneficial since the dimension of
y is smaller than (x, y), and implementing AR on a smaller dimensional problem will
be much easier. So the algorithm would be
• Generate Y ⇠ ⇡(y)
• Generate X ⇠ X|Y
• Output (X, Y ).

2
The variable Y is called the linchpin variable with target density ⇡(y). We can use
this quite easily in Bayesian linear regression.
Example 1 (Bayesian linear regression). Recall the posterior distribution in Bayesian
linear regression as:
⇢
2 2 n/2 p/2 a 1 (y X )T (y X ) T
b
⇡( , |y) = exp
2 2 2 2 2

First, note that we prefer 2 to be the linchpin variable since it is univariate, and
is p-variate. So we need to find the distribution | 2 and the marginal distribution of
2
. Let A = (X T X + I).

Z
2
⇡( , |y)d
Z ⇢
2 n/2 p/2 a 1 yT y 2 T X T y + T
XT X T
b
/ exp d
2 2 2 2 2
⇢ T Z ⇢ T
2 n/2 p/2 a 1 y y b XT X 2 T
XT y T
= exp exp d
2 2 2 2 2 2 2
⇢ T Z ⇢ T
2 n/2 p/2 a 1 y y b (X T X + I) 2 T XT y
= exp exp d
2 2 2 2 2
⇢ T Z (
T
2 n/2 p/2 a 1 y y b A 2 T AA 1 X T y
= exp 2 2
exp
2 2 2
)
1 T T 1 T 1 T T 1 T
(A X y) A(A X y) (A X y) A(A X y)
+ d
2 2 2 2
⇢
2 n/2 p/2 a 1 yT y b y T XA 1 AA 1 X T y
= exp +
2 2 2 2 2
Z ⇢ T
A 2 T AA 1 X T y + y T XA 1 AA 1 X T y
= ⇥ exp
2 2
⇢ Z ⇢
2 n/2 p/2 a 1 yT y b y T XA 1 X T y ( A 1 X T y)T A( A 1 X T y)
= exp + exp .
2 2 2 2 2 2 2

2
So | is a multivariate normal distribution

| 2 , y ⇠ Np A 1 X T y, 2
A 1
,

and the integral integrates to a known constant.

Z ⇢
2 2 n/2 p/2 a 1 y T (I XA 1 X T )y b 2 p/2
⇡( , |y)d / exp · det(A)p/2
2 2 2

3
⇢
2 n/2 a 1 y T (I XA 1 X T )y b
/ exp .
2 2 2

So the marginal posterior distribution for 2 |y is

✓ ◆
2 n y T (I XA 1 X T )y
|y ⇠ Inverse Gamma + a, +b .
2 2

We have just done the following decomposition

2
⇡( , |y) = ⇡( | 2 , y)⇡( 2 |y) ,

where both those densities are available in closed-form and samples can be generated
easily from them in the following way:
2
1. Generate ⇠ Inverse Gamma as indicated above
2
2. Generate | ⇠ Normal distribution as indicated above.
3. ( , 2 ) is one draw from the posterior. Repeat for many draws, and estimate
posterior mean and quantiles.
We now implement Bayesian linear regression for the cars dataset
###########################################
## Linchpin variable sampler
## for Bayesian linear regression for cars
###########################################
set.seed(1)

# loading the dataset

data(cars)
n <- dim(cars)[1]
X <- cbind(1, cars$speed)
y <- cars$dist
p <- dim(X)[2]
a <- 1 # prior parameters
b <- 1 # prior parameters

Drawing the samples is easy, since no AR step is required

# We implement Monte Carlo sampling using linchping

N <- 1e4
A <- t(X)%*%X + diag(p)
A.inv <- solve(A)

sig2 <- numeric(length = N)

beta <- matrix(0, nrow = N, ncol = p)

4
rate.sig <- ( t(y) %*% (diag(1,n) - X %*% A.inv %*% t(X)) %*% y )/2 + b

# sampling Inverge Gamma for sigma2

sig2 <- 1 / rgamma(N, shape = n/2 + a, rate = rate.sig)

# Sampling beta from multivariate normal

# mean + sqrt(covariance) %*% rnorm
foo <- svd(A.inv) #Singular values decomposition of A^{-1}
Ainv.sqrt <- foo$u %*% diag(foo$d^(1/2)) %*% t(foo$v)

for(i in 1:N)
{
beta[i,] <- A.inv %*% t(X) %*%y + Ainv.sqrt %*% rnorm(p, sd = sqrt(sig2))
# Getting beta estimates
}

We can view the posterior marginal density plots of the samples

par(mfrow = c(1,3))
plot(density(sig2), main = expression(sigma^2))
plot(density(beta[,1] ), main = expression(beta[1]))
plot(density(beta[,2] ), main = expression(beta[2]))

σ2 β1 β2
1.0
0.06
0.008

0.05

0.8
0.006

0.04

0.6
Density

Density

Density
0.03
0.004

0.4
0.02
0.002

0.2
0.01
0.000

0.00

0.0

100 200 300 400 500 −40 −30 −20 −10 0 10 2.5 3.0 3.5 4.0 4.5 5.0 5.5

N = 10000 Bandwidth = 6.579 N = 10000 Bandwidth = 0.9254 N = 10000 Bandwidth = 0.05591

We can also find the posterior means and quantiles:

poster <- cbind(sig2, beta)
colMeans(poster)
# sig2
#232.404843 -14.713624 3.766138

apply(poster, 2, quantile, c(.025, .975))

5
# sig2
#2.5% 157.9957 -27.370729 2.993072
#97.5% 342.2522 -1.772828 4.539354

Note that the posterior credible interval for both 1 and 2 do not have 0 in the interval,
implying both regression coefficients are important and should be treated as non-zero.

2 Questions to think about

• Implement accept-reject for the cars dataset and see for yourself how well the
algorithm works here.
2
• Suppose the marginal posterior distribution for |y was not from a nice known
family. What could we have done then?
• What is a MAP estimator of in this problem?

L31 Bayesian Logistic Regression PDF
No ratings yet
L31 Bayesian Logistic Regression PDF
8 pages
Ames Housing Price Prediction - Complete ML Project With Python
No ratings yet
Ames Housing Price Prediction - Complete ML Project With Python
14 pages
Eda - Final Assessment #3 - Saludar
No ratings yet
Eda - Final Assessment #3 - Saludar
15 pages
Bayes Regression
No ratings yet
Bayes Regression
7 pages
Intro Bayes Time Series 1
No ratings yet
Intro Bayes Time Series 1
72 pages
The Bmamevt Package: Bayesian Model Averaging at Work For Multivariate Extremes
No ratings yet
The Bmamevt Package: Bayesian Model Averaging at Work For Multivariate Extremes
5 pages
1.2.6 Advanced
No ratings yet
1.2.6 Advanced
5 pages
Assignment
No ratings yet
Assignment
2 pages
Assign 1
No ratings yet
Assign 1
5 pages
Bayesian Data Analysis - Reading Instructions 2: Chapter 2 - Outline
No ratings yet
Bayesian Data Analysis - Reading Instructions 2: Chapter 2 - Outline
36 pages
MA40189 20 Open
No ratings yet
MA40189 20 Open
6 pages
Assignment 5 Stat Inf b3 2022 2023 PDF
No ratings yet
Assignment 5 Stat Inf b3 2022 2023 PDF
16 pages
Lecture 5 - 8 Bayesian Estimation
No ratings yet
Lecture 5 - 8 Bayesian Estimation
65 pages
20 Bayesian2
No ratings yet
20 Bayesian2
50 pages
Scribe Notes BML
No ratings yet
Scribe Notes BML
25 pages
Bayesian Modelling Tuts-4-9
No ratings yet
Bayesian Modelling Tuts-4-9
6 pages
Bayesian Kernel Methods
No ratings yet
Bayesian Kernel Methods
40 pages
MCMC Bayes PDF
No ratings yet
MCMC Bayes PDF
27 pages
19-Bayesian 2
No ratings yet
19-Bayesian 2
39 pages
Fuskpaper Bayes
No ratings yet
Fuskpaper Bayes
51 pages
Bayesian Linear Model Gory Details
No ratings yet
Bayesian Linear Model Gory Details
9 pages
Bayesian Inference Slides 2021
No ratings yet
Bayesian Inference Slides 2021
37 pages
Lecture2 2013
No ratings yet
Lecture2 2013
60 pages
جلسه پنجم-1
No ratings yet
جلسه پنجم-1
15 pages
Lecture Notes For Probability and Statistics
No ratings yet
Lecture Notes For Probability and Statistics
7 pages
Deming Regression: Methcomp Package May 2007
100% (1)
Deming Regression: Methcomp Package May 2007
10 pages
Mlelectures PDF
No ratings yet
Mlelectures PDF
24 pages
Mlelectures PDF
No ratings yet
Mlelectures PDF
24 pages
확통1 LectureNote09 on Bayesian Statistical Inference
No ratings yet
확통1 LectureNote09 on Bayesian Statistical Inference
78 pages
Introduction To Bayesian Methods: Jessi Cisewski Department of Statistics Yale University
No ratings yet
Introduction To Bayesian Methods: Jessi Cisewski Department of Statistics Yale University
53 pages
Slice
No ratings yet
Slice
36 pages
Expo Kundu
No ratings yet
Expo Kundu
22 pages
Advanced ML Notes (Midterm)
No ratings yet
Advanced ML Notes (Midterm)
10 pages
Lecture 4: Inference, Asymptotics & Monte Carlo: August 11, 2018
No ratings yet
Lecture 4: Inference, Asymptotics & Monte Carlo: August 11, 2018
39 pages
Notes4 BayesianLearning
No ratings yet
Notes4 BayesianLearning
8 pages
Homework 8
100% (1)
Homework 8
6 pages
Homework1 2024
No ratings yet
Homework1 2024
2 pages
Lecture 5
No ratings yet
Lecture 5
23 pages
Lecture 6
No ratings yet
Lecture 6
13 pages
Parameter Estimation
No ratings yet
Parameter Estimation
50 pages
Exercise 3 Computer Intensive Statistics
No ratings yet
Exercise 3 Computer Intensive Statistics
10 pages
GLMConstrained
No ratings yet
GLMConstrained
11 pages
Stat 111
No ratings yet
Stat 111
7 pages
The Beta Distribution
No ratings yet
The Beta Distribution
11 pages
Week 2 DrBuddhananda Banerjee Vector RV
No ratings yet
Week 2 DrBuddhananda Banerjee Vector RV
10 pages
Lin Reg
No ratings yet
Lin Reg
34 pages
MA40189 20 Mock
No ratings yet
MA40189 20 Mock
4 pages
The University of Nottingham: Do NOT Turn Examination Paper Over Until Instructed To Do So
No ratings yet
The University of Nottingham: Do NOT Turn Examination Paper Over Until Instructed To Do So
6 pages
Lec24 BayesianLinearRegression
No ratings yet
Lec24 BayesianLinearRegression
29 pages
Filt Ident Lecturenotes
No ratings yet
Filt Ident Lecturenotes
12 pages
qt9kb6x0bw Nosplash
No ratings yet
qt9kb6x0bw Nosplash
18 pages
Bayesian Inference
No ratings yet
Bayesian Inference
18 pages
Bayesian - Lec - 3
No ratings yet
Bayesian - Lec - 3
24 pages
Gibbs Sampling
No ratings yet
Gibbs Sampling
10 pages
Section 9 Limited Dependent Variables
No ratings yet
Section 9 Limited Dependent Variables
17 pages
Chapter 4 ML Parametric Classification
No ratings yet
Chapter 4 ML Parametric Classification
42 pages
L11 TopicModels 2
No ratings yet
L11 TopicModels 2
37 pages
Chapter 5. Bayesian Statistics (II)
No ratings yet
Chapter 5. Bayesian Statistics (II)
30 pages
Bayes 2 V
No ratings yet
Bayes 2 V
32 pages
Bayesian Inference: by Hoai Nam Nguyen September 9, 2017
No ratings yet
Bayesian Inference: by Hoai Nam Nguyen September 9, 2017
7 pages
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
From Everand
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
Yue Jiang
4.5/5 (2)
EE311A 2021 AV Slides L24
No ratings yet
EE311A 2021 AV Slides L24
9 pages
EE311A 2021 AV Slides L21
No ratings yet
EE311A 2021 AV Slides L21
9 pages
EE311A 2021 AV Slides L23
No ratings yet
EE311A 2021 AV Slides L23
13 pages
EE311A 2021 AV Slides L20
No ratings yet
EE311A 2021 AV Slides L20
12 pages
Tutorial 5 Solutions
No ratings yet
Tutorial 5 Solutions
13 pages
Dsa Assignment Theory
No ratings yet
Dsa Assignment Theory
2 pages
EE340A: Electromagnetic Theory: 1 Laplace Transform of Transmission Line Equations
No ratings yet
EE340A: Electromagnetic Theory: 1 Laplace Transform of Transmission Line Equations
7 pages
Principles of Communications - EE320A
No ratings yet
Principles of Communications - EE320A
401 pages
L24 Simulated Annelaing
No ratings yet
L24 Simulated Annelaing
9 pages
L22 Bootstrap
No ratings yet
L22 Bootstrap
7 pages
L23 Stochastic Gradient and Mini Batch
No ratings yet
L23 Stochastic Gradient and Mini Batch
9 pages
Midsem cs201 PDF
No ratings yet
Midsem cs201 PDF
1 page
EE320A Solutions For Tutorial 2
No ratings yet
EE320A Solutions For Tutorial 2
14 pages
Em Algo For Multivariate GMM
No ratings yet
Em Algo For Multivariate GMM
9 pages
PPT08 - Multivariate One Way Anova
No ratings yet
PPT08 - Multivariate One Way Anova
46 pages
ML Notes (Module-3)
No ratings yet
ML Notes (Module-3)
21 pages
The Event Study Methodology Since 1969
No ratings yet
The Event Study Methodology Since 1969
20 pages
Econometrics For Finance Course Outline
No ratings yet
Econometrics For Finance Course Outline
4 pages
S1 Probability PDF
No ratings yet
S1 Probability PDF
8 pages
统计大纲
No ratings yet
统计大纲
14 pages
Ritesh Machine Learning Project
100% (9)
Ritesh Machine Learning Project
46 pages
Research Different Types of Indices of Correlation and Their Verbal Description
No ratings yet
Research Different Types of Indices of Correlation and Their Verbal Description
4 pages
JAMBURA: Vol 4. No 1. Mei 2021: Jurnal Ilmiah Manajemen Dan Bisnis P-Issn 2620-9551 E-ISSN 2622-1616
No ratings yet
JAMBURA: Vol 4. No 1. Mei 2021: Jurnal Ilmiah Manajemen Dan Bisnis P-Issn 2620-9551 E-ISSN 2622-1616
12 pages
CA - 605 - MJP Machine Learning Practical Slips
No ratings yet
CA - 605 - MJP Machine Learning Practical Slips
25 pages
Stata OLS Regression Example
No ratings yet
Stata OLS Regression Example
21 pages
상관관계와 산점도에 관한 예비수학교사의 SMK 분석 (문지은, 2018)
No ratings yet
상관관계와 산점도에 관한 예비수학교사의 SMK 분석 (문지은, 2018)
62 pages
Virtual COMSATS Inferential Statistics Lecture-32: Ossam Chohan CIIT Abbottabad
No ratings yet
Virtual COMSATS Inferential Statistics Lecture-32: Ossam Chohan CIIT Abbottabad
16 pages
ST4250 23S1 Assignment 1
No ratings yet
ST4250 23S1 Assignment 1
2 pages
Thera Bank Loan Purchase Modelling
No ratings yet
Thera Bank Loan Purchase Modelling
44 pages
Analisis Pengaruh Kebijakan Moneter, Kebijakan Fiskal, Dan Penyaluran Kredit Terhadap Pertumbuhan Ekonomi Di Provinsi Jawa Timur Tahun 2006-2016
No ratings yet
Analisis Pengaruh Kebijakan Moneter, Kebijakan Fiskal, Dan Penyaluran Kredit Terhadap Pertumbuhan Ekonomi Di Provinsi Jawa Timur Tahun 2006-2016
18 pages
Cheat Sheet Quantitative Methods in Finance Nova Cheat Sheet Quantitative Methods in Finance Nova
0% (1)
Cheat Sheet Quantitative Methods in Finance Nova Cheat Sheet Quantitative Methods in Finance Nova
3 pages
Durbin Watson Tabel (Anwar)
No ratings yet
Durbin Watson Tabel (Anwar)
112 pages
Dr. S. Vairachilai Department of CSE CVR College of Engineering Mangalpalli Telangana
No ratings yet
Dr. S. Vairachilai Department of CSE CVR College of Engineering Mangalpalli Telangana
33 pages
Unit 3 - CORRELATION AND REGRESSION
No ratings yet
Unit 3 - CORRELATION AND REGRESSION
85 pages
Factor Analysis
No ratings yet
Factor Analysis
3 pages
Unit - 5
No ratings yet
Unit - 5
111 pages
Anova - One Way - Part 1 of 2
No ratings yet
Anova - One Way - Part 1 of 2
4 pages
Unit - IV - DIMENSIONALITY REDUCTION AND GRAPHICAL MODELS
No ratings yet
Unit - IV - DIMENSIONALITY REDUCTION AND GRAPHICAL MODELS
59 pages
Eviews VAR Stata
No ratings yet
Eviews VAR Stata
17 pages
Examining Relationships Regression Facts
No ratings yet
Examining Relationships Regression Facts
10 pages
Syllabus - Introduction To Machine Learning
No ratings yet
Syllabus - Introduction To Machine Learning
3 pages
Q.Paper - Correlation and Regression UACA
No ratings yet
Q.Paper - Correlation and Regression UACA
3 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

L28 Bayseian Linear Regression Linchpin Sampler PDF

Uploaded by

L28 Bayseian Linear Regression Linchpin Sampler PDF

Uploaded by

MTH 511a - 2020: Lecture 28

Instructor: Dootika Vats

1 Bayesian linear regression

Note that the Inverse Gamma distribution has density

The posterior distribution is

The above is the (p + 1)-dimensional posterior distribution and we want to obtain

So an accept-reject sampler is theoretically possible to implement. However, we note

1.1 Linchpin variable samplers

As we have discussed plenty of times now, it is difficult to implement AR when the

and the integral integrates to a known constant.

So the marginal posterior distribution for 2 |y is

We have just done the following decomposition

# loading the dataset

Drawing the samples is easy, since no AR step is required

# We implement Monte Carlo sampling using linchping

sig2 <- numeric(length = N)

# sampling Inverge Gamma for sigma2

# Sampling beta from multivariate normal

We can view the posterior marginal density plots of the samples

N = 10000 Bandwidth = 6.579 N = 10000 Bandwidth = 0.9254 N = 10000 Bandwidth = 0.05591

We can also find the posterior means and quantiles:

apply(poster, 2, quantile, c(.025, .975))

2 Questions to think about

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.