0% found this document useful (0 votes)

101 views11 pages

ETC 2420/5242 Lab 10 2016: Purpose

1) The lab calculates conditional probabilities to analyze a spam filter using Bayesian inference. 2) Simulation and graphical analysis is used to examine the behavior of the posterior distribution under different priors and sample sizes. 3) The exact posterior density of the probability of heads coming up less than 3 times in 10 coin flips is derived and plotted. 4) The beta prior and posterior distributions for the proportion of Californians supporting the death penalty are derived and plotted based on survey data.

Uploaded by

Ishara

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

101 views11 pages

ETC 2420/5242 Lab 10 2016: Purpose

Uploaded by

Ishara

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

ETC 2420/5242 Lab 10 2016

Souhaib Ben Taieb

Week 10

Purpose

This lab is to compute conditional probabilities and practice Bayesian inference.

Question 1

A situation where Bayesian analysis is routinely used is your spam filter in your mail server. The message is
scrutinized for the appearance of key words which make it likely that the message is spam. Let us describe
how one one of these filters might work. We imagine that the evidence for spam is that the subject message
of the mail contains the sentence “check this out”. We define events spam (the message is spam) and check
this out (the subject line contains this sentence).
From previous experience we know that 40% of emails are spam, 1% of spam email have “check this out” in
the subject line, and .4% of non-spam emails have this sentence in the subject line.
Explain the different steps to compute the conditional probability P(spam | check this out).
P (check this out|spam)P (spam)
P (spam|check this out) = P (check this out)

P (spam) = 0.4
check this out|spam = 0.01

P (check this out) = P (check this out|spam)P (spam) + P (check this out|not spam)P (not spam)
= 0.01 × 0.4 + 0.004 × 0.6 = 0.0064

P (spam|check this out) = 0.004

0.0064 = 5
8 = 0.625

Question 2

Let X1 , . . . , Xn ∼ N (θ, 9).

a. If θ ∼ N (µ, τ 2 ), what is π(θ|x1 , . . . , xn )?

b. What is the posterior mean E[θ|x1 , . . . , xn ]?
c. What is the MLE estimate θ̂MLE ?

See the slides of week 9

Suppose the “true” value is θ = 2. Consider (1) µ = 5 and τ = 1, and (2) µ = 2 and τ = 2.
For n ∈ {1, 10, 20, 50, 100, 10000}:

a. Simulate a data set consisting of n observations

b. Plot on the same graphic π(θ), π(θ|x1 , . . . , xn ) and θ̂MLE .

Discuss the behavior of π(θ|x1 , . . . , xn ) as n increases and the impact of the prior distribution.

1
set.seed(1986)
theta <- 2
sigma_0 <- 3

alln <- c(1, 2, 5, 10, 100, 10000)

for(case in c(1, 2)){
if(case == 1){
prior_mu <- 2
prior_tau <- 2
}else if(case == 2){
prior_mu <- 5
prior_tau <- 1
}

for(n in alln){
x <- rnorm(n, mean = theta, sd= sigma_0)
x_bar <- mean(x)

a <- (n * x_bar)/sigma_0^2 + prior_mu/prior_tau^2

b <- n/sigma_0^2 + 1/prior_tau^2

post_mu <- a/b

print(post_mu)
post_sigma <- 1/(n/sigma_0^2 + 1/prior_tau^2)

xx <- seq(-5, 5, by = 0.001)

xx_prior <- xx * prior_tau + prior_mu
xx_post <- xx * post_sigma + post_mu

Y <- cbind(dnorm(xx_prior, mean = prior_mu, sd= prior_tau), dnorm(xx_post, mean = post_mu, sd = post
X <- cbind(xx_prior, xx_post)
matplot(X, Y, type = 'l', lty = 1, main = paste("n = ", n))
abline(v = x_bar, lty = 1)
}
}
# [1] 1.957306

n= 1
0.20
0.10
Y

0.00

−10 0 5 10

# [1] 2.376356

2
n= 2

0.20
0.10
Y

0.00
−5 0 5 10

# [1] 1.561813

n= 5
0.30
0.15
Y

0.00

−5 0 5 10

# [1] 2.718445

n = 10
0.0 0.2 0.4
Y

−5 0 5 10

# [1] 2.307785

3
n = 100

0 1 2 3 4
Y
−5 0 5 10

# [1] 2.028544

n = 10000
400
200
Y

−5 0 5 10

# [1] 4.495602

n= 1
0.4
0.2
Y

0.0

0 2 4 6 8 10

# [1] 4.879569

4
n= 2

0.4
Y

0.2
0.0
0 2 4 6 8 10

# [1] 3.914996

n= 5
0.6
0.3
Y

0.0

0 2 4 6 8 10

# [1] 2.632301

n = 10
0.8
0.4
Y

0.0

0 2 4 6 8 10

# [1] 2.529208

5
n = 100

4
Y

2
0
0 2 4 6 8 10

# [1] 2.069093

n = 10000
400
200
Y

0 2 4 6 8 10

Question 3

Suppose there is a Beta(4, 4) prior distribution on the the probability θ that a coin will yield a “head” when
spun in a specified maner. The coin is independently spun ten times, and “heads” appear fewer than 3 times.
You are not told how many heads were seen, only that the number is less than 3. Calculate your exact
posterior density (up to a proportionality constant) for θ and plot it.
Prior density:
π(θ) ∝ θ3 (1 − θ)3
Likelihood:

10 0 10 1 10 2

f (data|θ) = θ (1 − θ)1 0 + θ (1 − θ)9 + θ (1 − θ)8
0 1 2
= (1 − θ)10 + 10θ(1 − θ)9 + 45θ2 (1 − θ)8

Posterior density:
π(θ|data) ∝ θ3 (1 − θ)13 + 10θ4 (1 − θ)12 + 45θ5 (1 − θ)11

6
theta <- seq(0, 1, .01)
dens <- theta^3 * (1-theta)^13 + 10 * theta^4 * (1-theta)^12 + 45 * theta^5 * (1-theta)^11
plot (theta, dens, ylim=c(0,1.1*max(dens)), type="l", xlab="theta", ylab="", xaxs="i",yaxs="i", yaxt="n"

0.0 0.4 0.8

theta

Question 4

Suppose your prior distribution for θ, the proportion of Californians who support the deat penalty, is beta
with mean 0.6 and standard deviation 0.3.

a. Determine the parameters α and β of your prior distribution. Plot the prior density function.
b. A random sample of 1000 Californians is taken, and 65% support the death penalty. What are your
posterior mean and variance for θ? Plot the posterior density function.

E[θ](1−E[θ])
α+β = var(θ) − 1 = 1.67
α = (α + β)E[θ] = 1
β = (α + β)(1 − E[θ]) = 0.67

theta <- seq(0,1,.001)

dens <- dbeta(theta,1,.67)
plot (theta, dens, xlim=c(0,1), ylim=c(0,3),
type="l", xlab="theta", ylab="", xaxs="i",
yaxs="i", yaxt="n", bty="n", cex=2)
lines (c(1,1),c(0,3),col=0)
lines (c(1,1),c(0,3),lty=3)

0.0 0.4 0.8

theta

7
Posterior distribution:
π(θ|data) = Beta(α + 650, β + 350) = Beta(651, 350.67)
E(θ|data) = 0.6499
sd(θ|data) = 0.015

theta <- seq(0,1,.001)

dens <- dbeta(theta,651,350.67)
cond <- dens/max(dens) > 0.001
plot (theta[cond], dens[cond],
type="l", xlab="theta", ylab="", xaxs="i",
yaxs="i", yaxt="n", bty="n", cex=2)

0.60 0.64 0.68

theta

Question 5

10 Prussian cavalry corp were monitored for 20 years (200 Corp-Years) and the number of fatalities due to
horse kicks were recorded:

x = # Deaths Number of Corp-Years with x Fatalities

0 109
1 65
2 22
3 3
4 1

i.i.d
Let xi , i = 1, . . . , 200, be the number of deaths in observation i. Assume that xi ∼ Poisson(θ).

a. Compute the MLE estimate θ̂MLE ?

θ̂MLE = x̄ = 122
200 = 0.61
Suppose θ ∼ Gamma(α, β).

a. What is the prior mean and variance.

E[θ] = α
β

V ar[θ] = α
β2

8
b. What is the posterior distribution π(θ|x)?

Gamma(α + n ∗ x̄, β + n)

c. What is the posterior mean and variance.

E[θ|x] = α+n∗x̄
β+n

V ar[θ|x] = α+n∗x̄
(β+n)2

Plot on the same graphic π(θ), π(θ|x) and θ̂MLE for

a. α=β = 0.5
b. α=β =1
c. α=β = 10
d. α=β = 100

n <- 200
DT <- data.frame(c(0, 1, 2, 3, 4), c(109, 65, 22, 3, 1))
xbar <- sum(DT[, 1] * DT[, 2])/n

x <- seq(0, 2, by = 0.01)

for(case in c(1, 2, 3, 4)){

if(case == 1){
alpha <- beta <- 0.5
}else if(case == 2){
alpha <- beta <- 1
}else if(case == 3){
alpha <- beta <- 10
}else if(case == 4){
alpha <- beta <- 100
}

dens <- dgamma(x, shape = alpha, rate = beta)

alpha_posterior <- alpha + n * xbar

beta_posterior <- beta + n
dens_posterior <- dgamma(x, shape = alpha_posterior, rate = beta_posterior)

matplot(x, cbind(dens, dens_posterior), lty = 1, type = 'l', ylab = "Density", xlab = "theta")
abline(v = xbar)

9
6
Density

4
2
0
0.0 0.5 1.0 1.5 2.0

theta

6
Density

4
2
0

0.0 0.5 1.0 1.5 2.0

theta
6
Density

4
2
0

0.0 0.5 1.0 1.5 2.0

theta
8
6
Density

4
2
0

0.0 0.5 1.0 1.5 2.0

theta

10
TURN IN

• Your .Rmd file

• Your Word (or pdf) file that results from knitting the Rmd.
• Make sure your group members are listed as authors, one person per group will turn in the report
• DUE: Wednesday after the lab, by 7am, loaded into moodle

Resources

• Lecture slides on Bayesian reasoning

Practical Projects
100% (30)
Practical Projects
478 pages
Injection Engine Control System. VAZ 21213, 21214 (Niva)
No ratings yet
Injection Engine Control System. VAZ 21213, 21214 (Niva)
3 pages
Problem Set 1 Sol
No ratings yet
Problem Set 1 Sol
7 pages
Solutions 308
No ratings yet
Solutions 308
13 pages
19-Bayesian 2
No ratings yet
19-Bayesian 2
39 pages
Problem Set 1
No ratings yet
Problem Set 1
3 pages
W9PS
No ratings yet
W9PS
9 pages
MIT18 05S14 ps6 PDF
No ratings yet
MIT18 05S14 ps6 PDF
5 pages
Part A Statistics HT 2017 Problem Sheet 4
No ratings yet
Part A Statistics HT 2017 Problem Sheet 4
2 pages
ST903 Week9sol
No ratings yet
ST903 Week9sol
2 pages
CH 5
No ratings yet
CH 5
45 pages
Fuskpaper Bayes
No ratings yet
Fuskpaper Bayes
51 pages
Assignment 5 Stat Inf b3 2022 2023 PDF
No ratings yet
Assignment 5 Stat Inf b3 2022 2023 PDF
16 pages
University of Toronto Scarborough Department of Computer and Mathematical Sciences Final Exam, Winter - 2015
No ratings yet
University of Toronto Scarborough Department of Computer and Mathematical Sciences Final Exam, Winter - 2015
13 pages
Bayesian Statistics
No ratings yet
Bayesian Statistics
3 pages
Introduction To Bayesian Methods: Jessi Cisewski Department of Statistics Yale University
No ratings yet
Introduction To Bayesian Methods: Jessi Cisewski Department of Statistics Yale University
53 pages
Final Sol
No ratings yet
Final Sol
3 pages
Lecture4 More Bayes
No ratings yet
Lecture4 More Bayes
24 pages
T10 Sol..ol
No ratings yet
T10 Sol..ol
8 pages
Slides 1
No ratings yet
Slides 1
73 pages
Prints PDF
No ratings yet
Prints PDF
106 pages
Assign 1
No ratings yet
Assign 1
5 pages
Homework 3few
No ratings yet
Homework 3few
2 pages
238 03242024 - Final 课后
No ratings yet
238 03242024 - Final 课后
10 pages
Stat 111
No ratings yet
Stat 111
7 pages
Lecture 2 - 4 Prior
No ratings yet
Lecture 2 - 4 Prior
51 pages
20 Bayesian2
No ratings yet
20 Bayesian2
50 pages
Math2830 Chapter 08
No ratings yet
Math2830 Chapter 08
9 pages
Intro Bayes Time Series 1
No ratings yet
Intro Bayes Time Series 1
72 pages
Bayes
No ratings yet
Bayes
3 pages
hw10 Sol
No ratings yet
hw10 Sol
3 pages
Introduction To Bayesian Methods With An Example
No ratings yet
Introduction To Bayesian Methods With An Example
25 pages
Chapter 5. Bayesian Statistics (II)
No ratings yet
Chapter 5. Bayesian Statistics (II)
30 pages
MA40189 20 Open
No ratings yet
MA40189 20 Open
6 pages
Lecture Notes For Probability and Statistics
No ratings yet
Lecture Notes For Probability and Statistics
7 pages
Lecture 5 - 8 Bayesian Estimation
No ratings yet
Lecture 5 - 8 Bayesian Estimation
65 pages
Week 10
No ratings yet
Week 10
2 pages
W10 Notes
No ratings yet
W10 Notes
2 pages
ProblemSheet1 23
No ratings yet
ProblemSheet1 23
5 pages
STAT 135 Solutions To Homework 4:: 30 Points
No ratings yet
STAT 135 Solutions To Homework 4:: 30 Points
9 pages
The University of Nottingham: Do NOT Turn Examination Paper Over Until Instructed To Do So
No ratings yet
The University of Nottingham: Do NOT Turn Examination Paper Over Until Instructed To Do So
6 pages
Chap 2
No ratings yet
Chap 2
28 pages
STAT 830 Bayesian Estimation: Richard Lockhart
No ratings yet
STAT 830 Bayesian Estimation: Richard Lockhart
23 pages
Tutorial 2
No ratings yet
Tutorial 2
16 pages
1 Solution To Problem 8.1
No ratings yet
1 Solution To Problem 8.1
16 pages
Lecture 3
No ratings yet
Lecture 3
4 pages
5B Bayesian Inference: Class Problems
No ratings yet
5B Bayesian Inference: Class Problems
9 pages
DS 630 - Lec 4 - ST
No ratings yet
DS 630 - Lec 4 - ST
27 pages
MIT18 05S14 Class14 Slides
No ratings yet
MIT18 05S14 Class14 Slides
26 pages
BT Wk3 LectureNotes
No ratings yet
BT Wk3 LectureNotes
16 pages
Predição em Modelos de Tempo de Falha Acelerado Com Efeito Aleatório para Avaliação de Riscos de Falha - (JoaoBC)
No ratings yet
Predição em Modelos de Tempo de Falha Acelerado Com Efeito Aleatório para Avaliação de Riscos de Falha - (JoaoBC)
22 pages
Single Parameter Models
No ratings yet
Single Parameter Models
37 pages
Introduction To Bayesian Statistics
No ratings yet
Introduction To Bayesian Statistics
33 pages
Homework 8
100% (1)
Homework 8
6 pages
Bayesian Inference Slides 2021
No ratings yet
Bayesian Inference Slides 2021
37 pages
Lecture 20 - Bayesian Analysis
No ratings yet
Lecture 20 - Bayesian Analysis
4 pages
Chapter 1 B
No ratings yet
Chapter 1 B
35 pages
MLESA v2024 Week10 Assignment Solution
No ratings yet
MLESA v2024 Week10 Assignment Solution
7 pages
Bayesian Workshop1 Solution
No ratings yet
Bayesian Workshop1 Solution
3 pages
Lecture 4
No ratings yet
Lecture 4
7 pages
Computer Solved Differential Equations
From Everand
Computer Solved Differential Equations
Joe J.
No ratings yet
Differentiation (Calculus) Mathematics Question Bank
From Everand
Differentiation (Calculus) Mathematics Question Bank
Mohmmad Khaja Shareef
4/5 (1)
Short Answer Question #5: Midterm 3 Page 1 of 1 Spring 2020
No ratings yet
Short Answer Question #5: Midterm 3 Page 1 of 1 Spring 2020
1 page
D2L Quiz Portion
No ratings yet
D2L Quiz Portion
3 pages
Document 2292686 4256229 PDF
No ratings yet
Document 2292686 4256229 PDF
1 page
Short Answer Question #5: Midterm 3 Page 1 of 1 Spring 2020
No ratings yet
Short Answer Question #5: Midterm 3 Page 1 of 1 Spring 2020
1 page
Danzig Wolfe Decomposition
No ratings yet
Danzig Wolfe Decomposition
21 pages
Short Answer Question #5: Midterm 3 Page 1 of 1 Spring 2020
No ratings yet
Short Answer Question #5: Midterm 3 Page 1 of 1 Spring 2020
1 page
Chapter 2: Financial Returns: August 25, 2016
No ratings yet
Chapter 2: Financial Returns: August 25, 2016
20 pages
Exercises: Applied Bayesian Analysis and Numerical Methods (STK4021)
No ratings yet
Exercises: Applied Bayesian Analysis and Numerical Methods (STK4021)
30 pages
Judo Physiological Profile Sportsmedicine Franchini
No ratings yet
Judo Physiological Profile Sportsmedicine Franchini
21 pages
Portfolio Grade 1 Math Lesson
No ratings yet
Portfolio Grade 1 Math Lesson
1 page
Strat Sim
No ratings yet
Strat Sim
289 pages
77 4001 StaSaf
No ratings yet
77 4001 StaSaf
20 pages
B10x Technical Reference 1.4
No ratings yet
B10x Technical Reference 1.4
29 pages
Mkt350 Final Report The Art of Potano
No ratings yet
Mkt350 Final Report The Art of Potano
30 pages
Mabini Colleges, Inc.: College of Nursing and Midwifery
No ratings yet
Mabini Colleges, Inc.: College of Nursing and Midwifery
2 pages
Bio Metrics
No ratings yet
Bio Metrics
23 pages
The Relationship of Endodontic-Periodontic Lesions
No ratings yet
The Relationship of Endodontic-Periodontic Lesions
7 pages
Day 4 Plastic Pollution Ielts Nguyenhuyen
No ratings yet
Day 4 Plastic Pollution Ielts Nguyenhuyen
1 page
Aluminum and Glass Company in Qatar
No ratings yet
Aluminum and Glass Company in Qatar
5 pages
Sustainable Architecture Wiki
No ratings yet
Sustainable Architecture Wiki
9 pages
My Musicals
No ratings yet
My Musicals
4 pages
Internship Jntuh 160425 With Schedule
No ratings yet
Internship Jntuh 160425 With Schedule
3 pages
Expansion of Theme
100% (2)
Expansion of Theme
10 pages
Om Namah Shivaya
100% (1)
Om Namah Shivaya
17 pages
SL 1297 - Rudder Tube Assembly Inspection 2021-10-22
No ratings yet
SL 1297 - Rudder Tube Assembly Inspection 2021-10-22
4 pages
Buzz Marketing For Movies
No ratings yet
Buzz Marketing For Movies
9 pages
Resumen Productos Datalogic SENSORES
No ratings yet
Resumen Productos Datalogic SENSORES
219 pages
Trabajo Final de Ingles Técnico
No ratings yet
Trabajo Final de Ingles Técnico
5 pages
Worksheet 3 LS6 - MIANO, REYMARK
No ratings yet
Worksheet 3 LS6 - MIANO, REYMARK
1 page
Manual Phonic
0% (1)
Manual Phonic
46 pages
Notes Summer 2024 - Finance and Economics Summary
No ratings yet
Notes Summer 2024 - Finance and Economics Summary
3 pages
Computer Vision NN Architecture
No ratings yet
Computer Vision NN Architecture
19 pages
SonarQube Users (Archive) - Java - lang.OutOfMemoryError - Java Heap Space PDF
No ratings yet
SonarQube Users (Archive) - Java - lang.OutOfMemoryError - Java Heap Space PDF
9 pages
Chapter-4: Operations, Material and Maketing Management: Definition & Importance of Operational Management
No ratings yet
Chapter-4: Operations, Material and Maketing Management: Definition & Importance of Operational Management
47 pages
Pickle Brand Auditing and Strengthening
No ratings yet
Pickle Brand Auditing and Strengthening
34 pages
Medical Astrology - Medicine by The Stars
No ratings yet
Medical Astrology - Medicine by The Stars
4 pages
A.Datum Case Study
No ratings yet
A.Datum Case Study
23 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

ETC 2420/5242 Lab 10 2016: Purpose

Uploaded by

ETC 2420/5242 Lab 10 2016: Purpose

Uploaded by

ETC 2420/5242 Lab 10 2016

Souhaib Ben Taieb

This lab is to compute conditional probabilities and practice Bayesian inference.

P (spam|check this out) = 0.004

Let X1 , . . . , Xn ∼ N (θ, 9).

a. If θ ∼ N (µ, τ 2 ), what is π(θ|x1 , . . . , xn )?

See the slides of week 9

a. Simulate a data set consisting of n observations

alln <- c(1, 2, 5, 10, 100, 10000)

a <- (n * x_bar)/sigma_0^2 + prior_mu/prior_tau^2

post_mu <- a/b

xx <- seq(-5, 5, by = 0.001)

0.0 0.4 0.8

theta <- seq(0,1,.001)

0.0 0.4 0.8

theta <- seq(0,1,.001)

0.60 0.64 0.68

x = # Deaths Number of Corp-Years with x Fatalities

a. Compute the MLE estimate θ̂MLE ?

a. What is the prior mean and variance.

c. What is the posterior mean and variance.

Plot on the same graphic π(θ), π(θ|x) and θ̂MLE for

x <- seq(0, 2, by = 0.01)

for(case in c(1, 2, 3, 4)){

dens <- dgamma(x, shape = alpha, rate = beta)

alpha_posterior <- alpha + n * xbar

0.0 0.5 1.0 1.5 2.0

0.0 0.5 1.0 1.5 2.0

0.0 0.5 1.0 1.5 2.0

• Your .Rmd file

• Lecture slides on Bayesian reasoning

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.