0% found this document useful (0 votes)

20 views9 pages

MATH3806 Project 1

This project involves statistical analysis using Python, focusing on mean, variance, and outlier detection in datasets. Key findings include optimal power transformations for two datasets and the rejection of the null hypothesis regarding treatment differences. Additionally, the analysis indicates that male birds tend to have larger tails than females, although the comparison remains inconclusive.

Uploaded by

vymotu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views9 pages

MATH3806 Project 1

Uploaded by

vymotu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

Project 1

Chun Kit Bruce Lam

MATH 3806
01/04/2025
April 3, 2025

Foreword
All computations in this project are done solely using python, in particular,
the numpy, pandas, matplotlib, and scipy.stats packages. The equations
documented are ones that are technically required to compute the results.
However, computations such as finding the optimal power transformation is
done using the algorithms employed by the respective packages. Thank you
for your time and effort in advance.

1
Question 1
a.

Mean: 1 −1 · x̄ = 19.272
− 9.42 = 9.852
1
Variance: 1 −1 · S · = s21 + s22 − 2cov(x1 , x2 ) =
−1
14.139166666666666 + 62.23876666666666 − 13.472667 = 49.4326

b.
xc = x2 − x1
1
Pn
Mean: n
xc,i = 9.852
i=1 P
1 n
Variance: n−1
¯c )2
i=1 (xc,i − x = 49.4326

The means and the variances are the same, showing that there is no dif-
ference between the approaches when it comes to computing the mean and
variance of a dataset

c.
d2j = (x − x̄)⊤ S −1 (x − x̄) ∼ χ2df,α
By calculating the squared distance of each point and comparing them to a
χ2 distribution with 2 degrees of freedom and a 95% confidence level χ22,0.05 =
5.991464547107979, observations 4 (d24 = 7.749818085576154) and 20 (d220 =
9.373870986979817) are shown to be outliers.

2
d.
n
n X
MLE of Power Transformation: L(λ) = − ln(s2 ) + (λ − 1) ln(yi )
2 i=1

−1 i − 0.5
Q-Q Plot Construction: zi = Φ
n
Optimal λ for x1 = max(L(λ)) = 0.05449653698671072

y 0.05449653698671072 − 1
y (λ) =
0.05449653698671072

e.
Optimal λ for x2 = max(L(λ)) = −0.7013811554461303

y − 0.7013811554461303 − 1
y (λ) =
−0.7013811554461303

3
f.
T
By appending x1 with x2 such that x = x1 x2 , then repeating the pro-
cedures in part d and e:
Optimal λ for the bivariate case = max(L(λ)) = −0.02000817559628896

y −0.02000817559628896 − 1
y (λ) =
−0.02000817559628896

4
Question 2
a.
n
1 X
S= (xi − x̄)(xi − x̄)⊤
n − 1 i=1

T
124054.67241379 361620.44827586
x̄ = 1860.500000 8354.133333 S =
361620.44827586 3486333.15402299

Half Length : 901.6522227 140.51187343

−0.10573993 −0.99439382
Eigenvector Matrix :
−0.99439382 0.10573993
s
c2 λi
Axis = ∗ ei = Half Lengthi ∗ Eigenvector Matrix,i
ni
(n − 1)p
Fp,n−p (α) := c2
n−p
No. [2000, 10000], represented by the pink dot, is not within the area of the
confidence region. Therefore, using a confidence level of 95%, the proposed
mean does not align with the given data.

5
c.
Pn
j=1 (x(j) − x̄)(q(j) − q̄)
Correlation Coefficient = qP qP
n 2 n 2
j=1 (x (j) − x̄) j=1 (q(j) − q̄)

Since the data for both x1 and x2 follow the normality assumption line pretty
closely, the bivariate assumption can be comfortably assumed to be true.
This is further supported by the correlation coefficients of both the x1 and x2
normal probability plots being 0.9892631529453173 and 0.9883208213778977,
respectively. Plus, there are no visible outliers from the scatterplot.

6
Question 3
a.

b.
−1 (n − 1)p
T 2 = n(x̄ − µ0 )⊤ Spooled (x̄ − µ0 ) ≤ Fp,n−p (α) := c2
n−p
Pk
(ni − 1)Si
Spooled = Pi=1
k
i=1 (ni − 1)
1 1
Coefficient Vector = ( ∗ S1 + ∗ S2 )−1 (µ1 − µ2 )
n1 n2

Case Observation 31 = 184:

T 2 : 25.662530996663882 Processed F : 6.273885668660057

Coefficient Vector : −3.57426836 2.12202034

Case Delete Observation:

T 2 : 24.96490074510203 Processed F : 6.2772565319529265

Coefficient Vector : −3.49023807 2.07954999

7
The null hypothesis is rejected either way, so both treatments in our case did
not cause any major differences.

c.
Simultaneous Confidence Interval=
s r s r
p(n − 1) sjj p(n − 1) sjj
x̄j − Fp,n−p (α) ≤ µj ≤ x̄j + Fp,n−p (α)
n−p n n−p n
Case Observation 31 = 184:

x1 = −11.90907531 −1.15759136 x2 = −6.16866246 8.34644023

Half Length : 6.32119702 2.24142036

−0.55889415 −0.82923901
Eigenvector Matrix :
−0.82923901 0.55889415

Case Delete Observation:

x1 = −11.89886251 −1.02740012 x2 = −6.16232369 8.51585904

8

Half Length : 6.35396193 2.254200933

−0.55881027 −0.82929554
Eigenvector Matrix :
−0.82929554 0.55881027

d.
Male birds generally have larger tails than females according to the given
data. The comparison of tail sizes remains inconclusive.

Book Solution Applied Multivariate Statistical Analysis Solution Manual 6th Edition PDF
100% (2)
Book Solution Applied Multivariate Statistical Analysis Solution Manual 6th Edition PDF
369 pages
Applied Multivariate Statistical Analysis 6E【课后习题答案】
81% (80)
Applied Multivariate Statistical Analysis 6E【课后习题答案】
369 pages
SOLUTIONS 2022 Intro Stats Exam2
No ratings yet
SOLUTIONS 2022 Intro Stats Exam2
13 pages
Solusi Soal Bab 4
No ratings yet
Solusi Soal Bab 4
9 pages
Genetica Mensua Jose Luis PDF
100% (1)
Genetica Mensua Jose Luis PDF
808 pages
Research Report On Pakistan Post Office
No ratings yet
Research Report On Pakistan Post Office
29 pages
2018dec 02402 Solution en
No ratings yet
2018dec 02402 Solution en
31 pages
Question Bank
No ratings yet
Question Bank
6 pages
Lecture BDS 1 23 24 Print
No ratings yet
Lecture BDS 1 23 24 Print
15 pages
Solutions (Stats)
No ratings yet
Solutions (Stats)
20 pages
1.12.2024-BSC-301-CSBS-class Note - 2024-25
No ratings yet
1.12.2024-BSC-301-CSBS-class Note - 2024-25
58 pages
Question 1
No ratings yet
Question 1
23 pages
MAI 102 Mathematics II ETE 2023 24
No ratings yet
MAI 102 Mathematics II ETE 2023 24
28 pages
Module 2 - 1
No ratings yet
Module 2 - 1
18 pages
Actl 20025101 Finalexamsolutions 2006
No ratings yet
Actl 20025101 Finalexamsolutions 2006
15 pages
Assign20153 Sol
No ratings yet
Assign20153 Sol
47 pages
SM Unit 3, 2
No ratings yet
SM Unit 3, 2
25 pages
2019 Final
No ratings yet
2019 Final
20 pages
Formula Sheet
No ratings yet
Formula Sheet
6 pages
1B40 Practical Skills: Weighted Mean
No ratings yet
1B40 Practical Skills: Weighted Mean
7 pages
Formula B.SC (CS& AI)
No ratings yet
Formula B.SC (CS& AI)
2 pages
STA302 Final 2011S
No ratings yet
STA302 Final 2011S
21 pages
Solution
No ratings yet
Solution
148 pages
HASTS215 - HSTS215 NOTES Chapter5
No ratings yet
HASTS215 - HSTS215 NOTES Chapter5
18 pages
QUESTION 1 (3 + 12 + 5 = 20 marks) :, … ,Y Y μ and V Y σ
No ratings yet
QUESTION 1 (3 + 12 + 5 = 20 marks) :, … ,Y Y μ and V Y σ
4 pages
Problem Set 1 - Answers
No ratings yet
Problem Set 1 - Answers
7 pages
CS1B Nov 24 Solution
No ratings yet
CS1B Nov 24 Solution
11 pages
Prob-Stat - 222 Final
No ratings yet
Prob-Stat - 222 Final
41 pages
Prob-Stat - 222 Final - DUNG NGUYEN
No ratings yet
Prob-Stat - 222 Final - DUNG NGUYEN
41 pages
GU4291 GR5291 Homework1 23079925
No ratings yet
GU4291 GR5291 Homework1 23079925
3 pages
Endsem ML Makeup AK - 1
No ratings yet
Endsem ML Makeup AK - 1
7 pages
MIT2 086F12 Quiz3 Samples
No ratings yet
MIT2 086F12 Quiz3 Samples
14 pages
Final Formulas
No ratings yet
Final Formulas
5 pages
Midterm - EE511 - Part B: K K K K
No ratings yet
Midterm - EE511 - Part B: K K K K
8 pages
Joining Instructions Lisboa
No ratings yet
Joining Instructions Lisboa
8 pages
Questions For CET
No ratings yet
Questions For CET
11 pages
MAI 102 ETE Solutions
No ratings yet
MAI 102 ETE Solutions
25 pages
Example Class One
No ratings yet
Example Class One
4 pages
Chapter 23 Correlation and Linear Regression Tutorial Solutions With Comments
No ratings yet
Chapter 23 Correlation and Linear Regression Tutorial Solutions With Comments
21 pages
Data Analaysis and Visualization - 49Q
No ratings yet
Data Analaysis and Visualization - 49Q
28 pages
Homework 1: Statistics 109 Due February 17, 2019 at 11:59pm EST
No ratings yet
Homework 1: Statistics 109 Due February 17, 2019 at 11:59pm EST
23 pages
FormulaSheet FinalExam
No ratings yet
FormulaSheet FinalExam
8 pages
Weather Wax Hastie Solutions Manual
No ratings yet
Weather Wax Hastie Solutions Manual
18 pages
Weekly Homework X
No ratings yet
Weekly Homework X
15 pages
2018may 02402 Solution en
No ratings yet
2018may 02402 Solution en
36 pages
2022 CS244 End Sem Soln
No ratings yet
2022 CS244 End Sem Soln
6 pages
2010 Apr QMT500
No ratings yet
2010 Apr QMT500
8 pages
Data Analysis Exam Help
No ratings yet
Data Analysis Exam Help
8 pages
Stats 205 Hw1
No ratings yet
Stats 205 Hw1
4 pages
Assignment 1
No ratings yet
Assignment 1
18 pages
Multivariate Normal - Chi Square
No ratings yet
Multivariate Normal - Chi Square
19 pages
Assignment 3 (Sol.) : Introduction To Machine Learning Prof. B. Ravindran
No ratings yet
Assignment 3 (Sol.) : Introduction To Machine Learning Prof. B. Ravindran
8 pages
SML-SET 1-Batch 1-Answer Key
No ratings yet
SML-SET 1-Batch 1-Answer Key
8 pages
(Amir Hussain Shah) (Amir Hussain Shah) (Amir Hussain Shah) : Course Code Tutor Address Tutor Address Tutor Address
No ratings yet
(Amir Hussain Shah) (Amir Hussain Shah) (Amir Hussain Shah) : Course Code Tutor Address Tutor Address Tutor Address
25 pages
A Survey of Probability Concepts
No ratings yet
A Survey of Probability Concepts
42 pages
Prelim Intro To Multimedia Chap 1
No ratings yet
Prelim Intro To Multimedia Chap 1
38 pages
SMTP Diag Tool
No ratings yet
SMTP Diag Tool
6 pages
Critical Path: T.S T.S F.S F.S ES EF ES EF LS Duration LF LS Duration LF Total Slack Free Slack Total Slack Free Slack
No ratings yet
Critical Path: T.S T.S F.S F.S ES EF ES EF LS Duration LF LS Duration LF Total Slack Free Slack Total Slack Free Slack
21 pages
PDF
100% (2)
PDF
39 pages
ECOFLOW RIVER 2 - User Manual (EU-EN) V1.0 - 1673942476392-20230127
No ratings yet
ECOFLOW RIVER 2 - User Manual (EU-EN) V1.0 - 1673942476392-20230127
16 pages
C Programming Strings
No ratings yet
C Programming Strings
9 pages
Structured Network Cabling Baguio
No ratings yet
Structured Network Cabling Baguio
5 pages
Chapter 11 - Dynamic-Object-Modeling
No ratings yet
Chapter 11 - Dynamic-Object-Modeling
32 pages
Unit 4 Notes CC Ramadevi
No ratings yet
Unit 4 Notes CC Ramadevi
31 pages
PREPOSITIONS OF PLACE - Quizizz
No ratings yet
PREPOSITIONS OF PLACE - Quizizz
6 pages
Indian Ins Titut e of Technology M Adras: (Sep'2016 - Present)
No ratings yet
Indian Ins Titut e of Technology M Adras: (Sep'2016 - Present)
1 page
LED Lightboxes Specs
No ratings yet
LED Lightboxes Specs
14 pages
(Ebooks PDF) Download Triple Focus A New Approach To Education The Full Chapters
100% (3)
(Ebooks PDF) Download Triple Focus A New Approach To Education The Full Chapters
21 pages
1187-1996 - Farm Milk Cooling and Storage Systems
No ratings yet
1187-1996 - Farm Milk Cooling and Storage Systems
19 pages
Unit-4 (STLD) Lecture2
No ratings yet
Unit-4 (STLD) Lecture2
21 pages
Inertia Chassis Dyno Quick Start Guide
No ratings yet
Inertia Chassis Dyno Quick Start Guide
20 pages
The First Crusade The Call From The East Peter Frankopan Download
100% (1)
The First Crusade The Call From The East Peter Frankopan Download
18 pages
2000 Procedimientos Industriales - Formoso
100% (2)
2000 Procedimientos Industriales - Formoso
1,219 pages
66 Easy
No ratings yet
66 Easy
10 pages
NCS Expert Tutorial - How To Code Features in Your Car.
100% (1)
NCS Expert Tutorial - How To Code Features in Your Car.
10 pages
Instaliranje Total War
No ratings yet
Instaliranje Total War
2 pages
FP5207
No ratings yet
FP5207
13 pages
DevOps Engineer
No ratings yet
DevOps Engineer
2 pages
Low Power Square and Cube Architectures Using Vedic
100% (1)
Low Power Square and Cube Architectures Using Vedic
18 pages
dt209x Manual
No ratings yet
dt209x Manual
68 pages
ES - Lecture2 - Aug 2
No ratings yet
ES - Lecture2 - Aug 2
37 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

MATH3806 Project 1

Uploaded by

MATH3806 Project 1

Uploaded by

Project 1

Chun Kit Bruce Lam

Case Observation 31 = 184:

T 2 : 25.662530996663882 Processed F : 6.273885668660057

Case Delete Observation:

T 2 : 24.96490074510203 Processed F : 6.2772565319529265

Case Delete Observation:

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.