0% found this document useful (0 votes)
34 views8 pages

Small Résumé Stats

The document discusses different types of variables in statistics: 1) Nominal variables are categorical variables like gender, race, or airline name that have no intrinsic ordering. 2) Ordinal variables are ordered categorical variables like education level, rating scores, or passenger class that can be ranked but the differences between ranks are not fixed. 3) Discrete variables take countable numeric values like number of passengers but the values are restricted to whole numbers.

Uploaded by

cameratalorenzo
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
34 views8 pages

Small Résumé Stats

The document discusses different types of variables in statistics: 1) Nominal variables are categorical variables like gender, race, or airline name that have no intrinsic ordering. 2) Ordinal variables are ordered categorical variables like education level, rating scores, or passenger class that can be ranked but the differences between ranks are not fixed. 3) Discrete variables take countable numeric values like number of passengers but the values are restricted to whole numbers.

Uploaded by

cameratalorenzo
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 8

statistics 2022

Types 0f variables =
Quantiles

µ
Nominal (Nordet ) :
} Quantile Qlp) of a sample is the value

Sex, race , haire airline name Definition :
the P
-

,
CATEGORKAL
.

Ordinal (arder) level ofeduc rating passager class such that at mosta proportion
score ,
p is smaller than Qlp)
• :
,
,

Discrete canon / taire articula values but can be Infinite & at most ( ) of the data
larger than Qlp)

}
: is
/ tp

y ,
avant, #µ
.

& canon Guay by fixed amounts siæofpop no of


passagers
: .

,
Or
calcul .

_
) sort the data (
ascendingoider)
qq.nu, µ, ,,
ang.am, aaawn.in,, my ,,, ,, , gmpw.eu
,
an , =p ,n.gg ,

un , ,
,
ange
,

point /if not interpolation)


temperature , responsa time , BMI ,
incarne , frelons . 3) Ifrinteger QLP ,
' is A data .
: linear
'
=

Measures of Center

)
Properties ( unimodal distribution)
Mode : mot frequent value (for discrete variables )
highest frequency (if classes of same width) médian
class interval with symmetric distrib mean
• : =
.

relative t'Queney in the class


class Interval with highest density ( Class width ) •

right stewed - " : mode < mediante mean

Médian M QIO 5) left skewed " mode > médian >


:
=
: •
mean
-

n
✗et + ✗n
☒ f-
- - -

Mean : or average : =

,
☒ =
n

Measures of dispersion Frequency table


/ ✗si ( Xs ✗n )
Range )
classtp.r.gg#mf9--l9
• : Max . .

, ✗n -
min ,
. . .
, Frequency f- Rel .

req .

Interquartile IQR Q3 Qd
range
• : = -
+
+
n


Variance : s' =
¥1 Et ;
✗i -
☒Î
"

• Standard deviation :
s = v52 Total fr9 .
Total 1
=

Histograms graphical translation of :


a
fsequency table
freqencyorrel.frequescyrel.fr
Area d- each proportion al
rectangle is to each class

Height
"
is lqual to the
density
=

amplitude amplitude width of class


eq
,
where
-
-

Boxplots Q Q plots
-

1) Box between quartiles 01=010,25) and ☒3=010,75) A Q-Q plot (quantile-quantile plot) is a graphical method to compare two
2) Line at the médian M -
-
QU' 5) probability distributions by plotting their quantiles against each other.
The two distribution to be compared can be either theoretical or empirical.
3) Lower bound LB =
A1 -

1.5 ( Q3 -
Q 1) Most often, one is interested to compare an empirical distribution (from the
1.5 (Q3 Q 1) sample) to a theoretical one.
4) Upper bound UB =
Q3 + -

In this case, the ordered sample is plotted against the quantiles


5) Ends howett tatum
'

of F- l 5) fort 1 in
"
whiskers biggest than LB =
: . . -

,
.

smaller than UB where F is the cumulative distribution function of the theoretical distribution.
highest datum If the two distributions being compared are similar, the points in the Q-Q plot
6) Data outside symbol
of Whiskers represented by a .

=
will approximately lie on the 45 degree line.

Sampling schèmes


Simple : random sampling with equal probabilités ( Ex age classes: with proportions of individuels
• Stratified wrt / one or more variables : then simple random
sampling within each stratum in each class
,
tatethatpvp.to obtain sample of
n °O° )
Clustered divide
sample the groups ( all the members vept in the sample )
:

into then
population groups , we


Systematic :
tatie observations at regular spacing
Estimatoire 1 E- (✗ 4) = ✗ -
ELU)

Est imatoref µ ECXI


= ) En ÊXI 2 ENTV) Elu) + ELV) =

Properties : E- (E)
µ = =
EH Ê ) ? ÎE .ËH=±ÊEHil=âlµ+ÂÏ÷ˢ)
.
✗i -
-
En -

µ
=
µ 3 Var (XU) = à Var ( U )

Var =
=

varfn.E.it?H?varl?.xil--1-n.ElvarHiD--n- ITÎËÏTY fini -


-

a Valu + v) Valu) + VWLV ) =


if U and V are independent
Central Limit Theorem
Reminder if ZNNIO 1) then ✗ : ,
=
a + BZ ~ N / a) 62)
let ✗ 1 ,
. . _

,
✗n be iid ~ F. We demonstrated : E- (E) =
µ Val ,
= ¥ ,
with µ
= Et ✗i) and t' = Var ki) < •
,
i -1
-

, . . .

,
n

# N µ E)
À N / 0,1 ) À ( ( LT States that I benares like Normal random variable
Then µ therefore ifor large sample
a
sites :
-
:
,

oyrn •
× .

for large values of n / K) ,


the number of variables involved in ✗ .

{
1 with prob p
Estimating proportions ×; Bin H , P)
-

=
Xi with n =L trial
prob d- p)
-
o with .

Here µ ELXI) =p 02 Valk) =p( 1-


p)
-
-
=

Î À
" "

=
is the sample proportion of 1 ,
and for large sample sites :

Îappîix N ( p PI ) P'
-

Other estimation for µ exist , for example :

Ethel truncated -
or d- trimned meam : the meam after discording proportion a ✗ of the observations at the
high and low end distribution .

• if the distribution F is
sym metric then
µ
= EN :) = m where mis the population médian .

mediunli) ) )
, ,

À = médian ( Xi ,
. . .

,
✗n ) can also be used as am estimation of µ .
( similar / y ,
other estimation for t'exist : MADHI ,
. ._
Nn) medias (I Xi
-
_
-

.
Chi sq distribution .
.
:
if Ki , .
. .

,
Xn ~ N /
µ, o
'
) ,
we have
T' Van Xi )
Estimating Ê
"
2 alternative estimations ×
E)
=

/ ~
yn -
r

Êz
i

Ê
-1
-

s'
=p ? , ( Xi It and J' =L / ✗i À Voulx) 2d
nyj ECX) =D
- -

, If ✗ ,
then ,
-
-

=
#[ n ✗ 2- ni ] =
# { ✗ 2- j']
of and 52
'
There is result on the distribution
no
general s

Duyn?
( ) 52
isdistributedlitedozkn.se)
n l
From aboule Yn therefore s'
-

:
~
i with ,
oz
-

similar / : n J2
Ùn and Tis distribuée litre Dorln
y
~
,
p
.

Estimation of the variance 02

Let ✗n , . .
,
✗ v. 6e kiidr.ir .
~ N ( µ , 02)

Propertg:* Ô=Ïç( ×
Youpi ,
→ ECQ~t.li-1
Consequence regarding the 5 estimated
Remind : chi -

square distribution
" Z, -2k iidr.VN N / 0,1)
From the about E( 5) 0 , ,
. . .

Proper tg
: =

Def : a =
ËZÎ is distribue d. according to I distrib with kdegenotlreebn
EIÂ)=E(¥Ê( x-p K
.

s'
Ë ai ☒Y ✗i 1 anti
- -
- -
= -

EIQ.tk
Properties
:
,

ÊEÇÊ ai x-p ) K 1
Var ( Q) -2k -

- -
-
-

v11 EIÊHi ) 02 ( 1- ✗121=0.95


'
I)
Typical ✗ 0,10 and Zi 1.65
-
=
=
= -
an

Ets ) =P
[ # Épi À/
,


=
0,05 ( n d / 21=0.975 and Zr 1.96
02 alz
- =

-

=L - =

Values :


=
0,01 ( 1- ✗ 12 ) :
0.995 and 2- 1- * =
2-58
Confidence Interval
Pf µ ☒ ≤
} 1. Ipe
-

2- ≤ 2- 1- ✗
Confidence Interval for µ Elxi) when O2 known
prob 1- ✗
=

µ with
-

-
-

,
Lovers .
Nrk

CI [☒ ± Ze E €] Xn , X , Xn EdF distrib P { -
Z -

Ê ≤
µ ☒ ≤ Z
Ê} 1- a /+ I
µ
-
=
E
-

=
-

- ..

,
.

INN / µ , E) →
Ê- NNIO 1) ,
PH -

z -

E ≤ µ ≤
x-tz.EE }
-
-

1- ✗
Covers

À =
f.Ê ✗i
P { [ ☒ ± z E ]ȵ } -
=
1- ✗ ☐☆
.

2 Confidence interval for µ=E(✗i ) when ✗Unknown


tu student distribution with ki degree offieedom
by its estimators2-n.IE/i-x-)2
~

Replace
, ← >

CI =/ ☒ ± tn IH E) Fn ] ⇐ exact confidence
'

c- level (ta) where tn is the Student distribution with n -1


degieesoffrædom
- '

µ
-

,
.
, .

If inSleat of tn 2-e- an the CI Stil valid but with level of approximation


we use
,
is ,
ahigher .

3 Confidence interval for a proportion p (Bernoulli distribution Blp)) Eki) =p , Varlxi) =p(tp)
[☒ ± P '^ P) ] P¥P)
-

'
e (I = 2- 1- ✗ if p known unbiased ELI) Var (5)
n
=p
=

p F- ,

CI =
[À ± -21 an ] if pis Unknown : ÙÂ matos I. ( LT :

Ifm »
N (
p , PÈN)
RE
-

✗ Ê density function
,

Confidence interval for ,Xk~iidN ( µ À


ËÏÎ
4 the variance 02 ( normally distribued data :X , . . . ,

[0 HE
1) S2
)
'
J2 E (I (n
] ( 1- d) confidence level (" s' from pwpesty
-

=
,
~
aboie *
: , ( a)
S2 yn
(Knil i ËÊ]
l)
✗i. irai
n
T'
-

C- CI =
-

5 Confidence Interval for the difference of Mears Xn , . . .

,
Xnx Fid Fx ,
Yn , - -

,
Ynx Faty
1) same variances µ ✗ =
ELXI) , peu =
ELY;) Var (Xi) -
_
Va /Yi ) = 02
,

c- CI ( I I) [ ± 2- 1- au
spa %) ] il t'× & Fy N then
replace the Ze alz normal
quantile
= ~
µ× µ , + we
-
-
-

( nx 1) SÎHNY 1) SI with the tnxtny (1-012) quantile ( exact distribution)


sp
-2
- -

where =

nxtny
-
2

2) diff . variances
À NN replacethe 2- nan normal
]
¥ if t'✗ & Fy then we
quantile
CI =

[ ( i 5) ± -

Zeon
nx
+

My with the
tminlnx.im,
,

-
i
) ( 1- N2) quantile .

Theory of estimation we assume ✗ni . . _


An d F
if bias -0 -

theestimator is unbiased .

Bias bias (Tlk , . . .

,
Xn) O) ,
=
EITIX , , . . .

,
✗ n) ) -

O if bias < 0 Dis underestimated .

if bias > ◦ 0 is over estimated


• Bias of I for µ ECXI ) bias / ☒ µ ) =D = :

of s and J2 for T when Xs ✗ Y N/


' '
• Bias
;D µ ,
. .
.
, n ,


Elsa )=E( n
! Él Xi -
It ) =
n (Ë f) =
Es ln -

a) =
02 bias (s' p ) Eté) E- 0
'
-
-
-
unbiased (+ consistent)

• E- (5) =
¥02
=D bias (5,04--5-152) - t' =
02-02=-1--02 bias that deweases with n

estimation smaller variance


we also prefer with a

• Val 5) =
Vain? ÊGIXI ,
-

il
'

) =

(m )
^
ator /È / ,
"
f) =µÎ un D= -

n
?, on
and similar by
Un 1)
Var ( J2 ) on
-

• =
h2

"" "
Given that n ? >
nn ( À) .
> À Vol < vols)
Meam Squared Error MSE (TIK , . .

.gl/n),0)=E(lTlXn...,Xn) -
O ))
MSE MSEIÔ O )
Decomposition Var IÔ) + biasio O) .
=
,

MSEIS F) Var / 5)
'
? T = =
"

"ᵈ
n , ,

( ¥04? n2 ton
« t'
MSE ( 52,02) Var (5) + biasls :o) !
2N n -

"
-

☐ = +
n2

we can check that ¥ > YÉ for V3 talwegs !) therefore MSE / 5,04 > n > , MSE (52,02)

MSEI 52,02) ÉE where à ÎÉ (


: J E- ÊË Ai -
x-p =
-
-

E
bi / J' v2 )
as El J ) O2 E ( ÊQ)
=
¥ EIÔ) Ê( ) F k
'
-
= -

O
'
= - = K -
i - =
-

Ê " '
"
Var ( EE) (Ê ) UK 1) ¥
"
var / 54 vor O
= = -
=
=
-

MSEI 52,04 Val 5) tbias452.si) -

2¥:"
1- E) 2k¥02 =
o
"
+ =

'"
É )

Efficiency of estimation Twrt / u :
eff (IU) =

MSELT
( if both unbiaxd :
elf ( Tir) =

Var (T )
)
Bydel : most efficient estimation is the one with the Smallest MSE

If effltiu ) > 1 Sag that Tlk , Nn) is more efficient


,
we . . -
than UH ,
. . .
,Xn)
If eff(TN ) < 1 we
Sag that Tlk , Nn) is less efficient
,
. . -
than UH ,
. . .
,Xn)
Consistence :
if HE > O PC / TCK , An) il > E) ¥0 :
. . _
-

( Media is more robust than meant

/
constructions estimations • µ
-
-
EN =
% ✗ flxldx Maximum Likelihood
Method d- Moments ÈH)
• E-
www..jo?lx-peYfHldx--ElX ) -

1) we farm the livelihood function as a


produit
µ Et ✗
"
/ =
§ ✗
"
f-(Hdx initial moment of Kader LI ) =

Ëf f. (Xj) ,
where f. is the
probability density function .

( phi FNÈXÎ
A

) ( EIXIÎFCXIDX
-
oo

E- (X EIN)
"
centered moment ,Koidu

{Ênifo
pu x
-
- - -

-
as
1/01/11 , ✗n) =
Hit folk ) .

folk) .

.. .
-

folxn) : Xi continuous
ta :p ,
. . .

Poki ) Polk -
-
✗i
-
-
-
-
)
H :
'

Pdki ✗ ) n : xi discret
empirical sample of Koidu
. .

and .
- '
moment

I
Compute the moments needed of random variable 2) We talk the
log of the function
p a

First moment : EH
µ
= lol Xp / Xn ) log ( V01 Xp ✗ ))
. . . = . . .

,
n =

{ ËÊ log
second moment : EUH Vaut = + EÎX) = J' + pi = log fohi ) ( ) ✗i continuous
2) In the (Pol ✗i )) discrete
equations aboue replace ✗i
-

xi
-

, ,

• EH) by ÎÇÎ , Xj 3) We desir ate and =D ,


( solve the first oder conditions)
ÎÎÉXI J log folk )
n


ELXY by I
Ë XÎ
'

• • =
Âmn Sl / 01×1 , ✗ a) € so ✗i continuous
,

. . .

= =
,

µ by ÛMM ÷:* . :*. .. .. "" " " .

Jo

6g âµm
a
so
,
• T' 68 Ômm Illustration Xn , . . .
,
✗ n Ea
Exp (b) . f- (X) =
dexptbx) (PDF)
◦ Hain Âum =
ÊËÉXJ log( f-( x )) =
log ( d) -
DX

Fin =

I. Ë Xi ¥ ÊX;) -
?
=
ÊÏÉÇ ;
I)
-
'
l (d) ✗ii. →
✗n) =Ë( hogld) -
Hi ) nlogld)
= -
d. Ê ×:

În
.

Î Ê
,

.""station :
Doivative : -
✗i
=
°
,
=

ê÷ ¥ =

Letti , .in?iaExp(d) density ftp.jexpfyx)


,
-
. . with : ×≥◦ i =)
,
Here ad ,
p -1
Hypothesis testing
galper) ¥
.

We have that
µn Elxi)
= : ± , therefore On = D= =

Binomial test proportion


That impies that Inn =

¥ =
¥
:

Ho :p -0.5
☒ we test the null
Hypothesis
-

on
against the alternative hypothesis Ha :p > 0.5
• The test statistic used is X .
The distribution of ✗ unde Ho is
Bin / n'-25 p -0.5 ) ( or NI 12.512.52 )
approximation by a
-

• The
sample value of the test statistic is v. dos .

Es The p value is completed vndet Ho


-
:

)
,

( 5) Pto /

[¥ ≥
> v.dos -12s
PHO / × ≥ v. 06s) =P µ Z>
.
=

.
≈ . . .
. . .

2.

2- NN / 0.1 )
Type I and Type I error • une conclusion : .
. .

TOTALITES
i p value the
propability under Ho of Observing Somethin
:
-

false positive at least as extreme as what has been observed in the sample
B
The conclusion :

false négative 5%)


☒ small
p value
-

leg < :
evidence
against Ho
we define probabilities associated with each situation
rejet Ho (testis significant )
:

✗ =P / eject Ho / Ho true) =P " ( eject to) ← Worse !

against Ho

bah no evidence
large value :

and
p =P (don't eject Hot Houot tue) PHA ( don't eject to) p
-

don't
eject (test is inconclusive)
Critica value
2ᵉ
"
if µ > ,
ka :(1- x) -

quantile of distribution of I undotZ '"


the
kx-ipeot-zi.at/rn
rejection region :X > kx :
PH.LK > ka) =P, ↳ ( ÉTÉ > =L -
Y÷ )
• if µ < 2 Ka : X -

quantile .
,

K¥10 )
Zx Mrn
☒ ka PHOLI < ka) =P,"/ < a kx≈µo +

rejection region <


-
_
: :
where 2- ✗ = -
2- 1- a

if µ -1-2 , Kin and KI we the # 2) and H N2) -

quantiles
PHOLILKÎ ) -42 -
and Ptidx > kà ) -
-
N2
Relation ship between
p value and ka
-

smaller values of ✗
larger values ofkx :
rejet Ho less often but reduce Type I error
with p value
:

} TMP
-

☆ if p value < ✗ -
then to is rejected -
value is the minimal value d- ✗ which
gives significant test
a .

• it p value ≥ ✗ -

I then to is not rejected


) ] equivalent p value # pltbistrue)
[
{Tas )
ftp.IY
are
and < × -

The statements > ka

Power d- atest la " d- test son to the ' et ' d- the s' aphiwhee points stat
]
-

[
: .

the right ,
left ) Small grows the slowest to
big giowsthefaskst (to the
n=Eû÷:)
n' : :

=p,→(rejette )
-

: ,
n
-

M' Ho
TypeIegor
)
_

Ne fix ✗ Ï
: -

1-01-21
: .

Z
-2ps
- _

, a

:p Ptaldoritejecf +62f
- -
-
x
Mrn
#
Type error =

Ka)
wedefinethepowoof.ae/-estas:power=1-p--1-PHn-ldoritejectHo)-- Ptnlreject Ho) =P HAUT >

level
For
prefer test with
large power

a
given ,
we statistics
Tests
• Test
of location proportion Special case
:

☒ →
1)
°


i~B.in/1.p),p--Iusedtoestinatep.Ho:p--po,TeststatisticT= po -

( 1- po) Ff0 ON ""


'
sonne
tej reg
. here

y
. .

'
B T test
-
: T known n

{
E- µ
2-
qq.Z~NIO.tl rejectionHoregion µ : Z > 2- 1.x ; QCZ , a) 1- x
.
>
=
, µ . -
=

( of )
tecteo : 2- C -
Z, -
✗ or 2- < 2- ✗ ; (Zx ) - X

µ =/ Moi 12-1>12-1 -
€1 ; QLZ , a) _
= 1- ✗ 12

↳ studentt test
for location ( T' Unknown)
-

K, . . .
,
✗ n Fa N ( µ , )
☒ -

Ho

{
1- T~tn.se T > Truc x )
if
:
Ho true µ> peu
=

s / rn ;
5- ¥1 .Ë( ✗i -
x-P Ka quantile of tn distrib , .
rejection region tuto :
T < -
Tn ka) -

str can also be replaced by Fa if Voris gives ( or standard deviation ) ttttlo :/Tl >
Tn, (E)
À Wilcoxon rank test for location Xp Anita . . -
F
,
Funknown , symmetiicaround µ=E( Xi)
Ë Ni

{
1-
Âw×N(◦ 1) itejectionHoregion 2- 2- 1.x ; (2- a)
=
> 1- ✗
' µ
>
Ho : =

nrlnrtykn.tl) ( of )
n
tecteo : 2- C -
Z, -
✗ oe 2- < 2- ✗ ; (Zx ) - X

✗i 127 129 131 132 135 138 µ =/ plo : 12-1>12-1 ¥ / -

; Q (Z , _
a) = 1- ✗ 12
✗i -

pho -8 -
G -
U -
3 0 3
Hi -

Mo / 8 6 4 3 3
tank / Xi Hot 4 1.5 1.5 meanof 1 & 2
5 3 ←
-

signcxi pro ) -
- - -
-

signed tankini -5 -

y -3 -1.5 l -5
is 5 be one is =
H
n,

fric)
{
'
02--002 %
"

Testing >
t
>
'

☐ µ unknown °
, ,
'
'
Ien 1) s
02<0! ;
# X < (1- ×)
-

✗ %
~
' rejection region
040!
; fn.im#cy'n-il1- E)
• 2 sample t test -
Xy . - -
✗ nx
,
%, . . .

/ Yny Eki)=µ× El Yi) ,


-

µ , , same var o? unknown Ho :(1×414 , HA :µx > µ>


~N ~N

Î 0>0
☒ or µ µ, -0
-
-

tn ,

{
1- x)
T
tnxtny
=
-2
tu µ -1
:
> (
rejection region
" no >
,

qing + n1 / , ,

µ
<
psy : TC tnxtng-r.CN
=/ nx 1) SÎ +1ns 1) si

Sf
-
-

nxtny
-
2 µ✗ =/ ftp://T >
tnxmj-zlxc-bc.ua:{ + = ✗

}
SÎ=n± ÎÉCXI l'

dfwekh-salterwhailecorr.sy-nf-IE.LY
,

Êt (Ï

È
À } ;
-

Il
"

now the same ,


but -1-0
,
unknown Ho :µx=µy ap .
+

o : o :

{
I - I µ×> µ >
: T >
tmltl ni ( nx -

1)
+

njcny -

n¥yapptmincnxm.IN
' pej :

µ ✗
<
µ
: 1- <
tms)
+

µ, :/ Tl tmt )
=/ >
µ✗
continued :L
sample t test -

K ,
. . .
,Xnx~N(µ×i # j' ai YnyNN(plus ;h ) . . _ variances are known
I

{
I Z -21 Attia) ✗
N°11)
> 1-
t'× > µ]
- -
:

z
-
-

= ~

rejection region
0¥ ÇI µx< fly Zx qlza) n x
2- : < - -

µ×≠µy : 12-1>12-1 Et Qltn-42fr-42 -

Xn ,
. . .
,Xn×~N(µ×i # j' y . . _

YnyNN(plus ;h ) variances are Unknown Ho :µ×=µz


I I
NN/ 0,1)
-

2- =

¥ Ésaü -
.
.

sÎ=n¥Ê:( ×; -
x-p
same
tej les -
as above

2
m "
Kaboul : mears are known
Sy ¥ ¥ ( y J)
- '
- -

, ,
,

test :
0×2=05 ,µ×,µy unknown K ,
. .

.im/~N(ttxi0x)jYy...YnyNNlpts;On )
Fisher
5×2 ←

{
À > Ty
'
F =

gyz
~
Fm , -1
; ng
-

,
whol SÎ >
si F >
T-nx-yny.IN
Sissy rejection region F4
T-ny-unx.tk )
Nicoya + =

§;
F > Fnmailinmin (E) -
'
Emoi
0×2 # OÙ f- =
F < F 4- E
5min
. "

• 2
comparing groups
:
testot proportions PEPY in and My we
big tensihundreds
,
..
.

{
-
I "" " > :
Z > 2- ta (tra) -
t -

px py
ÊÉ
-2 = "

# rejection region alex)


"
2- < Zx ✗
pu p )
:
pxcpy
=
_
+

Px =/ Py : 12-1>12-1 %) QLZI.sn/z)-- 1- %
Îtngy
-

Â
m'
=

nxtny
← Mann Whitney Wilcoxon Xn , about of the data
rank test Xnxifdt-x.Yy.in> Faty Strong assumption the distribution
'

no
-
- -
- i.

Ho 0=0 Ha :D > 0 CDF :


Fx CDF Fyly)=F×(ytL)
↳ not necessary normal
=µ× µ y
: : -

Étape N ( MEI
nxnylnxtny +1)
)
nx

N' [(Ri -
-
i ) -
_
ËRI -

2
,
12
i -1- l' =\

Riront of Xi . in combine d
sample k×=n¥ˢ+ -211 -
x ) F -

☒ Composing ANOVA ] values in group ( columns)


groups ( lines
than 2. I ) test :
Quality of of
group variances
: each
→ we
more
groups ,

normally the errors

testing
:
Ho :

µ , -_µz= . .
.
-
-

µ,
=

µ Yij =
µ + ai +
Eij i=1 , . . -
I , j' 1 ,
. . .

,
]
meam group random
or ✗ n' ✗ 2ᵉ -
. . _
=✗I= 0 effect effect error Eij~NLO.tt ) ,
Edi =D ( all di -
-
O )
]

Ê
I

( /
numeiato' ( top )
SSB ✗
Yij € § Iii
I -1 %# = '
; ✗** = ,

È FI
,

F =
-

1. Il ] y
-

y 1. y

( 1))
Ssw

Il ]
group meam
general meam

denominator (left )
-

ÉE Y# * [ JËIY } ÈÊ( y
ÎËJÊYÏÎLÎËË IÀ
i ï»"*
i
( Yij -
=
:*
- "
** + ;
sspi →
g.
-

ç
= ,
:p -1
*
qu
.

J I -
J
total variation between inside each
group
IËLÈYÏÎ groups
vor var

Igg yig?
. .

ECJ 1) si
adf.tt] l)
ggw =
trick : ssw =
- -
.
-

÷, , i :|

{
]
FI (N
F >
rejet Ho
SST=Êz yij ËËÊYÏ) df :( 1) Il] rejection region ; :( ] 1)
-
I -

I. + -1)
1) (X) do
÷,
F < FEI ; Il] not
eject to
-

,
I J
-

rejection region
interval
:
if -0.05
d-

[FI-i.us v10.95 )
,

-
ksi-FI-i.IU-yk.SI )
; ta
] therefore wea -0.05
- ! !! ËÎ
F K

Goodness of fit - -

À test K categoriaalvars.lk ,
with
apiobability Pj associated to each Cj / Épi =L , sample size n

Ho observe 0s , 0k items G, La respective/y


categories
:
pi-pi pk we in
-
-
. . . .

/
. .
. . .

Éloi Et
?

☒ Éloi
z
eil ifth.is tue ✗ tends totale high positive values
×?
-

appuyez
~
-

= > ,
Ho
in ei i"
Ê ande ≥ g-
,

rejection region :[ Kaito )


ei-expeckdno.fi/emsincatef01gCiundvHo
,

( otherwisemerge categories
(i 1. K) ↳
XE.ilo.SN
-

"

pi ¥ e.
# with ntotalnoofikns
-
_
=

DE ✗ test of independence 2uategoricalvasAtB-Tcategou.es/orA ] categories for B Cij possible combinations


withpijpwba.Zpii-1Hoipij-pi.pj.tt/;j
, .

sample ) in size n
"
, we observe 011,0m . . .
D= ] items incategoriescn.cn C ,
. .
. ,,

Qi)
? £2 ( Q; ° '

Dii Ei;)
ÉÉ
' -
,


-

jappwtf-T-D.CI
= ~
Oit -1)
i=1j= , Eij i- l
j =/
-

Aj
154.95=2826 ;]
"
E;
;
- n

sumol-allcells.in table
Econometrics relevant notes
More random Stoff
( ou ( X 4) ,
= E -

END /Y EH . -

]
↳ car ( × , 4) =
E- (XY) -

Ecx) Et )

if X,Y are
independent : cou / ×, 4) =D
Vork + 7) =
ta ✗ + Va y +2calXY )

Distribution density
FI (
→¥ËË ,

f- ✗ ( x) =
x )

Properties

f- (x) ≥ 0
OÇ -1-1×1<1 f- × =
P(✗ < x )
• Place ✗ ≤

6) =
) fcxldx
a


Flx ) =

-
f fltldt
o

he Method d- Moments Coul Xu) El Xu) =D


-

Elu × } Elu] =D impies


-
-

FÉE À Â FÊTE
{ Efxu
Elu] =
EIY Po A. ×] =D -
-

→ -
-
=
0 a)
] =
EIXIY po FX)] =D -

n'
§
" " ? 0 (2)
pion ? yi p
-

xiyi n × =
- -

from (1) À =
g- _

PÎI (3) •
Il ✗ and Y aie positive'y Willard Slope will 6e positive
← sample
:
"

§ Ë
"

Replace Blink) :
↳✓ ( × ,
y, •
If X and Y negativdywielated Slope Will
are : be
négative
fris sample
=
=
cou
ÇÎ Weneed ✗
touareg au" the sample
" "
2
par / × ,
.

between ✗ &Y E) ( ✗i -
← sample •

Ordinary Least Squares ! ( yi f- ÊÛÎ Fn !


2
min
1N E)
'
( Ûi
>
pi pixi )- = = -
-

Po ft i =\ ;
, ;

{§ ÊÊÛÎ =
ÇÎ ( yi -

pi pix;)- =D ÉÉTÈS minimise :


find l
"
ado condition :
dérivative :O

i "
solve : same esti matois as MM
→ .

S
Û ÊÛÎ =
% ÇÎXI / yi § PÎX;) =D - -
Assomptions : o ' const , µ ~ Nco , ☐ 2)
SPÎ i = ,
MLE OLS = =
MM

Maximum Livelihood Approach Must seek the value of the parameter function fly / )
rector
p that maximises the livelihood ×

04K ) )
'

If we assume that Y ~ N ( ✗
p , ,
we maximise

Max : ln f- 191×1 =

€ 1- ln (21704×1)
,
-

204k)
^
.
( Yi -

Po p x ;)
-

,
?
Note ifhomoskedasticu &
:
normalityof
MLE =
as = MM

{§ / f- (YIN) =D dim ( )

il-u~NHHY-MLE-l-OLS-MMPe.ir
ln of
no equations =

p
farming a test
In the common case where Ho :
pi
=
O
( Ha usually =/ 0 : twosided ,
unless spécifie
Two -
sided test C :
tn ki ✗%

Ê

level
-

rejet to at the a
Rejet toit t stat ≤ -
-
C or t -
stat ≥ c • If we ,
we
Sag is Statistical
y significant .

One sided test level


§; insignifiant at the ✗

It fait statistical
-

* we toi eject Ho is
rejet toit wesaj
.

Upper tail p; > 0 : t stat ≥ c


-

,
g
Lower tail p;
< 0 :
rejet Ho if t stat ≤
-
- C
P -
value :
if p
-
value < level of test eject

Confidence interval Norma / E :


µ Voir :O
"

selfs;) ; Ê selfs;)]
"

A- d) % CI :
[ §; -

c- + C- Uniforme :{tatb ) var :


¥16 -

a)
where tn
✗[
✗ 12
C
percentile of distrib
:
1- k 2K
E :k
i
var
-

:
-

If Ho :p , O and O is not in CI rejet


-

Ho stat
significant
-

Poisson E :b Var d
Bernoulli E :P Var :

pls pt
-
-

pq
"
Generic faim forte moment et
Ufa b)
IË ai 6k
- i ,

mk

m ,
=
{ (atb )
Mz
=

§ (à tab +62 )
tab 63 )
'

Ê(
> tab +
m> =
a

)
?
(oîtâbtàb tabs
"

ma + 6
Power of a test

Example 's

5=22 n=8

0=10
✗= 0.05

Ho :
µ -30
-

HA µ 30 Ha µ -25
: < : -

if Ha :
µ
> 30
1) Hoist rue critica value kx : :
=

flot ZM En =
30-1.645 =
24.20
≤ -0.22 )
-

1- plz
2h20 25 and use -21
/ -0.2261=0.409

-
-

2) under Ha :

10158
=
-0.226 P 2- ≤

Example Effect on Heatment : Ai lpie) Bi (post ) Di Bi


-
-
-

Ai niid N( µ ,
02 )
onesided t test Ho :µ☐=0 (no effect ) H tes > 0 (
positive effet )
:

2=0.05
-
:
,

In O
NN / ) In f- ÊDI Â / Tn
st.ovr.jnlarge-sweappwx.bz
normal
-

µ >, À
:

Tn where =
=

qqn i. |

Tn 1.64 -20.95
Rejection region : > =

Now : suppose Hais true ,


and µ >, =D .
Then ,
the
power
is

Blot Pua ( Tn. > 1.64)

=P Hal > 1.64


)
=

Pna ( In -0+0>1.64)
ÔD / Tn

Pmf 'ËË > 1.64 -

Ë) horloge n' -

Ê• NNIO 1) ,

BIO) ≈ 1- ∅ / 1.64 q%n )-

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy