Unit-17 IGNOU STATISTICS
Unit-17 IGNOU STATISTICS
Structure
17.1 Introduction
Objectives
17.2 Some Concepts
17.3 Neyman-Pearson Lemma
17.4 Likelihood-ratio Tests
17.5 Summary
17.6 Solutions and Answers
In Unit 15,we introduced some basic notions about testing of hypothesis. There we
described some concepts and definitions useful in testing of hypothesis problems: In
this unit, we shall discuss the problem of testing of hypothesis in greater detail. To
begin with, we shall introduce some concepts and definitions. Next, we shall
describe, an important result (Neyman-Pearson Lemma) for constructing critical
regions for testing a simple hypothesis against a simple alternative. We also discuss
the likelihood ratio test. The usage of these two procedures of testing are illustrated.
Objeetives
After reading this unit, you should be able to:
is called a test of I-&-, against H1 and C is called a critical region of the test. The
statistic used in the specification of C is called a test statistic. In such a test .
procedure, one might commit two types of error. The probability of type 1 error is
a ( 8 ) = Pe (Reject &),when 8 E Q,
Elements ot Statistlcal and probability of type I1 error is
Inference
- p0 ( c ) when 8.ESal
- -
Tbe function y ( 8 ) 1 $ ( 8 ) as a function of 8 Is called the power h c t i o n
of the test. In the construction of a test-procedure,we fix the probability of the
Type 1error to a desired small level and choose one for which y ( 8 )'is maximum
(or, equivalently, $ ( 8 ) is minimum). Thus, given 0 s a s 1,our interest is thed to
construct a test procedure for which
-
- -
-
8 E GI, ). A test of null hypothesis & 8 E % against H1 8 E Qlis said to -
and y ( 8 ) 1 $ ( 8 ) is as large as possible (or $ ( 8 )) is as small as possible,
Sup a (8)
eeQ0
- a. A
If Sup a (8)
e E 0,
- a, then the level of significance and size of critical region, C, both
equal a.On the other hand, if Sup a (8) < a then the size of aitical region C is
eE %
smaller than its level of significance a. If & is a simple hypothesis, then it is clear
that a (8), 8 E Q, is the size of the aitical region C, which may or may not equal a
given significance level a.
We now define a criterion for selecting a test statistic for testing I-&, : 8 E Q, against
-
H1: 8 E Q1, if H1is a composite hypothesis. A test with critical region of size .
a Sup a (8) is said to be Uniformly Most PowerhI of size a of testing & if it
e E Qo
has the maxhum power among all critical regions C of the same size. In other
words, C,is the best (Uniformly Most Powerful) if for all tests C with size a
(which is the size of q),
the inequality Pe ( C,) r Pe (C)
Uniformly most powerful tests do not exist for many hypothesis testing problems.
Even when they do exist, they are often not easy to find. We now describe a test
procedure (equivalently, obtain a critical region ) for testing a simple hypothesis
against a simple alternative. In this case, the power function, y (8) reduces to a ~Gtlng
ofHypothe#
single number, so that the "uniformly", in uniformly most powerful, becomes
redundant, and we examine the question of the existence of a most powerfbl test of
given significance level a. \
S f i "X
C
- ai (given), i - l,2,. .. ...(1)
.
Further, let there exist constants kl, h, . . such that for the region within which
.
f, z k,f, + kf, + . . . .outside which fo < k, f, + k2 f, + . .. . . ,
We now describe the application of Lemma 1to the problem of testing of simple
hypothesis against a simple alternative.
For a fixed positive integer n, let XI,. . . . .X,, denote a random sample from a
.
density f ( 5 0 ). Let XI,. . . . X,, denote the observed sample. Then the density of
-
X - (XI,. . ... &) is Pe - n
0): Let Peo( 5 ) and Pel ( 5 ) be the densities of
ll f (Xj,
j-1
g under Hoand H1respectively. The problem is that of determining a critical region .
C, such that
and
provided there exists a k such that (5) is satisfied. The test can thus be written as
and
-42 & - b y ]
a kexp
I 20 i-1
Case I
Let p, > p,,. Then the critical region is
P,{X. kl) - a
where Z is distributed as N (0, 1). Therefore choose k, so that
c, - {xlTI>yp+o/\/;;za]
Xdh
where k2is choosen such that
where - Za
distribution.
- Z1 -,is the lower a probability point of the standard normal
.
It may be seen that the test 2 k, ( s 4 ) is uniformly most powerful for the
class of alternatives p1 (pi c h)because k, ( k2 ) is independent of y But
there is no uniformly most powerful test for the entire class of alternatives p1 rr h.
- [
1-P Z r -
k1- P1
0/6 ]
+
where (.) is the distribution function of a standard normal distribution.
thenull hypothesis in the light of the observation X, we compare the highest value
.PI of he likelihood under &, i.e. @
b
i
B Thecritical region of the size-a likelihood ration test of Hg against HIis
:tf
r
whye & is determined by the condition
gr
E)
een,
SUP P ~ ( x I * ( x ) < ~ Ja -
Thdcritical value of the ratio is determined by consideration of the size of the test.
i
t. E
dis'ibution of A is continuous as, then any size a is attainable. If, however, the
f
6 di8;ibution of h is discrete it is difficult to find a likelihood ratio test whose size is
ex@ eqdtQ a,It k,however, possible to obtain a likelihoodratio test of size
by nnh& , d u r n which we shall not discusshere. We may
als&oose the largest C sudr that
, Elements of Stetbtkd Example 2: We consider here the problem of testing Ho : p = against all its
Inference 2
alternativca in sampling from N (p, o '1, where both p and o are unknown. In this
C8Se
Q,- {(p,,,02);02>0]and
n- {(h,C,o
2
);-m<p~a,o >O
I.
We shall write = (p, 02).
Under -
G2 -n1
n
E(X~-~)~.
i-1
Thus, Sup L @ 1 J
X =
0€Q,
Now I
Thus
! The'likelihoodratio rejects & if
n
Since A (X)is a decreasing function of II ( X - h )2/2 - )2,we reject H,, if
(Xi
that is, if
statistic t ( ) =
fi(z-b)
i S
has a Student's t distribution with (n - 1)d.f under H,, : ~r = h,but under
H1: p # h,t ( - X ) has a non-central t-distribution with (n - 1)d.f. and
non-centrality parameter 8 = (v- %)/a. Thus the critical region is
Let ,,
=. - a/;! in accordance with the distribution oft ( X ) under H,,. Thus the
two sided test obtained here is
Suppose we now consider the.problemof testing I-&-, :p = bagainst the class of
alternatives H1: u = p1> b.I6 this case
and
n
-
- 2 (xi-2l.
Sup L@(xJ
e E no
- Sup
P>I+, a(=)"
1
exP - i
2a
A.
Thus
4
Sup L(e(xJ =
0EP (Wn, ($,x,-,,r
1
e-* if
Thus
._
X
Hence we consider those x for which r %. Proceeding on the same lines as for Twtlng d I i y p o t h e ~
the set of alternatives HI: p h,we get the test as
X
In the preceeding illustrations, A (X) was a simple function of and s2whose
distribution is known. In general, however, there is no guarantee that some such
nice relationship to a familiar variable will exist. Then we must use whatever tools
available to find the distribution of A. Fortunately, for large samples there is a good
approximation to the distribution of A which eliminates the necessity for finding the
distribution of A in situations where this is difficult to find. Under certain regularity
2
conditions, the random variable - 2 log, A has an asymptotic x -distribution.The
degrees of freedom equals the number of unknown parameters under S2 minus the
number of unknown parameters under q.
E3) Let XI, . . . ,X,,be a random sample from the Bernoulli distribution with
parameter p, 0 s p s 1. Construct a level a likelihood ratio test of I-&, : p s po
against HI : p > po.
17.5 SUMMARY
In this unit we have
1. briefly introduced the problem of testing of hypothesis,
2. discussed the Neyman-Pearson Lemma for testing a simple hypothesis against
a simple alternative,
3. described the likelihood ratio test for testing hypothesis
and
Poo(X) - 1
- exp
80
[ - X1/eo 1.
Using Neyman-Pearson Lrnrna, the best critical region is obtained as
XI (8, - 80) 2 kl
Ekmeatr of Statbtical CASE I: Let el > €lo
Inference
The test is
\
X1 b
where kZ is to be determined such that
1
that is, -
OD
$ exp (-xl/eo)
li
dx, - a
or exp (- k2/e0) - a
= 4 - 1% (i/aT4
The critical region is thus
- { X lx1 a log ( l / a ) l }
The power of the test is
- -1
81
m
Case I1
Let 8, < 8,
The test is
s k3
where k,is to determined such that
Pe0(X, 5 4) -
that is
, 1
-
14
$ cxp (- xl/$)
0
dxl - a
- 1- exp (- k3/e1)
S
i E2) Since both the densities (under I-&, and HI)are campletely specified, it is a case
of testing a simple hypothesis against a simple alternative. Using
Neyman-Pearson Lemma, the test is obtained as
Let
I - I
It is clearly seen that T (x) is an increasing function
T (x) r k if and only if I X I 1 r kt. It
form
which means that if either a very large or a very small value of X is observed,
then we suspect that H,is true and I-& is false. The size of the test is
Now
sup pr(l-p)n-r
Ospsl
- ($ (I-;)
8 -I
Also
r r r
The maximum likelihood estimate of p is po if po < - and is - if.po z -.
n' n n
Thus,
Sup pr(l-p)8*r = p ~ ( ~ - p ~ ) ~
if - po<r/n
'
PSPo
k
Thus for a preassigned a, 0 < a < 1, choose c' so that
I Since r has a discrete distribution no c' may exist for which we get the exact
probability a. In this case, choose c' such that
4
and