Probability Essentials (Jacod J., Protter P) PDF

Uploaded by

asadrauf

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF or read online on Scribd

100% found this document useful (1 vote)

3K views264 pages

Probability Essentials (Jacod J., Protter P) PDF

Uploaded by

asadrauf

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF or read online on Scribd

You are on page 1/ 264

Jean Jacod Université de Paris VI we Laboratoire de Probabilités 4, place Jussieu - Tour 56 75252 Paris Cedex 05, France e-mail: jj@ccrjussieu.fr Philip Protter School of Operations Research and Industrial Engineering Cornell University 219 Rhodes Hall Ithaca, NY 14853, USA e-mail: protter@orie.cornell.edu Sketch of Carl Friedrich Gau8 (by J. B, Listing; Nachla& Gauf, Posth, 26) by kind permission of Universitatsbibliothek Géttingen. Photograph of Paul Lévy by kind permission of Jean-Claude Lévy, Denise Piron, and Marie-Héléne Schwartz Photograph of Andrei N. Kolmogorov by kind permission of Albert N. Shiryaev Mathematics Subject Classification (2000); 60-01, 60E05, 60E10, 60G42 Cataloging-in-Publication Data applied for Bibliographic information published by Die Deutsche Bibliothek Die Deutsche Bibliothek lists this publication in the Deutsche Nationalbibliografie; detailed bibliographic data is available in the Internet at . ISBN 3-540-43871-8 2nd Edition Springer-Verlag Berlin Heidelberg NewY- ork ISBN 3-540-66419-X. 1st Ed, Springer-Verlag Berlin Heidelberg NewYork This work is subject to copyright. All rights are reserved, whether the whole or part of the material is concerned, specifically the rights of translation, reprinting, reuse of illustrations, recitation, broad-casting, reproduction on microfilms or in any other way, and storage in data banks. Duplica- tion of this publication or parts thereof is permitted only under the provisions of the German Co- pyright Law of September 9, 1965, in its current version, and permission for use must always be ob- tained from Springer-Verlag. Violations are liable for prosecution under the German Copyright Law. Springer-Verlag Berlin Heidelberg New York a member of BertelsmannSpringer Science+Business Media GmbH hutp://www.springende © Springer-Verlag Berlin Heidelberg 2000, 2003 Printed in Italy ‘Typesetting: Camera-ready copy from the author using a Springer TgX macro package Cover design: design & production GmbH, Heidelberg Printed on acid-free papery SPIN: 10884210 4i/3142LK -5 43.210Sey aie prieth s + Z- 57S To Diane and Sylvie and To Rachel, Margot, Olivier, Serge, Thomas, Vincent and MartinPreface to the Second Edition We have made small changes throughout the book, including the exercises, and we have tried to correct if not all. then at least most of the typos. We wish to thank the many colleagues and students who have commented con- structively on the book since its publication two years ago, and in particular Professors Valentin Petrov, Esko Valkeila, Volker Priebe, and Frank Knight. Jean Jacod, Paris Philip Protter, Ithaca March, 2002Preface to the First Edition We present here a one semester course on Probability Theory. We also treat measure theory and Lebesgue integration, concentrating on those aspects which are especially germane to the study of Probability Theory. The book is intended to fill a current need: there are mathematically sophisticated students and researchers (especially in Engineering, Economics, and Statistics) who need a proper grounding in Probability in order to pursue their primary interests. Many Probability texts available today are celebrations of Prob- ability Theory, containing treatments of fascinating topics to be sure, but nevertheless they make it difficult to construct a lean one semester course that covers (what we believe) are the essential topics. \. Chapters 1-23 provide such a course. We have indulged ourselves a bit by including Chapters 24-28 which are highly optional. but which may prove useful to Economists and Electrical Engineers. “‘\This book had its origins in a course the second author gave in Perugia. Italy in 1997; he used the samizdat “notes” of the first author. long used for courses at the University of Paris VI, augmenting them as needed, The result has been further tested at courses given at Purdue University. We thank the indulgence and patience of the students both in Perugia and in West Lafayette. We also thank our editor Catriona Byrne, as well as Nick Bingham for many superb suggestions, an anonymous referee for the same. and Judy Mitchell for her extraordinary typing skills. Jean Jacod, Paris Philip Protter, West LafayetteTable of Contents 1. Introduction .. 1 2. Axioms of Probability ...........0. 0.0 ccc eee cee eee eee eee 7 3. Conditional Probability and Independence. . 15 4, Probabilities on a Finite or Countable Space.............. 21 5. Random Variables on a Countable Space ... 27 6. Construction of a Probability Measure.............-.....5 35 7. Construction of a Probability Measure on R... 39 8. Random Variables ..........6 6.00. c cece teen ete tenes 47 9. Integration with Respect to a Probability Measure ....... 51 10. Independent Random Variables...................0000e0 ee 65 11. Probability Distributions on R............... 00. cece eee 77 12. Probability Distributions on R® ............66. 2... e eee eee 87 13. Characteristic Functions .......... 0.00. .0 00 eee eeeeee eee ee 103 14. Properties of Characteristic Functions ............-......+ 111 15. Sums of Independent Random Variables .................- 117 16. Gaussian Random Variables (The Normal and the Multivariate Normal Distributions) ... - 125 17. Convergence of Random Variables ................+0..5005 141 18. Weak Convergence ..x Table of Contents 19. Weak Convergence and Characteristic Functions... 20. The Laws of Large Numbers................0 00 cee e eves 173 21. The Central Limit Theorem ...................0.. eee eee 181 22. L? and Hilbert Spaces ...................... 022 e cece eee 189 23. Conditional Expectation ................ 0.0000 eee eee 197 24. Martingales....... 0.0... ccc cence etnies 211 25. Supermartingales and Submartingales ............-...--++ 219 26. Martingale Inequalities ............. 0.5.06 cece eee eee 223 27. Martingale Convergence Theorems ............---....++5+ 229 28. The Radon-Nikodym Theorem.................6-.00e eee 243, References ..... 6.6.66. cece tenet t tee tee eens 2491. Introduction Almost everyone these days is familiar with the concept of Probability. Each day we are told the probability that it will rain the next day; frequently we discuss the probabilities of winning a lottery or surviving the crash of an air- plane. The insurance industry calculates (for example) the probability that. a man or woman will live past his or her eightieth birthday, given he or she is 22 years old and applying for life insurance. Probability is used in business too: for example, when deciding to build a waiting area in a restaurant, one wants to calculate the probability of needing space for more than n people each day; a bank wants to calculate the probability a loan will be repaid; a manufacturer wants to calculate the probable demand for his product in the future. In medicine a doctor needs to calculate the probability of success of various alternative remedies; drug companies calculate the probability of harmful side effects of drugs. An example that has recently achieved spec- tacular success is the use of Probability in Economics, and in particular in Stochastic Finance Theory. Here interest rates and security prices (such as stocks, bonds, currency exchanges) are modelled as varying randomly over time but subject to specific probability laws; one is then able to provide insurance products (for example) to investors by using these models. One could go on with such a list. Probability theory is ubiquitous in modern society and in science. Probability theory is a reasonably old subject. Published references on games of chance (i.e., gambling) date to J. Cardan (1501-1576) with his book De Ludo Alae {4]. Probability also appears in the work of Kepler (1571-1630) and of Galileo (1564-1642). However historians seem to agree that the subject really began with the work of Pascal (1623-1662) and of Fermat (1601-1665). The two exchanged letters solving gambling “paradoxes” posed to them by the aristocrat de Méré. Later the Dutch mathematician Christian Huygens (1629-1695) wrote an influential book [13] elaborating on the ideas of Pascal and Fermat. Finally in 1685 it was Jacques Bernoulli (1654-1705) who pro- posed such interesting probability problems (in the “Journal des Scavans”) (see also [3]) that it was necessary to develop a serious theory to answer them. After the work of J. Bernoulli and his contemporary A. De Moivre (1667-1754) [6], many renowned mathematicians of the day worked on probability problems, including Daniel Bernoulli (1700-1782), Euler (1707-1803),2 1. Introduction Gauss (1777-1855), and Laplace (1749-1827). For a nice history of Probabil- ity before 1827 (the year of the death of Laplace) one can consult [21]. In the twentieth century it was Kolmogorov (1903-1987) who saw the connection between the ideas of Borel and Lebesgue and probability theory and he gave probability theory its rigorous measure theory basis. After the fundamental work of Kolmogorov. the French mathematician Paul Lévy (1886-1971) set the tone for modern Probability with his seminal work on Stochastic Pro- cesses as well as characteristic functions and limit theorems, We think of Probability Theory as a mathematical model of chance. or random events. The idea is to start with a few basic principles about how the laws of chance behave. These should be sufficiently simple that one can believe them readily to correspond to nature. Once these few principles are accepted, we then deduce a mathematical theory to guide us in more com- plicated situations. This is the goal of this book. We now describe the approach of this book. First we cover the bare essentials of discrete probability in order to establish the basic ideas concerning probability measures and conditional probability. We next consider probabilities on countable spaces, where it is easy and intuitive to fix the ideas. We then extend the ideas to general measures and of course probability measures on the real numbers. This represents Chapters 2-7. Random variables are handled analogously: first on countable spaces and then in general. In- tegration is established as the expectation of random variables, and later the connection to Lebesgue integration is clarified. This brings us through Chapter 12. Chapters 13 through 21 are devoted to the study of limit theorems, the central feature of classical probability and statistics. We give a detailed treatment of Gaussian random variables and transformations of random variables. as well as weak convergence. Conditional expectation is not presented via the Radon-Nikodym theorem and the Hahn-Jordan decomposition. but rather we use Hilbert Space projections. This allows a rapid approach to the theory, To this end we cover the necessities of Hilbert space theory in Chapter 22: we nevertheless extend the concept of conditional expectation beyond the Hilbert space setting to include integrable randoin variables. This is done in Chapter 23. Last, in Chapters 24-28 we give a beginning taste of martingales, with an applica- tion to the Radon—-Nikodym Theorem. These last five chapters are not really needed for a course on the “essentials of probability”. We include them however because many sophisticated applications of probability use martingales; also martingales serve as a nice introduction to the subject of stochastic processes. We have written the book independent of the exercises. That is, the important material is in the text itself and not in the exercises. The exercises provide an opportunity to absorb the material by working with the subject. Starred exercises are suspected to be harder than the others.1. Introduction 3 We wish to acknowledge that Allan Gut’s book [11] was useful in providing exercises, and part of our treatment of martingales was influenced by the delightful introduction to the book of Richard Bass [1}. No probability background is assumed. The reader should have a good knowledge of (advanced) calculus. some linear algebra, and also “mathematical sophistication”. Random Experiments Random experiments are experiments whose output cannot be surely pre- dicted in advance. But when one repeats the same experiment a large number of times one can observe some “regularity” in the average output. A typical example is the toss of a coin: one cannot predict the result of a single toss, but if we toss the coin many times we get an average of about 50% of “heads” if the coin is fair. The theory of probability aims towards a mathematical theory which describes such phenomena. This theory contains three main ingredients: a) The state space: this is the set of all possible outcomes of the experiment, and it is usually denoted by 92. Examples: 1) A toss of a coin: 2 = {h.t}. 2) Two successive tosses of a coin: 2 = {hh.tt-ht,th}. 3) A toss of two dice: 2 = {(i,j):11 be a sequence of rationals decreasing to a and (bp )n>1 be a sequence of rationals increasing strictly to b. Then (a,b) = U%4 (an, Dn] = Una ((— 08, bn] 1 (90, an]®) Therefore C C o(D), whence o(C) C o(D). However since each element of D is a closed set, it is also a Borel set, and therefore o(D) is contained in the Borel sets B. Thus we have B=o(C)Co(D) cB, and hence o(D) = B. Qo On the state space 2 the family of all events will always be a o-algebra A: the axioms (1), (2) and (3) correspond to the “logical” operations described in Chapter 1, while Axiom (4) is necessary for mathematical reasons. The probability itself is described below: Definition 2.3. A probability measure defined on a a-algebra A of 2 is a function P : A— [0,1] that satisfies: 1. P(Q)=1 2. For every countable sequence (An )n>1 of elements of A, pairwise disjoint (that is, A, NAm = whenever n 4m), one has P(U® An) = So P(An)- n=1 Axiom (2) above is called countable additivity; the number P(A) is called the probability of the event A. In Definition 2.3 one might imagine a more naive condition than (2). namely: A,BeA, ANB=0 + P(AUB)=P(A)+P(B). (2.1)2. Axioms of Probability 9 This property is called additivity (or “finite additivity”) and, by an elemen- tary induction , it implies that for every finite A;,...Am of pairwise disjoint events A; € A we have P(UnL1An) = D> P(An)- n=1 Theorem 2.2. If P is a probability measure on (92..A), then: (i) We have P(0) = (ii) P is additive. Proof. If in Axiom (2) we take A, = @ for all n, we see that the number a = P(@) is equal to an infinite sum of itself; since 0 < a < 1, this is possible only if a = 0, and we have (i). For (ii) it suffices to apply Axiom (2) with A; = A and Aj = Band Ag = Ay =... = 0, plus the fact that P(@) = 0, to obtain (2.1). o Conversely, countable additivity is not implied by additivity. In fact, in spite of its intuitive appeal, additivity is not enough to handle the mathematical problems of the theory, even in such a simple example as tossing a coin, as we shall see later. The next theorem (Theorem 2.3) shows exactly what is extra when we assume countable additivity instead of just finite additivity. Before stating this theorem, and to see that the last four conditions in it are meaningful, let us mention the following immediate consequence of Definition 2.3: A,CeA, ACC = P(A)< P(C) (take B = A°NC, hence AN B = @ and AU B = C, and apply (2.1)). Theorem 2.3. Let A be a o-algebra. Suppose that P : A — [0,1] satisfies (1) and is additive. Then the following are equivalent: (i) Aaiom (2) of Definition (2.3). (ii) If An € A and Ay, | 0, then P(An) | 0. (iii) If An € A and An | A, then P(An) | P(A). (iv) If An € A and An t 2, then P(An) 11. (v) If An € A and An | A, then P(An) t P(A). Proof. The notation A, | A means that Aj4) C An, each n, and N22, An = A. The notation A, f A means that Ay C Any; and U2 )An = Note that if A, | A, then AS 1 A‘, and by the finite additivity axiom P(AS) =1- P(Ay). Therefore (i) is equivalent to (iv) and similarly (iii) is equivalent to (v). Moreover by choosing A to be 2 we have that (v) implies (iv). Suppose now that we have (iv). Let A, € A with A, 7 A. Set B, A, UA®. Then B,, increases to (2, hence P(B,) increases to 1. Since A, C A we have A, A° = 0, whence P(A, U A‘) = P(A,) + P(A®). Thus10 2. Axioms of Probability 1 = lim P(B,) = lim {P(An) + P(A}. whence lity: P(A,) = 1 — P(A‘) = P(A). and we have (v). It remains to show that (i) is equivalent to (v). Suppose we have (v). Let An € A be pairwise disjoint: that is. ifn # m. then A,QAyn = 0. Define B, = UrepenAp and B = UX.;A,. Then by the definition of a Probability Measure we have P(B,) = Soy P(Ap) which increases with n to D*, P(An). and also P(B,,) increases to P(B) by (v). We deduce lim,— P(B,) = P(B) and we haye x P(B) = P(UL1 An) = SO P(An) n=1 and thus we have (i). Finally assume we have (i). and we wish to establish (v). Let A, € A. with A, increasing to A, We construct a new sequence as follows: B, =A. Bz = Ag\ Ay = A2 (Aj). By = An\ Ant. Then UX, B, = A and the events (B,),>1 are pairwise disjoint. Therefore by (i) we have n P(A) = lim SY P(B,). p=l But also 37’_, P(Bp) = P(An). whence we deduce lim, P(An) = P(A) and we have (vy). Oo If A€ 2%. we define the indicator function by lifweA. Lae) = {0 ifw¢ A. We often do not explicitly write the w. and just write 1,4. We can say that A, € A converges to A (we write A, — A) if limp se La, (w) = la(w) for all w € 2. Note that if the sequence A, increases (resp. decreases) to A. then it also tends to A in the above sense. Theorem 2.4. Let P be a probability measure, and let An be a sequence of events in A which converges to A. Then A€ A and linn P(An) = P(A). Proof. Let us define \ lim sup Ap = Ny Um>n Am nC oi liminf A, = US; Amon Am- 1502, Axioms of Probability 1 Since A is a o-algebra. we have limsup,_.., An € A and lim inf,.4, An € A (see Exercise 2.4). By hypothesis A, converges to A. which means lim, la, = 1a. all w. This is equivalent to saying that A = limsup, An = lim infty. An. Therefore A € A. Now let Bh = Om>nAm and Cy = Un>nAm- Then B, increases to A and C;, decreases to A, thus lima sx P(Bn) = limn—oc P(C,) = P(A), by Theorem 2.3. However B, C An C Cy. therefore P(Bn) < P(An) < P(C),). so lim, +5 P(An) = P(A) as well. q12 2. Axioms of Probability Exercises for Chapter 2 2.1 Let 2 be a finite set. Show that the set of all subsets of 2, 2%, is also finite and that it is a o-algebra. 2.2 Let (Ga)aca be an arbitrary family of c-algebras defined on an abstract space 2. Show that H = NacaGa is also a o-algebra. 2.3 Let (A,)n>1 be a sequence of sets. Show that (De Morgan’s Laws) a) (Um An)® = Me An b) (M1 An)® = Urea An 2.4 Let A be a o-algebra and (A,)n>1 a sequence of events in A. Show that liminf A, €.A; limsupA, €A; and liminf A, C limsup An. n—00 noo nav00 n30 2.5 Let (An)no1 be a sequence of sets. Show that limsup 1a, ~ liminf 14, = 1 fimsup, An\limint, An} n—00 n-00 (where A\ B = A Be whenever BC A). 2.6 Let A be a o-algebra of subsets of (2 and let B € A. Show that F = {AN B: A€é A} is a o-algebra of subsets of B. Is it still true when B is a subset of §2 that does not belong to A ? 2.7 Let f be a function mapping 2 to another space E with a o-algebra €. Let A = {Ac 2: there exists B € € with A = f~'(B)}. Show that A is a o-algebra on 22. 2.8 Let f : R > R be a continuous function, and let A = {A C R: there exists B € B with A= f~1(B)} where B are the Borel subsets of the range space R. Show that A C B, the Borel subsets of the domain space R. For problems 2.9-2.15 we assume a fixed abstract space 2, a o-algebra A, and a Probability P defined on (Q,.A). The sets A, B, A;, etc... always belong to A. 2.9 For A,B € A with AN B = 0, show P(AUB) = P(A) + P(B). 2.10 For A, B € A, show P(AU B) = P(A) + P(B) ~ P(ANB). 2.11 For A € A, show P(A) = 1- P(A®). 2.12 For A,B € A, show P(ANB°) = P(A) — P(ANB).Exercises 13 2.13 Let Ay.....4 A, be given events. Show that P (UL, Ay) = SO P(Ad — SO P(A Aj) + SE P(A;N.A; Ag) = (=D P(ALN AD... An) i SPA) — PAY), i=l i 0. The conditional probability of A given B is P(A| B) = P(ANB)/P(B). Theorem 3.2. Suppose P(B) > 0. (a) A and B are independent if and only if P(A| B) = P(A). (b) The operation A — P(A| B) from A — [0,1] defines a new probability measure on A, called the “conditional probability measure given B”. Proof. We have already established (a) in the discussion preceding the theorem. For (b), define Q(A) = P(A | B), with B fixed. We must show Q satisfies (1) and (2) of Definition 2.3, But P(QNB) P(B) _ Q(2) = P(2| B) = ER = BB = Therefore, Q satisfies (1). As for (2), note that if (Ap)n>1 is a sequence of elements of A which are pairwise disjoint, then PUUR An) OB) — P(UR (ALO B)) Q (Urdu) = P (Uf Ay | B) = Se 0) and also the sequence (A, 7 B)n>1 is pairwise disjoint as well; thus -> ee Yo P(An |B) = 3 QAn). i n=13. Conditional Probability and Independence 17 The next theorem connects independence with conditional probability for a finite number of events. Theorem 3.3. [f A,,..., A, € A and if P(A, N...N An-1) > 0, then P(A) MN... An) = P(A;)P(Ag | Ay) P(A3 | A, Ag) + P(Ap | APO... An-2). Proof. We use induction. For n = 2, the theorem is simply Definition 3.2. Suppose the theorem holds for n— 1 events. Let B= A, M...9 An—y. Then by Definition 3.2 P(BNA,) = P(A, | B)P(B); next we replace P(B) by its value given in the inductive hypothesis: P(B) = P(Ay)P(Ap | At)... P(An—-1 | Ar... Ana), and we get the result. a A collection of events (E,) is called a partition of 92 if E, € A, each n, they are pairwise disjoint, P(E,,) > 0, each n, and U,E, = 22. Theorem 3.4 (Partition Equation). Let (E,)n>, be a finite or countable partition of 2. Then if AG A, P(A) = YO P(A| En)P(En). Proof. Note that A=AN2= AN (UnEn) = Un(AN E,). Since the E,, are pairwise disjoint so also are (AN En)n>1, hence P(A) = P(Un(AN En) = 3) P(AN En) = ¥° P(A | En)P(En)- n n o Theorem 3.5 (Bayes’ Theorem). Let (E,) be a finite or countable partition of 2, and suppose P(A) > 0. Then P(A| En)P(En) Sn P(A| Em)P( Em)” Proof. By Theorem 3.4 we have that the denominator YO PUA | Em) P(Em) = P(A). P(En | A) = Therefore the formula becomes P(A| E,)P(En) _— P(AN En) _ Pay Bay Pn | AD. Oo Bayes’ theorem is quite simple but it has profound consequences both in Probability and Statistics. See, for example, Exercise 3.6.18 3. Conditional Probability and Independence Exercises for Chapter 3 In all exercises the probability space is fixed. and A. B, An, etc... are events. 3.1 Show that if AN B = 0. then A and B cannot be independent unless P(A) =0 or P(B) = 0. 3.2 Let P(C) > 0. Show that P(AUB | C) = P(A| C)+P(B | C)—P(AnB | C). 3.3 Suppose P(C) > 0 and Aj..... A, are all pairwise disjoint. Show that n P(UR,Ai |C) = 9 P(A: | C). 3.4 Let P(B) > 0. Show that P(ANB) = P(A| B)P(B). 3.5 Let 0 < P(B) <1 and A be any event. Show P(A) = P(A| B)P(B) + P(A| B°)P(B’). 3.6 Donated blood is screened for AIDS. Suppose the test has 99% accuracy, and that one in ten thousand people in your age group are HIV positive. The test has a 5% false positive rating. as well. Suppose the test screens you as positive. What is the probability you have AIDS? Is it 99%? (Hint: 99% refers to P (test positive|you have AIDS). You want to find P (you have AIDS|test is positive). 3.7 Let (An)n>i € A and (By)n>1 € Aand A, — A (see before Theorem 2.4 for the definition of A, — A) and B, — B, with P(B) > 0 and P(B,) > 0. all n. Show that a) limp P(An |B) = P(A| B). b) limp x P(A | Bp) = P(A| B). ¢) lima +x P(An | Ba) = P(A| B). 3.8 Suppose we model tossing a coin with two outcomes, H and T, repre- senting Heads and Tails. Let P(H) = P(T) = 3. Suppose now we toss two such coins, so that the sample space of outcoines {2 consists of four points: HH, HT, TH, TT. We assume that the tosses are independent. a) Find the conditional probability that both coins show a head given that the first shows a head (answer: 3). b) Find the conditional probability that both coins show heads given that at least one of them is a head (answer: 4). 3.9 Suppose A, B, C are independent events and P(AN B) ¥ 0. Show P(C| ANB) = P(C).Exercises 19 3.10 A box has r red and b black balls. A ball is chosen at random from the box (so that each ball is equally likely to be chosen). and then a second ball is drawn at random from the remaining balls in the box. Find the probabilities that é 2 nr) a) Both balls are red [Ans.: ==] b) The first ball is red and the second js black [Ans. apts 3.11 (Polya’s Urn) An urn contains r red balls and b blue balls. A ball is chosen at random from the urn. its color is noted, and it is returned together with d more balls of the same color. This is repeated indefinitely. What is the probability that a) The second ball drawn is blue? [Ans. its oF b) The first ball drawn js blue given that the second ball drawn is blue? s.: _btd [Ans.: 45 3.12 Consider the framework of Exercise 3.11. Let B,, denote the event that the nth ball drawn is blue. Show that P(B,) = P(B,) for all n > 1. 3.13 Consider the framework of Exercise 3.11. Find the probability that the first. ball is blue given that the n subsequent drawn balls are all blue. Find the limit of this probability as n tends to oo. [Ans.: -2#24); limit is 1| 3.14 An insurance company insures an equal number of male and female drivers. In any given year the probability that a male driver has an accident involving a claim is a, independently of other years. The analogous probability for females is 9. Assume the insurance company selects a driver at random. a) What is the probability the selected driver will make a claim this year? 5: ate Ans.: 93" b) What is the probability the selected driver makes a claim in two consec- utive years? [Ans | 3.15 Consider the framework of Exercise 3.14 and let A;, Ag be the events that a randomly chosen driver makes a claim in each of the first and second years, respectively. Show that P(Az | Ai) > P(A;). [Ans. P(Ag | Ai) — P(At) = se 2a+3) 3.16 Consider the framework of Exercise 3.14 and find the probability that a claimant js female. [Ans.: =25] 3.17 Let Aj, Ao.....An be independent events. Show that the probability that none of the Ay...., An occur is less than or equal to exp(— 2”_, P(A,))-20 3. Conditional Probability and Independence 3.18 Let A, B be events with P(A) > 0. Show P(ANB| AUB) < P(ANB| A).4. Probabilities on a Finite or Countable Space For Chapter 4, we assume 2 is finite or countable, and we take the o-algebra A = 2 (the class of all subsets of 2). Theorem 4.1. (a) A probability on the finite or countable set 2 is charac- terized by its values on the atoms: p., = P({w}), w € 2. (b) Let (pu)wee be a family of real numbers indexed by the finite or countable set 2. Then there exists a unique probability P such that P({w}) = pw if and only if p> 0 and Yeo Pw = 1. When £2 is countably infinite, >, p.. is the sum of an infinite number of terms which a priori are not ordered: although it is possible to enumerate the points of §2, such an enumeration is in fact arbitrary. So we do not have a proper series, but rather a “summable family”. In the Appendix to this chapter we gather some useful facts on summable families. Proof. Let A € A; then A = Usea{w}, a finite or countable union of pairwise disjoint singletons. If P is a probability, countable additivity yields P(A) = P (Useatw}) = YO P({w}) = YO vw. wea weA Therefore we have (a). For (b). note that if P({w}) = p,, then by definition p,, > 0, and also 1 = P(2) = P Useo{w}) = Yo Pw) = Yo rw. wen wen For the converse, if the p, satisfy p, > 0 and )>¢¢ Py = 1, then we define a probability P by P(A) = Yo,¢4 Pw, With the convention that an “empty” sum equals 0. Then P(@) = 0 and P(22) = cq Pw = 1. For countable additivity, it is trivial when 2 is finite; when 22 is countable it follows from the fact that one has the following associativity: 7je7 Duea, Pu = Lweuies A, Po if the Aj are pairwise disjoint. o Suppose first that §2 is finite. Any family of nonnegative terms summing up to 1 gives an example of a probability on (2. But among all these examples the following is particularly important:22 4. Probabilities on a Finite or Countable Space Definition 4.1. A probability P on the finite set 2 is called uniform if p, = P({w}) does not depend on x. In this case. it is immediate that = #4 #Q) Then computing the probability of any event A amounts to counting the number of points in A. On a given finite set 2 there is one and only one uniform probability. We now give two examples which are important for applications. P(A) a) The Hypergeometric distribution. An urn contains N white balls and AM black balls. One draws n balls without replacement, so n < N + Mf. One gets X white balls and n — X black balls. One is looking for the probability that X = 2, where « is an arbitrary fixed integer. Since we draw the balls without replacement, we can as well suppose that the n balls are drawn at once. So it becomes natural to consider that an outcome is a subset with n elements of the set {1.2..... N+M} of all N+.M balls (which can be assumed to numbered from 1 to N +1). That is, (2 is the family of all subsets with n points, and the total number of possible outcomes is #(2) = (wE™) = wees: recall that for p and q two integers with p 0 is the probability P defined on N by 7 ast, nl 2. The Geometric distribution of parameter a € [0,1) is the probability defined on N by Pn Pn = (1—aja”, n=0,1,2.3,....24 4. Probabilities on a Finite or Countable Space Note that in the Binomial model if n is large. then while in theory Cra —p)"~/ is known exactly, in practice it can be hard to compute. (Often it is beyond the capacities of quite powerful hand calculators. for example.) If nis large and p is small, however (as is often the case), there is an alternative method which we now describe. Suppose p changes with n; call it p,. Suppose further lim, .5< NPyp = A. One can show (see Exercise 4.1) that n i (1 —p,)?-F =e Pe im, ("Joos (L= pn)" =e i and thus one can easily approximate a Binomial probability (in this case) with a Poisson. Appendix: Some useful result on series In this Appendix we give a summary, mostly without proofs, of some useful result on series and summable families: these are primarily useful for studying probabilities on countable state spaces. These results (with proofs) can be found in most texts on Calculus (for example, see Chapter 10 of [18]). First we establish some conventions. Quite often one is led to perform calculations involving +oc (written more simply as 0c) or —oc. For these calculations to make sense we always use the following conventions: +oot+00 = +00, —oo-00 =—00, atoo=+00, a-co=-oo ifaeR, 0x0=0, a€|0,co] + axc=+00, a€[-0,0[ = ax w=-x. Let up; be a sequence of numbers, and consider the “partial sums” S, = ty +... + Un. S1: The series >, un is called convergent if S, converges to a finite limit S, also denoted by S$ = 37, un (the “sum” of the series). S2: The series So, un is called absolutely convergent if the series >, |tn| converges. S3: If un, > 0 for all n, the sequence S;,, is increasing, hence always converges to a limit S € [0, oo]. We still write $ = }>,, un, although the series converges in the sense of (S1) if and only if S < oc. The summands uw, can even take their values in [0, oc] provided we use the conventions above concerning addition with oo. In general the convergence of a series depends on the order in which the terms are enumerated. There are however two important cases where the ordering of the terms has no influence, and one speaks rather of “summable families” instead of “series” in these cases, which are $4 and S5 below:4. Probabilities on a Finite or Countable Space 25 S4: When the u, are reals and the series is absolutely convergent one can modify the order in which the terms are taken without changing the absolute convergence, nor the sum of the series. $5: When uw, € [0.00] for all n, the sum }>,, un (which is finite or infinite: cf. (S3) above) does not change if the order is changed. S6: When up € [0,00], or when the series is absolutely convergent, we have the following associativity property: let (A;)icy be a partition of N*, with J = {1,2,...,.N} for some integer N, or J = N*. For each i € J we set vi = Daca, Un: if A; is finite this is an ordinary sum, otherwise v; is itself the sum of a series. Then we have 37, un = Dye, Ui (this latter sum is again the sum of a series if J = N*).26 4. Probabilities on a Finite or Countable Space Exercises for Chapter 4 4.1 (Poisson Approximation to the Binomial) Let P be a Binomial probability with probability of success p and number of trials n. Let A = pn. Show that P(k successes) k n —k BOG) CS DYOa) Let n — oc and let p change so that \ remains constant. Conclude that for small p and large n, AK oy P(k successes) © where \ = pn. {Note: In general for this approximation technique to be good one needs n large, p small. and also \ = np to be of moderate size — for example A < 20.] 4.2 (Poisson Approximation to the Binomial, continued) In the setting of Exercise 4.1, let py = P({k}) and q, = 1— px. Show that the g, are the probabilities of singletons for a Binomial distribution B(1 — p,n). Deduce a Poisson approximation of the Binomial when n is large and p is close to 1. 4.3 We consider the setting of the hypergeometric distribution, except that we have m colors and N; balls of color i. Set N = Ny+...+Nmm.and call X; the number of balls of color i drawn among n balls. Of course X;+...+ Xm =n. Show that CpG) P(X, = 21,00 Xm=em)=4 CE) 5 eee 0 otherwise.5. Random Variables on a Countable Space In Chapter 5 we again assume {2 is countable and A= 2°. A random variable X in this case is defined to be a function from 92 into a set T. A random variable represents an unknown quantity (hence the term variable) that varies not as a variable in an algebraic relation (such as 2? —9 = 0). but rather varies with the outcome of a random event. Before the random event. we know which values X could possibly assume, but we do not know which one it will take until the random event happens. This is analogous to algebra when we know that ¢ can take on a priori any real value, but we do not know which one (or ones) it will take on until we solve the equation x? — 9 = 0 (for example). Note that even if the state space (or range space) T is not countable, the image T’ of 2 under X (that is, all points {i} in T for which there exists an w € 2 such that X(w) = 7) is either finite or countably infinite. We can then define the distribution of X (also called the law of X) on the range space T’ of X by P*(A) = P({w: X(w) € A}) = P(X71(A)) = P(X € A). That this formula defines a Probability measure on T' (with the o-algebra 27 of all subsets of T’) is evident. Since T’ is at most countable. this probability is completely determined by the following numbers: PP=PX=f)= YO pe. {uiX(w)=j} Sometimes, the family (px :j € T’) is also called the distribution (or the law) of X. We have of course Px(A) = 0j<4 7}. If P* has a known distribution, for example Poisson. then we say that X is a Poisson random variable. Definition 5.1. Let X be a real-valued random variable on a countable space 92. The expectation of X, denoted E{X}, is defined to be F{X}= Xp. provided this sum makes sense: this is the case when is finite; this is also the case when 92 is countable, when the series is absolutely convergent or28 5. Random Variables on a Countable Space X > 0 always (in the latter case. the above sum and hence E{X} as well may take the value +9c). This definition can be motivated as follows: If one repeats an experiment n times, and one records the values X;, X2,...,X, of X corresponding to the n outcomes, then the empirical mean 4(X1+...+Xn) is Duco X(w) fn({e}), where f,({w}) denotes the frequency of appearance of the singleton {w}. Since f,({w}) “converges” to P({w}), it follows (at least when 92 is finite) that the empirical mean converges to the expectation E{X } as defined above. Define L' to be the space of real valued random variables on (2,4, P) which have a finite expectation. The following facts follow easily: (i) C? is a vector space, and the expectation operator F is linear, (ii) the expectation operator FE is positive: if X € £' and X > 0, then E{X} > 0. More generally if X,Y € £' and X < Y then E{X} < E{Y}. (iii) C* contains all bounded random variables. If X = a, then E{X} =a. (iv) If X € £}, its expectation depends only on its distribution and, if T’ is the range of X, E{X} = So §P(X = 5). (5.1) jet’ (v) If X = 1, is the indicator function of an event A, then E{X} = P(A). We observe that if )>,,(X(w))?p. is absolutely convergent, then SX pos YM Xeps+ SY Xe). @ [X(w)|a} < FCO} for alla>0. Proof. Since X is an r.v. so also is Y = A(X); let A={¥~*((a,20))} = (ws W(X(w)) > a} = (A(X) = a}. Then A(X) > ala, hence E{h(X)} > E{ala} = aE{1a} = aP(A) and we have the result. o5. Random Variables on a Countable Space 29 Corollary 5.1 (Markov’s Inequality). PAx| 2 a} < HUD Proof. Take h(«) = |2| in Theorem 5.1. o Definition 5.2. Let X be a real-valued random variable with X? in £1. The Variance of X is defined to be o? =o% = E{(X — E(X))?}. The standard deviation of X, ax, is the nonnegative square root of the variance. The primary use of the standard deviation is to report statistics in the correct (and meaningful) units. An example of the problem units can pose is as follows: let X denote the number of children in a randomly chosen family. Then the units of the variance will be “square children”, whereas the units for the standard deviation ox will be simply “children”. If E{X} represents the expected, or average, value of X (often called the mean), then E{|X — E(X)|} = E{|X — p|} where p = E{X}, represents the average difference from the mean, and is a measure of how “spread out” the values of X are. Indeed, it measures how the values vary from the mean. The variance is the average squared distance from the mean. This has the effect of diminishing small deviations from the mean and enlarging big ones. However the variance is usually easier to compute than is £{|X — |}, and often it has a simpler expression. (See for example Exercise 5.11.) The variance too can be thought of as a measure of variability of the random variable X. Corollary 5.2 (Chebyshev’s Inequality). [f X? is in L', then we have a) P{|X| >a} < Ett for a>0. (b) prt amas for ado. Proof. Both inequalities are known as Chebyshev’s inequality. For part (a), take h(x) = 2* and then by Theorem 5.1 P{\X| >a} = P{A(X) > a2} < roy. For part (b), let ¥ = |X — E{X}]. Then P{|X — E{X}| > a} = P{Y > a} = P{¥? > a} < Pont = 7 Corollary 5.2 is also known as the Bienaymé-Chebyshev inequality.30 5. Random Variables on a Countable Space Examples: 1) X is Poisson with paraineter A. Then X: 2 — N (the natural numbers), and i P(X EA) = PPX =)=Yo ran jEA jEA The expectation of X is x ay ee A j=0 j=0 7 jt ay Gate 2) X has the Bernoulli distribution if X takes on only two values: 0 and 1. X corresponds to an experiment with only two outcomes, usually called “success” and “failure”. Usually {X = 1} corresponds to “success”. Also it is customary to call P({X = 1}) = p and P({X = 0}) =q=1-p Note E{X} =1P(X =1) + 0P(X =0) = Lp+0.q=p. 3) X has the Binomial distribution if P* is the Binomial probability. That is, for a given and fixed n, X can take on the values {0,1,2...., nj. P({X = k}) = (eka —pyr-*, where 0

0. The constant c is such that ¢ 7°, jr = 1. The function 1 (s)= Soa ooh k=1 is known as the Riemann zeta function, and it is extensively tabulated. = 1, Thus c= Wer: and32 5. Random Variables on a Countable Space 1 . 1 PRED = Tay The mean is easily calculated in terms of the Riemann zeta function: 4S apyxe ye Si BUR} DPX i) Gary LRA _ 62) C(a+1) 7) Ifthe state space EF of a random variable X has only a finite number of points, say n, and each point is equally likely, then X is said to have a uniform distribution. In the case where 1,2....,n, 1 . P(X =j)=7, J then X has the Discrete Uniform distribution with parameter n. Using that D2, i= M4EY, we have PO}= Dare an = Lead! = Hee) K aExercises 33 Exercises for Chapter 5 5.1 Let g : [0.00) — [0, 00) be strictly increasing and nonnegative. Show that Eg Xt 9(@) 5.2 Let h: R = [0,a] be a nonnegative (bounded) function. Show that for Oa)< for a> 0. P{R(X) > a} > 5.3 Show that 0} = E{X?} — E{X}, assuming both expectations exist. 5.4 Show that E{X}? < H{X?} always, assuming both expectations exist. 5.5 Show that 0} = H{X(X —1)}+px —pX, where px = E{X}, assuming all expectations exist. 5.6 Let X be Binomial B(p,n). For what value of j is P(X = j) the greatest’? (Hint: Calculate Pe) [Ans.: [(n + 1)p], where [2] denotes integer part of «.] 5.7 Let X be Binomial B(p,n). Find the probability X is even. [Ans.: }(1+ (1 = 2p)"),] 5.8 Let X, be Binomial B(p,,n) with A = np, being constant. Let A, {X,, > 1}, and let Y be Poisson (\). Show that limo. P(Xn = j | An) PY=j|Y>1). 5.9 Let X be Poisson (X). What value of j maximizes P(X = j)? [Ans.: [A].] (Hint: See Exercise 5.6.) 5.10 Let X be Poisson (A). For fixed j > 0, what value of \ maximizes P(X = jy? [Ans.: j.] 5.11 Let X be Poisson (A) with A a positive integer. Show E{|X — Al} = 2Me* 2 oar? and that 6% =A. 5.12* Let X be Binomial B(p,n). Show that for A > 0 and e > 0, P(X —np > ne) < E{exp(A(X — np — ne))}. 5.13 Let X,, be Binomial B(p,n) with p > 0 fixed. Show that for any fixed b> 0, P(X, 0 fixed. and a > 0. Show that x (2 >a) < vol?) vO min { VoD). avi} and also that P(|X — np| < ne) tends to 1 for all ¢ > 0. —?P 5.15 * Let X be a Binomial a where n = 2m. Let a(m, k) = = AY p(X =m+h). Show that limm—.<(a(m, k))™ = e7 5.16 Let X be Geometric. Show that for i,j > 0, P(X >itj|X>i)=P(X > Jj). 5.17 Let X be Geometric (p). Show & {ty} = eat - pr. 5.18 A coin is tossed independently and repeatedly with the probability of heads equal to p. a) What is the probability of only heads in the first n tosses? b) What is the probability of obtaining the first tail at the nt® toss? c) What is the expected number of tosses required to obtain the first tail? {Ans.: 45. —P 5.19 Show that for a sequence of events (A,)n>1. 20 oo E {= la, \ =o (An), n=1 n=l where oc is a possible value for each side of the equation. 5.20 Suppose X takes all its values in N (= {0,1,2.3,...}). Show that x B{X} = YO P(X > n). n=0 5.21 Let X be Poisson (\). Show for r = 2.3, 4,..., E{X(X —1)...(X —r+ 1} =r". 5.22 Let X be Geometric (p). Show for r = 2,3, 4,.... rip” E{X(X-1). (Xar+Dh= Ge.6. Construction of a Probability Measure Here we no longer assume 2 is countable. We assume given 2 and a o- algebra A Cc 2°. (Q,.A) is called a measurable space. We want to construct probability measures on A. When {2 is finite or countable we have already seen this is simple to do. When 2 is uncountable, the same technique does not work: indeed, a “typical” probability P will have P({w}) = 0 for all w, and thus the family of all numbers P({w}) for w € (2 does not characterize the probability P in general. It turns out in many “concrete” situations — in particular in the next chapter — that it is often relatively simple to construct _a “probability” on an algebra which generates the g-algebra A. and the problem at hand is then to extend this probability to the o-algebra itself. So, let us suppose A is the g-algebra generated by an algebra Ao, and let us further suppose we are given a probability P on the algebra Ag: that is. a function P : Ag > [0.1] satisfying 1. P(Q)=1. 2. (Countable Additivity) for any sequence (A,,) of elements of Ap, pairwise disjoint, and such that U,An € Ao, we have P(UnAn) = D>, P(An). It might seem natural to use for A the set of all subsets of 2, as we did in the case where {2 was countable. We do not do so for the following reason, illustrated by an example: suppose {2 = {0, 1], and let us define a set function P on intervals of the form P((a.b]) = 6 — a, where 0 1 with An Am = @ for n #m; then one can prove that no such P exists! The collection of sets 2!!! js simply too big for this to work. Borel realized that we can however do this on a smaller collection of sets, namely the smallest o-algebra containing intervals of the form (a, }]. This is the import of the next theorem: Theorem 6.1. Each probability P defined on the algebra Ag has a unique extension (also called P) on A.36 6. Construction of a Probability Measure We will show only the uniqueness. For the existence on can consult any standard text on measure theory: for example [16] or [23]. First we need to establish a very useful theorem. Definition 6.1. A class C of subsets of 2 is closed under finite intersections if for when Aj,....An €C, then Ay AQN...N An EC as well (n arbitrary but finite). A class C is closed under increasing limits if wherever Ay C Ay C Ag C -CAn C... is a sequence of events in C, then UR, An € ¢ as well. A class C is closed under differences if whenever A,B EC with AC B, then B\ AEC. Theorem 6.2 (Monotone Class Theorem). Let C be a class of subsets of 22, closed under finite intersections and containing Q. Let B be the smallest class containing C which is closed under increasing limits and by difference. Then B = (C). Proof. First note that the intersection of classes of sets closed under increasing limits and differences is again a class of that type. So, by taking the intersection of all such classes. there always exists a smallest class containing C which is closed_under increasing limit: differences. For each set B, denote Bp to be the collection of sets A such that A € B and AMB € B. Given the properties of B, one easily checks that Bg is closed under increasing limits and by difference. Let B € C; for each C € C one has BNC €C Cc Band C € B, thus C € Bg. Hence C C Bg C B. Therefore B = Bz, by the properties of B and of Bz. Now let B € B. For each C € C, we have B € Bo, and because of the preceding, BMC € B, hence C € Bg, whence C C Bg CB, hence B = Bz. Since B = Bg for all B € B, we conclude B is closed by finite intersections. Furthermore 2 € B, and B is closed by difference, hence also under complementation. Since B is closed by increasing limits as well, we conclude B is a o-algebra, and it is clearly the smallest such containing C. o The proof of the uniqueness in Theorem 6.1 is an immediate consequence of Corollary 6.1 below, itself a consequence of the Monotone Class Theorem. Corollary 6.1. Let P and Q be two probabilities defined on A, and suppose P and Q agree on a class C C A which is closed under finite intersections. If a(C) =A, we have P=Q. ‘ Proof. 2.€ A because A is a o-algebra, and since P(2) = Q(22) = 1 because they are both probabilities, we can assume without loss of generality that 2.0 C. Let B= {A € A: P(A) = Q(A)}. By the definition of a Probability measure and Theorem 2.3, B is closed by difference and by increasing limits. ' B\ A denotes BN AP6. Construction of a Probability Measure 37 Also B contains C by hypothesis. Therefore since o(C) = A, we have B= A by the Monotone Class Theorem (Theorem 6.2). a There is a version of Theorem 6.2 for functions. We will not have need of it in this book, but it is a useful theorem to know in general so we state it here without proof. For a proof the reader can consult [19, p. 365]. Let M be a class of functions mapping a given space 92 into R. We let o(M) denote the smallest o-algebra on §2 that makes all of the functions in M measurable: o(M) = {f -1(A);A € B(R); f € M}. Theorem 6.3 (Monotone Class Theorem). Lei M be a class of bounded functions mapping Q into R. Suppose M is closed under multiplication: f.g € M implies fg € M. Let A = o(M). Let H be a vector space of functions with H containing M. Suppose H contains the constant functions and is such that whenever (fn)n>1 is a sequence in H such that O B (note also that M92, (a.b + 4) = (a,b), so By C B and thus B = o(Bo)). The relation (7.1) implies that P((@y)) = FY) — F@), and if A € Bg is of the form A= Uicicn(ei, yi] with ys < tina, then P(A) = Dy cjen{F (ys) — Flei)}- If Q is another probability measure such that F(a) = Q((—00, a),40 7. Construction of a Probability Measure on R then the preceding shows that P = Q on Bo. Theorem 6.1 then implies that P =Q on all of B, so they are the same Probability measure. Oo The significance of Theorem 7.1 is that we know, in principle. the complete probability measure P if we know its distribution function F’ : that is, we can in principle determine from F the probability P(A) for any given Borel set A. (Determining these probabilities in practice is another matter.) It is thus important to characterize all functions F which are distribution functions, and also to construct them easily. (Recall that a function F is right continuous if limy)» F(y) = F(x), for all « € R.) Theorem 7.2. A function F is the distribution function of a (unique) probability on (R,B) if and only if one has: (i) F is non-decreasing; (ii) F is right continuous; (iii) Him, 0 F(x) = 0 and limy_.4., F(z) = 1. Proof. Assume that F is a distribution function. If y > 2, then (—90.2] C (—00, yj], so P((—o0, «]) < P((—oc, y]) and thus F(x) < F(y). Thus we have (i). Next let z, decrease to x. Then N3,(—00, tn] = (—00, 2], and the sequence of events {(—90,2,];n > 1} is a decreasing sequence. Therefore P(N) (00, fn]) = limps P((—26,tn]) = P((—0,2]) by Theorem 2.3, and we have (ii), Similarly, Theorem 2.3 gives us (iii) as well Next we assume that we have (i), (ii), and (iii) and we wish to show F is a distribution function. In accordance with (iii), let us set F(—oo) = 0 and F (+00) = 1. As in the proof of Theorem 7.1, let Bo be the set of finite disjoint unions of intervals of the form (z,y], with —o0 < x < y < +00. Define a set function P, P : By — (0, 1] as follows: for A =Urcien(in yi) with y; 0. By hypothesis (iii) there exists a a z such that F(—z) < € and 1— F(z) < «. For each n,i there exists a? € (a7, y?] such that F(a?) — F(x!) < sr, by (ii) (right continuity). Set By = Ureick, {a7 uF] 1(-2,2]}; Bn = UmenBr- Note that B/, € Bo and BY, C Ay, hence By € By and By C An. Furthermore, An\Bn C Umen(4m\Biy), hence P(An) ~ P(Bn) S P((-2,21°) + S3 P((Am\Br) 1 (-2 4) m= no ky < P((-z,2]°) + 32 YS P((2?, a?) m=1i=1 n kn < F(-2) +1-F(2)+ YO SO{F(a?) — F(e?)} < Be. (7.2) mal i=l Furthermore observe that B, C A, (where By, is the closure of Bn), hence ne_,B, = @ by hypothesis. Also B, S [-2, z], hence each B, is a compact: set. Tt is is a property of compact spaces! (known as “The Finite Intersection Property”) that for closed sets Fg, Nges Fg # 0 if and only if NgecFs #0 for all finite subcollections C’ of B. Since in our case N°2,B, = 0, by the Finite Intersection Property we must have that there exists an m such that B, = ¢ for all n > m. Therefore By, = ¢ for all n > m, hence P(Bn) = 0 for all n > m. Finally then P(An) = P(An) — P(Bn) $ 3e by (7.2), for all n > m. Since € was arbitrary, we have P(A,) | 0. (Observe that this rather lengthy proof would become almost trivial if the sequence k,, above were bounded; but although A, decreases to the empty set, it is not usually true). qa Corollary 7.1. Let F be the distribution function of the probability P on R. Denoting by F(x—) the left limit of F at x (which exists since F is nonde- creasing), for all x < y we have 1 For a definition of a compact space and the Finite intersection Property one can consult (for example) [12, p.81].42 7. Construction of a Probability Measure on R. (i) P((a.y] = Fly) ~ F(@). (ii) P({w.y]) = F(y) — F(a-), (ii) P([e.y)) = F(y-) - F(@-), Gv) P((x.y)) = Fy) ~ F(). (v) Pz}) = F(x) — F(e-), and in particular P({x}) =0 for all x if and only the function F is continuous. Proof. (i) has already been shown. For (ii) we write P(e— 2.) = Fy) Fe-4) by (i). The left side converges to F(y) — F(«—) as n — oc by definition of the left limit of F; the right side converges to P((z.y]) by Theorem 2.3 because the sequence of intervals (x — 4.y] decreases to [x.y]. The claims (iii), (iv) and (v) are proved similarly. oO Examples. We first consider two general examples: 1. If f is positive and Riemann-integrable and f™.. f («)dx = 1, the function F(z) = Poe f(y)dy is a distribution function of a probability on R; the function f is called its density. (It is not true that each distribution function admits a density, as the following example shows). 2. Let a € R. A “point mass” probability on R (also known as “Dirac measure”) is one that satisfies lifa€ A, P(A)= {5 otherwise. Its distribution function is Oifa 0 and f*. f(«)dz = 1, which the reader can check is indeed the case for examples 3-10. We abuse language a bit by referring to the density f alone as the distribution, since it does indeed determine uniquely the distribution.7. Construction of a Probability Measure on R 43 ifasa0. 4 fe) = ifr <0. is called 7 Exponential distribution with parameter 3 > 0. The exponential distribution is often used to model the lifetime of objects whose decay has “no memory”; that is, if X is exponential, then the probability of an object lasting t more units of time given it has lasted s units already, is the same as the probability of a new object lasting ¢ units of time. The lifetimes of light bulbs (for example) are often modeled this way: thus if one believes the model it is pointless to replace a working light bulb with a new one. This memoryless property characterizes the exponential distribution: see Exercises 9.20 and 9.21. ge _ : 5. f(@) = ra 7 1 3. fa) = {ra xz <0, is called the Gamma distribution with parameters a, 3 (0 < a < co and 0 < 8 <0; TI denotes the gamma function) ? The Gamma distribution arises in various applications. One example is in reliability theory: if one has a part in a machine with an exponential (3) lifetime. one can build in reliability by including n — 1 back-up components. When a component fails. a back-up is used. The result- ing lifetime then has a Gamma distribution with parameters (n. 3). (See Exercise 15.17 in this regard.) The Gamma distribution also has a rela- tionship to the Poisson distribution (see Exercise 9.22) as well as to the chi square distribution (see Example 6 in Chapter 15). The chi square distribution is important in Statistics: See the Remark at the end of Chapter 11. ana 1e(52)" if 2 > 0, 6. f(z) = ifx <0. is called i Weibull distribution with parameters a, 3 (0 ~ a* te *da, a > 0; it follows from the definition that (a) = (a ~ 1)! for a EN, and P(3) = vm44 7. Construction of a Probability Measure on R- known as the Gaussian distribution. Standard notation for the Normal with parameters j: and 0? is N(y.07). We discuss the Normal Distribution at length in Chapters 16 and 21; it is certainly the most important distribution in probability and it is central to much of the subject of Statistics. 8. Let gyu,o2(t) = Tee ee, the normal density. Then 1 : f(a) = 7 9u.02(log x) ifz>0, : 0 ifx <0, is called the Lognormal distribution with parameters 1, ¢?(—00

0 and X7,, + X* a.s. yield =n E{X*} 1 be an iid. sequence with E{|X,|} < oc. Then Xt. t Xn lim 27-7" noc n — B{X} as. Proof. Let S, = X, +...+ Xn, and Fon = o( Sn. Snai+Snz2....). Then Fn C Fm ifn > m, and the process Mon = E{Xi|F-n} is a backwards martingale. Note that E{M_,} — E{X,}, each n. Also note that by symmetry for 1 1 be independent random variables, E{Yn} =0, all n, and E{Y2} < 00 alln. Suppose ~~~, E{¥2} < 20. Let Sn = Sihay Yj. Then limn soo Sn = D2, Yj ewists a.s., and it is finite as. Proof, Let Fy = o(Yi,.-.,¥n), and note that E{Sni1—Sn | Fn} = E{¥aan | Fn} = E{Y¥n41} = 0. hence (S,)n>1 is an ¥,-martingale. Note further that sup, E{S#} < sup, (E{S2} +1) < O, E{¥2}+1 < co. Thus the result follows from the Martingale Convergence Theorem (Theorem 27.1). oO The Martingale Convergence Theorems proved so far (Theorems 27.1 and 27.4) are strong convergence theorems: all random variables are defined on the same space and converge strongly to random variables on the same space, almost surely and in L’. We now give a theorem for a class of martingales27. Martingale Convergence Theorems 235, that do not satisfy the hypotheses of Theorem 27.1 and moreover do not have a strong convergence result. Nevertheless we can obtain a weak convergence result, where the martingale converges in distribution as n — oc. The limit is of course a normal distribution, and such a theorem is known as a martingale central limit theorem. The result below is stated in a way similar to the Central Limit Theorem for ii.d. variables X,, with their partial sums S,,: Condition (i) implies that (Sp) is a martingale, but on the other hand an arbitrary martingale (S,) is the sequence of partial sums associated with the random variables X,, = Sn — Sp. and these also satisfy (i). Theorem 27.7 (Martingale Central L' a sequence of random variables satisfying (i) E{Xn | Fra} (i) E{Xn | Fraps (iit) E{IXpP | Fra} ee , and so for n large enough we have 0 < 1— ¢ < 1. Therefore we reduce “the left side of (27.12) by multiplying by (1 — we yr-p for n large enough, to obtain WV? wes, wWVTP Fits, luls a Eset (yo BoM RV cK (om) Pf} (-g) feel sea (27.13) Finally we use telescoping (finite) sums to observe ye £ {ems} and thus by the triangle inequality and (27.13) we have (always for n > ©): a 3 iw ony (7 KluP tule jefe } (: a) ; but this is the characteristic function of an N(0.1) random variable (cf Example 13.5), and characteristic functions characterize distributions (Theorem 14.1), so we are done. oO Remark 27.1. If S,, is the martingale of Theorem 27.7, we know that strong martingale convergence cannot hold: indeed if we had lim, Sp = Sas. with $ in L, then we would have limy 3% = 0 as., and the weak convergence of Se to a normal random variable would not be possible. What makes it not possible to have the strong martingale convergence is the behavior of the conditional variances of the martingale increments X,, (hypothesis (ii) of Theorem 27.7). o We end our treatment of martingales with an example from analysis: this example illustrates the versatile applicability of martingales; we use the martingale convergence theorem to prove a convergence result for approximation of functions. Example 27.1. ((10]) Let f be a function in L?(0, 1] for Lebesgue measure restricted to [0,1]. Martingale theory can provide insights into approxima- tions of f by orthogonal polynomials. Let us define the Rademacher functions on [0,1] as follows. We set Ro(a) =1,0 1, we set for0 m. (See Exercise 27.8.) Next we define the Haar functions as follows: Ho(«) = Ro(2), Ay (a) = Ry(x). For n > 2. letn =14+24...4+2°7? += 27"! —14A, where r > 2 and 1<\< 2°). Then V2-TR,,(a) for 2=2 . Let 1 n= [ H, (2) f(a)de, Sn(a. f) = > 0-H, (2). (27.16) r=0 Then limn oo Sn(t, f) = f(a) a.e. Moreover if S*(x. f) = sup, |Sn(2.f)I, then 1 pa [ .nyars (4) [ Uf (w)|Pdz. Proof. We first show that S,,(z, f) is a martingale. We have E{Sn+1(@.f) | Fu} = Sn(@.f) + Ef{onsiAng1(@) | Fn} = Sn(@,f) + Ong E{Hn+1(2) | Fn} = Sn(«, f)27. Martingale Convergence Theorems 239 where we used (27.15). However more is true: Sn(w.f) = Eff | Fr}, (27.17) which is the key result. Indeed to prove (27.17) is where we need the coeffi- cients a, given in (27.16). (See Exercise 27.10.) Next we show S,,(2, f) satisfies sup, E{S,(#,f)*} < 00, for p > 1 (the hypothesis for the Martingale Convergence Theorem; Theorem 27.1). We actually show more thanks to Jensen’s inequality (Theorem 23.9): since y(u) = |u|? is convex for p > 1, we have that 1 | iSa(0. f) Pdr = EXECS | Fa}} S E{ELif? | Fah} = EXifl’} 1 -[ f(a) ida < cc, 0 and thus sup E{S,,(x, f)*} < sup E{|Sp (x, f)|P} n n < E{|f|P} < 20. We now have by Theorem 27.1 that lim S,,(x, f) = f(x) almost everywhere. noc and also by Doob’s L? martingale inequalities (Theorem 26.2) we have p \P B(s(s}s (25) Bsa) < (2) ein. [ “(s(@. fy)Pde < (AY [ " flayirae. We remark that results similar to Theorem 27.8 above hold for classical Fourier series, although they are harder to prove. or equivalently oO240 27. Martingale Convergence Theorems Exercises for Chapter 27 27.1 (A martingale proof of Kolmogorov’s zero-one law.) Let X,, be independent random variables and let C,, be the corresponding tail o-algebra (as defined in Theorem 10.6). Let C € C... Show that E{1c|F,} = P(C). all n. where F,, = 0(X;;0 < j < n). Show further lim, ,, E{lc|Fn} = le as. and deduce that P(C) = 0 or 1. 27.2 A martingale X is bounded in L? if sup, E{X2} < cc. Let X be a martingale with X,, in L?, each n. Show that X is bounded in L? if and only if cc SO E{(Xn = Xna1)?} < co. n=1 (Hint: Recall Exercise 24,12.) 27.3 Let X bea martingale that is bounded in L?: show that sup, E{|Xal} < oo, and conclude that lim, X, = X a.s., with E{|X|} < oo. 27.4* Let X be a martingale bounded in L?. Show that lim, X, = X a.s. and in L?, That is, show that limy oo E{(X, —X)?} =0. 27.5 (Random Signs) Let (Xp)n21 be iid. with PX = P(X, = =1) = }. Let (an)nz1 be a sequence of real numbers, Show a ee aX, is a.s. convergent if 7%, a2 < oo. 27.6 Let X1, X2,... be i.id. nonnegative random variables with E{X,} = 1. Let Ry = Thea X;, and show that A, is a martingale. 27.7 Show that if n # m, then the Rademacher functions R, and R,, are independent for P = \ Lebesgue measure restricted to (0, 1]. 27.8 Let H, be the Haar functions, and suppose A € F, = o(Ho, Mh...., Hf,,). Show that [toss ote = A 27.9 Let f be in L?(0, 1]. Let Sy (2, f) be as defined in (27.16) and show that E{f | Fn} = Sp(2, f). (Hint: Show that [ tea = [ Sp(a, fda for A € Fr A A by using that the Haar functions are an orthonormal system; that is, 1 1 [ Ay(t)Am(2)dz = 0 ifn # mand [ H,,(x)?da = 1) 0 0Exercises 241 27.10 Use Martingale Convergence to prove the following 0—1 law. Let (Fn) be an increasing sequence of o-algebras and G,, a decreasing sequence of o- algebras. with G, C o(U%,Fn). Suppose that F,, and G,, are independent. for each n. Show that if A €M2L,Gn, then P(A) =0or 1. 27.11 Let H be a subset of L!. Let G be defined on (0,00) and suppose G is positive, increasing, and lim sca] = 00. tox t Suppose further that sup yey E{G((X))} < oc. Show that 4 is uniformly integrable. (This extends Theorem 27.2(a).)28. The Radon-Nikodym Theorem Let (2.F, P) be a probability space. Suppose a random variable X > 0 a.s. has the property E{X}= 1. Then if we define a set function Q on F by Q(A) = E{aX} (28.1) then it is easy to see that Q defines a new probability. Indeed Q() = E{loX} = E{X} =1 and if A;, Ao, A3,... are disjoint in F then Q (U 4) = E(us,ayX} t=1 -2 {Suna} i=1 =e naxy i=l = oA) i=) and we have countable additivity. The interchange of the expectation and the summation is justified by the Monotone Convergence Theorem (Theo- rem 9.1(d)). Let us consider two properties enjoyed by Q: (i) If P(A) = 0 then Q(A) = 0. This is true since Q(A) = E{1,X}, and then 1, is a.s. 0, and hence 1,X = Oas. (ii) For every ¢ > 0 there exists 6 > 0 such that if A € F and P(A) < 6, then QA) 0 there exists 6 > 0 such that if A €F and P(A) <6, then Q(A) 0. Set A = limsup,, 2: An- By Borel-Cantelli Lemma (Theorem 10.5) we have P(A) = 0. Fatou’s lemma has a symmetric version for limsups, which we established in passing during the proof of Theorem 9.1(f): this gives Q(A) = limsup Q(A,n) >. and we obtain a contradiction. o It is worth noting that conditions (i) and (ii) are actually equivalent. Indeed we showed (i) implies (ii) in Theorem 28.1; that (ii) implies (i) is simple: suppose we have (ii) and P(A) = 0. Then for any e > 0, P(A) < 6 and so P(A) < . Since € was arbitrary we must have Q(A) = 0. Definition 28.1. Let P,Q be two finite measures. We say Q is absolutely continuous with respect to P if whenever P(A) = 0 for A € F, then Q(A) = 0. We denote this Q << P. Examples: We have seen that for any r.v. X > 0 with E{X} = 1. we have Q(A) = E{1,X} gives a probability measure with Q < P. A naturally occurring example is Q(A) = P(A | A), where P(A) > 0, It is trivial to check that P(A) = 0 implies Q(A) = 0. Note that this example is also of the form Q(A) = E{1,X}, where X = Puy la: The Radon-Nikodym theorem characterizes all absolutely continuous probabilities. Indeed we see that if Q < P, then Q must be of the form (28.1). Thus our original class of examples is all that there is. We first state a simplified version of the theorem, for separable o-fields. Our proof follows that of P. A. Meyer [15]. Definition 28.2. A sub o-algebra G of F is separable if G = o(Ai,..., An,...), with A; © F, alli. That is, G is generated by a countable sequence of events. Theorem 28.2 (Radon-Nikodym). Lei (2,F,P) be a probability space with a separable o-algebra F. If Q is a finite measure on F and if P(A) =0 implies Q(A) = 0 for any such \ € F, then there exists a unique integrable positive random variable X such that Q(A) = E(1nX}. We write X = 43. Further X is unique almost surely: that is if X’ satisfies the same properties, then X' = X P-a.s. Proof. Since the result is obvious when Q = 0, we can indeed assume that Q(2) > 0. Then we can normalize Q by taking Q = ame, so we assume28. The Radon-Nikodym Theorem 245 without loss that Q is a probability measure. Let Aj, A2,....4, be a countable enumeration of sets in F such that F = o( Ay. A2,..., An-...). We define a filtration (Fn)n>1 by (At... An): There then exists a finite partition of {2 into sets Ani, An2,--.; Ant, Such that each element of F;, is the (finite) union of some of these events. Such events are called “atoms”. We define kn “2 Fa ale “) (28.2) with the convention that § = 0 (since Q < P the numerator is 0 whenever the denominator is 0 above). We wish to show the process (Xp )n>1 is in fact a martingale. Observe first that X,, is ¥,-measurable. Next, let m Pay deanna? 7 — QAAns) = » Plan) Ane A). We can write Now, since A € F;,, the set \ can be written as the union of some of the (disjoint) partition sets A,;, that is A = UyerAn, for a subset JC {1,..., kn}. Therefore AM An; = Ani if i € J and AN Any = ¢ otherwise, and we now obtain [Xue >> Fgh Pas nA) = Vi QAns) = QA) ie where we have used again the fact that Q(A,,;) = 0 whenever P(An,;) = 0. Since A € Fm we get similarly [, XmdP = Q(A). Hence (28.1) holds, and further if we take \ = 92 then we get f X,dP = Q(2) = 1 < 00, so Xp, is P-integrable. Therefore (X,)n>1 is a martingale.246 28. The Radon-Nikodym Theorem We also have that the martingale (X,,) is uniformly integrable. Indeed. we have / X,dP = Q(Xn > &); Xn>c} by Markov’s inequality P(X,) < Fides EU} Fol c Let ¢ > 0, and let 6 be associated with < as in Theorem 28.1 (since Q << P by hypothesis). If ¢ > 1/6 then we have P(X, > c) < 6, hence Q(Xn > c) Se, hence Je X, ce} 4ndP < ¢: therefore the sequence (X;,) is uniformly integrable, and by our second Martingale Convergence Theorem (Theorem 27.3) we have that there exists a r.v. X in L! such that limp—oo Xn = X as. and in L’ and moreover E{X | Fy} = Xn. Let now A € F, and define R(A) = E{1,X}. Then R agrees with Q on each Fn, since if A € Fy. R(A) = E{A,X} = E{1,X,} = Q(A). The Monotone Class Theorem (6.3) now implies that R= Q, since F=o(Fain>1). O Remark 28.1. We can use Theorem 28.2 to prove a more general Radon— Nikodym theorem, without the separability hypothesis. For a proof of Theo- rem 28.3 below, see (24, pp.147-149]. Theorem 28.3 (Radon-Nikodym). Let P be a probability on (Q,F) and let Q be a finite measure on (82, F). If Q< P then there exists a nonnegative rv. such that = E{1,X} for all A € F. Moreover X is P-unique a.s. We write X The Radon-Nikodym theorem is directly related to conditional expectation. Suppose given ({2,F, P) and let G be a sub o-algebra of F. Then for any nonnegative r.v. X with E{X} < 00, Q(A) = E{X1,} for A in G defines a finite measure on (2,G), and P(A) = 0 implies QA) = 0. Thus % exists on the space (2,G), and we define Y = #2. then Y is G-measurable. Note further that if A € G, then E{Y 1p} = Q(A) = E{X1)}. Thus Y is a version of E{X | G}. In fact, it is possible to prove the Radon— Nikodym Theorem with a purely measure-theoretic proof, not using martingales. Then one can define the conditional expectation as above: this is an alternative way for constructing conditional expectation, which does not use Hilbert space theory. Finally note that if P is a probability on R having a density f, and since P(A) = f, f(a)dz, then P is absolutely continuous with respect to Lebesgue measure m on R (here m is a o-finite measure, but the Radon-Nikodym Theorem “works” also in this case), and we sometimes write f = a.Exercises 247 Exercises for Chapter 28 28.1 Suppose Q < P and P 0a.s. (dP). 28.2 Suppose Q ~ P. Let X = 9%. Show that + = gs. 28.3 Let jy: be a measure such that p = aan QP, for P,, probability measures and an > 0. all n. Suppose Q, < Py, each n, and that v= 7, BnQn and 3, > 0, all n, Show that (A) = 0 implies »(A) = 0. 28.4 Let P,Q be two probabilities and let R = Pea Show that P< R. 28.5 Suppose Q ~ P. Give an example of a P martingale which is not a martingale for Q. Also give an example of a process which is a martingale for both P and Q simultaneously.References 1. R. Bass (1995), Probabilistic Techniques in Analysis; Springer-Verlag; New York. 2, H. Bauer (1996), Probability Theory; Walter de Gruyter; Berlin, 3. 4, G. Cardano (1961), Liber de ludo aleae; The Book on Games of Chance; Sidney J, Bernoulli (1713), Ars Conjectandi; Thurnisiorum; Basel (Switzerland). Gould (Translator); Holt, Rinehart and Winston, . G, Casella and R. L, Berger (1990), Statistical Inference; Wadsworth; Belmont, CA. . A, De Moivre (1718), The Doctrine of Chances; or, A Method of Calculating the Probability of Events in Play; W, Pearson; London, Also in 1756, The Doctrine of Chances (Third Edition), reprinted in 1967: Chelsea, New York. . J, Doob (1994), Measure Theory; Springer-Verlag; New York. 8 R. Durrett (1991), Probability: Theory and Examples; Wadsworth and 10. i. 12, 13. 14, 15. 16, 18, 19. 20. 21, 22. Brooks/Cole; Belmont, CA. . W. Feller (1971), An Introduction to Probability Theory and Its Applications (Volume II); John Wiley; New York. A. Garsia (1970), Topics in Almost Everywhere Convergence; Markham; Chicago. A. Gut (1995), An Intermediate Course in Probability; Springer-Verlag; New York. N. B, Haaser and J. A. Sullivan (1991), Real Analysis; Dover; New York. C. Huygens (1657); See Oeuvres Completes de Christiaan Huygens, (with a French translation (1920) ), The Hague: Nijhoff. A. N. Kolmogorov (1933), Grundbegriffe der Wahrscheinlichkeitrechnung. En- glish translation: Foundations of the Theory of Probability, (1950), Nathan Mor- rison translator; Chelsea; New York, P, A. Meyer (1966), Probability and Potentials; Blaisdell; Waltham, MA (USA). J. Neveu (1975), Mathematical foundations of the calculus of probabilities; Holden—Day; San Francisco. . D, Pollard (1984), Convergence of Stochastic Processes; Springer-Verlag; New York. M. H. Protter and P. Protter (1988), Calculus with Analytic Geometry (Fourth Edition); Jones and Bartlett; Boston. M. Sharpe (1988), General Theory of Markov Processes; Academic Press; New York. G. F. Simmons (1963), Introduction to Topology and Modern Analysis; McGraw-Hill; New York. S. M. Stigler (1986), The History of Statistics: The measurement of uncertainty before 1900, Harvard University Press; Cambridge, MA. D. W. Stroock (1990), A Concise Introduction to the Theory of Integration; World Scientific; Singapore.250 References 23. S, J. Taylor (1973), Introduction to Measure and Integration: Cambridge Uni- versity Press; Cambridge (U.K.). 24. D. Williams (1991), Probability with Martingales: Cambridge; Cambridge. UK.Index 14 Indicator function 10, 49 2° Set of all subsets of 2 3,7 A* Atranspose 92 A, — A convergence of the sets An to A 10 B(r,s) beta function 62 E{X} Expectation of X 27.51, 52 E{Y |G} Conditional expectation of Y given G 200 Eq{X |G} Conditional expectation of ‘X given G under Q 209 H, n™ Haar function 238 Jq Jacobian matrix 92 D := £! modulo as. equal 53 L? asa normed linear space 207 L? := £ modulo as. equal 53 N(u, %, 127 N(u,o") Normal distribution with a je and variance a? 125 P(A|B) 16 P2Q 67 P* Distribution measure of X 4,27 PX @ PY Product of P* and P* 69 P‘**) Distribution measure of the pair (X,Y) 69 Q

Probability - Leo Breiman
100% (1)
Probability - Leo Breiman
438 pages
Dynamical System - Meiss
No ratings yet
Dynamical System - Meiss
34 pages
0910sem2 ST5209
No ratings yet
0910sem2 ST5209
8 pages
Moment Generating Functions
No ratings yet
Moment Generating Functions
7 pages
(Dimitris N. Politis, Joseph P. Romano, Michael Subsampling
No ratings yet
(Dimitris N. Politis, Joseph P. Romano, Michael Subsampling
180 pages
The Theory of Interest - Solutions Manual
No ratings yet
The Theory of Interest - Solutions Manual
11 pages
Bosq Nguyen A Course in Stochastic Processes PDF
100% (1)
Bosq Nguyen A Course in Stochastic Processes PDF
354 pages
Karlin Taylor A Second Course On Stochastic Processes
No ratings yet
Karlin Taylor A Second Course On Stochastic Processes
33 pages
Wickens Exercises in Econometrics
0% (1)
Wickens Exercises in Econometrics
113 pages
Solutions - Manual Stochastic Modeling
86% (7)
Solutions - Manual Stochastic Modeling
78 pages
Stochastic Calculus Notes 4/5
No ratings yet
Stochastic Calculus Notes 4/5
22 pages
An Introduction To Malliavin Calculus With Applications To Economics
No ratings yet
An Introduction To Malliavin Calculus With Applications To Economics
83 pages
2014 - Lectures Notes On Game Theory - WIlliam H Sandholm
No ratings yet
2014 - Lectures Notes On Game Theory - WIlliam H Sandholm
167 pages
Generalised Linear Models and Bayesian Statistics
No ratings yet
Generalised Linear Models and Bayesian Statistics
35 pages
Feller Volume 1
No ratings yet
Feller Volume 1
527 pages
Mood Introduction To The Theory of Statistics
0% (1)
Mood Introduction To The Theory of Statistics
577 pages
Alpha Chiang - Elements of Dynamic Ion
90% (10)
Alpha Chiang - Elements of Dynamic Ion
174 pages
Bayesian in Actuarial Application
100% (1)
Bayesian in Actuarial Application
22 pages
The Riemann and Lebesgue Integrals
100% (1)
The Riemann and Lebesgue Integrals
14 pages
Probability Theory-Merged
100% (1)
Probability Theory-Merged
127 pages
Stochastic Processes
100% (1)
Stochastic Processes
264 pages
Solution To Mathematics For Economists Third Edition
No ratings yet
Solution To Mathematics For Economists Third Edition
67 pages
Numerical Methods in Economics
0% (1)
Numerical Methods in Economics
349 pages
Solution To Campbell Lo Mackinlay PDF
0% (1)
Solution To Campbell Lo Mackinlay PDF
71 pages
Solutions Shreve Chapter 5
No ratings yet
Solutions Shreve Chapter 5
6 pages
Stochastic Process
No ratings yet
Stochastic Process
43 pages
Fumio Hayashi - Econometrics PDF
No ratings yet
Fumio Hayashi - Econometrics PDF
362 pages
LargeScaleInference PDF
No ratings yet
LargeScaleInference PDF
273 pages
Markov Processes - Characterization and Convergence
No ratings yet
Markov Processes - Characterization and Convergence
273 pages
Calculus 1 - 4
No ratings yet
Calculus 1 - 4
500 pages
Martingales, Wiener Processes & Ito's Lemma
No ratings yet
Martingales, Wiener Processes & Ito's Lemma
35 pages
Maple - SolvingSolving Stochastic Differential Equations in Maple Stochastic Differential Equations in Maple
No ratings yet
Maple - SolvingSolving Stochastic Differential Equations in Maple Stochastic Differential Equations in Maple
3 pages
Lecture - 12 Von Neumann & Morgenstern Expected Utility
No ratings yet
Lecture - 12 Von Neumann & Morgenstern Expected Utility
20 pages
(BOOK) A Primer in Econometric Theory - Stachurski 2016
No ratings yet
(BOOK) A Primer in Econometric Theory - Stachurski 2016
398 pages
Oksendal Stochastic Differential Equations PDF Free
No ratings yet
Oksendal Stochastic Differential Equations PDF Free
385 pages
Dokumen - Pub Time Series Econometrics J 6726102
100% (2)
Dokumen - Pub Time Series Econometrics J 6726102
219 pages
(Yves Meyer, Ronald Coifman) Wavelets Calderón-Z
No ratings yet
(Yves Meyer, Ronald Coifman) Wavelets Calderón-Z
335 pages
Domination in Graphs PDF
No ratings yet
Domination in Graphs PDF
56 pages
Random Walk Article
100% (1)
Random Walk Article
289 pages
Pugh, Charles - Real Mathematical Analysis (Back Matter)
No ratings yet
Pugh, Charles - Real Mathematical Analysis (Back Matter)
12 pages
A First Course in Optimization Theory
No ratings yet
A First Course in Optimization Theory
8 pages
CompleteNote PDF
No ratings yet
CompleteNote PDF
131 pages
Measure July 20 2021
No ratings yet
Measure July 20 2021
13 pages
Marcin Pitera. Stochastic Processes.
No ratings yet
Marcin Pitera. Stochastic Processes.
45 pages
Convergence of Stochastic Processes
No ratings yet
Convergence of Stochastic Processes
223 pages
Basic Lebesgue Measure Theory: Royden 2010 Kolmogorov and Fomin 1970 Stein and Shakarchi 2005 Tao 2011
No ratings yet
Basic Lebesgue Measure Theory: Royden 2010 Kolmogorov and Fomin 1970 Stein and Shakarchi 2005 Tao 2011
28 pages
Introduction To Mathematical Statistics
100% (3)
Introduction To Mathematical Statistics
225 pages
Model Diff
100% (3)
Model Diff
418 pages
Measure and Integration Theory - 4-06-11-2021!16!20-07 - Measure and Integration Theory (20MAT22C2)
100% (1)
Measure and Integration Theory - 4-06-11-2021!16!20-07 - Measure and Integration Theory (20MAT22C2)
90 pages
Counter Examples in Probability
No ratings yet
Counter Examples in Probability
12 pages
The Mathematics of Finance Modeling and Hedging Pure and Applied Undergraduate Texts Victor Goodman PDF Download
No ratings yet
The Mathematics of Finance Modeling and Hedging Pure and Applied Undergraduate Texts Victor Goodman PDF Download
90 pages
(Sunanda Roy, 2020) A First Course in Mathematical Economics by Sunanda Roy
No ratings yet
(Sunanda Roy, 2020) A First Course in Mathematical Economics by Sunanda Roy
366 pages
Sde
No ratings yet
Sde
64 pages
Dartmouth - Prob
No ratings yet
Dartmouth - Prob
518 pages
Probability Theory An Introduction Using R Chapman and Hall, 2024
No ratings yet
Probability Theory An Introduction Using R Chapman and Hall, 2024
596 pages
(Mathematics and Its Applications) Malempati M. Rao, Randall J. Swift - Probability Theory With Applications - Springer (2006)
No ratings yet
(Mathematics and Its Applications) Malempati M. Rao, Randall J. Swift - Probability Theory With Applications - Springer (2006)
536 pages
B. Ramdas Bhat - Modern Probability Theory - An Introductory Textbook-Wiley (1985)
100% (1)
B. Ramdas Bhat - Modern Probability Theory - An Introductory Textbook-Wiley (1985)
288 pages
Introduction
No ratings yet
Introduction
4 pages
Probability
100% (2)
Probability
520 pages

Probability Essentials (Jacod J., Protter P) PDF

Uploaded by

Probability Essentials (Jacod J., Protter P) PDF

Uploaded by

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.