P&S Mod3&4
P&S Mod3&4
Firm 1 2 3 4 5 6 7 8 9 10
Advertisemen 11 13 14 16 16 15 15 14 13 13
t Exp. (Rs. In
Lakhs)
Sales Volume 50 50 55 60 65 65 65 60 60 50
(Rs. In Lakhs)
Interpretation: From the above calculation it is very clear that there is a high degree of
positive correlation i.e. r = 0.7866, between the two variables. i.e. Increase in
advertisement expenses leads to increased sales volume
2. Find the correlation coefficient between age and playing habits of the following
students using Karl Pearson’s coefficient of correlation method
Age 15 16 17 18 19 20
Number 250 200 150 120 100 80
of
students
Regular 200 150 90 48 30 12
Players
Interpretation: From the above calculation it is very clear that there is high degree of
negative correlation i.e. r = -0.9912, between the two variables of age and playing
habits. i.e. Playing habits among students decreases when their age increases.
3. Find Karl Pearson’s coefficient of correlation between capital employed and profit
obtained from the following data.
Capital 10 20 30 40 50 60 70 80 90 100
Employed
(Rs. In
Crore)
Profit (Rs. 2 4 8 5 10 15 14 20 22 50
In Crore)
Solution: Let us assume that capital employed is variable X and profit is variable Y.
4. A computer while calculating the correlation coefficient between the variable X and Y
obtained the following results: N = 30; ∑X = 120 ∑X2 = 600 ∑Y = 90 ∑Y2 = 250 ∑XY =
335 It was, however, later discovered at the time of checking that it had copied down
two pairs of observations as: (X, Y) : (8, 10) (12, 7)
While the correct values were: (X, Y) : (8, 12) (10, 8)
Obtain the correct value of the correlation coefficient between X and Y
Name of A B C D E F G H I
student
s
Internal 51 68 73 46 50 65 47 38 60
Exam
External 49 72 74 44 58 66 50 30 35
Exam
Interpretation: From the above calculation it is very clear that there is high degree of
positive correlation i.e. R = 0.7833, between two exams. It means there is a high degree of
positive correlation between the internal exam and external exam of the students.
X 82 68 75 61 68 73 85 68
Y 81 71 71 68 62 69 80 70
Solution: In the problem, we find there are repetitions of ranks. Value of X = 68 repeated 3
times and Value of Y = 71 repeated 2 times. Therefore, we need to compute the adjustment
factor to be added to the value of ∑ D 2.
4. The coefficient of rank correlation of marks obtained by 10 students in English and
Economics was found to be 0.5. It was later discovered that the difference in marks in
two subjects obtained by one of the student was wrongly taken as 3 instead of 7.
Find the correct coefficient of rank correlation.
7. For the variables X and Y the two lines of regression are given by 3x +2y –25 = 0 and 6x
+ y – 30 = 0.
(i) Identify the lines of regression of X on Y and Y on X.
(ii) Find the means of X and Y.
(iii) Find the correlation coefficient between X and Y.
Solution:
Let 3x +2y –25 = 0 (1)
6x + y – 30 = 0 (2)
8. For bivariate distribution, Mean of x = 65, Mean of y = 53 s.d of x = 4.7 s.d of y = 5.2,
Correlation Coeff = 0.78 Find two regression equations and estimate
i) The most probable value of y when x = 63
ii) The most probable value of x when y = 50
9. The regression equation of y on x is 𝑥 + 3𝑦 − 88 = 0 and that of regression equation of x
on y is 2𝑥 + 𝑦 − 71 = 0. Find
i) Mean values of x and y
ii) Coefficient of correlation
10. Given two regression equations 3𝑥 − 𝑦 − 25 = 0 and 2𝑥 − 3𝑦 + 30 = 0. Find
i) Mean values of x and y
ii) Coefficient of correlation
iii) s.d of x if s.d of y is 2
Multiple Regression:
Solution:
3. Find the multiple linear regression equation of X 1 on X 2 and X 3 from the data
relating to three variables given below:
X1 4 6 7 9 13 15
X2 15 12 8 6 4 8
X3 30 24 20 14 10 4
4.
σ 1=10 , σ 2=8 , σ 3 =5
X3 = annual income
i. Calculate the satisfaction level of a person who passed out at the age of 45
has living children, has an annual income of rupees 12,000 and has only
one social activity in a week
ii. Interpret the value a = −20
iii. Would a person be more satisfied with the additional income of Rs. 2000?
y x1 x2
140 60 22
155 62 25
159 67 24
179 70 20
192 71 15
200 72 14
212 75 14
215 78 11
7. The salary of a person in an organisation has to be regressed in terms of
experience (X1) and mistakes (X2). If it is given that the values Y =3.3; X 1= 2.7;
X 2= 13.7; S y = 2.1; S 1 = 1.5; S 2 = 2.6 and the zero order correlations : r y 1 = 0.5; r y 2
= −0.3; r 12 = −0.47; Find the multiple linear regression and interpret the results.
Interpretation:
X1 X2 Y X 12 X1 X2 X 22 X1Y X2Y
4 1 7 16 4 1 28 7
7 2 12 49 14 4 84 24
9 5 17 81 45 25 153 85
12 8 20 144 96 64 240 160
10. Find the multiple regression coefficient for the below data
5. The random variable X has binomial distribution B 40,0.2 ( ) . Determine each of the
following. a) P(X = 5) . b) P ( X < 13). c) P ( X >10 ) . d) P(8 < X<14 ) .
6. The probability of a customer ordering the colour of a particular model of new car in
silver is 0.2 . Find the probability that in next 30 random orders there will be …
7. The probability that Anna wakes up before her alarm rings is 0.4 .
a) Find the mean and variance of the number of times that Anna wakes up before her
alarm rings, in the next 7 mornings.
b) Determine the probability that in the next 7 mornings, Anna will wake up before
her alarm rings …
i. … at most once.
c) Calculate the probability that in the next 4 weeks Anna will wake up before her
alarm rings on exactly 7 mornings.
8. A box contains 50 coloured drawing. The proportion of red drawing pins in these
boxes is 3 out of 20 .
a) Find the mean and variance of the number of red drawing pins in these boxes.
9. Ama is a supermarket cashier. The probability that Ama will have to rescan a
shopping item because the barcode reader failed to “read” it, is 0.15. A shopping item
whose bar code is read on the first attempt is called a “first time item”. Ama scans 40
shopping items.
a) Determine the probability that Ama will have more than 31 but at most 37 “first
time items”. Bama is a less experienced supermarket cashier. The probability that
Bama will scan a shopping item on her first attempt is 0.7 . Bama scans 50 shopping
items.
b) Determine the probability that Bama will have at least 35 “first time items”.
10. A geologist is looking for fossils in rocks. In a certain area it has been established
over a long period of time that 10% of the rocks contain fossils. The geologist selects
twenty rocks from this area. a) State 2 conditions that must be apply in order for a
binomial model to be valid. Find the probability that in the geologist’s sample there
will be … b) … one rock containing fossils. c) … at least one rock containing fossils.
The geologist selects a new sample of n rocks. He wants to have at least a 95% chance
that his new sample will contain fossils. d) Determine the smallest value of n .
Poisson Distribution
1. In the manufacture of glassware, bubbles can occur in the glass which reduces the
status of the glassware to that of a ‘second’. If, on average, one in every 1000 items
produced has a bubble, calculate the probability that exactly six items in a batch of
three thousand are seconds.
2. A manufacturer produces light-bulbs that are packed into boxes of 100. If quality
control studies indicate that 0.5% of the light-bulbs produced are defective, what
percentage of the boxes will contain: (a) no defective? (b) 2 or more defectives?
3. Suppose it has been observed that, on average, 180 cars per hour pass a specified
point on a particular road in the morning rush hour. Due to impending roadworks it is
estimated that congestion will occur closer to the city centre if more than 5 cars pass
the point in any one minute. What is the probability of congestion occurring?
4. The mean number of bacteria per millilitre of a liquid is known to be 6. Find the
probability that in 1 ml of the liquid, there will be: (a) 0, (b) 1, (c) 2, (d) 3, (e) less
than 4, (f) 6 bacteria.
5. A factory uses tools of a particular type. From time to time failures in these tools
occur and they need to be replaced. The number of such failures in a day has a
Poisson distribution with mean 1.25. At the beginning of a particular day there are
five replacement tools in stock. A new delivery of replacements will arrive after four
days. If all five spares are used before the new delivery arrives then further
replacements cannot be made until the delivery arrives. Find (a) the probability that
three replacements are required over the next four days. (b) the expected number of
replacements actually made over the next four days.
6. Suppose vehicles arrive at a signalised road intersection at an average rate of 360 per
hour and the cycle of the traffic lights is set at 40 seconds. In what percentage of
cycles will the number of vehicles arriving be (a) exactly 5, (b) less than 5? If, after
the lights change to green, there is time to clear only 5 vehicles before the signal
changes to red again, what is the probability that waiting vehicles are not cleared in
one cycle?
7. A manufacturer sells a certain article in batches of 5000. By agreement with a
customer the following method of inspection is adopted: A sample of 100 items is
drawn at random from each batch and inspected. If the sample contains 4 or fewer
defective items, then the batch is accepted by the customer. If more than 4 defectives
are found, every item in the batch is inspected. If inspection costs are 75 p per
hundred articles, and the manufacturer normally produces 2% of defective articles,
find the average inspection costs per batch.
10. A shop sells a particular make of video recorder. (a) Assuming that the weekly
demand for the video recorder is a Poisson variable with mean 3, find the probability
that the shop sells (i) at least 3 in a week, (ii) at most 7 in a week, (iii) more than 20 in
a month (4 weeks). Stocks are replenished only at the beginning of each month. (b)
Find the minimum number that should be in stock at the beginning of a month so that
the shop can be at least 95% sure of being able to meet the demands during the month.
Normal Distribution
4. If Z ~ N( 0,1) , find a such that (a) P (Z < a) = 0.90 (b) P( Z > a) = 0.25
5. Eggs laid by a particular chicken are known to have lengths normally distributed,
with mean 6 cm and standard deviation 1.4 cm. What is the probability of: (a)
finding an egg bigger than 8 cm in length; (b) finding an egg smaller than 5 cm in
length?
6. If X ~ N(4,9) , find (a) P(X > 6) (b) P(X>1 )
7. The lifetimes of a certain brand of car tyres, in km, are Normally distributed with
a mean of 7500 . Find the standard deviation, if 5% of these tyres last less than
6000 km.
8. The lengths of pine needles, in cm, are Normally distributed. It is further given
that 11.51% of these pine needles are shorter than 6.2 cm and 3.59% are longer
than 9.5 cm. Find the mean and the standard deviation of the length of these pine
needles.
9. The weights of newly born kittens are Normally distributed. 4.95% of newly born
kittens are heavier than 122 grams and 10.56% are lighter than 93 grams. Find the
mean and the standard deviation of the weights of newly born kittens.
10. An airline operates between Manchester and Madrid. The flight time may be
modelled by a Normal distribution with mean of 85 minutes and standard
deviation 8 . In order to boost sales for the service, the airline decides to refund
the fares if a flight time exceeds the mean flight time by t minutes. The airline
does not want to refund more than 0.005 of the fares. Find the value of t , correct
to the nearest minute.
Gamma Distribution, Exponential Distribution, Weibull Distribution
1. Suppose that on an average 1 customer per minute arrive at a shop. What is the
probability that the shopkeeper will wait more than 5 minutes before (i) both
of the first two customers arrive, and (ii) the first customer arrive? Assume
that waiting times follows gamma distribution.
2. Let X∼N(2,4)X∼N(2,4) and Y=3−2XY=3−2X.
Find P(X>1)P(X>1).
Find P(−2<Y<1)P(−2<Y<1).
Find P(X>2|Y<1)P(X>2|Y<1).
5. car cooling systems are controlled by electrically driven fans. Assuming that
the lifetime T in hours of a particular make of fan can be modelled by an
exponential distribution with λ = 0.0003 find the proportion of fans which will
give at least 10000 hours service. If the fan is redesigned so that its lifetime
may be modelled by an exponential distribution with λ = 0.00035, would you
expect more fans or fewer to give at least 10000 hours service?
6. The time intervals between successive barges passing a certain point on a busy
waterway have an exponential distribution with mean 8 minutes. (a) Find the
probability that the time interval between two successive barges is less than 5
minutes. (b) Find a time interval t such that we can be 95% sure that the time
interval between two successive barges will be greater than t.
7.
8. The lifetime X (in hundreds of hours) of a certain type of vacuum
tube has a Weibull distribution with parameters α=2 and β=3. Compute
the following:
a. E(X) and V(X)
b. P(X≤5)
c. P(1.8≤X≤5)
d. P(X≥3).
9. Assume that the life of a packaged magnetic disk exposed to corrosive gases
has a Weibull distribution with α=300 hours and β=0.5.
Calculate the probability that
a. a disk lasts at least 600 hours,
b. a disk fails before 500 hours.
10. A monitor issues a warning signal when an action is needed as part of a
production process. The interval, 𝑋 hours, between successive signals follows
an exponential distribution with parameter 0.08. (i) Find the probability that
the interval between the next two signals is: a. Between 10 and 20 hours; b.
Less than two hours; c. Longer than 50 hours. (ii) State the mean and standard
deviation of the intervals between successive signals. (iii) Following a
warning signal, what is the longest time the production process could be left
unsupervised whilst ensuring the probability of missing the next signal is less
than 0.01?