Stats problems with solutions - Siddipet
Stats problems with solutions - Siddipet
60 70 76 74 65 62
64 65 77 73 76
80 90 64 64 63
78 68 65 70 87
Solution:
Mean (x̅ ) =
∑ of the observations
no . of observations
=
∑ xi
n
1491
= = 71
21
= 11th observation
= 70
Mode = 64 and 65
This is Bio-Modal case.
2. Find the mean weight of 470 infants born in a hospital in one year from the following data.
Solution:
I.e. ∑ fi and∑ fi × xi
MEAN (x̅ ) =
∑ fi × xi
∑ fi
MEAN (x̅ ) = 1543/470
n 1 x 1+ n 2 x 2
Combined mean =
n 1+ n 2
n 1 x 1+ n 2 x 2
Combined mean =
n 1+ n 2
=1600000/30
9.1 – 10.0 10
10.1 – 11.0 21
11.1 – 12.0 67
13.1 – 14.0 84
14.1 – 15.0 29
15.1 – 16.0 4
Solution:
Formula:
n
( – f ')i
Median= 2
Li+
fmed
step1. Calculating the total number of observations (N) and cumulative frequency
step2. Find the middle value observation I.e, N/2
step3. Find the frequency of the class interval previous to the median class interval. (f)
Step4. Finding median class interval. (fmed)
step5. Finding lower limit of the median class interval (Li)
step6. Finding length of the class interval (i).
Class interval Frequency Cumulative frequency
9.1 – 10.0 10 10
10.1 – 11.0 21 31
11.1 – 12.0 67 98
12.1 – 13.0 170 268
13.1 – 14.0 84 352
14.1 – 15.0 29 381
15.1 – 16.0 4 385
Total 385
Step7.
n
( – f ')i
Median= 2
Li+
fmed
= 12.1 + (94.5/170)
= 12.1+ 0.56
= 12.66 gm
Frequency 5 8 7 12 28 20 10 10
Solution:
10(28−12)
mode = 40+
2 X 28−12−20
= 40+6.666 = 46.67
6. If the mean and median of a moderately asymmetrical series are 26.8 and 27.9 respectively.
What would be its most probable mode?
Solution:
Solution:
i.e. Range
x̅ = 118
Xi |xi- x̅ |
115 3.2
117 1.2
121 3.2
120 2.2
118 0.2
Total 10
= 10/5
=2
= 2/118
=0.016
8. The distribution of age at the marriage of grooms with brides of age groups 15-39 is displayed.
Calculate the Mean deviation.
Age Group 15-19 19-23 23-27 27-31 31-35 35-39
(Years)
No of Grooms 8 59 47 23 4 4
Solution:
9. Find the mean respiratory rate per minute and its S.D, when in 9 cases the rate was found to
be 23, 22, 20, 24, 16, 17, 18, 19 and 21.
Solution:
formula for S.D.= sqr. of ( ∑(x- x̅ )2 /(ŋ-1)) ; If sample size is less than 30, divide it by n-1
x̅ =180 /9 = 20
step2. The deviation of each of the observation in the sample
Xi (xi – x̅ ) (xi - x̅ )2
23 3 9
22
2 4
20
0 0
24
4 16
16 -4 16
17
-3 9
18 -2 4
19 -1 1
21
1 1
Total 60
Step4: This sum ∑ (xi- x̅ )2 is divided by (n-1), If sample size is less than 30, divide it by n-1
∑ (xi- x̅ )2 = 60; n= 9
Solution:
Mean = Σxi / n
= 58/5
=11.6
th
(n+1)
Median = observation (since, n is odd)
2
= 3 rd observation
i.e. median = 11
= 11
Formula for S.D.= sqr. of ( ∑(xi- x̅ )2 /(ŋ-1)) ; If sample size is less than 30, divide it by n-1
Mean = ∑xi/n
x̅ =11.6
step2. The deviation of each of the observation in the sample from the mean, i.e. (xi- x) is
computed.
Step3. Squares of theses deviations are tabulated Sum of these squares i.e. ∑ (xi- x) 2 is
obtained
Xi (xi – x̅ ) (xi - x̅ )2
14 2.4 5.76
11
-0.6 0.36
11 -0.6 0.36
10 -1.6 2.56
12
0.4 0.16
Total 9.2
Step4: This sum ∑ (xi- x̅ )2 is divided by (n-1), If sample size is less than 30, divide it by n-1
∑ (xi- x̅ ) 2 =9.2; n= 5
= sqr. of (9.2/4)
= sqr. Of (2.3)
= 1.51
11. The calculate of S.D of intelligence Quotient (IQ) of 27 boys is given below as an example.
I.Q 0-20 20-40 40-60 60-80 80-100
Freq 3 4 3 4 13
.
Solution:
= 1750/27
x̅ = 64.81
= sqr. of (879.76)
= 29.66
12. The body surface area of five children are given. Calculate the mean, variance, standard deviation and
standard error.
1) Mean = ∑xi/n
= 961/5
x̅ =192.2
2) S.D
formula for S.D.= sqr. of ( ∑(xi - x̅ )2 /(ŋ-1)) ; If sample size is less than 30, divide it by n-1
Mean = ∑xi/n =1
step2. The deviation of each of the observation in the sample from the mean, i.e. (xi- x̅ ) is computed.
Step3. Squares of theses deviations are tabulate, Sum of these squares i.e. ∑ (xi - x̅ )2 is obtained
Xi (xi – x̅ ) (xi - x̅ )2
∑ (xi - x̅ )2 =13306.8; n= 5
= sqr. of (13306.8/4)
= sqr. Of (3326.7)
= 57.677
= (57.67)²= 3325.82
=57.67/√5
= 57.67/2.23
=25.86
13. Hb level of 10 individuals with mean=3, Median=89 and S.D.= 1.69. Describe the skewness of the data.
solution:
step1. skewness means frequency curve is not symmentrical about the Peak.
Skewness =3(mean-Median)/S.D.
by the problem:
mean=3
median= 89
standard deviation = 1.69
=3(3-89)/1.69
=-258/1.69
=-152
Hb 10 10 2.6
Diastolic BP 10 81 6.9
14. Following table represents average and standard deviation of Hb level and diastolic B.P of 10
individuals. Which one has more variation?
Solution:
step2. and also given that standard deviations of first and second group
=(2.6/10)x100
=26%
=(6.9/81)x100
=8.52%
15. In a series of boys the mean systolic blood pressure was 120 and S.D was 10.In the same series mean
height and S.D were 160 cm and 5 cm respectively. Find which character shows greater variation?
Solution:
step1. given mean of systolic blood pressure was 120 and mean height is 160cm
step2. and also given that standard deviations of first and second group
=(10/120)x100
=8.33%
=3.125%
conclusion:
since, Coefficient of variation in systolic blood pressure > Coefficient of variation in height.
16. The following figures give the crude birth rate per 1000 people in India from 1968 to 1980.
Calculate the C.V.
Crude Birth Rate(X):17.1, 16.5, 15.8, 15.2, 14.3
Solution:
Mean = ∑xi/n
= 78.9/5
x̅ = 15.78
= sqr. of (4.788/4)
= sqr. Of (1.197)
= 1.094
17. Find out which of the following is more consistent in scoring. Would you also accept him as a better
batsman? Why?
Solution:
Formula for C.V. = (σ/x) x 100
Mean = ∑xi/n
x̅ = 18.8
= sqr. of (812.8/4)
= sqr. Of (203.2)
= 14.25
Mean = ∑xi/n
= 82/5
x̅ = 16.4
= sqr. of (1049.2/4)
= sqr. Of (262.3)
= 16.19
Conclusion:
Coefficient of Variation in batsman A < Coefficient of Variation in batsman B
Hence, batsman A is better than batsman B.
18. From the following frequency distribution calculate the values of Quartiles.
i.e , Q1 ,Q2 & Q3
Marks in 0-10 10-20 20-30 30-40 40-50 50-60 60-70
Statistics
No. of 4 12 22 10 8 6 4
Students
Solution:
16.5−16
= 20 + X 10
22
= 20 + 0.22 =20.22
19. Calculate the quartile deviation and Median for the following data and interpret on the results.
Monthly 150-160 160-170 170-180 180-190 190-200
per capita
expenditure
classes (Rs.)
No. of 5 10 28 26 34
Families
Solution:
first quartile Q1= L+¿) × h
second quartile Q2= =L+¿) × h = Median
170-180 28 43
180-190 26 69
190-200 34 103
Total 103
Preparing table to find quartiles:
25.75−15
= 170 + X 10
28
= 170 + 3.83 =173.83
20. In a Community survey, estimate Sample Size when the prevalence rate of malnutrition
amongst children is about 40% with the allowable 5% error and 10% error.
Solution:
calculation:
p=40%
q=(100-40)=60%
allowable 5%
L=5% of 40%=2%
n= (4xpxq)/(L²)
L= 10% of 40%=4
n= (4xpxq)/L2
n= (4x40x60)/16
= 600
only 600 children are to be examine to estimate the malnutrition amongst children
21. The menstrual cycle of a woman follows normal distribution with 28 days of mean and 2 days
of Std. deviation. Assuming the menstrual cycle follows normal distribution, find what % of
woman will have menstrual cycle between 26 and 34 days.
Solution:
z = x- x̅ /S.D.
Z=26-28/2
= -1
Z = x- x̅ /S.D.
Z = 34-28/2
= 6/2
=3
= 0.3413+ 0.4987
= 0.84
Null hypothesis H₀: μ₁ = μ₂ (Anxiety Scores are same among Hypertensive &Normal)
Alternative hypothesis H1 : μ₁ ǂ μ₂ (Anxiety Scores are not same among Hypertensive &
Normal
Ӏ ZӀ = | X̄ ₁ - X̄ ₂| / √ ((σ₁²/n₁) + (σ₂²/n₂))
Inference:
23. A double blind controlled study was carried out with new drug (A)and standard drug (B)and the
results are given the drug (A)and (B)were administered to reduce blood pressure in patients suffering from
hypertension.
Can you infer that the new drug is more effective than the standard drug in reducing the B.P.?
B 12 2 50
Solution:
Null hypothesis HO: new drug and Std. are equally effective in reducing B.P.
Null hypothesis H1: new drug is more effective than Std. drug in reducing B.P.
x 1−x̅ ₂
√
3 3
| Z |= 61 2 6 2 2 = = = 5.89
+ √ 0.26 0.059
n1 n2
Inference:
Therefore, new drug is more effective than the standard drug in reducing blood pressure.
24. Mean Haemoglobin level of 20 children's was 10.6 gm/dl. Is it significantly from population
mean value of 11.0 gm/dl with a standard deviation of 0.15?
z
(Test at 5% level tab= 1.96)
Solution:
Null hypothesis (H0): (population mean) μ =11
Alternative hypothesis (H1): (population mean) μ ≠11 (two tailed )
Test statistic Z= x̅ - μ (σ /√n)
x̅ =10.6
μ = 11.0
σ = 0.15
n= 20
= |-0.4|/.033
Z=|-12.12|
IZI =12.12> 1.96, at 5% level of significance.
Hence, we reject the null hypothesis (H0)
25. The following table represents the study undertaken by the community medicine department to find
the association between hypertension and obesity.
Obesity Disease Total
Yes No
Yes 113 195 308
No 106 395 501
Total 219 590 809
Based on the data find out whether there is any association between obesity and hypertension
Note: Given the table value of×2=3.84at 5% level of significance for d.f.=1
Solution:
= 809×23652×23652 / 219×590×308×501
= 23.30
At 5% level of significance the calculated value ×2 is more than the table value.
26. In a cohort study of smoking and lung cancers, the observations are given in the table, find whether
There is any association between smoking and lung cancer.
Solution:
Inference:
Since, X2 observed > X2 0.01 ; X2 observed > X2 0.05 for DF=1, Reject Ho at both 1% & 5% level
of significance and infer that smoking is associated with lung cancer.
27. In a filarial survey, the no of people with or without filarial infestation in both sexes is
calculated find out whether there is any association between sex and filarial disease.
Solution:
Inference:
No association between sex and filarial disease.
28. A new vaccine on measles was tried in the community. Test the hypothesis whether the
vaccine is effective for measles or not
Measles vaccination Measles vaccination not Total
taken taken
Suffered 11 32 43
Solution:
Measles.
Measles.
2
2 N (ad −bc)
Χ =
(a +c)(b+d )(a+ b)(c+ d)
2
598×(11× 224−32× 331)
= = 18.91%
342× 256 × 43× 555
2 2
Χ Calculated is more than Χ table at both 1% and 5% level of significance.
Reject Ho and accept H1.
Inference:
Therefore there is an association between measles vaccination and suffering from measles.
1. The 1991 census population of India was 856 million and it was 1027 million in 2001 census.
Estimate the mid-year population in the year 2002 and 2010.
Solution:
Mid- year population in
6 6
61027 ×10 −826 ×10 1
P2002 = 1027 ×10 + ×1
10 3
= 1053.8 millions
Mid-year population in
6 6
1027 ×10 −826 ×10 1
P2010 = 1027 ×10 6 + ×9
10 3
= 1214.6 millions
2. The midyear population of a town was 2, 00,000 in 2000.The following vital rates were observed
during the year in that town.
Solution:
total no . of deaths
b) Crude death rate = ×1000
mid year population
= 7 x 2, 00,000 / 1000
=1400
Total no . of infant deaths
c) Infant mortality rate = ×1000
total no . of live births
= 312
= 312 – 125
= 187
3. The midyear population of the town was 4 lakhs in 2003. The following vital events were observed
During the same year. Calculate the vital rates.
Solution:
12800
= × 1000 = 32/1000 population
400000
total no . of deaths
b) Crude death rate = × 1000
Mid year population
4400
= = 11/1000 population
400000
c) Growth rate = CBR – CDR = 32-11 = (21÷1000) ×100=2.1٪
Total no . of infant deaths 1050 82
d) Infant mortality rate = ×1000= ×1000= live births
total no . of live births 12800 1000
384 30
= ×1000= live births
12800 1000
4. The 1991 census population of a town was 3, 20,000 and it was 3, 40,000 in 2001 census.
a) Total no. of live births = 12134
b) Total no. of deaths = 3467
c) Total no. of infants deaths = 960
d) Total no. of neonatal deaths = 360
Calculate all possible vital rates for the town in the year 2004.
Solution:
340000−320000 1
Mid-year population = 340000 + ×3 = 346667
10 3
12134 35
= × 1000= live births
346667 1000
total no . of deaths
b) Crude death rate = ×1000
mid year population
3467 10
= × 1000= live births
346667 1000
25
c) Growth rate = CBR- CDR = 35 – 10 = ×100=2.5 %
1000
79
¿ live births
1000
29
¿ live births
1000
f) Post neonatal mortality rate = IMR-NMR = 79-29 = 50/1000 live birth
5. The 1996 mid-population of a PHC area was 32000.the following vital events were recorded
1996 in that area. calculate all possible rates from 1996
a)total no of live births :864
b)total no of deaths:288
c)total no of infants deaths:63
d)total no of neonataldeaths:26
Solution:
864 27
= ×1000= live births
32000 1000
total no . of deaths
b) Crude death rate = ×1000
mid year population
288 9
= ×1000= live births
32000 1000
18
c) Growth rate = CBR- CDR = 27 – 9 = ×100=1.8 %
1000
30
¿ live births
1000
Solution:
= 26 x 2, 28,000 / 1000
= 5928
total no . of deaths
b) Crude death rate = ×1000
mid year population
= 10 x 2, 28,000 / 1000
=2280
= 343.8 ~= 344
= 12 x 5928/1000 = 63.5 ~= 66
= 344 – 66
= 278
Solution:
3200 21
= ×1000= live births
150000 1000
total no . of deaths
b) Crude death rate = ×1000
mid year population
1400 9
= ×1000= live births
150000 1000
12
c) Growth rate = CBR- CDR = 21 – 9 = ×100=1.2 %
1000
3
¿ live births
1000
8. In a population of 10,000 there were 200 cases of TB on 1stjan 2003. There were 40 new cases of TB
during the same year, out of which 10 died due to TB, 15 were cured and 5 migrated to another
community. Calculate the following.
240
Period prevalence = ×100=2.4 %
10000
210
Point prevalence = ×100=2.1 %
10000
40
Incidence of TB in 2003 = × 1000=4.8 per 1000 population
9800
10
Case fatality rate = ×100=4.1 %
240