Question Bank Research Methodology and Biostatistics BPT 402 1. Calculate Appropriate Measure of Skewness From The Following Data
Question Bank Research Methodology and Biostatistics BPT 402 1. Calculate Appropriate Measure of Skewness From The Following Data
Solution:
2. Calculate Karl Pearson’s Coefficient of skewness from the following data:
Solution:
3. The first four central moments of a distribution are 0, 2.5, 0.7 and 18.75.
Examine the skewness and kurtosis of the distribution.
Solution:
Solution:
The measure of dispersion which are expressed in terms of the original units of data
are termed as Absolute Measures. Such measures expressed in units of data are not
suitable for comparing the variability of the distributions or series expressed in
different units of measurement.
Solution:
6. The following table shows the grouped data, in classes, for the heights of 50
people.
height (in cm) - classes frequency
120 <- 130 2
130 <- 140 5
140 <- 150 25
150 <- 160 10
160 <- 170 8
Solution:
Let mi be the midpoint of the i th clss and fi be the corresponding frequency.
= 148.4
= √[ (2*(125-148.4)2+5*(135-148.4)2+25*(145-148.4)2+10*(155-
148.4)2+8*(165-148.4)2) /(50) ]
= 9.9
7. There are 3 urns each containing red and black marbles (see table below). You
draw one marble from Urn 1. If you draw a red marble from Urn 1, you make
your second draw from Urn 2. If you draw a black marble from Urn 1, you make
your second draw from Urn 3. What is the probability of drawing two marbles of
the same colour?
Solution:
8. Out of 7 consonants and 4 vowels, how many words of 3 consonants and 2 vowels
can be formed?
Solution:
9. In how many different ways can the letters of the word 'OPTICAL' be arranged
so that the vowels always come together?
Solution:
Solution:
11. Five randomly selected students took a math aptitude test before they began
their statistics course. What linear regression equation best predicts statistics
performance, based on math aptitude scores? If a student made an 80 on the
aptitude test, what grade would we expect her to make in statistics?
Solution:
Regression Equation
In the table below, the xi column shows scores on the aptitude test. Similarly, the
yi column shows statistics grades. The last two columns show deviations scores - the
difference between the student's score and the average score on each test. The last two
rows show sums and mean scores that we will use to conduct the regression analysis.
And for each student, we also need to compute the squares of the deviation scores (the
last two columns in the table below).
And finally, for each student, we need to compute the product of the deviation scores.
The regression equation is a linear equation of the form: ŷ = b 0 + b1x . To conduct a
regression analysis, we need to solve for b0 and b1. Computations are shown below.
Notice that all of our inputs for the regression analysis come from the above three
tables.
If a student made an 80 on the aptitude test, the estimated statistics grade (ŷ)
would be:
There are lot of sampling techniques which are grouped into two categories as
Probability Sampling
Probability Sampling
This Sampling technique uses randomization to make sure that every element of the
population gets an equal chance to be part of the selected sample. It’s alternatively known
as random sampling.
2. Stratified Sampling: This technique divides the elements of the population into small
subgroups (strata) based on the similarity in such a way that the elements within the
group are homogeneous and heterogeneous among the other subgroups formed. And
then the elements are randomly selected from each of these strata. We need to have
prior information about the population to create subgroups.
3. Cluster Sampling: Our entire population is divided into clusters or sections and then
the clusters are randomly selected. All the elements of the cluster are used for
sampling. Clusters are identified using details such as age, sex, location etc.
4. Multi-Stage Sampling: It is the combination of one or more methods described
above. Population is divided into multiple clusters and then these clusters are further
divided and grouped into various sub groups (strata) based on similarity. One or more
clusters can be randomly selected from each stratum. This process continues until the
cluster can’t be divided anymore.
Non-Probability Sampling
It does not rely on randomization. This technique is more reliant on the researcher’s ability to
select elements for a sample. Outcome of sampling might be biased and makes difficult for all
the elements of population to be part of the sample equally. This type of sampling is also
known as non-random sampling.
1. Convenience Sampling: Here the samples are selected based on the availability. This
method is used when the availability of sample is rare and also costly. So based on the
convenience samples are selected.
2. Purposive Sampling: This is based on the intention or the purpose of study. Only
those elements will be selected from the population which suits the best for the
purpose of our study.
3. Quota Sampling: This type of sampling depends of some pre-set standard. It selects
the representative sample from the population. Proportion of characteristics/ trait in
sample should be same as population. Elements are selected until exact proportions of
certain types of data is obtained or sufficient data in different categories is collected.
4. Referral /Snowball Sampling: This technique is used in the situations where the
population is completely unknown and rare. Therefore we will take the help from the
first element which we select for the population and ask him to recommend other
elements who will fit the description of the sample needed.
13. A doctor believes that the proportions of births in this country on each day of the
week are equal. A simple random sample of 700 births from a recent year is
selected, and the results are below. At a significance level of 0.01, is there enough
evidence to support the doctor’s claim? Draft Null and Alternate hypothesis to
state your findings.
Solution:
Degree of Freedom: k-1= 7-1=6
The null hypothesis H0: the population frequencies are equal to the expected
frequencies (to be calculated below).
The alternative hypothesis, Ha: the null hypothesis is false.
Theory Questions