2.1 Sampling Methods: Unit-Ii
2.1 Sampling Methods: Unit-Ii
RaviSankar Page: 1
I BSc Geography (English Medium) Mob.:9842007798 Dept.of Statistics
UNIT-II
Examples of population: -
population to refer not only to people. Finally, the population is set of all items/units that
have been chosen for the study.
A population is said to be infinite if it has infinite number of units. For example, the
number of stars in the sky, the number of people seeing the Television programmes etc.,
Census Method:
Information on population can be collected in two ways – census method and sample
method. In census method, every element of the population is included in the investigation.
For example, if we study the average annual income of the families of a particular village or
area, and if there are 1000 families in that area, we must study the income of all 1000 families.
In this method no family is left out, as each family is a unit.
1. The population of India includes the estimated population of entire Kachchh district,
Morvi, Maliya-Miyana and Wankaner talukas of Rajkot district, Jodiya taluka of
Jamanagar district of Gujarat State and entire Kinnaur district of Himachal Pradesh
where population enumeration of Census of India 2001 could not be conducted due to
natural calamity.
2. For working out density of India, the entire area and population of those portions of
Jammu and Kashmir which are under illegal occupation of Pakistan and China have not
been taken into account.
3. Figures shown against Population in the age-group 0-6 and Literates do not include the
figures of entire Kachchh district, Morvi, Maliya-Miyana and Wankaner talukas of
18BGE24A: Allied: Statistics-II UNIT-II Handled & Prepared by: Dr.S.RaviSankar Page: 4
I BSc Geography (English Medium) Mob.:9842007798 Dept.of Statistics
Rajkot district, Jodiya taluka of Jamanagar district and entire Kinnaur district of
Himachal Pradesh where population enumeration of Census of India 2001 could not be
conducted due to natural calamity.
4. Figures shown against Himachal Pradesh have been arrived at after including the
estimated figures of entire Kinnaur district of Himachal Pradesh where the population
enumeration of Census of India 2001 could not be conducted due to natural calamity.
5. Figures shown against Gujarat have been arrived at after including the estimated figures
of entire Kachchh district, Morvi, Maliya-Miyana and Wankaner talukas of Rajkot
district, Jodiya taluka of Jamnagar district of Gujarat State where the population
enumeration of Census of India 2001 could not be conducted due to natural calamity.
sampling unit by a number. Such a list or map is called sampling frame. A list of voters, a
list of house holders, a list of villages in a district, a list of farmers etc. are a few examples of
sampling frame.
Reasons for selecting a sample:
Sampling is inevitable in the following situations:
• Complete enumerations are practically impossible when the population is infinite.
• When the results are required in a short time.
• When the area of survey is wide.
• When resources for survey are limited particularly in respect of money and trained
persons.
• When the item or unit is destroyed under investigation.
Parameters and Statistics:
We can describe samples and populations by using measures such as the mean, median,
mode and standard deviation. When these terms describe the characteristics of a population,
they are called Parameters. When they describe the characteristics of a sample, they are called
Statistics. A parameter is a characteristic of a population and a statistic is a characteristic
of a sample. Since samples are subsets of population statistics provide estimates of the
parameters. That is, when the parameters are unknown, they are estimated from the
values of the statistics.
In general, we use Greek or capital letters for population parameters and lower
case letters to denote sample statistics.
- N, μ, σ are the standard symbols for the size, mean, S.D, of population.
- n, x bar, s are the standard symbol for the size, mean, S.D of sample respectively.
Principles of Sampling:
Samples have to provide good estimates for the Population Parameters. The following
principles tell us that the sample methods provide such good estimates.
ii. Principle of Inertia of large numbers: Other things being equal, as the sample size
increases, the results tend to be more accurate and reliable.
iii. Principle of Validity: This states that the sampling methods provide valid estimates
about the population parameters.
iv. Principle of Optimisation: This principle takes into account the desirability of
obtaining a sampling design which gives optimum results. This minimizes the risk
or loss of the sampling design. The foremost purpose of sampling is to gather
maximum information about the population under consideration at minimum cost,
time and human power. This is best achieved when the sample contains all the
properties of the population.
18BGE24A: Allied: Statistics-II UNIT-II Handled & Prepared by: Dr.S.RaviSankar Page: 6
I BSc Geography (English Medium) Mob.:9842007798 Dept.of Statistics
Non-Probability Sampling:
In this method, the population units may enter the sample more
than once. (ie.,) the units once selected is returned back to the
population before the next draw.
….. 999 are assigned. We may start at any place and may go on in any direction
such as column wise or row- wise in a random number table. But consecutive
numbers are to be used.
On the basis of the size of the population and the random number table
available with us, we proceed according to our convenience. If any random
number is greater than the population size N, then N can be subtracted from
the random number drawn. This can be repeatedly until the number is less than
N or equal to N.
Example 1:
In an area there are 500 families. Using the following extract from a
table of random numbers select a sample of 15 families to find out the standard
of living of those families in that area.
4652 3819 8431 2150 2352 2472 0043 3488
9031 7617 1220 4129 7148 1943 4890 1749
In the above random number table, we can start from any row or
column and read three digit numbers continuously row-wise or column wise.
Since some numbers are greater than 500, we subtract 500 from those
numbers and we rewrite the selected numbers as follows:
Population
Stratum ....
n1 n2 nL
A sample size being ni from ith stratum (i=1, 2,…,L) is independently taken by
Simple Random Sampling in such way that n1+n2+… +nL = n. A sample obtained using this
procedure is called a Stratified Random Sample.
To determine the sample size for each stratum, there are two methods:
i)Proportionate Allocation: In proportionate allocation, sample size is determined as
proportionate to stratum size. If the stratum size i s large, that stratum`will get more
representation in the sample. If the stratum size is small, that stratum will get less
representation in the sample.
The sample size for the ith stratum can be determined using
the formula ni = (n/N)* Ni . If we put i=1,2,3,... L , we get n1= (n/N)* N1 ,
ii) Optimum Allocation: The optimum allocation method uses variation in the stratum
and cost to determine the stratum sample size ni .
Example 1 : To study about the introduction of NEET exam, the opinions are collected from 3
schools. The strength of the schools are 2000, 2500 and 4000. It is fixed that the sample size
is 170. Calculate the sample size for each school?
Solution
Here N = 2000 + 2500 + 4000 = 8500 and n = 170 then n1 = n2 = n13 = ?
N1 = 2000, N2 = 2500, N3 = 4000
School 1: n1 = (n/N)× N1 = ( 170 / 8500 ) ×2000 = 40
School 2: n2 = (n/N)× N2 = ( 170 / 8500 ) × 2500 = 50
School 3: n3 = (n/N)× N3 = ( 170 / 8500 ) × 4000 = 80
Therefore, 40 students from school 1;
50 students from school 2 and
80 students from school 3
are to be selected using Simple Random Sampling to obtain the required stratified
random sample.
The main objective of stratification is to give a better cross-section of the population
for a higher degree of relative precision. The criteria used for stratification are States, age
and sex, academic ability, marital status etc,. In many practical situations, when it is difficult to
stratify with respect to the characteristic under study, administrative convenience may be
considered as the basis for stratification.
Merits
⚫ It provides a chance to study of all the sub-populations separately.
⚫ An optimum size of the sample can be determined with a given
cost, precision and reliability.
⚫ It is a more precise sample.
⚫ Representation of sub groups in the population.
⚫ Biases reduced and greater precise.
Limitations
⚫ There is a possibility of faulty stratification and hence the accuracy may be lost.
⚫ Proportionate stratification requires accurate information on the
proportion of population in each stratum.
18BGE24A: Allied: Statistics-II UNIT-II Handled & Prepared by: Dr.S.RaviSankar Page: 13
I BSc Geography (English Medium) Mob.:9842007798 Dept.of Statistics
… … … … … … … …
Thus, with selection (random) of the first unit, the whole sample is selected automatically.
As the first unit could have been any of the k units, the technique will generate ‘k’ systematic sample
units with equal probability. If N is not an integral multiple of n, then sizes of a few possible
systematic samples may vary by one unit.
Note:
This sample is also called quasi random sample, since, the first unit only is selected at
random and all the subsequent units are not selected randomly.
Merits:
⚫ This method is simple and convenient.
⚫ Less time consuming.
⚫ It can be used in infinite population.
Limitation:
⚫ Since, it is a quasi random sampling, the sample may not be a true
representative sample of the population under study.
18BGE24A: Allied: Statistics-II UNIT-II Handled & Prepared by: Dr.S.RaviSankar Page: 14
I BSc Geography (English Medium) Mob.:9842007798 Dept.of Statistics
Example 2
The first sampling interval consists of numbers 1 to 20. If the randomly selected number
(random starter) is 7, the systematic sample will consist units corresponding to positions (serial
numbers) : 7, 27, 47, 67, 87, 107, 127, 147, 167, 187.
Applications:
⚫ Systematic sampling is preferably used when the information is to be collected from trees
in a highway, houses in blocks, etc,.
⚫ This method is often used in industry, where an item is selected for testing from a
production line (say, every fifteenth item in the order of production) to ensure that
equipments are working satisfactorily.
⚫ This technique could also be used in a sample survey for interviewing people. A market
researcher might select every 10th person who enters a particular store, after selecting a
person at random as a random start.
Quota
Sampling
Merits:
⚫ Useful for pilot study.
⚫ Use the results that are easily available.
⚫ Processes of picking people in the most convenient and faster way to immediately
get their reactions to a certain hot and controversial topic.
⚫ Minimum time needed and minimum cost incurs.
18BGE24A: Allied: Statistics-II UNIT-II Handled & Prepared by: Dr.S.RaviSankar Page: 15
I BSc Geography (English Medium) Mob.:9842007798 Dept.of Statistics
Limitations:
⚫ High risk of selection bias.
⚫ May provide misleading information.
⚫ Not representative sample. Errors occur in the form of the
members of the population who are infrequent or non-users of that
location and who are not.
Note: The use of convenience sampling technique is discouraged by many
researchers dueto inability to generalize research findings, the relevance of bias
and high sampling error. Nevertheless, convenience sampling may be the only option
available in certain situations. For example, “it may be that a researcher intends to
study the ‘customer satisfaction of Jet Airways’ he uses the convenient sampling
because he has been able to negotiate access through available contacts”.
The investigator believes that in his opinion, some objects are the best representative of
the population than others. It involves “hand picking” of sampling units. That is the interviewer
uses his judgment in the selection of the sample that who should represent the population to
serve the investigator’s purpose. It is usually used when a limited number of individuals possess
the trait of interest. This type of sampling is also known as purposive sampling. This is useful
when selecting specific people, specific events, specific prices of data, etc. For example,
selecting members for a competition like quiz, oratorical contest to represent a school.
18BGE24A: Allied: Statistics-II UNIT-II Handled & Prepared by: Dr.S.RaviSankar Page: 16
I BSc Geography (English Medium) Mob.:9842007798 Dept.of Statistics
Merits:
⚫ Low expense.
⚫ Minimum time needed.
⚫ Easy
Limitations:
⚫ Highly subjective.
⚫ Generalization is not appropriate.
⚫ Certain members of the population will have a smaller chance or no
chance of selection compared to others.
⚫ This method does not give representative part of the population, since
favoritism is involved.
2.8.4 Quota Sampling
This is another non-probability sampling method. In this method, the
population is divided into different groups and the interviewer assign quotas to
each group. The selection of individuals from each group is based on the judgment
of the interviewer. This type of sampling is called quota sampling. Specified sizes of
number of certain types of people are included in the sample.
Merits:
⚫ The selection of the sample in this method is quick, easy and cheaper.
⚫ May control sample characteristics.
⚫ More chance of representative.
Limitations:
⚫ Selection bias.
⚫ The sample is not a true representative and statistical properties cannot
beapplied.
Example
A selection committee wants to compose a cricket team (11 players) for a test match.
Pace Wicket
Groups Spinners All-rounders Batsmen
bowlers keepers
Players in Players in Players in this Players in Players in
this category this category this this
category category category
Quota 2 3 3 2 1
Select the players using judgment sampling to fulfill the requirement
of the respective quota.