Elementary Statistics
Elementary Statistics
Level of Confidence
The key idea encompassed in the Central Limit Theorem is that when a population is repeatedly sampled, the
average value of the attribute obtained by those samples is equal to the true population value.
Measures the certainty of the estimate.
*Widely Used Confidence Level
90%, 95%, & 99%
Example: Using a 95%, what does it mean?
95 out of 100 samples will have the true population value within the range of precision specified.
Margin of Precision or Error
It is the range which the true value of the population is estimated to be. This range is often presented in
percentage form (e.g. ± 5%).
Measures the uncertainty of the estimate.
Example. If the researcher finds that 60% of the sample students adopted the synchronous mode of
classes with a ±5% margin of error. It only tells us that between 55% and 65% in the population have adopted
the modality.
Degree of Variability of attributes of Being Measured
Refers to the distribution of attributes in the population. The more heterogeneous a population, the larger the
sample size required to obtain a given level of precision. The less variable (more homogeneous) a population,
the smaller the sample size.
*Commonly used percent of variability is: 20%, 80%, 50%.
SAMPLE SIZE DETERMINATION FORMULA
SAMPLING
The process of choosing individual members or a portion of the population in order to draw statistical
conclusions and estimate the characteristics of the entire population.
TWO GENERAL METHODS OF SAMPLING
Probability sampling involves random selection, allowing you to make strong statistical inferences about the
whole group.
Non-probability sampling involves non-random selection based on convenience or other criteria, allowing you
to easily collect data.
Probability Sampling
1. Simple Random Sampling In a simple random sample, each member/subject of the target population has
equal chance of being selected or chosen to be part of the target number of samples. Your sampling frame
(Complete lists of individual or elements—depends on the subject of interest) should include the whole
population.
Common techniques: Lottery technique, Fish ball Technique
For example, in an organization of 500 employees, if the HR team decides on conducting team building
activities, it is highly likely that they would prefer picking chits out of a bowl. In this case, each of the 500
employees has an equal opportunity of being selected.
2. Systematic Random Sampling Using systematic random sampling, researchers select sample members from
a population at regular intervals. It involves the selection of a starting point for the sample and a repeatable
sample size at regular intervals.
For example, a researcher intends to collect a systematic sample of 500 people in a population of 5000.
He/she numbers each element of the population from 1-5000 and will choose every 10th individual to be a part
of the sample (Total population/ Sample Size = 5000/500 = 10).
3. Stratified Random Sampling is a technique in which the researcher divides the population into subgroups
(strata) that do not overlap but are representative of the full population. During sampling, it is possible to
organize these groups and then draw samples from each group independently.
Example: The company has 800 female employees and 200 male employees. You want to ensure that
the sample reflects the gender balance of the company, so you sort the population into two strata based on
gender. Then you use random sampling on each group, selecting 80 women and 20 men, which gives you a
representative sample of 100 people.
4. Cluster Random sampling also entails splitting the population into subgroups, but each subgroup should
have characteristics that are similar to those of the entire sample. Instead of randomly selecting individuals from
each subgroup, entire subgroups are selected.
Example: The company has offices in 10 cities across the country (all with roughly the same number of
employees in similar roles). You don’t have the capacity to travel to every office to collect your data, so you use
random sampling to select 3 offices – these are your clusters.
5. Multi-Stage Random Sampling Using combination of random sampling designs in order to obtain needed
samples based on a large population.
Non-Random Sampling
1. Convenience Sampling- This type of sampling, researchers prefer participants as per their own convenience.
The researcher selects the closest live persons as respondents. In convenience sampling, subjects who are
readily accessible or available to the researcher are selected.
2. Purposive Sampling- In this type of sampling, the researcher chooses the participants as per his/her own
judgment, keeping back in mind the purpose of the study. It uses the judgment of an expert in selecting cases or
it selects cases with a specific purpose in mind.
3. Quota Sampling- You select your sample according to some fixed quota. This type of sampling is somehow
related to stratified sampling. Make sure that the sample represents each group or stratum of the population.
Unlike the stratified sampling, the researcher in quota sampling method selects the subjects available
immediately fulfilling the criteria.
4. Snowball sampling- Also called "chain referral sampling,” in this method, the sample is actually collected in
various stages. This method is appropriate when the members of a special population are difficult to locate. It
begins by the collection of data from one or more contacts usually known to the person collecting the data. At
the end of the data collection process (e.g., questionnaire, survey, or interview), the data collector asks the
respondent to provide contact information for other potential respondents. These potential respondents are
contacted, interviewed and further asked to provide more contacts. This process goes on till the purpose of the
researcher is achieved.
Sample Scenario:
1. A researcher looking to analyze the characteristics of people belonging to different annual income divisions
will create strata (groups) according to the annual family income. E.g. – less than $20,000, $21,000 – $30,000,
$31,000 to $40,000, $41,000 to $50,000, etc. By doing this, the researcher concludes the characteristics of
people belonging to different income groups. Marketers can analyze which income groups to target and which
ones to eliminate to create a roadmap that would bear fruitful results.
Answer: Stratified Random Sampling
2. Company A distribute leaflets of upcoming events or promotion of their new product toothpaste – they do
that by standing at the mall entrance and giving out pamphlets randomly.
Answer: Convenience Sampling
3. All employees of the company are listed in alphabetical order. From the first 10 numbers, you randomly
select a starting point: number 6. From number 6 onwards, every 10th person on the list is selected (6, 16, 26,
36, and so on), and you end up with a sample of 100 people.
Answer: Systematic Random Sampling
4. The company has offices in 10 cities across the country (all with roughly the same number of employees in
similar roles). You don’t have the capacity to travel to every office to collect your data, so you use random
sampling to select 3 offices – these are your clusters.
Answer: Cluster Random Sampling
5. You are researching experiences of students with successful business in your school. Since there is no list of
all students with successful business in the school, you meet one person who agrees to participate in the
research, and she puts you in contact with other students with potentials to be included in the survey that she
knows in the area.
Answer: Snowball Random Sampling
Module 4: INTRODUCTION OF DATA COLLECTION AND METHODS / TECHNIQUES OF DATA
COLLECTION
DATA
➢ Data is various kinds of information formatted in a particular way.
➢ Is a collection of facts such as numbers, word, measurements, observation or just description of things.
DATA COLLECTION
➢ Is the process of gathering, measuring, and analyzing accurate data from a variety of relevant sources to find
answers to research problems, answer questions, evaluate outcomes, and forecast trends and probabilities.