What Is Statistics?: Item 2000 2010 Malaysia Population
What Is Statistics?: Item 2000 2010 Malaysia Population
INTRODUCTION
What is statistics?
Statistics is a group of method used to collect, analyze, present and interpret data and to make
decisions.
Example:
Item 2000 2010
Malaysia Population 23,494,900 28,908,795
Source: Jabatan Perangkaan Malaysia 2011
Types of statistics
Descriptive Statistics : Consists of methods for organizing, displaying, and describing data by
using tables, graphs and summary.
Example: The test scores of students enrolled in a statistics class. The data set usually large, thus
manage the data set by using frequency tables, drawing graphs or calculating measures (e.g.
averages).
Inferential Statistics: Consists of method that use sample results to help make decisions or
predictions about population.(As a “best guess” of a population value based on sample
information)
Example: Do you feel pressured by your boss to come to work?
Yes,
38%
No,
Not 61%
, 1%sure
Population versus Sample
Population is a collection of all individuals, objects, or measurements of interest. A survey that
includes every member of the population is called a census.
Sample is a portion or subset of the population selected for study.A sample selected must be
represents the characteristics of the population. The technique of collecting information from a
portion of the population is called sample survey
Types of data
Primary data Secondary data
Definition Data collected from primary sources Data collected from other sources
or from samples (raw data)
Example:
Variable Type of variable
Number of students in a class
Number of football players in a team
Height of a child
Monthly income of a manager
Gender
Ethnicity
Favorite reality TV shows
Level of measurement
Example:
Sampling
Nonprobability Probability
sampling sampling
techniques techniques
Multistage
sampling
Nonprobability Sampling Techniques
1. Convenience Sampling
This technique attempt to obtain a sample of convenient elements. The selection of
sampling units is left primarily to the interviewer. Often, respondents are selected because
they happen to be in the right place at the right time.
2. Judgemental Sampling
A form of convenience sampling in which the population elements are purposely selected
based on the judgement of the resecher.
3. Snowball Sampling
Snowball sampling method begins by identifying a small number of individual (at least
one) who meet the criteria for inclusion in a study. We then ask them to recommend others
they knowwho also meet the selected criteria. This technique is appropriate when we are
trying to get a group of specific respondents who are difficult to reach or find.
4. Quota Sampling
Quota sampling involves non-random selections of individuals based on some
predetermined (category). Before selections made, we must decide what characteristics
that we want to use as the base of the quota such as gender, age, religion, etc.
Example:
Suppose 5 students from a class of 40 students are selected as a sample. Use simple
random sampling to select the sample.
Lottery Method
Step 1 : Have a list of 40 student’s names.
Step 2 : Write each of the names/ numbers on a separate piece of paper.
Step 3 : Place all 40 names/ numbers in a box and mix them.
Step 4 : Draw one name/number randomly from the box and repeat this experiment for
4more times.
Step 5 : The corresponding 5 names/numbers in a sampling frame is a sample.
Random number table
Step 1 : Have a list of 40 student’s names
Step 2 : Generate5 different random numbers.
For example, the generated random numbers are 15, 31, 27, 02 and 11.
Step 3 : Thus the corresponding 5 random numbers in a sampling frame is a sample
2. Systematic sampling
In systematic random sampling, we first randomly select one member from the first k units.
Then every kth member, starting the first selected member, is included in the sample.
N
Range k , where N is a population size and n is a sample size.
n
Example:
Suppose a population consists of 1800 employees of BigMoney Industries. A sample of
300 employees is to be selected from that population. Use systematic sampling to
select the sample.
Segamat
Divide
Muar
Segamat
sample
Batu pahat
Mersing
Mersing
Select at random using
SRS/systematic
K.Tinggi
Pontian
J. Bahru
Example:
Suppose a company wants to make a survey on the number of households in the state of
Selangor. Steps in using cluster sampling are as follows:
Step 1 : The whole state of Selangor is divided into 12 districts which are called
clusters.
Step 2 : Then, select cluster at random using SRS, let say 5 districts from 12 districts.
Divide according to
program
Accountancy
Diploma
programM Finance
Computer Science
Strata Sample
Population
Example:
A company has 2000 employees, 55% of the employees work in manufacturing
department, 35% in painting department and 10% in inspecting department. A human
resource manager of the company wanted to conduct a survey on number of
absentee by selecting a sample of 120 employees. Use stratified sampling to obtain
120 employees from 2000 employees.
Step 1 : The employees are divided according to departments which are called strata
such that:
Department Number of employees
Manufacturing 55%x120 66
Painting 35%x120 42
Inspecting 10%x120 12
Total number of employees 120
Step 2 : Then, 66 employees are selected at random from manufacturing department,
42 employees from painting department and 12 employees from inspecting
department as a sample.
5. Multistage Sampling
This method is designed to reduce time and cost when working with samples from very
large population.
Example:
Suppose we need a random sample of 2000 residents from Malaysian population. Since
Malaysia consists of 12 states with many districts within eah state and many villages within
each district, we could apply the multi-stage sampling technique.
Step 1: Select district at random, let say 4 states was selected. (using SRS)
Step 2: Then choose 5 districts randomly from each 4 selected states.(SRS)
Step 3: Finally we select 100 individuals at random from each of the 20 districts chosen to
make up our sample of 2000 (4states x 5 districts x 100 people)
Convenience sampling Less expensive, less time Selection bias, sample not
consuming, most convenient representative, not
recommended for descriptive
or causal research
Simple Random sampling Easily applied. Results can Difficult to obtain sampling
(SRS) be projected on population frame, expensive,
sometimes no assurance of
representativeness
Direct the researcher will greet respondents and explain briefly his intention
questionnaire before giving the questionnaire to the respondent. The researcher will wait
until the questionnaire has been completed.
Mail -cheaper than personal interviews, the coverage is wider, No interviewer
questionnaire influence and More time to think of proper response
- Low response rate, might be bias, only simple questions can be asked,
nobody is on hand to explain the question.
Direct -record the real situation, not influenced by what people say or think,
observation
-require researcher with high skills and unbiased
Source of Errors
The use of inadequate frame
A poorly designed questionnaire
Recording and measurement errors
Non-response problems