Prof. Januario Flores JR
Prof. Januario Flores JR
1
05/03/2022
The word statistics has two meanings. The second meaning of statistics refers to the field
In the more common usage, statistics refers to or discipline of study.
numerical facts.
The numbers that represent the income of a Statistics is the study of how to collect,
family, the age of a student, the number of organize, analyze, and interpret
graduate students currently enrolled in doctorate
programs, and the starting salary of a typical numerical information from data.
college graduate are examples of statistics in this
sense of the word
2
05/03/2022
• Decisions made by using statistical Statistics has two aspects: theoretical and applied.
methods are called educated guesses.
Theoretical or mathematical statistics deals with the
development, derivation, and proof of statistical
• Decisions made without using statistical (or
theorems, formulas, rules, and laws.
scientific) methods are pure guesses and, hence,
may prove to be unreliable.
Applied statistics involves the applications of those
theorems, formulas, rules, and laws to solve real-
world problems.
The purpose of collecting and analyzing data is • Effective interpretation of data (inference) is
to obtain information. based on good procedures for producing data
and thoughtful examination
Statistical methods provide us tools to obtain of the data.
information from data.
Statistical methods enable us to look at information • The goal of statistics is not to perform numerous
from a small collection of people or items and calculations using the formulas, but to gain an
make inferences about a larger collection of understanding of your data.
people or items.
3
05/03/2022
Procedures for analyzing data, together with rules Broadly speaking, applied statistics can be
of inference, are central topics in the study of divided into two areas:
statistics.
descriptive statistics
We should also focus on understanding both the
and
suitability of the method and the meaning of the
result. inferential statistics.
4
05/03/2022
While descriptive statistics summarize the While descriptive statistics can only summarize a
characteristics of a data set, inferential sample’s characteristics, inferential statistics use
statistics help you come to conclusions and make your sample to make reasonable guesses about the
predictions based on your data. larger population.
When you have collected data from a sample, you With inferential statistics, it’s important to use
can use inferential statistics to understand the larger random and unbiased sampling methods. If your
population from which the sample is taken. sample isn’t representative of your population, then
you can’t make valid statistical inferences.
5
05/03/2022
We may make some decisions about the We may want to find the starting salary of a
political views of all college and university typical college graduate. To do so, we may select
students based on the political views of 1000 2000 recent college graduates, find their starting
students selected from a few colleges and salaries, and make a decision based on this
universities. information.
6
05/03/2022
A quantitative variable has a value or numerical A population consists of all elements that are being
measurement for which operations such as addition studied.
or averaging make sense. A sample is a subset of the population.
7
05/03/2022
A census is a sample of the entire population. In population data, the data are from every
individual of interest.
When we collect information on all elements of the
target population, it is called a census.
In sample data, the data are from only some of the
individuals of interest.
8
05/03/2022
If weights of all the ready-to-harvest pineapples We have categorized data as either qualitative or
in the field are included in the data, then we have a quantitative.
population. The average weight of all ready-to-harvest
pineapples in the field is a parameter. Another way to classify data is according to one of
the four levels of measurement.
The nominal level of measurement applies to data The interval level of measurement applies to data
that consist of names, labels, or categories. that can be arranged in order. In addition,
There are no implied criteria by which the differences between data values are
data can be ordered from smallest to largest. meaningful.
The ratio level of measurement applies to data that
The ordinal level of measurement applies to data can be arranged in order. In addition, both
that can be arranged in order. However, differences between data values and ratios of
differences between data values either cannot data values are meaningful.
be determined or are meaningless.
9
05/03/2022
Time Time
• In most cases, the size of the population is quite
large. Consequently, conducting a census takes
In fact, because of the amount of time needed to
a long time, whereas a sample survey can be
conduct a census, by the time the census is
conducted very quickly.
completed, the results may be obsolete.
• It is time-consuming to interview or contact
hundreds of thousands or even millions of members
of a population.
• On the other hand, a survey of a sample of a few
hundred elements may be completed in much less
time.
10
05/03/2022
Cost
The cost of collecting information from all
members of a population may easily fall outside Impossibility of Conducting a Census
the limited budget of most, if not all, surveys.
Consequently, to stay within the available
resources, conducting a sample survey may be
the best approach.
11
05/03/2022
Random sampling: Use a simple random sample from the Systematic sampling: Number all members of the population
entire population. sequentially. Then, from a starting point selected at
random, include every kth member of the population in
Stratified sampling: Divide the entire population into distinct the sample.
subgroups called strata. The strata are based on a
specific characteristic such as age, income, education Cluster sampling: Divide the entire population into pre-
level, and so on. All members of a stratum share the existing segments or clusters. The clusters are often
specific characteristic. Draw random samples from geographic. Make a random selection of clusters.
each stratum. Include every member of each selected cluster in the
sample.
12
05/03/2022
Multistage sampling: Use a variety of sampling methods to A sampling error is the difference between the
create successively smaller groups at each stage. The result obtained from a sample survey and the
final sample consists of clusters.
result that would have been obtained if the
whole population had been included in the
Convenience sampling: Create a sample by using data from
survey.
population members that are readily available.
A sampling error is the difference between The sampling error occurs because of chance, and it
measurements from a sample and cannot be avoided.
corresponding measurements from the
respective population. It is caused by the fact A sampling error can occur only in a sample survey.
that the sample does not perfectly represent It does not occur in a census.
the population.
13
05/03/2022
14