Lecture No. 1 Statistics and Probability
Lecture No. 1 Statistics and Probability
1
Statistics and Probability
WHAT IS STATISTICS?
That science which enables us to draw conclusions about
various phenomena on the basis of real data collected on
sample-basis
A tool for data-based research
Also known as Quantitative Analysis
A lot of application in a wide variety of disciplines …
Agriculture, Anthropology, Astronomy, Biology,
Economic, Engineering, Environment, Geology, Genetics,
Medicine, Physics, Psychology, Sociology, Zoology ….
virtually every single subject from Anthropology to
Zoology …. A to Z!
Any scientific enquiry in which you would like to base
your conclusions and decisions on real-life data, you
need to employ statistical techniques!
Now a days, in the developed countries of the world,
there is an active movement for of Statistical Literacy.
THE NATURE OF THIS DISCIPLINE:
DESCRIPTIVE STATISTICS
PROBABILITY
INFERENTIAL STATISTICS
Upon completion of the first segment, you
will be able to:
Sampling Methodology
X
X 156
n 10
15.6 years
Comparing the sample
mean age of 15.6 years
with the population mean
age of 15.785 years, we
note that the difference is
really quite slight, and
hence the sampling error
is equal to
Sampling Error:
X 15.6 15.785
0.185 years
And the reason for such a
small error is that we have
adopted the RANDOM
sampling method.
The basic advantage of random
sampling is that the probability is very
high that the sample will be a good
representative of the population from
which it has been drawn, and any
quantity computed from the sample
will be a good estimate of the
corresponding quantity computed
from the population!
Actually, a sample is
supposed to be a
MINIATURE
REPLICA of the
population.
Other Types of Random
Sampling
Stratified sampling
(if the population is
heterogeneous)
Systematic sampling
(practically, more convenient than
simple random sampling)
Cluster sampling
(sometimes the sampling units
exist in natural clusters)
Multi-stage sampling
etc., etc.
All these designs rest upon
random or quasi-random
sampling. They are various
forms of PROBABILITY
sampling --- that in which each
sampling unit has a known (but
not necessarily equal)
probability of being selected.
Because of this
knowledge, there exist
methods by which the
precision and the reliability
of the estimates can be
calculated OBJECTIVELY.
It should be realized that in
practice, several sampling
techniques are incorporated
into each survey design, and
only rarely will simple
random sample be used, or a
multi-stage design be
employed, without
stratification.
The point to remember is
that whatever method be
adopted, care should be
exercised at every step so as
to make the results as reliable
as possible.
The tree-diagram below presents an outline
of the various techniques
TYPES OF DATA
Qualitative Quantitative
Frequency
Curve
In today’s lecture, we will be dealing with various
techniques for summarizing and describing qualitative
data.
Qualitative
Univariate Bivariate
Frequency Frequency
Table Table
Percentages
Component Multiple
Pie Chart Bar Chart Bar Chart
Bar Chart
Medium of
f %
Institution
Urdu 719 59.9 = 60%
English 481 40.1 = 40%
1200
40,000
30,000
20,000
10,000
50,000
40,000
30,000
20,000
10,000
0
1965 1966 1967 1968 1969
Sex
Male Female Total
Med.
Urdu
English
Total
2000
1500
1000 Imports
Exports
500
0
1
5
-7
-7
-7
-7
-7
70
71
72
73
74
19
19
19
19
19
The question is, what is the basic
difference between a component bar
chart and a multiple bar chart?
• The component bar chart should be used when we have
available to us information regarding totals and their components.
For example, the total number of male students out of which some
are Urdu medium and some are English medium. The number of
Urdu medium male students and the number of English medium
male students add up to give us the total number of male students.
On the contrary, in the example of exports and imports, the
imports and exports do not add up to give us the totality of some
one thing!
IN TODAY’S LECTURE, YOU
LEARNT:
• The nature of the science of Statistics
• The importance of Statistics in various fields
• Some technical concepts such as
• The meaning of “data”
• Various types of variables
• Various types of measurement scales
• The concept of errors of measurement
IN NEXT LECTURE, YOU
WILL LEARN:
• Concept of sampling
• Random verses non-random sampling
• Simple random sampling
• A brief introduction to other types of random sampling
• Methods of data collection
In other words, you will begin your journey in a
subject with reference to which it has been said
that “statistical thinking will one day be as
necessary for efficient citizenship as the ability to
read and write”.