Chapter - 1 Notes
Chapter - 1 Notes
=>Collection of Data
•Data constitutes the base. The finding of an investigation depends on the correctness and
completeness of the relevant data.
•Sources of data are of two kinds primary source and secondary source. The term source
means origin or place from which data comes or is got.
•Census Techniques
•sample survey
=>Collection of data
The function of statistics is the collection of data. It is the basis for any statistical analysis
and the data collected must be accurate data leads to defective analysis and decision
taken will be misleading. There are two types of data
=>Primary Data
Definition
Primary data is the data that is collected for the first time through personal experiences or
evidence, particularly for research. It is also described as raw data or first-hand
information. The mode of assembling the information is costly, as the analysis is done by
an agency or an external organisation, and needs human resources and investment. The
investigator supervises and controls the data collection process directly.
The data is mostly collected through observations, physical testing, mailed questionnaires,
surveys, personal interviews, telephonic interviews, case studies, and focus groups, etc.
=>Secondary Data
Definition
Secondary data is a second-hand data that is already collected and recorded by some
researchers for their purpose, and not for the current research problem. It is accessible in
the form of data collected from different sources such as governanent publications,
censuses, internal records of the organisation, books, journal articles, websites and
reports, etc
This method of gathering data is affordable, readily available, and saves cost and time.
However, the one disadvantage is that the information assembled is for some other
purpose and may not meet the present research purpose or may not be accurate.
The investigator must decide at the outset whether he will use primary and secondary data
in an investigation. The choice between the two depends on the following considerations:
•Availability of time
The persons from whom information are collected are known as informants or
respondents.
The investigator contacts witness or neighbours or friends or some other third parties who
are capable of supplying the necessary information.
The investigator appoints local agents or correspondents in different places and complies
the information sent by them. Information to News-paper and some departments of
government are collected by this method.
A list of question is prepared and is sent to all the informants by post. The list of questions
is technically called questionnaire.
Enumerators or interviewers take the schedules, meet the informants and fill in their
replies. Often distinction is made between a schedule and a questionnaire.
Secondary data can be complied either from Published sources or Un Published sources.
Published sources
Reports and official publication of International bodies such as the International Monetary
Fund, International Finance corporation, United Nations organisation and world Health
organisation.
Private Publication- such as the publications of Trade and professional bodies such as the
Federation of Indian chambers of commerce and Institute of chartered Accountants.
Un Published Sources
All statistical material is not always published. There are various sources of unpublished
data such as records maintained by various Government and private offices, studies made
by research institutions, scholars, etc. such sources can also be used where necessary.
Primary data
Statistics data collected by various department of the government such as Labour Bureau
and central statistical Organisation are primary of the respective departments.
Secondary data
Data which is collected for the second and subsequently times is known as secondary
data.
Statistics data collected by various department of the government such as Labour Bureau
and central
Statistical Organisation are primary of the respective department. Whereas for all others
they constitute secondary data
Classification
In any statistical investigation, the collected data are called raw data. These data are huge
and may give confusion. Also they cannot be easily understood by people and they may not
be useful for further analysis.
Even if we use the phrase ‘Classification and tabulation’ the actual process in analysis is
that classification is the first step in tabulation. Proper classification helps us to make
proper tabulation.
Classification is to arrange the available data into homogenous groups (or) classes (or)
categories according to similarities of data.
Characteristics
All the data have to be arranged into homogenous groups by the process of
classification.
The classification are based on unity in diversity
It may be either real (or) imaginary
It should be flexible to accept the adjustments
It may depend on either similarities (or) dissimilarities.
Categories of classification
The classification of data depends generally on the nature of the data and particularly on
the purpose and objectives of enquiry.
Objectives of classification
1. To condense the mass of data they are classified into small groups (or) small
categories
2. To facilitate comparison through classification we can compare between variables
3. To prepare data for tabulation can be represented in tabular form
4. To study the relationship through classification and tabulation, we can study the
relationship between the variables, which is true when different characteristics of
the data, have been known
(|)Statistics can be used only for quantitative data analysis but not for qualitative data like
inteligence, honesty, poverty, etc.
(ii) Statistical data can be collected for a particular type of problem solving and can’t be
used for other situation.
Eg. If half of the MLA’s in a state are belong to same community, then you can’t conclude
that half of the people in that state are also from that community.
2.Define tabulation.
3.What is Histogram?
The bars should be side by side without a gap between adjacent bars. This is possible only
when we take true class interval. The area of each bar is proportional to the frequency of
corresponding classes. But usually in histogram, the class intervals are equal. Even if they
are not equal, we can construct histogram with areas of bars proportional to the
frequencies of the classes.
Statistics deals with numerical facts and numbers. It is for collection and arrangements of
facts and numbers.
Olden days, the kings and rulers in the ancient times were interested in their manpower
strength. They conducted census to get details about their population. From this
information, they calculated their strength and ability for wars. So those days, statistical
methods were defined as “the science of kings, politcal and science of state craft”.
Now-a-days without statistics we cannot do any projects.
Primary data
Secondary data
From the olden days the ruling kings and chiefs relied heavily on statistics in framing
suitable military and financial policies.