Methods of Data Collection
Methods of Data Collection
Data can be obtained from primary or secondary sources. Primary data refer to information
obtained firsthand by the researcher on the variables of interest for the specific purpose of the
study. Secondary data refer to information gathered from sources already existing.
Some examples of sources of primary data are individuals, focus groups, panels of respondents
specifically set up by the researcher and from whom opinions may be sought on specific issues
from time to time, or some unobtrusive sources such as a trash can. The Internet could also serve
as a primary data source when questionnaires are administered over it.
Data can also be obtained from secondary sources, as for example, company records or archives,
government publications, industry analyses offered by the media, web sites, the Internet, and so
on. In some cases, the environment or particular settings and events may themselves be sources
of data, as for example, studying the layout of a plant.
While deciding about the method of data collection to be used for the study, the researcher
should keep in mind two types of data viz., primary and secondary. The primary data are those
which are collected afresh and for the first time, and thus happen to be original in character. The
secondary data, on the other hand, are those which have already been collected by someone else
and which have already been passed through the statistical process.
We collect primary data during the course of doing experiments in an experimental research but
in case we do research of the descriptive type and perform surveys, whether sample surveys or
census surveys, then we can obtain primary data either through observation or through direct
communication with respondents in one form or another or through personal interviews. This, in
other words, means that there are several methods of collecting primary data, particularly in
surveys and descriptive researches. Important ones are:
observation method,
interview method,
through questionnaires,
through schedules, and
Observation Method
The observation method is the most commonly used method specially in studies relating to
behavioural sciences. In a way we all observe things around us, but this sort of observation is not
scientific observation. Observation becomes a scientific tool and the method of data collection
for the researcher, when it serves a formulated research purpose, is systematically planned and
recorded and is subjected to checks and controls on validity and reliability. Under the
observation method, the information is sought by way of investigator’s own direct observation
without asking from the respondent.
Interview Method
The interview method of collecting data involves presentation of oral-verbal stimuli and reply in
terms of oral-verbal responses. This method can be used through personal interviews and, if
possible, through telephone interviews.
(a) Personal interviews: Personal interview method requires a person known as the interviewer
asking questions generally in a face-to-face contact to the other person or persons.
But there are also certain weaknesses of the interview method. Among the important
weaknesses, mention may be made of the following:
It is a very expensive method, specially when large and widely spread geographical
sample is taken.
There remains the possibility of the bias of interviewer as well as that of the respondent;
there also remains the headache of supervision and control of interviewers.
Certain types of respondents such as important officials or executives or people in high
income groups may not be easily approachable under this method and to that extent the
data may prove inadequate.
This method is relatively more-time-consuming, specially when the sample is large and
recalls upon the respondents are necessary.
The presence of the interviewer on the spot may over-stimulate the respondent,
sometimes even to the extent that he may give imaginary information just to make the
interview interesting.
Under the interview method the organisation required for selecting, training and
supervising the field-staff is more complex with formidable problems.
Interviewing at times may also introduce systematic errors.
Effective interview presupposes proper rapport with respondents that would facilitate free
and frank responses.
5. There is a higher rate of response than what we have in mailing method; the non-response is
generally very low.
8. At times, access can be gained to respondents who otherwise cannot be contacted for one
reason or the other.
But this system of collecting information is not free from demerits. Some of these may be
highlighted.
1. Little time is given to respondents for considered answers; interview period is not likely to
exceed five minutes in most cases.
4. It is not suitable for intensive surveys where comprehensive answers are required to various
questions.
6. Questions have to be short and to the point; probes are difficult to handle
1. There is low cost even when the universe is large and is widely spread geographically
2.It is free from the bias of the interviewer; answers are in respondents’ own words.
4. Respondents, who are not easily approachable, can also be reached conveniently.
5 Large samples can be made use of and thus the results can be made more dependable and
reliable.
1. Low rate of return of the duly filled in questionnaires; bias due to no-response is often
indeterminate.
4. There is inbuilt inflexibility because of the difficulty of amending the approach once
questionnaires have been despatched.
5. There is also the possibility of ambiguous replies or omission of replies altogether to certain
questions; interpretation of omissions is difficult.
This method of data collection is very much like the collection of data through questionnaire,
with little difference which lies in the fact that schedules (proforma containing a set of questions)
are being filled in by the enumerators who are specially appointed for the purpose. These
enumerators along with schedules, go to respondents, put to them the questions from the
proforma in the order the questions are listed and record the replies in the space meant for the
same in the proforma. In certain situations, schedules may be handed over to respondents and
enumerators may help them in recording their answers to various questions in the said schedules.
Enumerators explain the aims and objects of the investigation and also remove the difficulties
which any respondent may feel in understanding the implications of a particular question or the
definition or concept of difficult terms
Secondary data means data that are already available i.e., they refer to the data which have
already been collected and analysed by someone else. When the researcher utilises secondary
data, then he has to look into various sources from where he can obtain them. In this case he is
certainly not confronted with the problems that are usually associated with the collection of
original data.
Secondary data may either be published data or unpublished data. Usually published data are
available in:
(b) various publications of foreign governments or of international bodies and their subsidiary
organisations;
(e) reports and publications of various associations connected with business and industry, banks,
stock exchanges, etc.;
(f) reports prepared by research scholars, universities, economists, etc. in different fields; and
(g) public records and statistics, historical documents, and other sources of published
information.
The sources of unpublished data are many; they may be found in diaries, letters, unpublished
biographies and autobiographies and also may be available with scholars and research workers,
trade associations, labour bureaus and other public/ private individuals and organisations
By way of caution, the researcher, before using secondary data, must see that they possess
following characteristics:
1. Reliability of data: The reliability can be tested by finding out such things about the said data:
2. Suitability of data: The data that are suitable for one enquiry may not necessarily be found
suitable in another enquiry. Hence, if the available data are found to be unsuitable, they should
not be used by the researcher. In this context, the researcher must very carefully scrutinise the
definition of various terms and units of collection used at the time of collecting the data from the
primary source originally. Similarly, the object, scope and nature of the original enquiry must
also be studied. If the researcher finds differences in these, the data will remain unsuitable for the
present enquiry and should not be used.
3. Adequacy of data: If the level of accuracy achieved in data is found inadequate for the purpose
of the present enquiry, they will be considered as inadequate and should not be used by the
researcher. The data will also be considered inadequate, if they are related to an area which may
be either narrower or wider than the area of the present enquiry
Thus, there are various methods of data collection. As such the researcher must judiciously select
the method/methods for his own study, keeping in view the following factors:
1. Nature, scope and object of enquiry: This constitutes the most important factor affecting the
choice of a particular method. The method selected should be such that it suits the type of
enquiry that is to be conducted by the researcher. This factor is also important in deciding
whether the data already available (secondary data) are to be used or the data not yet available
(primary data) are to be collected.
2. Availability of funds: Availability of funds for the research project determines to a large extent
the method to be used for the collection of data. When funds at the disposal of the researcher are
very limited, he will have to select a comparatively cheaper method which may not be as
efficient and effective as some other costly method. Finance, in fact, is a big constraint in
practice and the researcher has to act within this limitation.
3. Time factor: Availability of time has also to be taken into account in deciding a particular
method of data collection. Some methods take relatively more time, whereas with others the data
can be collected in a comparatively shorter duration. The time at the disposal of the researcher,
thus, affects the selection of the method by which the data are to be collected.
4. Precision required: Precision required is yet another important factor to be considered at the
time of selecting the method of collection of data