0% found this document useful (0 votes)
111 views150 pages

RM Full Note

This document provides definitions and explanations of key concepts related to business research. It defines research as a systematic process of identifying a question, developing a plan to answer it, and analyzing data to do so. Business research helps companies understand customers, markets, and competitors to reduce costs and create targeted solutions. There are different types of business research, including exploratory research to investigate ill-defined problems, and conclusive research to reach conclusions. Both quantitative and qualitative methods can be used.

Uploaded by

Anilisa
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
111 views150 pages

RM Full Note

This document provides definitions and explanations of key concepts related to business research. It defines research as a systematic process of identifying a question, developing a plan to answer it, and analyzing data to do so. Business research helps companies understand customers, markets, and competitors to reduce costs and create targeted solutions. There are different types of business research, including exploratory research to investigate ill-defined problems, and conclusive research to reach conclusions. Both quantitative and qualitative methods can be used.

Uploaded by

Anilisa
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 150

Introduction to Business research

Definitions of research
The term research is derived from the French word Recherche which means to travel. In
common man’s language refers to “ search for knowledge “.
According to Albert Scent Gyorgyi, “Research is to see what everybody else has seen and to
think what no body else had thought to do”.
OR
Definition of Research
A search for the truth.
• A movement from known to the unknown.
• An organised and systematic way of finding
answers to questions.
• A systematic process of identifying a question
or problem, setting forth a plan of action to
answer the question or resolve the problem
and rigorously collecting, analysing and
interpreting data for the purpose.
Definition of Manager
A manager is a person who is responsible for a part of a company, i.e., they ‘manage‘ the
company. Managers may be in charge of a department and the people who work in it. In some
cases, the manager is in charge of the whole business. For example, a ‘restaurant manager’ is
in charge of the whole restaurant.
OBJECTIVES OF RESEARCH
• It extends knowledge of human beings, social life and environment.
• It brings to light hidden information that might never be discovered fully during the ordinary
course of life.
• Research establishes generalisations and general laws and contributes to theory building in
various fields of knowledge.
• It verifies and tests existing facts and theory and this help improving our knowledge and
ability to handle situations and events.
• It aims to analyse inter-relationships between variables and to derive causal explanations
and thus enables us to have a better understanding of the world in which we live.
Need For Business Research
Business research is one of the most effective ways to understand customers, the market
and competitors. Such research helps companies to understand the demand and supply of
the market. Using such research will help businesses reduce costs, and create solutions or
products that are targeted to the demand in the market and the correct audience.

• Estimating expenses
• Determination of price
• Assisting managers in the decision making
process
• Evaluating market trends
• Achieving competitive advantage

TYPES OF BUSINESS RESEARCH


1. Exploratory research/ Formulative research
2. Conclusive research
3. Pure / Fundamental / Basic research
4. Applied research
5. Quantitative / Qualitative research
Exploratory research:
Exploratory research is defined as a research used to investigate a problem which is not clearly
defined. It is conducted to have a better understanding of the existing problem, but will not
provide conclusive results. For such a research, a researcher starts with a general idea and
uses this research as a medium to identify issues, that can be the focus for future research.

Eg: Different ways that the women professionals adapt to manage work family conflict.

Conclusive research:

conclusive research is meant to provide information that is useful in reaching conclusions or


decision-making. It relies on both secondary data, particularly existing databases that are
reanalyzed to shed light on a different problem than the original one for which they were
constituted, and primary research, or data specifically gathered for the current study. The
purpose of conclusive research is to provide a reliable or representative picture of
the population through the use of a valid research instrument. In the case of formal research,
it will also test hypothesis.

Conclusive research can be sub divided into two major categories:

1. Descriptive or statistical research, and


2. Causal research

Descriptive Research: It is a fact that find investigation with adequate interpretation. Definite
conclusions can be arrived at, but it does not establish a cause and effect relationship. It tries
to describe the characteristics of the respondent in relation to a particular product.

Eg: Trends in the consumption of soft drinks with respect to socioeconomic characteristics
such as age, family, income, education level.

Casual Research: This is conducted to determine the cause and effect relationship between
two variables.

Eg: Effect of Advertisement on Sales.

Pure / Fundamental / Basic Research

Basic research, also called pure research or fundamental research, is a type of scientific
research with the aim of improving scientific theories for better understanding and prediction
of natural or other phenomena. It is not directly involved with practical problem. There is no
intention to apply this in practise.

Eg: Theory of relativity by Einstein, Newton’s contributions, Galileo’s contributions

Applied Research

It aims at finding solution to a real life problem requiring an action or policy decision. It is
often referred to as a scientific method of inquiry or contractual research because it involves
the practical application of scientific methods to everyday problems.

Eg: Why have sales decreased during the last quarter?


Quantitative / Quanlitative Research

Quantitative research is expressed in numbers and graphs. It is used to test or confirm


theories and assumptions. This type of research can be used to establish generalizable facts
about a topic. Common quantitative methods include experiments, observations recorded as
numbers, and surveys with close ended questions.

Eg: To measure the purchase intentions for Nanos as a function of the demographic variables
of income, family size, and distance travelled, one would need to use quantitative methods.

Qualitative research can be used to explore, describe, or understand the reasons for a certain
phenomenon.

Eg: To understand what a low cost car means to an Indian consumer, this kind of investigation
would be required.

EXPLORATORY RESEARCH

Exploratory research is defined as a research used to investigate a problem which is not clearly
defined. It is conducted to have a better understanding of the existing problem, but will not provide
conclusive results. For such a research, a researcher starts with a general idea and uses this research
as a medium to identify issues that can be the focus for future research. An important aspect here is
that the researcher should be willing to change his/her direction subject to the revelation of new data
or insight. Such a research is usually carried out when the problem is at a preliminary stage. It is often
referred to as grounded theory approach or interpretive research as it used to answer questions like
what, why and how.

For example: Consider a scenario where a juice bar owner feels that increasing the variety of juices
will enable increase in customers, however he is not sure and needs more information. The owner
intends to carry out an exploratory research to find out and hence decides to do an exploratory
research to find out if expanding their juices selection will enable him to get more customers of if
there is a better idea.

Types and methodologies of exploratory research

While it may sound a little difficult to research something that has very little information about it,
there are several methods which can help a researcher figure out the best research design, data
collection methods and choice of subjects. There are two ways in which research can be conducted
namely primary and secondary. Under these two types, there are multiple methods which can used
by a researcher. The data gathered from these research can be qualitative or quantitative. Some of
the most widely used research designs include the following:
1.PRIMARY RESEARCH METHODS

Primary research is information gathered directly from the subject. It can be through a group of
people or even an individual. Such a research can be carried out directly by the researcher himself or
can employ a third party to conduct it on their behalf. Primary research is specifically carried out to
explore a certain problem which requires an in-depth study.

• Surveys/polls: Surveys/polls are used to gather information from a predefined group of


respondents. It is one of the most important quantitative method. Various types of
surveys or polls can be used to explore opinions, trends, etc.

For example: A survey is sent to a given set of audience to understand their opinions about
the size of mobile phones when they purchase one. Based on such information organization
can dig deeper into the topic and make business related decision.

• Interviews: While you may get a lot of information from public sources, but sometimes an in
person interview can give in-depth information on the subject being studied. Such a research
is a qualitative research method. An interview with a subject matter expert can give you
meaningful insights that a generalized public source won’t be able to provide. Interviews are
carried out in person or on telephone which have open-ended questions to get meaningful
information about the topic.

For example: An interview with an employee can give you more insights to find out the degree
of job satisfaction, or an interview with a subject matter expert of quantum theory can give
you in-depth information on that topic.

• Focus groups: Focus group is yet another widely used method in exploratory research. In
such a method a group of people is chosen and are allowed to express their insights on the
topic that is being studied. Although, it is important to make sure that while choosing the
individuals in a focus group they should have a common background and have comparable
experiences.

For example: A focus group helps a research identify the opinions of consumers if they were
to buy a phone. Such a research can help the researcher understand what the consumer value
while buying a phone. It may be screen size, brand value or even the dimensions. Based on
which the organization can understand what are consumer buying attitudes, consumer
opinions, etc.

• Observations: Observation research can be qualitative observation or quantitative


observation. Such a research is done to observe a person and draw the finding from their
reaction to certain parameters. In such a research, there is no direct interaction with the
subject.
For example: An FMCG company wants to know how it’s consumer react to the new shape of
their product. The researcher observes the customers first reaction and collects the data,
which is then used to draw inferences from the collective information.

2.SECONDARY RESEARCH METHODS

Secondary research is gathering information from previously published primary research. In such a
research you gather information from sources likes case studies, magazines, newspapers, books, etc.
• Online research: In today’s world, this is one of the fastest way to gather information on any
topic. A lot of data is readily available on the internet and the researcher can download it
whenever he needs it. An important aspect to be noted for such a research is the
genuineness and authenticity of the source websites that the researcher is gathering the
information from.

For example: A researcher needs to find out what is the percentage of people that prefer a
specific brand phone. The researcher just enters the information he needs in a search engine
and gets multiple links with related information and statistics.

• Literature research: Literature research is one of the most inexpensive method used for
discovering a hypothesis. There is tremendous amount of information available in
libraries, online sources, or even commercial databases. Sources can include
newspapers, magazines, books from library, documents from government agencies,
specific topic related articles, literature, Annual reports, published statistics from
research organizations and so on.

• Case study research: Case study research can help a researcher with finding more
information through carefully analyzing existing cases which have gone through a similar
problem. Such analysis is very important and critical especially in today’s business world.
The researcher just needs to make sure he analyses the case carefully in regards to all
the variables present in the previous case against his own case. It is very commonly used
by business organizations or social sciences sector or even in the health sector.

For example: A particular orthopaedic surgeon has the highest success rate for
performing knee surgeries. A lot of other hospitals or doctors have taken up this case to
understand and benchmark the method in which this surgeon does the procedure to
increase their success rate.

Exploratory research: Steps to conduct a research

• Identify the problem: A researcher identifies the subject of research and the problem is
addressed by carrying out multiple methods to answer the questions.
• Create the hypothesis: When the researcher has found out that there are no prior studies
and the problem is not precisely resolved, the researcher will create a hypothesis based on
the questions obtained while identifying the problem.
• Further research: Once the data has been obtained, the researcher will continue his study
through descriptive investigation. Qualitative methods are used to further study the subject
in detail and find out if the information is true or not.

Characteristics of Exploratory research

• They are not structured studies


• It is usually low cost, interactive and open ended.
• It will enable a researcher answer questions like what is the problem? What is the purpose
of the study? And what topics could be studied?
• To carry out exploratory research, generally there is no prior research done or the existing
ones do not answer the problem precisely enough.
• It is a time consuming research and it needs patience and has risks associated with it.
• The researcher will have to go through all the information available for the particular study
he is doing.
• There are no set of rules to carry out the research per se, as they are flexible, broad and
scattered.
• The research needs to have importance or value. If the problem is not important in the
industry the research carried out is ineffective.
• The research should also have a few theories which can support its findings as that will make
it easier for the researcher to assess it and move ahead in his study
• Such a research usually produces qualitative data, however in certain cases quantitative data
can be generalized for a larger sample through use of surveys and experiments.

Advantages of Exploratory research

• The researcher has a lot of flexibility and can adapt to changes as the research progresses.
• It is usually low cost.
• It helps lay the foundation of a research, which can lead to further research.
• It enables the researcher understand at an early stage, if the topic is worth investing the
time and resources and if it is worth pursuing.
• It can assist other researchers to find out possible causes for the problem, which can be
further studied in detail to find out, which of them is the most likely cause for the problem.

Disadvantages of Exploratory research

• Even though it can point you in the right direction towards what is the answer, it is
usually inconclusive.
• The main disadvantage of exploratory research is that they provide qualitative data.
Interpretation of such information can be judgmental and biased.
• Most of the times, exploratory research involves a smaller sample, hence the results
cannot be accurately interpreted for a generalized population.
• Many a times, if the data is being collected through secondary research, then there
is a chance of that data being old and is not updated.

CONCLUSIVE RESEARCH

Conclusive research design, as the name implies, is applied to generate findings that are practically
useful in reaching conclusions or decision-making. In this type of studies research objectives and data
requirements need to be clearly defined. Findings of conclusive studies usually have specific uses.
Conclusive research design provides a way to verify and quantify findings of exploratory studies.
Conclusive research design usually involves the application of quantitative methods of data collection
and data analysis. Moreover, conclusive studies tend to be deductive in nature and research
objectives in these types of studies are achieved via testing hypotheses.
The table below illustrates the main differences between conclusive and exploratory research
design:

Factor Conclusive Exploratory

Objectives To test hypothesis and relationships To get insights and understanding


• Information needs a clearly defined • Information needs are loosely defined
• Research process is formal and
• Research process is unstructured and flexible
structured
• Small, non-representative sample
Characteristics
• Large representative sample • Primary data analysis is qualitative
• Data analysis is quantitative
Findings Conclusive Only tentative

Outcome Findings used as input to decision making Generally followed by further exploratory
conclusive research

Main differences between conclusive and exploratory research design

It has to be noted that “conclusive research is more likely to use statistical tests, advanced
analytical techniques, and larger sample sizes, compared with exploratory studies. Conclusive
research is more likely to use quantitative, rather than qualitative techniques”. Conclusive
research is helpful in providing a reliable or representative picture of the population through
the application of valid research instrument.

Conclusive research design can be divided into two categories: descriptive


research and causal research.

1.Descriptive Research

Descriptive research can be explained as a statement of affairs as they are at present with the
researcher having no control over variable. Moreover, “descriptive studies may be
characterized as simply the attempt to determine, describe or identify what is, while
analytical research attempts to establish why it is that way or how it came to be”.

Descriptive research is “aimed at casting light on current issues or problems through a process
of data collection that enables them to describe the situation more completely than was
possible without employing this method.”

In its essence, descriptive studies are used to describe various aspects of the phenomenon.
In its popular format, descriptive research is used to describe characteristics and/or behaviour
of sample population.

An important characteristic of descriptive research relates to the fact that while descriptive
research can employ a number of variables, only one variable is required to conduct a
descriptive study. Three main purposes of descriptive studies can be explained as describing,
explaining and validating research findings.

Descriptive studies are closely associated with observational studies, but they are not limited
with observation data collection method. Case studies and surveys can also be specified as
popular data collection methods used with descriptive studies.
Advantages of Descriptive Research

1. Effective to analyse non-quantified topics and issues


2. The possibility to observe the phenomenon in a completely natural and unchanged
natural environment
3. The opportunity to integrate the qualitative and quantitative methods of data
collection
4. Less time-consuming than quantitative experiments

Disadvantages of Descriptive Research

1. Descriptive studies cannot test or verify the research problem statistically


2. Research results may reflect certain level of bias due to the absence of statistical tests
3. The majority of descriptive studies are not ‘repeatable’ due to their observational
nature
4. Descriptive studies are not helpful in identifying cause behind described phenomenon

2.Causal research (explanatory research)

Causal research, also known as explanatory research is conducted in order to identify the
extent and nature of cause-and-effect relationships. Causal research can be conducted in
order to assess impacts of specific changes on existing norms, various processes etc.

Causal studies focus on an analysis of a situation or a specific problem to explain the patterns
of relationships between variables. Experiments are the most popular primary data collection
methods in studies with causal research design.

The presence of cause cause-and-effect relationships can be confirmed only if specific causal
evidence exists. Causal evidence has three important components:

1. Temporal sequence. The cause must occur before the effect. For example, it would not be
appropriate to credit the increase in sales to rebranding efforts if the increase had started
before the rebranding.

2. Concomitant variation. The variation must be systematic between the two variables. For
example, if a company doesn’t change its employee training and development practices, then
changes in customer satisfaction cannot be caused by employee training and development.

3. Nonspurious association. Any covariation between a cause and an effect must be true and
not simply due to other variable. In other words, there should be no a ‘third’ factor that
relates to both, cause, as well as, effect.

Basic Research
Basic research is a type of research approach that is aimed at gaining a better understanding
of a subject, phenomenon or basic law of nature. This type of research is primarily focused
on the advancement of knowledge rather than solving a specific problem. Basic research is
also referred to as pure research or fundamental research.
Typically, basic research can be exploratory, descriptive or explanatory; although in many
cases, it is explanatory in nature. The primary aim of this research approach is to gather
information in order to improve one’s understanding, and this information can then be useful
in proffering solutions to a problem.
Applied Research
Applied research is a type of research design that seeks to solve a specific problem or provide
innovative solutions to issues affecting an individual, group or society. It is often referred to
as a scientific method of inquiry or contractual research because it involves the practical
application of scientific methods to everyday problems.
When conducting applied research, the researcher takes extra care to identify a problem,
develop a research hypothesis and goes ahead to test these hypotheses via an experiment. In
many cases, this research approach employs empirical methods in order to solve practical
problems.
Types of Applied Research
There are 3 types of applied research. These are evaluation research, research and
development, and action research.
Evaluation Research
Evaluation research is a type of applied research that analyses existing information about a
research subject to arrive at objective research outcomes or reach informed decisions. This
type of applied research is mostly applied in business contexts, for example, an organisation
may adopt evaluation research to determine how to cut down overhead costs.
Research and Development
Research and development is a type of applied research that is focused on developing new
products and services based on the needs of target markets. It focuses on gathering
information about marketing needs and finding ways to improve on an existing product or
create new products that satisfy the identified needs.
Action Research
Action research is a type of applied research that is set on providing practical solutions to
specific business problems by pointing the business in the right directions. Typically, action
research is a process of reflective inquiry that is limited to specific contexts and situational in
nature.

• Applied research to improve an organization’s hiring process.


• Applied research to improve workplace efficiency and organizational policies.
• Applied research to bridge skill gaps in the workplace.
Applied Research Example in Science
In itself, applied research is a scientific method of investigation because it applies existing
scientific knowledge to practical situations. It is useful in different fields including
thermodynamics, physics, material sciences and microbiology.
Examples of applied research in science include the following:

• Applied research to improve agricultural crop production.


• Applied research to treat or cure a specific disease.
Quantitative Research
When the data to be studied needs to be quantified and subjected to a suitable analysis in
order to generalize the findings to the population at large or to be able to quantify and explain
and predict the occurrence of a certain phenomenon.
For example- To measure the purchase intentions for Nano as a function of the demographic
variables of income, family size, and distance travelled, one would need to use quantitative
methods.
Qualitative Research
It can be used to explore, describe, or understand the reasons for a certain phenomenon.
For example, to understand what a low-cost car means to an Indian consumer, this kind of
investigation would be required.

Research Applications in Business Decisions

The types of applications in Business Decisions are :

1. Marketing
• Product identification
• Demand estimation
• Demand-supply analysis
• Product development
• Market segmentation
• Sales promotion programme
• Product launching
• Buyer behaviour
• Product Research
• Pricing Research
• Promotional research
• Place Research
• Packaging
2. Human Resources Management
• Manpower planning
• Performance appraisal systems
• Conflict management
• Study of organisational climate
• Design of incentive plans
• Leadership styles
• Training and Development
• Change management
• Negotiation and wage settlement
• Labour welfare study
3. Finance
• Economic evaluation of alternatives
• Study of financial parameters of an organisation
• Capital budgeting
• Ratio analysis
• Working capital
• Portfolio management
• Balance of payment
• Inflation
• Deflation
• Behavioural finance
4. Production & Operations Management
• Operation planning
• Demand forecasting and decision analysis
• Process planning
• Project management and maintenance management studies
• Logistics and supply chain and inventory management analysis.
• Quality estimation and assurance studies.

Cross-functional Research

Cross functional Research requires an open orientation where experts from across the
discipline contribute to and gain from the study.

For example, an area such as new product development requires commitment of the
marketing, production, and consumer insights team to exploit new opportunities.

Managerial Effectiveness
The term ‘managerial effectiveness’ could mean achievement of organizational goals,
increase in productivity, profit, workers’ satisfaction, growth, diversification etc. Managerial
effectiveness aims at optimum allocation and utilization of scarce organizational resources in
order to achieve the goals at minimum cost. It aims at deriving maximum output out of
minimum input.
1

MODULE 2

Research Methodology

The study of conducting research is Research Methodology.

Research: The word research is composed of two syllables “Re” and “Search”.

“Re” is the prefix meaning ‘Again or over again or a new’ and “Search” is
the latter meaning ‘to examine closely and carefully’ or ‘to test and try’.

Together they form, a careful, systematic, patient study and investigation


in some field of knowledge undertaken to establish principles / policies.

Research can also be defined as

1. Search for knowledge

2. Systematic and scientific search for getting relevant answers on any taken
up specific topic.

3. Scientific enquiry into a subject.

4. Research is a movement from the unknown to the known.

5. It is the voyage of discovery

Acc to Bulmer,

Research is primarily committed to establishing systematic, reliable and


valid knowledge about the social world.

Acc. To Clifford Woody,

Research comprises of

• Defining and redefining problems.

• Formulating hypothesis (basic idea)


2

• Collecting

• Organizing

• Evaluating datas

• Making decisions

• Suggesting solutions

• Reaching conclusions

• Finally, carefully testing the conclusions

To determine whether they fit the formulated Hypothesis.

Research Methods: May be understood as all those methods or techniques that


are used by a researcher for conducting a Research depending upon the methods.

(1) Library Research: analysis of historical records and documents.

- Statistical compilation, references, abstracts, guides manipulation


(handle with skill)

(2) Field Research: Observation, questionnaires, personal, Group or


telephonic interviews, case study.

(3) Laboratory Research:

Group (team) study, use of audio visual tools.

Research Methodology: is the way do systematically solve the research problem.

In it we study the various steps that are generally adopted by a


researcher in studying his research problem logically.

When we talk of Research Methodology, we not only talk of research


methods but also consider the logic behind the methods we use in the context of
our research study and explain why we are using a particular method or we are
3

not using a particular method or technique so that research results are capable
of being evaluated either by the researcher or others.

Steps:

1) Why a particular research study has been undertaken?

2) How the Research problem has been defined?

3) What way and why the hypothesis (basic idea) has been formulated?

4) Why a particular technique of analyzing data is used? (or) How the data
were collected?

5) How the collected data were interpreted?

6) What deletion was made? 7) What was the conclusion?

Finally what was the solution for the Research problem?

Importance of knowing the subject – research Methodology:

1) A student preparing himself for a career of carrying out research as


his profession –

- Will be trained better to do research

- Will help him develop disciplined thinking

- Will help him observe the field objectively.

- Will enable thoroughly to understand the logic behind the research


problem.

- Will increase the ability to evaluate the results.

- Face the evaluated results with confidence.


4

- Useful in various fields such as Govt. Business, administration,


community development & social work.

To qualify a Research or study:

To be a Good or perfect one,

The Research adapted should process certain characteristics,

It must as far as possible be 1) Controlled


2) Rigorous
3) Systematic
4) Valid
5) Verifiable
6) Empirical
7) Critical

1. Controlled: The research problem should not be affected or influenced by


external factors (i.e. variables other than the participating facts).

2. Rigorous: The procedures followed to find answers to questions should


be relevant, appropriate & justified. But the degree of rigiour may vary from one
problem to another problem.

3. Systematic: The investigation should follow a certain logical sequence


(Not in a haphazard manner)

4. Valid & Verifiable: The findings should be valid & can be verified by you
or others at any time.

5. Empirical: The conclusions drawn should be based on hard evidence,


gathered from real life experiences or observations.

6. Critical: The process of investigation must be foolproof and free from


drawbacks. The process adapted and the procedures used must be able to
withstand any critical scrutiny.
5

Types of Research

Research can be classified from the view point or perspectives as,

From the view point

Application objective Inquiry mode

1) Pure Research 1) Descriptive 1) Quantitative Research

2) Applied Research 2) Qualitative Research


2) Correlative

3) Exploratory

4) Explanatory
1) Pure Research: (Basic or Fundamental Research)

Gathering, knowledge is termed as ‘pure’ or ‘basic’ research. Just to


gather knowledge in order to formulate or generalize theories or policies.

Eg) Research on mathematics.

This types of research adds knowledge to the already existing organized


body.

Applied Research: To find an immediate solution for a pressing practical


problem.

Eg: Social, economical and political trends prevailing in a country.

Applied Vs Fundamental Based on the objectives of Research:

1) Descriptive Research:

• Survey or fact finding enquires of different kinds. It


describes the actual prevailing state of affairs, existing at
present.
6

• Otherwise known as ex post facts means existing position


of facts / issues.

• Here the variable influencing the research has no control


or the researcher has no control over the variables.

Eg: Frequency of shopping, customer preference etc.

2) Correlative Research :

• Goes on to discover the existing relationship or


interdependence between two or more aspects / variables.

• Otherwise known as comparative study.

• Investigates association between variables.

Eg: Sum of humour and job satisfaction, (related variable)

Research problem is workers turnover

Analytical Research:

The researcher has to use facts / information already existing and analyze
these data to make a critical evaluation.

Eg: document study / historical evidence.

Descriptive Vs Analytical Research:

Explanatory Research:

Attempts to clarify or explain why and how, any particular research


problem arises and can be solved.

4. Exploratory Research: Study undertake to explore a new area or an


unknown destination.

III. Based on the Inquiry Mode:


7

1) Quantitative Research:

• Relates to aspects that can be quantified and expressed in terms


of quantity.

• Otherwise known as structured Research.

• In this type of Research, the objectives, design, sample and all the
other factors influencing the research is pre determined.

The research problem and its solution will be expressed in terms of


quantity and hence statistical and economic analysis is adapted in this type of
Research.

Quantitative Research:

• Otherwise known as unstructured research.

• The aspects related to quality / kind or texture.


Eg: Behaviour science

Apart from the above, other types of Research are,

Conceptual Research: Research related to some abstract idea or theory

• Used by philosophers or thinkers for developing new concepts.

Empirical research

(based on experiments or experience)

• Otherwise known as experimental type of Research.

• The result obtained by adapting Empirical Research is considered


to be most powerful (evidence enclosed)

Based on the time consumed to complete a particular research,

a) one time Research:restricted to a single time period.

b) Longitudinal Research: Conducted over several time period.


8

Qualities of a Researcher

Top 10 qualities of a Researcher

1) Ananalytical mind: Constant analysis on a variety of factors.

2) A people person : For respondents to get the best out of interviews /


focus groups.

3) The ability to stay calm: especially when you have pressing


deadlines. Keep well focused and think logically there will always be an
end point.

4) Intelligence : Researcher requires critical analysis, but most of all


common sense.

5) Curiosity: Have curiosity and be passionate about developing deeper


to unearth more insight.

6) Quick thinker: Things don’t always go as you plan, so you need to


be able to think fast.

7) Commitment: Research is a tough job, the hours may be long, the


deadlines short. ‘

8) Excellent written and verbal communication skills: So that


different audience can clearly understand the findings.

9) Sympathetic: Having a sympathetic ear when listening to some


respondents (cry etc) is a good skill, to have.

10) Systematic: Check, check and check again. Spending a proper amount
of time for checking always pays.

According to Micheal Foster,

1) Truthful data / facts – desire for accuracy of observation.

2) No expressions like approximately, almost or nearly.


9

3) Should poccess alert mind. Nature is constantly changing, be keen and


watchful to notice such changes, no matter how small or insignificant
they may.

4) Scientific inquiry – desire for knowledge – it requires moral courage,


Steadfast (constant / not changing) endurance (to tolerate the difficulty,
suffer patiently)

- When a research scientist feel defeated or completely lost, he needs


immense courage and the sense of conviction (found guilty)

Significance or Importance of Search

1) Doubt is better than over confidence for it leads to inquiry, inquiry


leads to invention. Process or the three stages of research to bring
out economic policies.

1) Investigation of prevailing economic structure with the available facts.

2) Analyse or diagnose the data.

3) Prediction for future developments.

2) Research encourages scientific and inductive thinking.

Eg:- Role of Research in :

1) Econ omics:

Researches done on applied (production and sales of goods in a


profitable manner) economics is increasing in modern days. Govt. &
business sectors have become more complex, they face several
operational problems to solve this problems, Research is carried on.

To frame Govt. economic policies.

Govt. budget a formulation depends on the analysis of needs


& desires of the people, available of revenues needs research.
10

Decision making – requires proper research.


Allocation of a countries scarce resource – also needs research.

2) Business Decisions:

In business sectors there are both planning and operational


problems.

a) Problems Research: Investigation of


the present structure and development of the
market – relating to purchase, production, promotion and
sales.

b) Operational Research: Relates to application of logical,


mathematical and analytical techniques – to solve market
problems – there by minimize cost and profit maximization.

c) Motivational Research: Helps to determine people


behavior or consumer response.

All the above three are responsible for business decision making.

3) Social sectors: To gain knowledge on unknown aspects and do


something better and more efficiently.

Social scientist gain their knowledge for their own sake and for the
development of the society.

1. Formulating the Research problem:-

a) The formulation of a general topic into a specific Research


problem thus constitutes the first step in a scientific inquiry. Two steps

are involved in formulating the Research problem,

a) Understanding the problem thoroughly.

b) Rephrasing the same into meaningful terms from an analytical point


of view.
11

1. Identify a broad field or subject area of interest to you.

2. Dissect the broad area into small area.

3. Select what is of most interest to you.

4. Raise Research questions.

5. Formulate objectives

6. Assess your objectives

7. Double check

 The best way to understand the problem is to discuss with his own
colleague or guide.

 Examine all available literatures to get himself acquainted (get used


to ) with the selected problem.

 Review two types of literature

Conceptual literature :

Concerning concepts & theories

Empirical Literature : Concerning studies made earlier which are similar to the
one proposed.

 Outcome of the review will be the knowledge so as to pre


determine what data or materials are available for operational
purposes.

 Next step – the Researcher rephrases the problem into Analytical or


operational terms.

PUT THE PROBLEM INTO SPECIFIC TERMS

This step is of greatest importance in the entire research process.


12

The problem to be investigated must be defined unambiguously or clearly.

Prof W.A. Neiswanger States,

The statement of the objective of the Research problem is of basic


importance because,

(i) It determines the data which are to be collected

(ii) Characteristics of the relevant data

(iii) Choice of techniques to be used in these explorations

(iv) Frame a Final report

Step II

Extensive Literature Survey:

 A brief summary of the problem should be written down.

 Make extensive literature survey

 Sources of survey can be, journals, bio-graphics, Govt. reports, books,


conference proceedings etc.

Based on the nature of the problem.

 Earlier study if any which is similar to the study in hand should be


carefully studied.

 A good library will be a great help to the researcher at this stage.

Stage III:

Developing Hypothesis : (Development of working Hypothesis)

 State in clear terms the working hypothesis (Basic Idea of the


Research problem)
13

 It is a tentative assumption in order to test to logical or empirical


consequences.

 Provide the focal point for research.

 Hypothesis should be very specific and very well limited to the place of
research in hand because it has to be tested.

 Hypothesis guides the researched by limiting the area of Research and


keep him on the right track.

 It sharpens his thinking and focuses attention on important facets of the


problem.

 It indicates the type of data required for the study.

 Type of methods of data analysis done.

How to develop working Hypothesis?

1) Discuss with collogues / experts, about the problems, its origin, its
objectives and solutions.

2) Examination of data/ records if available.

3) Review similar studies / similar problems.

4) To secure greater insight into the practical aspects of the problem –


conduct personnel investigation or field interviews.

STEP IV:

Preparing the Research Design:

Research design is the conceptual structure within which research is


conduction. It constitutes the blue print for the collection, measurement and
analysis of data.
14

The function of the Research design is to provide relevant evidence with


minimal expenditure of effort, time and money. It provides an outline of what
the researcher is going to do in terms of 1) Framing the hypothesis, 2) its
operational implications and 3) finally data analysis

The Research design highlights certain decision,

1) The nature of the study

2) Purpose of the study

3) Location where the study would be conducted

4) The nature of data required

5) From where the data would be collected

6) The techniques of data collection that would be used

7) What time period the study would cover

8) The type of sample design that would be used

9) The method of data analysis that would be adapted

10)The manner in which the report would be prepared

Type of Research Design : 4 types

1) Sampling Research Design : Deal with selection of relevant items

2) Observational Research Design: Deals with the observations (field


observations) that is to be made.

3) Statistical Research design: Deals with the information on the data


collected & analysed.

4) Operational Research Design: How the above three are carried out.
15

Determining sample Design:

All the items considered in any field of inquiry constitutes a “universe” or


population. Study of the entire population without leaving out a single
item is known as “Census Study”

This type of census study is practically not possible.

So we select few items from the entire population for our study purpose.
The items so selected constitutes what is technically called “sample”.

The way of selecting such a “sample” is known as the “Sample Design”.

These samples can be either probability samples or non probability samples.

Probability: Each item in the population has on equal chance of being


selected for the study.

1) Simple random sampling

2) Systematic random sampling

3) Stratified random sampling

4) Cluster / area random sampling.

Non Probability sampling: All the items do not have an equal chance of being
selected for the study.

The selection depends upon the convenience & judgment of the


Researcher.

Mixed sampling: When more than one type of sampling technique is


used for a study, it is mixed sampling.

The sample design to be used in a Research study must be decided by the


researcher considering the nature of the study.

6. Collecting the Data:


16

“Gathering appropriate data” which are made use in Research study.

Data can be collected in several ways either through (1) Experiment (or)
(2) through surveys.

 In experimental means, when a researcher conducts a research,


some quantitative measurements are observed, based on which,
he examines the truth of the underlying hypothesis.

 In case of surveys, data are collected by

1) By observations

2) Through personnel interview

3) Through telephone interviews

4) By mailing of questionnaires

5) Through schedules / enumerators

The Researcher should select one of these methods of collecting the data
taking in account the

1) Nature of investigation

2) Objective & scope of Inquiry

3) Financial Resources

4) Time frame

5) Desired degree of Accuracy.

6) Execution of the Project: (Putting a plan)

 Important step in Research study.

 See that the project is executed in a systematic manner and in


time.
17

Eg) If the survey done in a project is via Questionnaire the answers


can be machine coded / processed

 If interview were conducted, make sure that the interviewers is


well trained – to keep the survey as much as realistic as possible.

8. Analysis of Data :

After the data are collected the researcher turns to the task of
analyzing the data the analysis of data require closely related
operations, like ‘coding, Editing & Tabulation’.

The wide data collected should be condensed into small manageable


groups, for easy analysis.

Coding: The collected data are transformed into symbols that may be
tabulated or counted.

Editing: Unwanted & irrelevant data will be removed.

Tabulation: Technical procedure where the data are put in the form of
tables.

Research Design:

 The most important step after defining the ‘Research problem’ is


preparing the Research Design

 Research design is the conceptual structure within which the research


is conducted.

 It constitutes the ‘BLUE PRINT” for collection, measurement and


analysis of data.

 Research design provides an answer to the question, what the


Researcher is going to do with regards to framing hypothesis, its
operational implications and how to analyse the data?
18

Research Design: - Decisions

Highlights certain decisions,

1) Nature of the study

2) Purpose of the study

3) Location where the study would be conducted

4) Nature of “DATA” required

5) From where the “DATA” can be collected

6) Time period of the study

7) Type of sample design to be used

8) Techniques of data collection

9) Methods of Data Analysis 10)Preparation of Report.

Research Design

May be sub divided into,

1) Sampling design: Deals with, the method of ‘selecting items’ for the study.

2) Observational design: Relates to the condition under which the


observations are to be made.

3) Statistical Design: Deals with the “no of items” selected or the study
and how the selected data will be analysed.

4) Operation design: The technique by which the sampling,


observational and statistical designs can be carried out.

Research Design – Features :

1) Helps to identify the type and source of information needed for the study.
19

2) Specifies the methods to be adopted in collecting & analyzing data.

3) Specifies the time schedule of the research and the monetary budget involved.

Concepts Relating to Research Design

1) Dependent and Independent variables :

Variables : A magnitude that varies is known as “variable”

Continuous variable : Values that can be expressed even in decimal poins are
known as continuous variables

Eg: age (4 years 3 months)

Height (5.2 cm)

Weight (45.3 kg)

Non continuous Variables:Value that can be expressed only in integer values are
called Non continuous variables

Eg: No. of students in a class ( 45)

No. of children in a family (3) Statistically

known as “discrete variables” Dependent or

Endogenous variables :

When the change in one variable depends on the change in other variable,
it is known as dependent or Endogenous variable.

Demand ----- Price (independent)

Independent or Exogenous variable

The variable that causes the change in the dependent variable is known
as independent or exogenous variable.

Demand (Dependent) ------- Price ,Income


20

Here demand is a dependent variable while price / income is an


independent variable.

Extraneous variable :

The independent variable which is not directly related to the purpose of


the study but affects the dependent variable is know as Extraneous variables.

 The influence caused by the extraneous variable on the dependent value


is technically known as “Experimental Error”

 A research study or a Research design should always be framed in such a


manner that the influence of ‘Extraneous variables’ on the dependent
variable is completely controlled and the influence of the independent
variable is clearly evident.

Control:

Good Research design should minimize the effect for Extraneous


variables.

Confounded Relationship

The relationship between dependent and independent variable is said to


be confounded by an extraneous variables.

Research Hypothesis:

When the formulated hypothesis is tested by adopting scientific methods,


it is known as Research Hypothesis.

Experimental & Non Experimental Hypothesis testing:

 When the objective of the Research is to test the hypothesis, it is


Research hypothesis.

 Research in which the independent variable are (handled with skill)


manipulated, it is experimental hypothesis testing.
21

 When the variables are not manipulated, it is non experimental


hypothesis testing.

Experimental & Control Groups:

 When a group is exposed to usual conditions in an experimental


hypothesis, research it is control Groups.

 When the group is exposed to special or certain new conditions, it is


experimental groups.

8. Treatments:

The different conditions to which the experimental & control groups are
subject to is known as treatments.

9. Experiment: Fertilizers and crops)

Process of verifying the truth.

Absolute Experiment:

 Determine the fact

Comparative Experiment:

 Determine the impact in comparison with another fact.

10. Experimental units

 Pre-determined block to which different treatments are applied.

Eg : animal testing

Types of Research Design

There are three different types of Research design,


22

1) Exploratory Research Design:

 Is a “Formulative Research design”

 Main purpose is the discovery of ideas & insights

 Should be flexible enough considering different dimensions of the problem


under study.

2) Descriptive and Diagnostic Research Design:

 Descriptive Research Design is concerned with describing the characteristics


of a particular individual or a group.

 Study concerned with narration of facts or characters related to an


individual, group or institution are descriptive research studies.

 Diagnostic Research design determines the frequency with which a variable


occurs or its relationship with another variables.

 Both the Research designs should be planned carefully.

 Research design should be Rigid (No flexibility)

3) Hypothesis testing Research Design :

 Test the hypothesis of causal relationship between two or more variables.

 Adopt procedure that not only reduce bias but enhance reliability – and
facilitates deriving Inferences (results) about the Research problem.

Importance of Research Design:

 Facilitates the smooth flow of the various stages of Research.

 Helps yield maximum information with minimum effort, time and money.

 Helps to plan in advance data collecting and analysis techniques.

 Prepare with utmost care to avoid errors.


23

 Attain reliability

Characteristics of a Good Research Design

 Posses the qualities of being flexible, suitable efficient & economical.

 Should minimize ‘bias’ and maximize reliability of data collection &


Analysis.

 No experimental error should be allowed

 Should yield maximum information

 Research problem should be viewed from different angles or dimensions.

The choice of Research design depends on,

 Nature of the Research problem

 Objectives of the Research problem

 Skills / ability of the Researcher

 Methods of gathering information

 Availability of monetary support

 Time schedule

HYPOTHESIS

A Research hypothesis is a predictive statement, which is capable of


being ‘tested’ using scientific methods, which involves independent and
24

dependent valuables. (eg) the female students perform as well as the male
students.

This statement is a hypothesis that can be objectively tested and verified.

It is a proposition that can be put to test in order to examine its validity.

Characteristics of Hypothesis

1) A hypothesis should be precise and clear. If not clear, the inferences will
not be reliable.

2) It must be capable of being put to test.

3) It should state the relationship between the variables, in case relational


hypothesis.

4) It should be stated in a simple language.

5) It should be consistant and derived from all known facts.

6) Hypothesis must be amenable to testing within a reasonable period of


time

7) Hypothesis should explain what it actually to explain. (the solution for


the Research problem). The explanation should be on empirical reference.

Concepts Relating to Testing of Hypothesis

1) Null Hypothesis & Alternative

Hypothesis (Statistical Analysis)

Null Hypothesis: Denoted by H0. If both the variables (say male or female)
or (Head or Tail) are equally good, it is Null Hypothesis.

Alternative Hypothesis: Denoted by Ha or H1. If one variable is considered


superior to other or vice versa or if there is a difference, it is alternative
hypothesis.
25

Mean Population (u) or (p)

Total / No. of variables

Null Hypothesis

Ho : u = 100

Alternative Hypothesis

Ha : u = 100

Ha : u > 100

Ha : u < 100

Aspects to be considered while formulating Null Hypothesis

1) The researcher always tries to reject Null hypothesis since Alternative


Hypothesis should be proved.

2) Null hypothesis when it is actually true, when rejected involves great risk,
the level of significance should be considered.

3) Null hypothesis should be very specific (No approximation)


The level of significance:

• Important concept of hypothesis testing.

• It is a certain percentage chosen with great ‘care, reason and thought’

(eg) let us consider the level of significance to be 5%. It means the


Researcher takes a risk of rejecting Null hypothesis (Ho) by 5% when Ho is
actually true.

3. Decision Rule

• The researcher should make a decision, if to accept or Reject Ho.


26

• The decision rule should be decided on the number of items to be tested


and the basic of which to accept or reject.

4. Type I and Type II Errors

(i) Researcher may reject Ho, when it is true – Type I Error (which must have
been accepted).

(ii) Researcher may accept Ho, when it is false – Type II Error (which must
have been rejected)

5. One tailed and Two tailed Tests:

(i) One tailed test rejects the Null hypothesis when the sample mean is
either greater or lower than the hypothesized value of the population
mean.

Two tailed Test: When the sample mean is both greater and lower than
the hypothesized value of the population mean.

Procedure for Hypothesis Testing:

1. Testing hypothesis refers whether the formulated hypothesis is valid or


not

2. Whether to Accept or Reject Null Hypothesis.

(i) Making a formal statement:

• Making a formal statement of the null hypothesis and alternative


hypothesis.

(ii) Selecting a significant level of testing

• A pre-determined level of significance should be specified.

• Either 5% or 1% level can be considered for the purpose.

(iii) Deciding the Distribution to use:


27

Measurement and Measurement Scales

• Measurement is the foundation of any scientific investigation

• Everything we do begins with the measurement of whatever it is we want to


study

• Definition: measurement is the assignment of numbers to objects

Example:

When we use a personality test such as the EPQ

(Eysenck Personality Questionnaire) to obtain a measure of Extraversion –


‘how outgoing someone is’

we are measuring that personality characteristic by assigning a number (a


score on the test) to an object (a person)

Four Types of Measurement Scales

Nominal

Ordinal

Interval

Ratio

•The scales are distinguished on the relationships assumed to exist between


objects having different scale values
28

•The four scale types are ordered in that all later scales have all the properties
of earlier scales— plus additional properties

Nominal Scale

• Not really a ‘scale’ because it does not scale objects along any dimension

• It simply labels objects

Gender is a nominal scale

Male = 1

Female = 2

Religious Affiliation

Catholic= 1

Protestant= 2

Jewish = 3

Muslim = 4

Other = 5

Categorical data are measured on nominal scales which merely assign labels to
distinguish categories

What about symptoms of depression from a psychiatric assessment?

None = 0

Mild = 1

Moderate = 2

Severe = 3
29

Ordinal Scale

•Numbers are used to place objects in order

•But, there is no information regarding the differences (intervals) between


points on the scale

Interval Scale

•An interval scale is a scale on which equal intervals between objects,


represent equal differences

•The interval differences are meaningful

•But, we can’t defend ratio relationships

Fahrenheit Scale

• Interval relationships are meaningful

• A 10-degree difference has the same meaning anywhere along the scale

• For example, the difference between 10 and 20 degrees is the same as


between 80 and 90 degrees

• But, we can’t say that 80 degrees is twice as hot as 40 degrees

• There is no ‘true’ zero, only an ‘arbitrary’ zero

Ratio Scale

•Have a true zero point

•Ratios are meaningful

•Physical scales of time, length and volume are ratio scales


30

• We can say that 20 seconds is twice as long as 10 seconds

It isn’t so straight-forward??

•The underlying construct or variable being measured defines the scale of


measurement, not the numbers themselves (Why?)

•Statistical procedures use numbers without considering the underlying


constructs that are measured

•Measurement is the foundation, but whether or not statistics can be


interpreted depends on research design issues

Variables and Constants

•The names imply their meaning in statistics

•Properties of objects that can take on different values are referred to as


variables

• A constant is a number that does not change its value (is constant) in a given
situation

Discrete and Continuous Variables

•Discrete variables have a relatively small set of possible values

gender, marital status, religious affiliation

•Continuous variables can (theoretically) assume any value between the


lowest and highest points on the scale

time, distance, weight

Independent and Dependent Variables

•Independent variables:
31

manipulated by the experimenter under the control of the experimenter

•Dependent variables:

not under the experimenter’s control usually the outcome to be measured

• Typically, we are interested in measuring the effects of independent


variables on dependent variables

What are the independent and dependent variables?

An experimenter measures blood pressure response in male and female


hypertensive subjects, randomly assigned to two drug treatment conditions:
standard medication A, and improved medication B

Statistical Notation

• Variables will be represented by an uppercase letter (e.g., X, Y, or Z)

• An individual value of that variable will be represented by the letter and a


subscript (e.g., Xi or Zi)

Example: a set of aggression scores from 5 depressed adolescents

10 12 25 7 40

the set of scores is referred to as X

X1=10 X2=12 X3=25 X4=7 X5=40

To refer to a single score, without specifying which one, we will use Xi, where i
can take on any value from 1 to 5, or 1 to N.
32

Summation Notation

One of the most common symbols in statistics is the uppercase Greek letter
sigma (Σ)

Σ means sum up, or add up

ΣXi = 10 + 12 + 25 + 7 + 40 = 94

ΣXi = ΣX

ΣX2 = 102 + 122 + 252 + 72 + 402 = 2518

Sum up the squared values of X

(ΣX)2 = (94)2 = 8836

(sum the values of X) and then square the sum

Always do what is in the parentheses first, before performing operations


outside of parentheses.
MODULE 3
SAMPLING DESIGN

Sampling design - probability and non-probability sampling methods - merits and


demerits -Review of various sampling methods - determination of sample size –
sampling errors. Data Collection: Secondary and primary methods of data
collection-data collection and data editing - coding - categorization - handling
unsatisfactory responses and missing values - questionnaire construction - type of
questions - guidelines for questionnaire designing - questionnaire testing - other
methods of data collection - observation - projective methods - goodness of fit of
data -reliability and consistency - Cronbach’s Alpha.
Sampling design

Sampling design is a mathematical function that gives you the


probability of any given sample being drawn. Since sampling is the foundation of
nearly every research project, the study of sampling design is a crucial part of
statistics, and is often a one or two semester course. It involves not only learning
how to derive the probability functions which describe a given sampling method
but also understanding how to design a best-fit sampling method for a real-life
situation.

Probability sampling methods


Probability sampling means that every member of the population has a chance of
being selected. It is mainly used in quantitative research. If you want to produce
results that are representative of the whole population, probability sampling
techniques are the most valid choice.

There are four main types of probability sample.

1. Simple random sampling

In a simple random sample, every member of the population has an equal chance
of being selected. Your sampling frame should include the whole population.

To conduct this type of sampling, you can use tools like random number
generators or other techniques that are based entirely on chance.

Example
You want to select a simple random sample of 100 employees of Company X.
You assign a number to every employee in the company database from 1 to 1000,
and use a random number generator to select 100 numbers.
2. Systematic sampling

Systematic sampling is similar to simple random sampling, but it is usually


slightly easier to conduct. Every member of the population is listed with a
number, but instead of randomly generating numbers, individuals are chosen at
regular intervals.

Example
All employees of the company are listed in alphabetical order. From the first 10
numbers, you randomly select a starting point: number 6. From number 6
onwards, every 10th person on the list is selected (6, 16, 26, 36, and so on), and
you end up with a sample of 100 people.

If you use this technique, it is important to make sure that there is no hidden
pattern in the list that might skew the sample. For example, if the HR database
groups employees by team, and team members are listed in order of seniority,
there is a risk that your interval might skip over people in junior roles, resulting
in a sample that is skewed towards senior employees.

3. Stratified sampling

Stratified sampling involves dividing the population into subpopulations that may
differ in important ways. It allows you draw more precise conclusions by ensuring
that every subgroup is properly represented in the sample.

To use this sampling method, you divide the population into subgroups (called
strata) based on the relevant characteristic (e.g., gender, age range, income
bracket, job role).

Based on the overall proportions of the population, you calculate how many
people should be sampled from each subgroup. Then you use random or
systematic sampling to select a sample from each subgroup.
Example
The company has 800 female employees and 200 male employees. You want to
ensure that the sample reflects the gender balance of the company, so you sort the
population into two strata based on gender. Then you use random sampling on
each group, selecting 80 women and 20 men, which gives you a representative
sample of 100 people.

4. Cluster sampling

Cluster sampling also involves dividing the population into subgroups, but each
subgroup should have similar characteristics to the whole sample. Instead of
sampling individuals from each subgroup, you randomly select entire subgroups.

If it is practically possible, you might include every individual from each sampled
cluster. If the clusters themselves are large, you can also sample individuals from
within each cluster using one of the techniques above. This is called multistage
sampling.

This method is good for dealing with large and dispersed populations, but there
is more risk of error in the sample, as there could be substantial differences
between clusters. It’s difficult to guarantee that the sampled clusters are really
representative of the whole population.
Example
The company has offices in 10 cities across the country (all with roughly the same
number of employees in similar roles). You don’t have the capacity to travel to
every office to collect your data, so you use random sampling to select 3 offices
– these are your clusters.
Non-probability sampling methods
In a non-probability sample, individuals are selected based on non-random
criteria, and not every individual has a chance of being included.

This type of sample is easier and cheaper to access, but it has a higher risk of
sampling bias. That means the inferences you can make about the population are
weaker than with probability samples, and your conclusions may be more
limited. If you use a non-probability sample, you should still aim to make it as
representative of the population as possible.

Non-probability sampling techniques are often used in exploratory and qualitative


research. In these types of research, the aim is not to test a hypothesis about a
broad population, but to develop an initial understanding of a small or under-
researched population.

1. Convenience sampling

A convenience sample simply includes the individuals who happen to be most


accessible to the researcher.

This is an easy and inexpensive way to gather initial data, but there is no way to
tell if the sample is representative of the population, so it can’t produce
generalizable results.

Example
You are researching opinions about student support services in your university,
so after each of your classes, you ask your fellow students to complete a survey
on the topic. This is a convenient way to gather data, but as you only surveyed
students taking the same classes as you at the same level, the sample is not
representative of all the students at your university.
2. Voluntary response sampling

Similar to a convenience sample, a voluntary response sample is mainly based on


ease of access. Instead of the researcher choosing participants and directly
contacting them, people volunteer themselves (e.g., by responding to a public
online survey).
Voluntary response samples are always at least somewhat biased, as some people
will inherently be more likely to volunteer than others.

Example
You send out the survey to all students at your university and a lot of students
decide to complete it. This can certainly give you some insight into the topic, but
the people who responded are more likely to be those who have strong opinions
about the student support services, so you can’t be sure that their opinions are
representative of all students.

3. Purposive sampling

This type of sampling, also known as judgement sampling, involves the


researcher using their expertise to select a sample that is most useful to the
purposes of the research.

It is often used in qualitative research, where the researcher wants to gain detailed
knowledge about a specific phenomenon rather than make statistical inferences,
or where the population is very small and specific. An effective purposive sample
must have clear criteria and rationale for inclusion.

Example
You want to know more about the opinions and experiences of disabled students
at your university, so you purposefully select a number of students with different
support needs in order to gather a varied range of data on their experiences with
student services.
4. Snowball sampling

If the population is hard to access, snowball sampling can be used to recruit


participants via other participants. The number of people you have access to
“snowballs” as you get in contact with more people.

Example
You are researching experiences of homelessness in your city. Since there is no
list of all homeless people in the city, probability sampling isn’t possible. You
meet one person who agrees to participate in the research, and she puts you in
contact with other homeless people that she knows in the area.

Advantages of Probability Sampling

1. The absence of systematic error and sampling bias


2. Higher level of reliability of research findings
3. Increased accuracy of sampling error estimation
4. The possibility to make inferences about the population

Disadvantages of Probability Sampling

1. Higher complexity compared to non-probability sampling


2. More time consuming
3. Usually more expensive than non-probability sampling

Advantages of Non-Probability Sampling

1. Possibility to reflect the descriptive comments about the sample


2. Cost-effectiveness and time-effectiveness compared to probability sampling
3. Effective when it is unfeasible or impractical to conduct probability
sampling
Disadvantages of Non-Probability Sampling

1. Unknown proportion of the entire population is not included in the sample


group i.e. lack of representation of the entire population
2. Lower level of generalization of research findings compared to probability
sampling
3. Difficulties in estimating sampling variability and identifying possible bias

DETERMINATION OF SAMPLE SIZE

The size of a sample depends upon the basic characteristics of the population, the
type of information required from the survey, and the cost involved.

The researcher must keep in mind the following points while determining the
sample size

• Nature of the universe


• Availability of finance
• Standard of accuracy desired
• Nature of study (whether intensive or extensive)
• Size of the universe
• Method of sampling
• Availability of time

Methods for determining sample size

• Researchers may arbitrary decide the size of the sample without giving any
explicit consideration to the accuracy of the sample results or the cost of
sampling. The arbitrary approach should be avoided.
• For some of the projects, the total budget for the field survey in a project
proposal is allocated. If the cost of sampling per sample unit is known, one can
easily obtain the sample size by dividing the total budget allocation by the cost of
sampling per unit.

• This method concentrates only on the cost aspect of sampling rather than
the value of information obtained from such a sample.

• There are other researchers who decide on the sample size based on what
was done by other researchers in similar studies. This cannot be a substitute for
the formal scientific approach.

• The most commonly used approach for determining the sample size of the
population is the confidence interval approach.

Following points are taken into account for estimating sample size in estimation
of problems involving means

• The variability of the population

• The confidence attached to the estimate

• The allowable error or margin of error

While estimating Population mean, population being infinite


n is the sample size, Z=table value and e =difference between population mean
and sample mean.

While estimating population mean, population being finite.

Determination of sample size for estimating the population proportion

Points to be noted for sample size determination

• These formulas are applicable to simple random sampling only.

• In case the universe is divided into different strata, the accuracy for
determining the sample size for each strata may be different.
SAMPLING ERROR

A sampling error is a statistical error that occurs when an analyst does not
select a sample that represents the entire population of data. As a result, the results
found in the sample do not represent the results that would be obtained from the
entire population.
Sampling is an analysis performed by selecting a number of observations
from a larger population. The method of selection can produce both sampling errors
and non-sampling errors.

Sampling Vs Non-Sampling Error

Sampling Error

• This arises when a sample is not the representative of the population.

• The difference between the sample mean and the population mean is called
sampling error.

• Sampling error arises when a sample is not representative of the population.

Non-sampling error

This may arise due to the following reasons:

1. The respondents when asked for a particular variable may not give the correct
answers.

2. The error can arise while transferring the data from the questionnaire on the
computer.

3. There can be errors at the time of coding, tabulation and computation.

• If the population of the study is not properly defined, it can lead to errors.

• The chosen respondent may not be available to answer the questions or may
refuse to be part of the study
DATA COLLECTION

Data is a collection of facts, figures, objects, symbols, and events gathered from
different sources. Organizations collect data to make better decisions. Without
data, it would be difficult for organizations to make appropriate decisions, and so
data is collected at various points in time from different audiences.

Primary Data Collection Methods

Primary data is collected from the first-hand experience and is not used in the
past. The data gathered by primary data collection methods are specific to the
research’s motive and highly accurate.

Primary data collection methods can be divided into two categories: quantitative
methods and qualitative methods.

Quantitative Methods:
Quantitative techniques for market research and demand forecasting usually
make use of statistical tools. In these techniques, demand is forecast based on
historical data. These methods of primary data collection are generally used to
make long-term forecasts. Statistical methods are highly reliable as the element
of subjectivity is minimum in these methods.

Qualitative Methods:
Qualitative methods are especially useful in situations when historical data is
not available. Or there is no need of numbers or mathematical calculations.
Qualitative research is closely associated with words, sounds, feeling, emotions,
colors, and other elements that are non-quantifiable. These techniques are based
on experience, judgment, intuition, conjecture, emotion, etc.
Quantitative methods do not provide the motive behind participants’ responses,
often don’t reach underrepresented populations, and span long periods to collect
the data. Hence, it is best to combine quantitative methods with qualitative
methods.

Surveys
Surveys are used to collect data from the target audience and gather insights into
their preferences, opinions, choices, and feedback related to their products and
services. Most survey software often a wide range of question types to select.
You can also use a ready-made survey template to save on time and effort.
Online surveys can be customized as per the business’s brand by changing the
theme, logo, etc. They can be distributed through several distribution channels
such as email, website, offline app, QR code, social media, etc. Depending on
the type and source of your audience, you can select the channel.

Once the data is collected, survey software can generate various reports and run
analytics algorithms to discover hidden insights. A survey dashboard can give
you the statistics related to response rate, completion rate, filters based on
demographics, export and sharing options, etc. You can maximize the effort
spent on online data collection by integrating survey builder with third-party
apps.

Interviews
In this method, the interviewer asks questions either face-to-face or through
telephone to the respondents. In face-to-face interviews, the interviewer asks a
series of questions to the interviewee in person and notes down responses. In
case it is not feasible to meet the person, the interviewer can go for a telephonic
interview. This form of data collection is suitable when there are only a few
respondents. It is too time-consuming and tedious to repeat the same process if
there are many participants.

Delphi Technique
In this method, market experts are provided with the estimates and assumptions
of forecasts made by other experts in the industry. Experts may reconsider and
revise their estimates and assumptions based on the information provided by
other experts. The consensus of all experts on demand forecasts constitutes the
final demand forecast.

Questionnaire
A questionnaire is a printed set of questions, either open-ended or closed-ended.
The respondents are required to answer based on their knowledge and
experience with the issue concerned. The questionnaire is a part of the survey,
whereas the questionnaire’s end-goal may or may not be a survey.

Secondary Data Collection Methods

Secondary data is the data that has been used in the past. The researcher can
obtain data from the sources, both internal and external, to the organization.

Internal sources of secondary data:


• Organization’s health and safety records
• Mission and vision statements
• Financial Statements
• Magazines

• Sales Report
• CRM Software
• Executive summaries
External sources of secondary data:

• Government reports
• Press releases
• Business journals
• Libraries

• Internet
The secondary data collection methods, too, can involve both quantitative and
qualitative techniques. Secondary data is easily available and hence, less time
consuming and expensive as compared to the primary data. However, with the
secondary data collection methods, the authenticity of the data gathered cannot
be verified.

Data Processing

• Once the data has been collected, it has to be processed and reduced so that it
can be analysed by the researcher.

• The data which is collected through a study is usually in a raw form.

• This form of data often has errors and inconsistencies which are not relevant
for the study.

• This raw data has to be transformed into a relevant set by the researcher
through the process of editing, coding, and tabulation.

• This stage is very important for effective research work, as processing the data
reduces the errors and biasness, resulting in relevant and specific data which is
appropriate for analysis.

• Data processing is the primary stage of data analysis.


• It refers to setting-up the collected data in such a format that it can be
appropriately coded and analysed through computer. Without processing the
data, it cannot be evaluated and communicated.

Data Editing

• Raw data is subjected to many errors and omissions which can occur during
the data collection process.

• In editing, these errors are corrected so that readers do not get confused or
misled. Editing converts the raw data into a presentable format, so that further
analysis and interpretations can be done efficiently.

Essentials of Editing

• Completeness

• Accuracy

• Consistency

Stages of Editing

• Field Editing

• Office/ Central Editing Field Editing


• All responses collected are checked for errors and omissions.

• While collecting the data, the researchers instantly check for accuracy,
uniformity, and completeness of the answers.

• By the researcher

• By the supervisor
Office/ Central Editing

• When all completed forms are brought at the central office, an individual or a
team of individuals performs the editing activities on these forms.

• This process is called office editing or central editing.

Coding

• Coding is the process of converting the data into meaningful categories and
then assigning symbols to each of these categories.

• These symbols are known as codes.


• The categories are formulated in such a way that these are mutually exclusive,
i.e an answer should belong to a single class only.

Example: While coding the responses, the researcher assigns a code ‘1’ to all
male respondents and ‘2’ to all female respondents. It can be seen that this
process reduces the entire data of responses into two mutually exclusive classes
of ‘Male’ and Female’. Also, the logic that governs the assigning of a code to a
data is uniform i.e ‘Gender’.

Classification/ Categorisation

• Classification is the process of creating the homogenous classes into


which the edited and coded data can be grouped on the basis of their common
characteristics.

• By classifying the data in different groups, meaningful results can be


extracted. The complete data set is divided into predefined groups in such a way
that none of the data is left unclassified.

For example, a researcher can classify a group of 100 respondents into smokers
(60) and non-smokers (40). By analysing this classification, it can be noticed
that all respondents in the respective categories are either smokers, or
nonsmokers, and hence are homogenous in the sense that they share the habit of
smoking or non-smoking.

Handling Unsatisfactory responses and Missing values

• The data gathered from the questionnaires should be clear and legible so
that process of data coding can be carried out properly.

• The researcher may also face problems like incomplete responses and
unanswered questions while analysing the recorded data.

• The researcher conducts a basic screening for inconsistency in the


answered questions. For example, a respondent has an annual income of Rs
300000 while answering the question regarding frequently purchased clothing
brand, he/ she selects high end brands.

• Sometimes, the respondents tick on more than one option while


answering a single response question.

Treatment of Unsatisfactory responses and missing values

• Returning to the field


• Assigning missing values
• Discarding unsatisfactory respondents

Assigning missing values


(When returning to the field is not possible)

This is done when,

• The number of blank or wrong answers is small.


• The number of responses per person is small.
• The important parameters to be studied do not have too many blanks,
otherwise the sample size for those variables becomes too small for
generalisations.
Discarding Unsatisfactory responses

• If the response sheet has too many blanks/ illegible or multiple responses for
a single answer, the form is not worth correcting and editing. So, it is better to
completely discard the whole questionnaire.

• If too many forms are discarded, then the sample for the study might become
too small for an analysis or generalization so, here it is advisable to carry out another
round of field visits.

QUESTIONNAIRE

• One of the most difficult steps in the research process is designing a well-
structured instrument.

• A questionnaire is the primary tool of data collection.

Steps in Designing a Questionnaire

• Convert the research objectives into information areas


• Method of administration
• Content of the questionnaire
• Motivating the respondent to answer
• Determining the types of questions
• Criteria for questionnaire designing
• Questionnaire structure
• Physical presentation of the questionnaire
• Pilot testing the questionnaire
• Administering the questionnaire
Type of Questions

DECIDE ON THE TYPES OF QUESTIONS

• Open Ended Question

These are questions where respondents are free to answer in their own words.

Ex-What is your opinion about Merriboy Ice-cream?

Note- There is no option given to this question and so the respondent is free to
answer in his own words.

• Closed End Questions

These are questions, where the respondents have an option to choose from the
question.

Ex-Which brand of icecream do you prefer?

Note- There are options given to this question and so respondent is free to
choose any one of the options.
• Dichotomous Questions
These questions have only 2 answers” Yes or No”, “true” or false”

• Multiple Choice Questions

How much do you spend on grocery products?


Less than Rs 2500
Between Rs 2500-5000
More than Rs 5000

Scales

Do you intend to buy a new car within the next six months?

Criteria for Questionnaire Designing

• Clearly specify the issue.


• Use simple terminology
• Avoid ambiguity in questioning
• Avoid leading questions
• Avoid loading questions
• Avoid implicit choices and assumptions
• Avoid double barrelled questions
Consistency of Questionnaire

• While conducting a questionnaire survey, the researcher should carefully


monitor the consistency of the answers.

• The researcher can identify some basic inconsistencies in the answers by


logically screening the questionnaire.

• For example, in a questionnaire a highly dissatisfied customer will mark the


maximum number of ticks on the poor option when the options are excellent,
good, average, and poor.

• For example, in a questionnaire with two connecting questions, the first


question asks, do you like ethnic or western clothes?
The second question asks the respondent to select some of the salient features of
an ethic or a western outfit.

It might be a possibility that a response to the initial question the respondent


ticked on the ethnic clothes option and accidentally he or she selected the salient
features of a western outfit for the second question.

TESTING QUESTIONNAIRE

Pilot Testing the Questionnaire


• Pilot testing refers to testing and administering the designed instrument on a small
group of people from the population under study.
• This is essentially done to cover any errors that might have still remained even
after the previous steps.
• If the respondent had a problem understanding a question or response category, the
investigator should verbatim record the instruction he/she gave to clarify the point
as this then would need to be incorporated in the final version of the questionnaire.
Question before Pilot testing
• If someone among your family member wants to buy new mobile phone, will you
suggest him/ her mobile phones of XYZ brand?
Yes No

Question after Pilot Testing


• If someone among your family member wants to buy a new mobile phone, will you
suggest him/ her mobile phones of XYZ brand?

Yes No
If Yes, then why:
If No, then why

Note: Here the researcher had made some amendments in the questionnaire by
adding reasons to know why the respondent will suggest the XYZ brand or why he/
she will not suggest the brand.

Physical Characteristics of the Questionnaire

• Good layout and physical attractiveness are crucial, particularly in in mail, Internet,
and other self-administered questionnaires.
• Questionnaires should be presented in a professional, attractive and uncluttered
format.
• Questionnaires should be designed to appear as short as possible.
• Always allow enough room for respondents to answer questions and provide plenty
of white space between questions.
PROJECTIVE METHODS

• Projective methods are used in those areas of qualitative research where


the researcher aims to achieve a greater understanding of the subject.
• Under this method, individuals are asked to express their feelings or
opinions on a situation or behaviour of another individual.
• Projective techniques help project those feelings and opinions of
respondents that are otherwise lying dormant by enabling them to provide their
thoughts on something other than themselves.

Types of Projective Methods


• Sentence Completion
Under this technique, participants are provided with incomplete sentences and
are encouraged to complete these sentences in their own way.
• Word Association
Under this technique, participants are provided with a list of words, one by
one, and requested to state the first thing that comes to their mind in response to
each of these words.
• Story Completion
1. Here participants are given an incomplete story which, though subtly leads
to a pre-determined topic, does not in any way provide an insight into the ending.
2. Participants are required to conclude the story in their own words
• Verbal Projection tests
1. In these tests, a situation is described to the participants either orally, or by
exploiting simple methods of visual presentation.
2. Participants are often required to provide their thoughts and opinions on
the effects the situation could have on others, not themselves.
• Pictorial Techniques
1. Pictorial technique was adopted to make photography a part of fine art.
2. In this technique, respondents are requested to describe the artistic quality
of an image, rather than the contents of the image itself.
• Play techniques
1. Play techniques are a well-known stimulating form of research where each
participant is assigned a role and then asked to improvise and enact the character.
2. This technique may be used by a researcher to observe various behaviour
traits of participants such as dominance, sympathy, submissiveness, fairness, etc.
3. Play technique is usually adopted in understanding the behaviour of
children.
• Socio-metric analysis
1. Socio-metric analysis is used to observe and understand the relationship
among individuals of a group.
2. This technique helps analyze the attractions and repulsions between
persons under varied circumstances, by encouraging them to decide who they
would or would not choose under said conditions and circumstances.

OTHER METHODS OF DATA COLLECTION

Observation Method of Research Data Collection

Observation is way of gathering data by watching behavior, events, or noting


physical characteristics in their natural setting.

Observation involves three processes, i.e.

(i)sensation (ii) attention (iii) perception.

Sensation is gained through the sense organs which depend upon the physical
alertness of the observer. The sense organs are receptive to stimuli and get
attracted leading to the first stage in observation.
Then comes attention or concentration which is largely a matter of
commitment and will-power. Adequate training and experience can make it
almost a matter of habit.

The third is perception which comprises the interpretation of sensory reports.


Thus, sensation merely reports the facts as observed but perception enables the
mind to recognize the facts.

Through this process, observation serves the purpose of

(i) studying collective behaviour and complex social situations;


(ii) following up of individual units composing the situations;
(iii) understanding the whole and the parts in their interrelation;
(iv) getting the out of the way details of the situation.

TYPES OF OBSERVATION METHOD OF DATA COLLECTION IN


RESEARCH

There are different types of observation method of data collection in research. The
important one’s are listed below:

1. Casual and Scientific Observation

An observation may be either casual or scientific.

Casual observation occurs without any previous preparation. It is a matter of chance


that the right thing is observed at the right time and in the right place.

Scientific observation, on the other hand, is carried out with due preparations and is
done with the help of right tools of measurement experienced enumerators and under
able guidance. Scientific observations yield thorough and accurate data.
2. Simple and Systematic Observation

An observation may be either Simple or Systematic.

Simple Observation is found in almost all research studies, at least in the initial
stages of exploration. Its practice is not very standardized. It befits the heuristic
nature of exploratory research. Participant studies are also usually classified as
simple observation because participant roles do not permit systematic observation.

Systematic observation, on the other hand, employs standardized procedures,


training of observers, schedules for recording and other devices to control the
observer sometimes even the subject. Clearly some systematization is valuable in
research observation, but the situation often limits what can be done. A systematic
observation is a scientific observation too.

3. Subjective and Objective Observations

An observation may be either Subjective or Objective.

In every act of observation there are two components namely, the object (or what is
observed) and the subject (or the observer). It may be that sometimes one may have
to observe one’s own immediate experience. That is called Subjective Observation
or Self-Observation or introspection. Prejudices and biases are generally parts of
subjective observation. Many data of psychological interest are gathered by the
method of subjective observation. To avoid such prejudices, the observer takes stock
of him and discovers what prejudices and biases will prevent impartial study and
disinterested points of view. Persistent self-observation and criticism by others may
ultimately overcome prejudice and biases. Such introspection may have another
social value i.e., it sensitizes the observer to the problems of others and creates
sympathetic insight which facilitates, at least to some degree, the understanding of
people’s behavior in similar circumstances and similar cultural contexts. The net
result is impartial subjective observation. When the observer is an entity apart from
the thing observed, the observation of this type is objective.

4. Factual and Inferential Observation

Observation may be either factual or inferential.

In factual observation things or phenomena observed with naked eyes are reported.

In inferential observation behaviour or psychological aspects are observed.

5. Direct and Indirect Observation

Observation may be either Direct or Indirect.

In the case of direct Observation, the observer is physically present and personally
monitors what takes place. This approach is very flexible of events and behavior as
they occur. He is also free to shift places, change the focus of the observation, on
concentrate unexpected events if they should occur.

In indirect observation recording is done by mechanical, photographic or electronic


means. For example, a special motion picture camera which takes one frame every
second is mounted in a department of a large store to study customer and employee
movement.

6. Behavioral and Non-behavioral Observations

Observation may be either behavioral or non-behavioral.


As pointed earlier the concept of observation involves not only watching but also
listening and reading. Thus, observation includes the full range of monitoring
behavioral and non-behavioral activities and conditions. Non-verbal analysis,
linguistic analysis, extra-linguistic analysis and spatial analysis are the four major
categories of behavioral observational study of persons. Record analysis, physical
condition analysis and physical process analysis are the three major categories of
non-behavioral study of persons. Non-verbal behavioral observation includes
observation of body movement, motor expressions and even exchanged glances.
Body movement, is an indicator of interest or boredom, anger or pleasure in a certain
environment. Motor expressions such as facial movements can be observed as a sign
of emotional studies. For instance, eye-blink rates are studied as indicators of
interest in advertising messages. Finally, exchanged glances might be of interest in
studies of interpersonal behavior. Linguistic behavior is a second frequently used
from of behavioral observation. One simple type, familiar to most students, is the
tally of ‘ahs’ (or other annoying words or sounds) that a professor emits during a
class.

GOODNESS-OF-FIT

The goodness-of-fit test is a statistical hypothesis test to see how well sample
data fit a distribution from a population with a normal distribution. Put
differently, this test shows if your sample data represents the data you would
expect to find in the actual population or if it is somehow skewed. Goodness-of-
fit establishes the discrepancy between the observed values and those that would
be expected of the model in a normal distribution case.

There are multiple methods for determining goodness-of-fit. Some of the most
popular methods used in statistics include the chi-square, the Kolmogorov-
Smirnov test, the Anderson-Darling test, and the Shipiro-Wilk test.
• Goodness-of-fit tests are statistical tests aiming to determine whether a set of
observed values match those expected under the applicable model.
• There are multiple types of goodness-of-fit tests, but the most common is the
chi-square test.
• Chi-square determines if a relationship exists between categorical data.
• The Kolmogorov-Smirnov test—used for large samples—determines
whether a sample comes from a specific distribution of a population.
• Goodness-of-fit tests can show you whether your sample data fit an expected
set of data from a population with normal distribution.

Goodness-of-fit tests are commonly used to test for the normality of residuals or
to determine whether two samples are gathered from identical distributions.

Why Is Goodness-of-Fit Important?

Goodness-of-Fit tests help determine if observed data aligns with what is expected.
Decisions can be made based on the outcome of the hypothesis test conducted.

For example, a retailer wants to know what product offering appeals to young
people. The retailer surveys a random sample of old and young people to identify
which product is preferred. Using chi-square, they identify that, with 95%
confidence, a relationship exists between product A and young people. Based on
these results, it could be determined that this sample represents the population of
young adults. Retail marketers can use this to reform their campaigns.
RELIABILITY AND VALIDITY IN RESEARCH

I.RELIABILITY

• Reliability refers to the degree of accuracy of measuring instruments.

• A instrument is said to be reliable if it provides consistent results.

• It can be defined as the relative absence of errors of measurement in a measuring


instrument.

For ex- A weighing scale is reliable if it gives the same reading when the same object
is weighted several times.

Two Aspects of Reliability

1. Stability
2. Equivalence

1.STABILITY

• Stability aspect stands for securing consistent results with repeated measurements
by the same person with the same instrument.

• Equivalence aspect considers how much error may get introduced by different
investigators or different sample of items being studied.

Methods of determining Reliability

A. Test-Retest method
B. Split-half method
C. Equivalence –form method
1.Test-Retest method

• The test-retest method of determining reliability involves administering the


same scale or measure to the same respondents at two separate times to test for
stability.

• If the measure is stable over time, the test administered under the same
conditions each time should obtain similiar results.

For example- The researcher measures job satisfaction and finds that 64 percent
of the population is satisfied with their jobs.

If the study is repeated a few weeks later under similiar conditions, the researcher
finds that 64 percent of the population is satisfied with their jobs. It appears that
the measure has repeatability. The high stability correlation or consistency
between the 2 measures at time 1 and time 2 indicates a high degree of reliability.

2.Split-half method

• A method of measuring the degree of internal consistency by checking one half


of the results of a set of scaled items against the other half.

• In the split-half method the researcher may take the result obtained from one
half of the scale items (eg. Odd numbered items) and check them against the
results from the other half of the items. (eg. Even numbered items).

3.Equivalent-form method

• In the equivalent-form method two alternative instruments are designed to be


as equivalent as possible.

• Each of the two measurement scales is administered to the same group of


respondents.

• If there is high correlation betweeen the 2 forms, the researcher concludes that
that the scale is reliable.

•There is a problem if there is low correspondentce between the two instruments.


II.VALIDITY

• It means the effectiveness of an instrument in measuring the specific property


which it intends to measure.

• Any measuring instrument is valid when it measures most accurately the


objects or individuals and their characteristics.

• If we are measuring certain characteristics like height, weight, length then


maintaining validity is not a great problem because for measuring them direct
and standardised measuring devices are available.

• There are certain social characteristics which are highly abstract in nature and
can be measured only indirectly.

Examples of such characteristics are achievement, aspiration, attitude.

It is difficult to measure these characteristics validity in exact amount.

Types of Validity

1. Content Validity
2. Concurrent Validity
3. Predictive Validity

1.Content Validity

• It is also called face validity.

• It refers to the investigator’s subjective evaluation of the validity of the


measuring instrument.

• An investigator may construct a scale containing 10 or 15 statements to measure


the variable “attitude”.

• He then evaluates each statement to assess whether its variable is related to


attitude and requests some specialists to assess.
• If there is an agreement among the judges, the researcher may be satisfied with
the face validity of the scale.

2.Concurrent Validity

• It is used to measure the validity of the new measuring techniques by correlating


them with the established techniques.

• It involves computing the correlation coefficient of two measures of the same


phenomena (for example, perception of an airline and image of a company)
which are administered at the same time.

• The concurrent validity would be established if there is a high correlation


between the two techniques.

3.Predictive Validity

• This refers to empirically measured association between the results produced


by the instrument and the subsequent actual outcome.

• For example- An admission test’s predictive validity may be examined by


computing the correlation coefficient between the admission test scores of the
freshmen and their grade-point averages during the first year of study.

• If this coefficient is greater than 0.50, the predictive validity of the admission
test is established.

4. Sensitivity

• It refers to an instrument’s ability to accurately measure variability in stimuli


or response.

• A dichotomous response category such as “agree or disagree” does not allow


the recording of subtle attitude changes.

• A more sensitive measure, with numerous items on the scale may be needed.
For example- adding strongly agree, mildly agree, neither agree nor disagree,
mildly disagree, and strongly disagree as categories increases a scale’s
sensitivity.

Relation between Validity and Reliability


• Reliability and Validity are interrelated.

• Reliability is a condition of validity.

• An instrument that is valid is always reliable.

• An instrument that is not valid may or may not be reliable.

• A reliable instrument need not be valid.

Conclusion
• Measuring instruments are evaluated in terms of reliability, validity and
sensitivity.

• Reliability refers to the measuring instrument’s ability to provide consistent


results in repeated uses.

• Validity refers to the degree to which the instrument measures the concept, the
researcher wants to measure.

• Sensitivity is the measuring instrument’s ability to accurately measure


variability in stimuli or responses.
Cronbach’s alpha
• Another measure which is used to test the internal consistency of a multiple
item scale is the coefficient alpha (α).

• It is the most commonly used method of computing the reliability of the


research instrument.
MODULE 4
DATA ANALYSIS AND INTERPRETATION

DATA ANALYSIS
Data analysis can be defined as the process of gathering, modelling, and transforming data so as to
get useful information, suggestions, and conclusion in decision making.

DESCRIPTIVE STATISTICS
• Descriptive statistics or descriptive analysis are used to present quantitative
descriptions in a manageable form
• Descriptive statistics help in simplifying large amount of data in a sensible way
• each descriptive statistic reduces lot of data into a simple summary .
• This sort of analysis may describe data on one variable, two variable, or more than
two variables.
• Mean ,median, mode ,variance, range and standard deviation are the widely applied
descriptive statistics .

TYPE OF DESCRIPTIVE ANALYSIS


• Univariate analysis
• Bivariate analysis
• Multivariate analysis

UNIVARIATE ANALYSIS

Univariate analysis is concerned with the description or summarization of individual variables in a


given data set.

Example: age of the respondent

If gender was measured, we would look at how many participants were men and how many were
women

BI-VARIATE ANALYSIS

Bivariate analysis is concerned with the relationships between perils of variables in a data set.

Eg: relationship between age and weight.

MULTIVARIATE ANALYSIS

multivariate analysis is done analysis of the simultaneous relationship among three or more
Phenomena.
Eg: relationship between age, weight and height.

Univariate Statistical Techniques


• Measures of central tendency
• Measures of Dispersion
Measures of Central Tendency
• Measure of central tendency is also known as measure of central value or measure of
location or average of first order.
• It is a statistical measure and calculates the location or position of central point to explain
the central tendency of the whole quantity of data.
• According to Simpson and Kafka, “A measure of central tendency (called as average) is a
single value within the range of the entire mass of data that is used to represent the whole
data.
• The single value which represents the characteristics of the entire data may termed as
central value.
• The averages/arithmetic mean are frequently called measures of central tendency because
they reflect the spirit of data.
• According to Crum & Smith, “ An average is sometimes called measures of central tendency
because individual values of variables cluster around it”
Types of Measures of Central Tendency
• Mean
• Median
• Mode
Mean
• Mean is also known as arithmetic average.
• To calculate the mean, summate all the observations and divide it by the total number of
observation.
X= Sum of all observations/ Number of observations
Median
• The median is a positional average that divides a distribution into two equal parts so that
one half of items fall above it and other half below it.
• For example, in the series 3, 4, 6, 7, 8, 9, 10, the median is 7. If the number of cases (n) is
odd, the median is the middle term.
• If n is even, the median is the midpoint of the distance between the two middle most items.
In the distribution 2, 3, 4,6,7,8 the median is midpoint between 4 and 6 ie. 5.
• According to Corrner, “ Median is that value of variable which divides the group into 2 equal
parts, one part containing all values greater than the median value and the other part
containing all the values smaller than the median values”.
Mode
• Mode is the value of that item of a series which occurs most frequently in the series.
• Suppose a set of students of a class report the following number of video movies they see
in a month. Number of movies: 10, 15, 20, 15, 15, 15,8 A glance at the series reveals that
mostly the students see 15 movies in a month. Therfore mode=15.

Measures of Dispersion
• Distribution cannot be clearly depicted by measuring the averages or central tendency.
• Averages provide the observations of only the central part of the distribution. So the study
of scatteredness of observation is very important and this study is known as measure of
dispersion.
• The word ‘dispersion’ literally means ‘fluctuation’, scatter, variation, deviation or spread.
• So the measure of dispersion shows the variation of an individual item from its average.
Methods of measuring the dispersion
• Range
• Coefficient of Range
Range
• Range is the simplest absolute measure of dispersion
• It is the difference between the highest and lowest value in a series.
Range=Highest value – Lowest value
• Range therefore measures the maximum variation in the values of a series.
Coefficient of Range
• Coefficient of range is defined as:
Largest value – Smallest value/ Largest value+ smallest value
Quartile Deviation
• Quartile deviation partially solves the limitations of range.
• It performs the calculation over middle half of the values in a data set thereby it minimizes
the influence of extreme values.
• A quartile helps break up the observations and divide into four intervals based upon the
values of the data. It also examines how far is the distribution of the observations from the
mean
• Quartiles help to measure the spread of values above and below the mean by dividing the
data into four groups.
• the observations and divide into four intervals based upon the values of the data. It also
examine show far is the distribution of the observations from the mean.
Mean Deviation
• Mean Deviation is defined as the arithmetic mean of deviations of all the values in a series
from their average, counting all such deviations as positive. The average selected may be
mean, median, or mode.
• It can be calculated from any one of the measures of central tendency such as mean,
median, and mode.
• It is also known as first moment of dispersion.
Standard Deviation
• The standard deviation is the square-root of the arithmetic average of the squares of the
deviations measured from the mean.
• Standard deviation is used to measure the spread of items in a set of observation.
• If all the observation value are identical or distribution of items of a set would be uniform,
then deviation of every value from mean is zero. When elements of the set are more
dispersed, then standard deviation becomes larger.
Descriptive analysis of bivariate data
• Bivariate analysis is concerned with the relationship between pairs of variables in a data set.
• It is the simultaneous analysis of 2 variables.
• It is usually undertaken to see if one variable, such as gender is related to another variable,
perhaps attitudes toward male/female equality.
Bivariate Statistical Techniques
• Correlation analysis
• Regression Analysis
Correlation Analysis
• It is the study of the linear relationship between two variables.
• If there are 2 variables and changes in the value of one variable will affect the value of the
other variable, then both the variables are correlated.
Example- An increase in income will lead to improved job performance.
Regression Analysis
• It is used for prediction.
• It is used to find the variations in the dependent variable due to changes in the independent
variable.

PARAMETRIC TESTS FOR HYPOTHESIS TESTING


Introduction
• These are some models with assumptions on which parametric tests are based.
• These tests can be more sophisticated methods for the analysis if the assumptions are
fulfilled appropriately. It has some assumptions which are closely related with the traits of
the parent population from which the samples are drawn.

Important Parametric Tests


• t test
• Z test
• F test
t test
• A particular frequency distribution is obtained when a very large number of small samples
are taken from a population and the mean of these samples is used to plot the frequency
distribution. It is called Students’ t-distribution.
• Sir William Gossett and R.A Fisher are considered to be the great contributors towards the
theory of small samples.
• The t-distribution can be implemented in situations when the sample size is 30 or less and
no information is available with respect to the population standard deviation.
Where t-test can be used?
• t test can be used to find out the difference of means of two groups.
Applications (uses) of t-test are:
• To test the given population mean when the sample is small and the population S.D is not
known.
• To test the equality of two sample means where the samples are small and population
standard deviation is unknown.
• To test the difference in values of two dependent samples.
• To test the significance of correlation coefficients.
Z-test
• Z-test was used in situations where the sample size is more than 30.
Uses of Z-test
• To test the given population mean when the sample is large and the population S.D is
known.
• To test the equality of two sample means when the samples are large or when the
population SD is known.
• To test the population proportion.
• To test the equality of two sample proportions.
• To test the population SD when the sample is large.
• To test the equality of two sample standard deviations when the samples are large or when
population standard deviations are known.
Analysis of Variance (ANOVA)
Analysis of variance (ANOVA) refers to a broad class of methods for studying variations among
samples under different conditions (or treatments). The simplest form of ANOVA can be used
for testing three or more population means. It can be considered as an extension of the two-
sample t-tests we discussed for comparing two population means.
The basic idea of ANOVA is to partition the total variation in a dataset into two or more
components. Associated with each of these components is a specific source of variation, so
that in the analysis it is possible to ascertain the magnitude of the contributions of each of
these sources to the total variation.
MANOVA
Multivariate analysis of variance (MANOVA) is an extension of the univariate analysis of
variance (ANOVA). In an ANOVA, we examine for statistical differences on one continuous
dependent variable by an independent grouping variable. The MANOVA extends this analysis
by taking into account multiple continuous dependent variables, and bundles them together
into a weighted linear combination or composite variable. The MANOVA will compare
whether or not the newly created combination differs by the different groups, or levels, of
the independent variable. In this way, the MANOVA essentially tests whether or not the
independent grouping variable simultaneously explains a statistically significant amount of
variance in the dependent variable.

ANCOVA
Analysis of covariance is used to test the main and interaction effects of categorical variables
on a continuous dependent variable, controlling for the effects of selected other continuous
variables, which co-vary with the dependent. The control variables are called the "covariates."
ANCOVA is used for several purposes:

* In experimental designs, to control for factors which cannot be randomized but which can
be measured on an interval scale.
* In observational designs, to remove the effects of variables which modify the relationship
of the categorical independents to the interval dependent.
* In regression models, to fit regressions where there are both categorical and interval
independents. (This third purpose has become displaced by logistic regression and other
methods.

SEM
Structural equation modeling (SEM) is a set of statistical techniques used to measure and
analyze the relationships of observed and latent variables. Similar but more powerful than
regression analyses, it examines linear causal relationships among variables, while
simultaneously accounting for measurement error.

Nonparametric Tests

Non-parametric testsare statistical tests without parameters. For these types of tests, you
need not characterize your population’s distribution based on specific parameters. They are
also referred to as distribution-free tests due to the fact that they are based on fewer
assumptions (e.g. normal distribution). These tests are particularly used for testing
hypothesis, whose data is usually non normal and resists transformation of any kind. Due to
the lesser amount of assumptions needed, these tests are relatively easier to perform. They
are also more robust. An added advantage is the reduction in the effect of outliers and
variance heterogeneity on our results. This test can be used for ordinal and sometimes even
for nominal data. However, nonparametric tests do have their own disadvantages as well.
Firstly, the results that they provide may be less powerful compared to the results provided
by the parametric tests. To overcome this problem, it is preferred that a larger number of
samples be taken if one is adopting this approach. Secondly, their results are usually more
difficult to interpret than the results of parametric tests. This is because we usually assign
ranks to samples in the case of non-parametric tests rather than using the original data. This
further complicates the system and distorts our intuitive understanding of the data. Non-
parametric tests are useful and important in many cases, but they may not provide us with
the ideal results.
When to use Non-parametric testing?
- When the outcome is a rank or an ordinal variable – For example in the case of movie
ranking etc.
- When there are a number of explicit outliers – The samples may show a continuous
pattern with some very extreme ended outliers.
- When the outcome has a clear limit of detection – This means that the outcome being
determined has been done so with some limitations or imprecision.

CHI-SQUARE TEST
Χ2 quantity
• The quantity, χ2 describes the magnitude of the frequency between theory and observation.
• It describes the magnitude of difference between observed frequencies and the frequencies
expected under certain assumptions.
• It is a statistical test which tests the signifance of difference between the observed
frequencies and the expected frequencies.
• Χ2 value ranges from 0 to infinity. It is 0 when the expected and observed frequencies
completely coincide.
• So, greater the value of Χ2 , greater is the discrepancy between observed and expected
frequencies.
Characteristics of Chi-Square test
• It is a non-parametric test. Assumptions about the form of the distribution or its parameters
is not required.
• It is a distribution free test, which can be used in any type of distribution of population.
• It analyses a set of differences between a set of observed frequencies and a set of
corresponding expected frequencies.
Applications of Chi-Square Test
• Useful for the test of goodness of fit.
• Useful for the test of independence of attributes.
• Useful for testing homogeneity
• Useful for testing given population variance

RUN TEST FOR RANDOMNESS


Run Test
• Run test is used to examine the randomness of the sample. Run- A run is defined as a
sequence of like elements that are preceded and followed by different elements or no
elements at all.
• To explain the concept of run, consider an example where the gender of a customer entering
a restaurant is noted. The following sequence is obtained:
MMFMFFFMMMMFFFMMFFFMMMMMFFMMMFFFMFFFFFMMFFFFF
Where M and F, denote the male and female respondents respectively.
MMFMFFFMMMMFFFMMFFFMMMMMFFMMMFFFMFFFFFMMFFFFF
Assumptions in run test of randomness:
1. Data level: In run test of randomness, it is assumed that the data is recorded in order and
not in a group. If data is not in order, then we have to assign the mean, median or mode value
to the data.
2. Data Scale: In run test of randomness it is assumed that data is in numeric form. This
condition is compulsory in run test of randomness, because in numeric data, it is easy to assign
run to the numeric data.
3. Distribution: Run test of randomness is a nonparametric test, so it does not assume any
assumption about the distribution.
4. In run test of randomness, the probability of run is independent.
Two Sample Sign Test

Introduction
• A two- sample sign test is a non-parametric test based upon the sign of a pair of
observations.
• Suppose a sample of respondents is selected and their views on the image of a company are
sought. After some time, these respondents are shown an advertisement, and thereafter, the
data is again collected on the image of a company.
• For those respondents, where the image has improved, there is a positive and for those
where the image has declined there is a negative sign assigned and for the one where there
is no change, the corresponding observation is dropped from the analysis and the sample size
reduced accordingly.
• The key concept underlying the test is that if the advertisement is not effective in improving
the image of the company, the number of positive signs should be approximately equal to the
number of negative signs. For small samples, a binomial distribution could be used, whereas
for a large sample, a normal approximation to the binomial distribution could be used, as
already explained in the one sample test.
Mann-Whitney U Test
Introduction

• This test was developed by HB Mann and R Whitney in the 1940’s.

• This test is used to examine whether two samples have been drawn from populations with same
mean.

• This test is an alternative to t test for testing the equality of means of two independent samples

• A two tailed hypothesis for a Mann-Whitney test could be written as

H0 : Two samples come from identical populations or Two populations have identical probability
distribution.

H1 : Two samples come from different populations or Two populations differ in locations

Procedure involved in the use of MannWhitney U test

• The two samples are combined (pooled) into one large sample and then we determine the rank of
each observation in the pooled sample. If two or more sample values in the pooled samples are
identical, if there are ties, the sample values are each assigned a rank equal to the mean of the ranks
that would otherwise be assigned.

• We determine the sum of the ranks of each sample.

Let R1 and R2 represent the sum of the ranks of the first and the second sample whereas n1 and n2
are the respective sample sizes of the first and the second sample. For convenience, choose n1 as a
small size if they are unequal so that n1< n2 . A significant difference between R1 and R2 implies a
significant difference between the samples

Mann-Whitney test for a large sample

• If n1 or n2 is greater than 10, a large sample approximation can be used for the distribution of the
Mann-Whitney U statistic.
• For this purpose, either of U1 or U2 could be used for testing a one tailed or a two-tailed test. In
this test, U2 will be used for the purpose.

• Under the assumption that the null hypothesis is true, the U2 statistic follows an approximately
normal distribution with mean uu2=n1n2 /2

If the absolute sample value of Z is greater than the absolute critical value of Z, the null hypothesis is
rejected

Example

• The table below represents the number of bounced cheques in two banks- Bank A and Bank B- on
randomly chosen 12 days for Bank A and 15 days for Bank B. Use a Mann-Whitney U test to examine
at a 5 percent level of significance whether Bank A has more bounced cheques as compared to Bank
B.

Solution H0 : Two populations have identical probability distributions

H1 : Population A is shifted to the right of population B


We pool both the samples and rank them

We consider the sample of Bank B as coming from the population B whereas that of Bank A belonging
to the population A. R1= Sum of ranks of Bank A= 249 R2= Sum of ranks of Bank B= 129
The critical value of Z at a 5 percent level of significance is given by 1.645. The sample value of Z
exceeds the critical value of Z and the null hypothesis is rejected. Therefore, Bank A has a larger
number of bounced cheques as compared to Bank B
WILCOXON SIGNED RANKED TEST FOR
PAIRED SAMPLES
Introduction

• There are instances where the sample data consists of paired observations.

• Examples of paired samples include a study where husband and wife are matched or where subjects
are studied before and after experimentation or observations are taken on a variable for brother and
sister.

• In two sample sign test, only the sign of the difference (positive or negative) was taken into account
and no weightage was attached to the magnitude of the difference

The Wilcoxon-matched pair signed rank test takes care of this limitation and attaches a greater
weightage to the matched pair with a larger difference.

• The test, therefore, incorporates and makes use of more information than the sign test

Test procedure for Wilcoxon Signed Ranked test for paired samples

• Let di denote the difference in the score for the ith matched pair. Retain signs, but discard any pair
for which d=0.

• Ignoring the signs of the difference, rank all the di ’s from the lowest to highest.

• To each rank, prefix the sign of the difference.

• Compute the sum of the absolute value of the negative and positive ranks to be denoted as T_ and
T+ respectively.

• Let T be the smaller of the two sums found in step iv.

• When the number of the pairs of observations (n) for which the difference is not zero and is greater
than 15, the T statistic follows an approximate normal distribution under the null hypothesis, that the
population differences are centered at 0.
Problem

• A sample of 16 salesmen was selected in an organisation and their score on the performance
appraisal was noted. The salesmen were sent for a three-week training programme and in the next
appraisal, their scores were noted again. The appraisal scores before and after the training are given
below:

Use a 5 percent level of significance to test the hypothesis that the training has not caused any change
in the performance appraisal system score

H0 : There is no difference in the appraisal score because of training.

H1 : There is a difference in the appraisal score because of training


T+= Sum of positive ranks =84 T-= Sum of negative ranks =52 T=Min(T- , T+ )=52
Wilcoxon Signed -rank test

The number of typing errors per page made by 17 students who joined a typing institute before and
after the training is given below. Use a 5 percent level of significance to test the hypothesis that the
average number of typing errors decreased after the training.

KRUSKAL-WALLIS TEST
Introduction

• When testing the equality of more than two population means, one way ANOVA
technique was used.

• One of the assumptions used in ANOVA is all the involved populations from where the
samples are drawn are normally distributed. If this assumption does not hold true, the F-
statistic used in ANOVA becomes invalid.

• The normality assumption may not hold true when we are dealing with ordinal data
or when the size of the sample is very small.
• The Kruskal Wallis test comes to our rescue in such situations. This is, in fact, a non-
parametric counterpart to one-way ANOVA. The test is an extension of the Mann-Whitney
U test.

• Both methods require that the scale of measurement of a sample value should be at
least ordinal.

• The hypothesis to be tested in Kruskal-Wallis test is:


H0: The k populations have identical population distributions.
H1: At least two of the populations differ in locations.

Procedure in Kruskal Wallis test

• Obtain random samples of size n ,….., n from each of the k populations. Therefore,
1 k
the total sample size is n=n1+n2+…..+nk

• Pool all the samples and rank them, with the lowest score receiving a rank of 1. Ties
are to be treated in the usual fashion by assigning an average rank to the tied positions.

• Let r =the total of the ranks from the i


i
th sample.

The Kruskal- Wallis test uses the X2 to test the null hypothesis. The test statistic is given by:
Problem
• Three machines are used in the packaging of 16kg of wheat flour. Each machine is
designed so as to pack on an average of 16 kg of flour per bag. Samples of six bags were
selected from each machine and the amount of wheat packaged in each bag is shown
below: Use a percent level of significance to test the hypothesis that the amount of wheat
packaged by the three machines is the same.

Machine 1 15.8 15.9 16.2 15.7

Machine 2 16.5 16 15.4 15.9

Machine 3 15.7 16.4 16.2 15.9

Solution
H0 : Amount of wheat packaged by the three machines is the same.
H1 : Amount of wheat packaged by at least two machines is different.

Weight Rank Machine Weight Rank Machine

15.4 1 2 16 10 2

15.7 3 1 16.1 11 2

15.7 3 3 16.2 13 1

15.7 3 3 16.2 13 2

15.8 5.5 1 16.2 13 3

15.8 5.5 1 16.3 15.5 1

15.9 8 1 16.3 15.5 3


15.9 8 3 16.4 17 3

15.9 8 2 16.5 18 2

Pool the elements of the different samples and rank them. These rankings are shown below:
r1 (Total of ranks of machine 1)=50.5
r2 (Total of ranks of machine 2)=61
r3 ( Total of ranks of machine 3)=59.5
CORRELATION ANALYSIS
What is Correlation?

• A study of the linear relationship between two variables is known as correlation.

• If there are two variables and the changes in the value of one variable will affect the value of the
other variable, then both the variables are correlated.

• For example, when the price of a commodity changes, the demand of that commodity also changes

Types of Correlation

• On the basis of direction

• On the basis of number of variables

• On the basis of ratio of the change in direction

On the basis of direction

• Positive correlation

• Negative correlation

Positive correlation
• Correlation is positive when two variables vary in the same direction.

Negative Correlation

• Two variables are said to be negatively correlated when both the variables vary in the opposite
direction. When one variable increases, then the other variable decreases and vice versa is also a
negative correlation.

• For example, consider the correlation between production and price of a crop.

On the basis of the number of variables

• Simple correlation

• Multiple correlation

• Partial correlation

Simple Correlation

• When we measure the linear relationship between two variables, then this interpretation is known
as simple correlation.

Ex- Relationship between sales and expenses, income and consumption etc.

Multiple Correlation

• It is defined as the measurement of the effect of multiple variables on one variable.

For example- If we try to find out the relationship of rainfall and temperature on the yield of wheat,
then this is known as multiple correlation

Partial Correlation
• If we have various related variables and try to find out the relationship between two variables, then
it is known as partial correlation.

For example-Consider the two variables height and weight which are partially correlated because of
the effect of the third variable ‘age’ on height and weight. In this condition, if we neglect the effect of
age, and the study the relationship between height and weight, then this correlation is known as
partial correlation.

On the basis of the ratio of change in direction

• Linear correlation

• Non-linear correlation

Linear Correlation

• In a linear correlation, change in the values of one variable has a fixed ratio to the variation in the
values of the other variable.

• When these variables are plotted on a graph, then plotted points would fall on a straight line. For
example, consider the following relationship shown in the table.

Non-Linear Correlation • In a non-linear correlation, changes in values of one variable does not have
a fixed ratio to the variation in the value of the other variable.

• When these variables are plotted on a graph, then plotted points would fall on a curve.

Degree of Correlation

• Perfect correlation- If two variables change in the same direction and in the same proportion, the
correlation between the two variables is called perfect positive correlation. The correlation coefficient
in this case is +1.
• If two variables change in the opposite direction and in the same proportion, the correlation
between the two variables is called perfect negative correlation. In this case, the coefficient of
correlation is -1

• If two series of two variables show no relation between them or change in one variable does not
lead to a change in the other variable, then it means that there is no relationship between variables.
In this case, the coefficient of correlation is zero.

REGRESSION ANALYSIS

• Regression analysis means the estimation or the prediction of the unknown value of one variable
from the known value of other variable.

• For example, if the agriculturist knows that the yield of rice and rainfall are closely related, then he
will want to know the amount of rain required to achieve a certain production. In this situation, he
will use a regression analysis

Dependent and Independent variables

• The variable whose value is influenced or is to be predicted is called dependent variable and the
variable which influences the values or is used for prediction is called independent variable

Classification of Regression

• Simple Regression

• Multiple Regression

• Linear Regression

• Non-linear Regression

Simple Regression

• When there are only two variables, the regression equation obtained is called simple regression
equation

Multiple Regression

• In multiple regression analysis, there are more than 2 variables and we try to find out the effect of
more than 2 independent variables on one dependent variable. Let X, Y, and Z, be the three variables.
X and Y are the independent variables and Z is the dependent variables.

Then we use multiple regression analysis to study the relative movement of Z, for a unit movement in
X and Y. For example, if there are three variables- yield, rainfall, and temperature. If yield is depending
on rainfall and temperature, then we get regression equation of Z on X and Y where Z is yield, X-
rainfall, and Y-temperature
Linear Regression

• On the basis of the proportion of changes in the variables, the regression can be classified as Linear
and non-linear regression.

• If the given bivariate date is plotted on a graph, the points so obtained on the scatter diagram will
more or less concentrate around a curve, called curve of regression.

• If the regression curve is a straight line, we say that there is linear relationship between the variables
under study. Mathematically, the relation between x and y in a linear regression can be expressed as
y= a+bx

• In a linear regression, the change in the dependent variable is proportionate to the changes in the
independent variable

Non-linear Regression

• If the curve of regression is not a straight line, then the regression is termed as curved or non-linear
regression. In this case, the dependent variable does not change by a constant amount of change in
the independent variable

DISCRIMINANT ANALYSIS
Introduction

• Discriminant analysis is used to predict group membership.

• This technique is used to classify individuals/objects into one of the alternative groups on the basis
of a set of predictor variables.

• The dependent variable in the discriminant analysis is categorical and on a nominal scale, whereas
the independent variables or predictor variables are either interval or ratio scale in nature.

• For example, the dependent variable may be choice of a brand of a personal computer (brand A, B,
or C) and the independent variables may be ratings of attributes of PC’s on a 7 point Likert Scale.

• When there are two groups (categories) of dependent variable, we have two-group discriminant
analysis and when there are more than two groups, it is a case of multiple discriminant analysis.

Illustration of Discriminant Analysis

• A wool manufacturer is interested in getting information on the possible commercial acceptance of


a new yarn. He wants to know the characteristics of fibers that differentiate between prospective
buyers/ non-buyers of the product. He is interested in ascertaining the relative importance of the
following yarn characteristics such as Durability, Lightness in weight, Low investment in conversion
facilities, Rot resistance
• The rating represent the judgement of buyers regarding the individual characteristic ratings and ‘buy
versus’ ‘not buy’ response.

• Thus, each respondent rates the product according to each of the four characteristics and then
indicates whether he would be a prospective buyer of the product or not.

• The rating is done on a 11-point scale (where 0 represents very and poor and 10-excellent).

We may conduct a discriminant analysis to find

• The percentage of sample that is able to classify correctly.

• Which variables (durability, light weight, low investment, and rot resistance) are relatively better in
discriminating between two groups.

Where Discriminant analysis is used?

• Those who buy our brand and those who buy competitor’s brand.

• Good salesman, poor salesman, medium salesman.

• Those who go to Food world to buy and those who buy in a Kinara shop.

• Heavy user, medium user and light user of the product.

Uses of Discriminant Analysis

• It is used for Scale construction-Discriminant analysis is used to identify the variables or statements
that are discriminating and on which people with diverse views will respond differently.

• Segment discrimination

• Perceptual mapping.

CONJOINT ANALYSIS

Introduction

• Conjoint analysis uses nominal-scale data.

• It attempts to identify the most desirable attributes that could be offered in a product or service.

• An attempt is made to determine the relative importance that consumers attach to the attributes
and the utilities that they attach to the levels of attributes.

• The values assumed by the attributes are called levels.

• The utilities describe the importance that consumer attach to the levels of each attribute.
• Here the respondents are told about the various combinations of the attribute levels and are asked
to evaluate the combinations in terms of their desirability.

• Conjoint analysis makes use of subjective evaluation of the combinations presented to the
consumer.

• Conjoint analysis is a statistical technique used in market research to determine how people value
different attributes (feature, function, benefits) that make up an individual product or service.

• Conjoint Analysis is search for the development of part-worth utility functions which describe the
utility, consumers attach to the levels of each attribute

Uses of Conjoint Analysis

• Determine relative importance of the attributes in the choice process of the consumers.

• Determine the market share of the brand that differ in attribute levels.

• Segmenting the market based on similiarity of preference for attribute levels.

WHAT ARE ATTRIBUTES AND LEVELS?

• Attributes and levels form the fundamental basis of conjoint analysis.

• The idea is that a product or service can be broken down into its constituent parts - so for instance
a mobile phone has a size, weight, battery life, size of address book, type of ring.

• Each of these elements making up a generic mobile phone is known as an attribute.

• When we compare between mobile phones each will have a different specification on each of these
attributes.

• You might have choices in terms of battery life between 72, 108, 120 hours of battery life.

• Each of these options is known as a level of the battery life attribute.

Suppose we ask a set of respondents to express their preference among movies that varied on three
attributes, each with two levels as shown below:

Hero of the movie: Sharukh Khan or Akshay Kumar

Type of Movie: Action or comedy

Price of ticket: Rs 150 or Rs 200

There are in total 2X2X2=8 combinations. Each of these features is presented to say, respondent

Number 1. The various features would look like: Hero of the movie: Sharukh Khan or Akshay Kumar

Type of Movie: Action or comedy Price of ticket: Rs 150 or Rs 200

Feature 1- Sharukh Khan, Action, Rs 150.


Feature 2- Sharukh Khan, Action, Rs 200.

Feature 3- Akshay Kumar, Action, Rs 150.

Feature 4- Akshay Kumar, Action, Rs 200.

Feature 5 – Sharukh Khan, Comedy, Rs 150.

Feature 6- Sharukh Khan, Comedy, Rs 200.

Feature 7- Akshay Kumar, Comedy, Rs 150.

Feature 8- Akshay Kumar, Comedy, Rs 200

• The respondent could be presented with the above 8 combinations and asked to give their
preferences in terms of desirability of the feature, either on an interval scale or an ordinal scale.

Steps in Conjoint Analysis

• Identification of Attributes

• Determination of attribute levels

• Determination of attribute combinations

• Nature of judgement on stimuli

• Aggregation of judgements

• Choice of technique of analysis

FACTOR ANALYSIS

Introduction

It is a very useful method to reduce a very large number of variables resulting in data
complexity to a few manageable factors.

2 commonly used Factor Analysis procedures

• Principal Component Analysis


• Common factor Analysis

• When the objective is to summarise information from a large set of variables into fewer
factors, principal component factor analysis is used.
• On the other hand, if the researcher wants to analyse the components of the main
factor, common factor analysis is used.
• Example- Common factor- Inconvenience inside a car. Common factors may be
1. Leg Room
2. Seat Arrangement
3. Entering the rare seat
4. Inadequate dickey space
5. Door locking mechanism

PRINCIPLE COMPONENT FACTOR ANALYSIS

• Purposes- Customer feedback about a two wheeler manufactured by a company.

Method – The Marketing Researcher prepares a questionnaire to study the customer feedback. The
researcher has identified 6 variables or factors for this purpose.

• Fuel efficiency (A)


• Durability (B)
• Comfort (C)
• Spare Parts Availability (D)
• Breakdown frequency (E)
• Price (F)

Grouping of variables

• A, B, D, E into factor-1
• F into Factor 2
• C into Factor 3

Factor 1 can be termed as Technical factor.


Factor 2 can be termed as Price factor.
Factor 3 can be termed as Personal factor.

Uses of Factor Analysis


• Scale construction
• Establish antecedents
• Psychographic profiling
• Segmentation analysis
• Marketing Studies

CLUSTER ANALYSIS

• Cluster is a collection of data objects.


• It is a descriptive analysis technique which groups objects (respondents, products, variables)
so that each object is similar to other objects in the cluster and different from objects in all
other clusters.
• In cluster analysis, one aims to find similarities between data according to characteristics
found in the data and grouping similar data objects into clusters.

Cluster Analysis
• Cluster analysis is a grouping technique. The basic assumption underlying the technique is
the fact that similarity is based on multiple variables, and the technique attempts to measure
the proximity in terms of the study variables.
• The emerging groups are homogenous in their composition and heterogenous as compared
to other groups.
• The grouping can be done for objects, individuals, and products.
• The researcher identifies a set of clustering variables which have been assumed as
significant for the purpose of classifying the objects into groups.
• It is also referred to as classification technique, numerical taxonomy, and Q analysis.
• Cluster analysis starts with an undifferentiated group of people, events, or objects, and
attempts to reorganize them into homogenous groups. Cluster analysis classifies objects so
that each person is similar to others in the cluster with respect to some predetermined
selection criteria.
• The resulting clusters of objects then exhibit high internal (within cluster) homogeneity and
high external (between cluster) heterogeneity. Thus if successful, the objects within cluster
will be close together when plotted geometrically, and different clusters will be far apart.
Simple cluster solution of breakfast food based on people who seek
nutrition and convenience (ease of preparation)

A person might be using different criteria for a weekday and for weekend breakfast. A bakery/
confectionary shop selling sandwiches, bread rolls would know
1. The lucrative segment
2. The segment which might be motivated to buy if one takes care of their weekday/weekend
needs
3. A segment which is currently not interested in getting a ready-toeat breakfast solution and
might not look at the bakery as an outlet to visit in the morning.
• Once the homogenous clusters emerge, the next step is to determine the profile of the
group in terms of who they are?
• What is their gender, age group, family size, etc?
• What deals motivate them to buy from a particular store when are buying eatables in
general?

Applications
• It is used to segment the market in marketing. It helps marketers discover distinct groups in
their customer bases, and then use this knowledge in targeted marketing programs.
• It is used in social networking sites in making new groups based on user’s data.
• It is also used in city planning – Identifying groups of houses according to their house type,
value , and geographical location.

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy