Statistical Treatment
Statistical Treatment
The overarching aim of a quantitative research study is to classify features, count them,
and construct statistical models in an attempt to explain what is observed.
Things to keep in mind when reporting the results of a study using quantitative
methods:
1. Explain the data collected and their statistical treatment as well as all relevant
results in relation to the research problem you are investigating. Interpretation of
results is not appropriate in this section.
2. Report unanticipated events that occurred during your data collection. Explain
how the actual analysis differs from the planned analysis. Explain your handling
of missing data and why any missing data does not undermine the validity of your
analysis.
3. Explain the techniques you used to "clean" your data set.
4. Choose a minimally sufficient statistical procedure; provide a rationale for its
use and a reference for it. Specify any computer programs used.
5. Describe the assumptions for each procedure and the steps you took to ensure
that they were not violated.
6. When using inferential statistics, provide the descriptive statistics, confidence
intervals, and sample sizes for each variable as well as the value of the test
statistic, its direction, the degrees of freedom, and the significance level [report
the actual p value].
7. Avoid inferring causality, particularly in nonrandomized designs or without
further experimentation.
8. Use tables to provide exact values; use figures to convey global effects. Keep
figures small in size; include graphic representations of confidence intervals
whenever possible.
9. Always tell the reader what to look for in tables and figures.
NOTE: When using pre-existing statistical data gathered and made available by
anyone other than yourself [e.g., government agency], you still must report on the
methods that were used to gather the data and describe any missing data that exists
and, if there is any, provide a clear explanation why the missing data does not
undermine the validity of your final analysis.
Reference:
Babbie, Earl R. The Practice of Social Research. 12th ed. Belmont, CA: Wadsworth Cengage,
2010;
Muijs, Daniel. Doing Quantitative Research in Education with SPSS. 2nd edition. London: SAGE
Publications, 2010.
Babbie, Earl R. The Practice of Social Research. 12th ed. Belmont, CA: Wadsworth
Cengage, 2010.
Brians, Craig Leonard et al. Empirical Political Analysis: Quantitative and Qualitative
Research Methods. 8th ed. Boston, MA: Longman, 2011; McNabb,
David E. Research Methods in Public Administration and Nonprofit Management:
Quantitative and Qualitative Approaches. 2nd ed. Armonk, NY: M.E.
Sharpe, 2008; Quantitative Research Methods. Writing@CSU. Colorado State
University; Singh, Kultar. Quantitative Social Research Methods. Los Angeles, CA:
Sage, 2007.
Basic Statistical Tools in Research and Data Analysis
This requires a proper design of the study, an appropriate selection of the study
sample and choice of a suitable statistical test.
Branches of Statistics
Descriptive statistics deals with the presentation and collection of data. This is usually
the first part of a statistical analysis. It is usually not as simple as it sounds, and the
statistician needs to be aware of designing experiments, choosing the right focus
group and avoid biases.
Inferential statistics, as the name suggests, involves drawing the right conclusions
from the statistical analysis that has been performed using descriptive statistics.
In the end, it is the inferences that make studies important and this aspect is dealt with
in inferential statistics.
While drawing conclusions, one needs to be very careful so as not to draw the wrong or
biased conclusions. Even though this appears like a science, there are ways in which
one can manipulate studies and results through various means. For example, data
dredging is increasingly becoming a problem as computers hold loads of information
and it is easy, either intentionally or unintentionally, to use the wrong inferential
methods.
Both descriptive and inferential statistics go hand in hand and one cannot exist without
the other. Good scientific methodology needs to be followed in both these steps of
statistical analysis and both these branches of statistics are equally important for a
researcher.
Parameters in Statistics
The most common statistics parameters are the measures of central tendency. These
tell us how the data behaves on an average basis. For example, mean, median and
mode are measures of central tendency that give us an idea about where the data
concentrates. Standard deviation tells us how the data is spread from the central
tendency, i.e. whether the distribution is wide or narrow. Such parameters are often very
useful in analysis.
Variables
Variables such as height and weight are measured by some type of scale,
convey quantitative information and are called as quantitative variables. Sex
and eye color give qualitative information and are called as qualitative variables
Descriptive Statistics
Descriptive statistics: describes and summarizes data. You are just describing
what the data shows: a trend, a specific feature, or a certain statistic (like a mean
or median).
Descriptive statistics just describes data. For example, descriptive statistics
about a college could include: the average SAT score for incoming freshmen; the
median income of parents; racial makeup of the student body. It says nothing
about why the data might exist, or what trends you might be able to see from the
data.
Descriptive statistics can be further broken down into several sub-areas, like:
Measures of central tendency.
Measures of dispersion.
Charts & graphs.
Shapes of Distributions.
Central Tendency (Measures of Location): Definition and Examples
The mean is the average of a set of numbers. Add up all the numbers in a set
of data and then divide by the number of items in the set. For example, the mean
of 2 3 5 9 11 is:
(2 + 3 + 5 + 9 + 11) / 5 = 30 / 5 = 6.
The median is the middle of a set of numbers. Think of it like the median in a
road (that grassy area in the middle that separates traffic). Place your data in
order, and the number in the exact center of a list is the median. For example:
1234567
The median is 4 because it’s in the center, with three numbers either side.
The mode is the most common number in a set of data. For example, the mode
of 1 2 2 3 5 6 is 2. Some data sets have no mode, like this one: 1 2 3 4 5 6.
Others have multiple modes, like this one: 1 1 2 3 3.