Frequency Tables, Bar Graphs, and Histograms: Handout #5
Frequency Tables, Bar Graphs, and Histograms: Handout #5
Handout #5
“Results” of Student Survey
ID V1 V2 V3 V4 V5 V6 V7 V8 V9 V10
1 1 2 3 1 1 3 2 2 2 2
2 3 1 3 3 1 2 1 5 2 1
3 2 3 3 1 2 2 2 1 2 2
4 1 2 1 3 2 1 3 4 3 2
5 1 1 1 1 1 3 3 4 2 2
6 1 2 5 5 2 2 2 4 2 2
7 1 2 5 2 1 3 3 4 2 2
8 1 1 5 1 1 3 3 1 2 2
9 3 2 2 3 2 2 1 5 2 1
10 2 2 3 5 1 2 1 1 2 1
11 1 1 1 1 2 2 2 4 2 2
12 1 1 1 1 1 3 2 1 2 2
One can’t just stare at this and grasp what the data is “saying.”
– The numbers don’t “speak for themselves”
• even apart from being numerically coded.
Data Needs to be “Boiled Down” to Reveal
Meaningful Information, Patterns, and
Relationships
• How you do this depends on the nature of the data, e.g.,
– nominal, ordinal, etc.
• Two stages:
– reduce the data to a single relatively compact table (frequency
table, crosstabulation, control table, etc.) or corresponding chart
(frequency bar graph, histogram, dot chart, box chart,
scattergram, etc.)
– reduce it further to one or several summary statistical measures
(measures of central tendency, dispersion, association,
correlation and regression coefficients, etc.).
• We first look at the process of boiling univariate data
down to frequency tables, frequency bar graphs, and
histograms.
– Then (univariate) measures of central tendency and dispersion.
Constructing Frequency Tables for Discrete
Variables in the Student Survey Data
Recall that the first question in the Student Survey was the
following:
ID V1 ID V1
1 1 6 1
2 3 7 1
3 2 8 1
4 1 9 3
5 1 10 2
Absolute (Cases Counts) vs. Relative
(Percentages) Frequencies
Count up tallies to get absolute frequencies.
Values Code Tallies IDs Abs Freqs. Rel Freqs. Adj Rel. Freqs.
Dem. 1 20 46% 49%
Ind. 2 [not 12 28% 29%
Rep. 3 9 21% 22%
Other 4 shown] 0 0%
DK 5 2 5%
NA 9 0 0%
Total 43 100% 100%
Democratic 49%
Independent 29%
Republican 22%
Total 100%
(n = 41)
Value Code Abs. Freqs. Rel. Freqs. Adj. Rel. Freqs. Cum Rel. Freqs.
0-20 1 8 19% 19% 19% 100%
21-40 2 4 9% 9% 28% 81%
41-60 3 16 37% 37% 65% 72%
61-80 4 8 19% 19% 84% 35%
81-100 5 7 16% 16% 100% 16%
Missing 9 0 0%
Total 43 100% 100%
• Note that V16 is (at least) ordinal in nature. The list of values should follow the
natural ordering.
• If the ordering runs from “Low” to “High,” the lowest value is conventionally put at the
top of the list and the highest value at the bottom (illogical though this may seem).
• There is no missing data, so Adjusted Relative Frequency = Relative Frequency.
• In this table, we have shown one other type of percentage — namely, cumulative
(adjusted relative) frequencies, where the cumulation can proceed either downward
or upward. Thus the “61-80" row of the table shows that 19% of the respondents
have 61 to 80 degrees of “warmth” toward the Democratic Party, 84% have this level
of warmth or cooler (i.e., 80 degrees or less), and 35% have this level of warmth or
warmer (i.e., 61 degrees or more).
• Cumulative frequencies make no sense if the variable in question is merely nominal
in nature (or if the table does not list ordinal values in their natural order).
SPSS Frequency Tables
for ANES Discrete Variables
V25 DEMOCRATIC CANDIDATE THERMOMETER SCORE (1972-2004)
• Since pie charts do not show values in a linear order, they are
especially appropriate for displaying frequencies of nominal
variables
• Since such charts show how a “pie” is “divided up,” they are also
especially appropriate for displaying “shares,” such as how parties
divide up popular votes, electoral votes, or seats in a legislature, or
how a budget is divided up among different spending categories.
• Using different colors (or hatching) for each slice can help the
reader quickly grasp the information in the chart.
Comparing Frequency Distributions for Subsets of Cases, for
Different (but “comparable”) Variables, or from different Data
Sets (e.g., Student Survey and SETUPS/ANES)
• Popular newspapers
(especially USA Today),
magazines, advertise-
ments, etc., like to
present bar graphs but
usually can’t resist the
temptation of making
them “cute” by letting
figures of one sort or
other take the place of
simple bars.
• Often, as the heights of
the figures vary, their
widths also vary in a
proportionate manner.
• The eye then tends to
compare areas rather
than heights, producing
distinctly misleading
impressions.
• Cuteness trumps clarity.
“Cute” Bar Charts (cont.)