Statanalysis C2a
Statanalysis C2a
Dot Plot
One of the simplest graphical summaries of data is a dot plot.
A horizontal axis shows the range of data values.
Then each data value is represented by a dot placed above the axis.
Histogram
Another common graphical display of quantitative data is a histogram.
The variable of interest is placed on the horizontal axis.
A rectangle is drawn above each class interval with its height corresponding
to the interval’s frequency, relative frequency, or percent frequency.
Unlike a bar graph, a histogram has no natural separation between rectangles
of adjacent classes.
Histograms Showing Skewness
Symmetric
- Left tail is the mirror image of the right tail
Example: Heights of People
Moderately Skewed Left
- A longer tail to the left
Example: Exam Scores
Moderately Right Skewed
- A Longer tail to the right
Example: Housing Values
Highly Skewed Right
- A very long tail to the right
Example: Executive Salaries
Cumulative Distributions
Cumulative frequency distribution - shows the number of items with values
less than or equal to the upper limit of each class.
Cumulative relative frequency distribution – shows the proportion of items
with values less than or equal to the upper limit of each class.
Cumulative percent frequency distribution – shows the percentage of items
with values less than or equal to the upper limit of each class.
The last entry in a cumulative frequency distribution always equals the total
number of observations.
The last entry in a cumulative relative frequency distribution always equals
1.00.
The last entry in a cumulative percent frequency distribution always equals
100.
Stem-and-Leaf Display
A stem-and-leaf display shows both the rank order and shape of the
distribution of the data.
It is similar to a histogram on its side, but it has the advantage of showing the
actual data values.
The first digits of each data item are arranged to the left of a vertical line.
To the right of the vertical line we record the last digit for each item in rank
order.
Each line (row) in the display is referred to as a stem.
Each digit on a stem is a leaf.
Stretched Stem-and-Leaf Display
If we believe the original stem-and-leaf display has condensed the data too
much, we can stretch the display vertically by using two stems for each
leading digit(s).
Whenever a stem value is stated twice, the first value corresponds to leaf
values of 0 - 4, and the second value corresponds to leaf values of 5 - 9.
Leaf Units
A single digit is used to define each leaf.
In the preceding example, the leaf unit was 1.
Leaf units may be 100, 10, 1, 0.1, and so on.
Where the leaf unit is not shown, it is assumed to equal 1.
The leaf unit indicates how to multiply the stem-and-leaf numbers in order to
approximate the original data.