Statistical Techniques in Business and E
Statistical Techniques in Business and E
Business &
Economics
Fifteenth Edition
Douglas A. Lind
Coastal Carolina University and The University of Toledo
William G. Marchal
The University of Toledo
Samuel A. Wathen
Coastal Carolina University
STATISTICAL TECHNIQUES IN BUSINESS & ECONOMICS
Some ancillaries, including electronic and print components, may not be available to customers
outside the United States.
1 2 3 4 5 6 7 8 9 0 RJE/RJE 1 0 9 8 7 6 5 4 3 2 1
www.mhhe.com
Dedication
To Jane, my wife and best friend, and our sons, their wives,
and our grandchildren: Mike and Sue (Steve and Courtney),
Steve and Kathryn (Kennedy and Jake), and Mark and Sarah
(Jared, Drew, and Nate).
Douglas A. Lind
William G.
Marchal
Samuel A. Wathen
A Note from
Over the years, we have received many compliments on this text and
understand that it’s a favorite among students. We accept that as the
high est compliment and continue to work very hard to maintain that
status. The objective of Statistical Techniques in Business and
Economics is
to provide students majoring in management, marketing, finance,
accounting, economics, and other fields of business administration with
an introductory survey of the many applications of descriptive and infer
ential statistics. We focus on business applications, but we also use
many exercises and examples that relate to the current world of the col
lege student. A previous course in statistics is not necessary, and the
mathematical requirement is first-year algebra.
In this text, we show beginning students every step needed to be suc
cessful in a basic statistics course. This step-by-step approach
enhances performance, accelerates preparedness, and significantly
improves moti vation. Understanding the concepts, seeing and doing
plenty of examples and exercises, and comprehending the application of
statistical methods in business and economics are the focus of this
book.
The first edition of this text was published in 1967. At that time, locat
ing relevant business data was difficult. That has changed! Today, locat
ing data is not a problem. The number of items you purchase at the gro
cery store is automatically recorded at the checkout counter. Phone
companies track the time of our calls, the length of calls, and the iden tity
of the person called. Credit card companies maintain information on the
number, time and date, and amount of our purchases. Medical devices
automatically monitor our heart rate, blood pressure, and tem perature
from remote locations. A large amount of business information is
recorded and reported almost instantly. CNN, USA Today, and MSNBC,
for example, all have websites that track stock prices with a delay of less
than 20 minutes.
Today, skills are needed to deal with a large volume of numerical
information. First, we need to be critical consumers of information pre
sented by others. Second, we need to be able to reduce large amounts
of information into a concise and meaningful form to enable us to make
effective interpretations, judgments, and decisions. All students have cal
culators and most have either personal computers or access to personal
computers in a campus lab. Statistical software, such as Microsoft Excel
and Minitab, is available on these computers. The commands necessary
to achieve the software results are available in a special section at the
end of each chapter. We use screen captures within the chapters, so the
student becomes familiar with the nature of the software output.
Because of the availability of computers and software, it is no Ionger
necessary to dwelI on calculations. We have replaced many of the calcu
lation examples with interpretative ones, to assist the student in under
standing and interpreting the statistical results. In addition, we now place
more emphasis on the conceptual nature of the statistical topics. While
making these changes, we still continue to present, as best we can, the
key concepts, along with supporting interesting and relevant examples.
iv
the Authors
What’s New in This Fifteenth Edition?
We have made changes to this edition that we think you and your stu
dents will find useful and timely.
• We have revised the learning objectives so they are more specific,
added new ones, identified them in the margin, and keyed them
directly to sections within the chapter.
• We have replaced the key example in Chapters 1 to 4. The new
example includes more variables and more observations. It presents
a realistic business situation. It is also used later in the text in Chap
ter 13.
• We have added or revised several new sections in various chapters:
Chapter 7 now includes a discussion of the exponential distribution.
Chapter 9 has been reorganized to make it more teachable and
improve the flow of the topics.
Chapter 13 has been reorganized and includes a test of hypothe
sis for the slope of the regression coefficient.
Chapter 17 now includes a graphic test for normality and the chi
square test for normality.
• New exercises and examples use Excel 2007 screenshots and the lat
est version of Minitab. We have also increased the size and clarity of
these screenshots.
• There are new Excel 2007 software commands and updated Minitab
commands at the ends of chapters.
• We have carefully reviewed the exercises within the chapters, those
at the ends of chapters, and in the Review Section. We have added
many new or revised exercises throughout. You can still find and
assign your favorites that have worked well, or you can introduce
fresh examples.
• Section numbers have been added to more clearly identify topics and
more easily reference them.
• The exercises that contain data files are identified by an icon for easy
identification.
• The Data Exercises at the end of each chapter have been revised.
The baseball data has been updated to the most current completed
season, 2009. A new business application has been added that refers
to the use and maintenance of the school bus fleet of the Buena
School District.
• There are many new photos throughout, with updated exercises in
the chapter openers.
LO8 Compute and explain the variance and the standard deviation.
The highly competitive automobile retailing industry in the United States has changed
dramatically in recent years. These changes spurred events such as the:
impor tant concepts of the previous Traditionally, a local family owned and operated the com
munity dealership, which might have included one or two man ufacturers or brands, like
Describing Data: chapter and pro vides a link to the Pontiac and GMC Trucks or Chrysler and the popular Jeep line. Recently, however,
skillfully managed and well-financed companies have been acquiring local dealer
Numerical Measures
material in the current chapter. This step-
by-step approach increases com
prehension by providing continuity across
the concepts.
Example/Solution
After important concepts are introduced, a
solved example is given to provide a how-
to illustration for students and to show a
introductory students.
2.1
Introduction
3
Learning Objectives When you have completed this chapter, you will be able to:
Self-Review 3–6 The weights of containers being shipped to Ireland are (in thousands of pounds): 95 103 105 110
Self- 104 105 112 90
vi
Engage Students
and Promote Learning?
reference and review.
Variance and standard deviation are based on squared deviations from the mean.
ˆ
Y 8.109 .08991t
Statistics in Action articles are scattered
Formulas
The slope of the trend line is .
08991. This shows that over
through out the text, usually about two per the 24 quarters the
deseasonalized sales
chapter. They provide unique and increased at a rate of 0.08991
($ million) per quarter, or
interesting applications and historical $89,910 per quarter. The value
of 8.109 is the intercept of the
insights in the field of statistics. trend line on the Y-axis (i.e.,
for t 0).
Statistics in Action
Forecasts are not al
ways correct. The re
ality is that a forecast
may just be a best
Margin Notes
guess as to what will
happen. What are
the reasons forecasts
are not correct? One
expert lists eight
Definitions
Definitions of new
terms or terms unique
to the study of
statistics are set apart
from the text and
highlighted for easy
The variance is non-negative and is zero only if all observations are the same.
Population Variance The formulas for the population variance and the sample
variance are slightly different. The population variance is considered first. (Recall that a
population is the totality of all observations being studied.) The population variance is
found by:
Exercises
Exercises are included after sections within the
chapter and at the end of the chapter. Section
exercises cover the material studied in the section.
N
Exercises
For Exercises 35–38, calculate the (a) range, (b) arithmetic mean, (c) mean deviation, and (d) interpret the values.
35. There were five customer service representatives on duty at the Electronic Super Store during last weekend’s sale. The
The
BY CHAPTER
Chapter Summary
Each chapter contains a brief summary of the chapter
material, including the vocabulary and the critical
formulas.
Pronunciation
Key
This tool lists the mathematical symbol, its mean
vii
Pronunciation Key
SYMBOL MEANING PRONUNCIATION
Lp Location of percentile L sub p First quartile Q sub 1
help the student retain the meaning of the symbol and generally based on three large data sets. These data sets are printed in
enhance course communications. Appendix A in the text and are also on the text’s web site.
These data sets present the students with real-world and more
complex applications.
Chapter Exercises
Generally, the end-of-chapter exercises are the most
Q Third quartile Q sub 3
challenging and integrate the chapter con cepts. The answers
and worked-out solutions for all odd-numbered
exercises appear at the end of the text. For 3
Exercises 03
124
The last several exercises at
the end of each chapter are
Activities
Software Commands
Software Commands 1. The Excel Commands for the descriptive statistics on page 69 are:
viii
Reinforce
Student
Learning?
seven classes.
2–5 a.
c. Class frequencies.
BY SECTION
c. If we use a class interval of 25 feet and begin
y
2–1 a. Qualitative data, because the customers’ response to the taste test is the name of a n
beverage.
b. Frequency table. It shows the 20
number of people who prefer each
beverage.
c. 10
0
would be necessary. A class interval of
e Cola-Plus Coca-Cola Pepsi Beverage
u
Lemon-Lime
reasonable. This alternative requires only
F
Section
Reviews
After selected
groups of
chapters (1–4, 5–
7, 8 and 9, 10–
12, 13 and 14, 15 A Review of Chapters 1–4
and 16, and 17 and 18), a Sec tion Review is included. Much This section is a review of the major concepts and terms introduced in Chapters 1–4. Chapter 1 began by describing the meaning and purpose of statistics. Next we described the different types of
variables and the four levels of measurement. Chapter 2 was concerned with describing a set of observations by organizing it into a frequency distribution and then portraying the frequency distri bution
review. Glossary
Practice Test
Cases Practice Test
Chapter 1
90 degrees is 10 degrees more than a temperature of 80 degrees, and so on.
Nominal measurement The “lowest” level of measure ment. If data are classified into categories and the order of those categories is
not important, it is the nominal level of E l d ( l f l) d
Descriptive statistics The techniques used to describe the important characteristics of a set of data. This includes organizing the data
The review also includes continuing cases values into a frequency distribution, computing measures of location, and computing mea
The Practice Test includes both objective questions and problems covering
Test is intended the material studied in the section.
to give students
There is a practice test at the end of each review section. The tests are in two parts. The first part contains several ob jective questions, usually in a fill-in-the-blank format. The second part is problems. In
most cases, it should take 30 to 45 minutes to complete the test. The problems require a calculator. Check the answers in the Answer Section in the back of the book.
Part 1—Objective
an idea of 1. The science of collecting, organizing, presenting, analyzing, and interpreting data to assist in making effective deci sions is called . 1. 2. Methods of organizing, summarizing, and presenting data in an
informative way is called . 2.
3. The entire set of individuals or objects of interest or the measurements obtained from all individuals or objects of in terest is called the . 3. 4. List the two types of variables. 4.
content that 5. The number of bedrooms in a house is an example of a . (discrete variable, continuous variable, qualitative variable—pick one) 5. 6. The jersey numbers of Major League Baseball players is an example
of what level of measurement? 6.
might appear on 7. The classification of students by eye color is an example of what level of measurement? 7. 8. The sum of the differences between each value and the mean is always equal to what value? 8. 9. A set of
data contained 70 observations. How many classes would you suggest in order to construct a frequency
distribution? 9. 10. What percent of the values in a data set are always larger than the median? 10. 11. The square of the standard deviation is the . 11. 12. The standard deviation assumes a negative
a test and how value when . (All the values are negative, when at least half the
values are negative, or never—pick one.) 12. 13. Which of the following is least affected by an outlier? (mean, median, or range—pick one) 13.
the test might Part 2—Problems 1. The Russell 2000 index of stock prices increased by the following amounts over the last three years.
be structured. 18% 4% 2%
The Practice
What is the geometric mean increase for the three years?
ix
What Technology
Connects
McGraw-Hill Connect™ Business
Statistics
Less
Features. Connect Business Statistics offers a number of powerful tools and features to make manag
ing assignments easier, so faculty can spend more time teaching. With Connect Business Statistics,
students can engage with their coursework anytime and anywhere, making the learning process more
accessible and efficient. Connect Business Statistics offers you the features described below.
Simple Assignment Management.
With Con
nect Business Statistics, creating
assignments
is easier than ever, so you can spend
more
time teaching and less time
managing. The
assignment management function
enables
you to:
• Create and deliver assignments
easily with
selectable end-of-chapter
questions and
test bank items.
• Streamline lesson planning, student pro
gress reporting, and assignment grading to
make classroom management more effi
cient than ever.
• Go paperless with the eBook and on
line submission and grading of student
assignments.
Excel Integrated
Data File
• Have assignments scored automatically, giving students immediate feedback on their work and side
by-side comparisons with correct answers.
• Access and review each response; manually change grades or leave comments for students to review.
• Reinforce classroom concepts with practice tests and instant quizzes.
Instructor Library. The
Connect Business
Statistics Instructor Library is
your repository
for additional resources to
improve student
engagement in and out of
class. You can
select and use any asset that
enhances your
lecture. The Connect Business Statistics
Instructor Library includes:
• eBook
• PowerPoint presentations
• Test Bank
• Solutions Manual
• Digital Image Library
Student Study Center. The Connect Business Statistics Student Study Center is the place for students to
access additional resources. The Student Study Center:
• Offers students quick access to lectures, practice materials, eBooks, and more. • Provides
instant practice material and study questions and is easily accessible on-the-go.
Guided Examples. These narrated video walkthroughs provide students with step-by-step guidelines for
solving problems similar to those contained in the text. The student is given personalized instruction on
how to solve a problem by applying the concepts presented in the chapter.
Student Progress Tracking. Connect Business Statistics keeps instructors informed about how each
student, section, and class is performing, allowing for more productive use of lecture and office hours. The
progress-tracking function
enables you to:
• View scored
work
immediately
and track
individual or
group
performance
with
assignment
and grade
reports.
• Access an
instant view of
student or
class performance
relative to learning objectives.
• Collect data and generate
reports required by many
accreditation organizations,
such as AACSB.
xi
What Technology Connects
McGraw-Hill CONNECT™ PLUS
BUSINESS STATISTICS business statistics
McGraw-Hill Connect Plus Business Statistics. McGraw-Hill reinvents the textbook learning experience for
the modern student with Connect Plus Business Statistics. A seamless integration of an eBook and
Connect Business Statistics, Connect Plus Business Statistics provides all of the Connect Business Sta
tistics features plus the following:
• An integrated eBook,
allowing
for anytime,
anywhere access
to the textbook.
• Dynamic links
between the
problems or
questions you
assign to your
students and
the location in the
eBook
where that problem
or question
is covered.
• A powerful search
function to
pinpoint and
connect key con
cepts in a snap.
xii
AACSB Statement
The McGraw-Hill Companies is
a proud corporate member of
AACSB International. Understand
ing the importance and value of
AACSB accreditation, Statistical
Techniques in Business & Eco
nomics recognizes the curricula
guidelines detailed in the AACSB
standards for business accredita
tion by connecting selected ques
tions in the text and the test bank
to the six general knowledge
and skill guidelines in the AACSB
standards.
The statements contained in Statistical Techniques in Business & Economics are
provided only as a guide for the users of this textbook. The AACSB leaves content
coverage and assessment within the purview of individual schools, the mission of
the school, and the faculty. While Statistical Techniques in Business & Economics
and the teaching package make no claim of any specific AACSB qualification or eval
uation, we have labeled selected questions within Statistical Techniques in Business
& Economics according to the six general knowledge and skills areas.
McGraw-Hill Customer Care Information
At McGraw-Hill, we understand that getting the most from new technology can be
challenging. That’s why our services don’t stop after you purchase our products.
You can e-mail our Product Specialists 24 hours a day to get product-training
online. Or you can search our knowledge bank of Frequently Asked Questions on
our support website. For Customer Support, call 800-331-5094 or visit
www.mhhe.com/support. One of our Technical Support Analysts will be able to
assist you in a timely fashion.
xiii
Minit
ab®/SPSS®/JMP®
Minitab® Student Version 14, SPSS® Student Version 18.0, and JMP® Student Edition Version 8 are
software tools that are available to help students solve the business statistics exercises in the text. Each
can be packaged with any McGraw-Hill business statistics text.
xiv
All test bank questions are available in an EZ Test electronic format. Included are a number of multiple
choice, true/false, and short-answer questions and problems. The answers to all questions are given, along
with a rating of the level of difficulty, chapter goal the question tests, Bloom’s taxonomy question type, and
the AACSB knowledge category.
WebCT/Blackboard/eCollege
All of the material in the Online Learning Center is
also available in portable WebCT, Blackboard, or
eCollege content “cartridges” provided free to
adopters of this text.
xv
ALEKS also includes a new instructor module with powerful, assignment-driven features and extensive con
tent flexibility. ALEKS simplifies course management and allows instructors to spend less time with admin
istrative tasks and more time directing student learning. To learn more about ALEKS, visit www.aleks.com.
xvi
Acknowledgments
This edition of Statistical Techniques in Business and Economics is the product of many people: students, colleagues,
reviewers, and the staff at McGraw-Hill/Irwin. We thank them all. We wish to express our sincere gratitude to the survey and
focus group participants, and the reviewers:
Acknowledgments
Margaret M. Capen Nicholas R. Farnum Pacific Union College
East Carolina University California State University, Cindy L. Hinz
Robert Carver Fullerton St. Bonaventure University
Stonehill College K. Renee Fister Johnny C. Ho
Jan E. Christopher Murray State University Columbus State University
Delaware State University Gary Franko Shaomin Huang
James Cochran Siena College Lewis-Clark State College
Louisiana Tech University Maurice Gilbert J. Morgan Jones
Farideh Dehkordi-Vakil Troy State University University of North Carolina at Chapel Hill
Western Illinois University Deborah J. Gougeon Michael Kazlow
Brant Deppa University of Scranton Pace University
Winona State University Christine Guenther John Lawrence
Bernard Dickman Pacific University California State University, Fullerton
Hofstra University Charles F. Harrington Sheila M. Lawrence
Casey DiRienzo University of Southern Indiana Rutgers, The State University of New
Elon University Jersey
Craig Heinicke
Erick M. Elder Baldwin-Wallace College Jae Lee
University of Arkansas at Little Rock State University of New York at New Paltz
George Hilton
Rosa Lemel University of Northern Iowa Debra Stiver
Kean University Paul Paschke University of Nevada, Reno
Robert Lemke Oregon State University Ron Stunda
Lake Forest College Srikant Raghavan Birmingham-Southern College
Francis P. Mathur Lawrence Technological University Edward Sullivan
California State Polytechnic University, Surekha K. B. Rao Lebanon Valley College
Pomona Indiana University Northwest Dharma Thiruvaiyaru
Ralph D. May Timothy J. Schibik Augusta State University
Southwestern Oklahoma State University of Southern Indiana Daniel Tschopp
University Carlton Scott Daemen College
Richard N. McGrath University of California, Irvine Bulent Uyar
Bowling Green State University Samuel L. Seaman University of Northern Iowa
Larry T. McRae Baylor University Lee J. Van Scyoc
Appalachian State University Scott J. Seipel University of Wisconsin–Oshkosh
Dragan Miljkovic Middle Tennessee State University Stuart H. Warnock
Southwest Missouri State University Sankara N. Sethuraman Tarleton State University
John M. Miller Augusta State University Mark H. Witkowski
Sam Houston State University Daniel G. Shimshak University of Texas at San Antonio
Cameron Montgomery University of Massachusetts, Boston William F. Younkin
Delta State University Robert K. Smidt University of Miami
Broderick Oluyede California Polytechnic State University Shuo Zhang
Georgia Southern University William Stein State University of New York, Fredonia
Andrew Paizis Texas A&M University Zhiwei Zhu
Queens College Robert E. Stevens University of Louisiana at Lafayette
Andrew L. H. Parkes University of Louisiana at Monroe
Their suggestions and thorough reviews of the previous edition and the manuscript of this edi
tion make this a better text.
Special thanks go to a number of people. Debra K. Stiver, University of Nevada–Reno, reviewed
the manuscript and page proofs, checking text and exercises for accuracy. Joan McGrory, South
west Tennessee Community College, checked the Test Bank for accuracy. Professor Kathleen Whit
comb of the University of South Carolina prepared the study guide. Dr. Samuel Wathen of Coastal
Carolina University prepared the quizzes and the Test Bank. Professor René Ordonez of Southern
Oregon University prepared the PowerPoint presentation, many of the screencam tutorials, and the
guided examples in Connect. Ms. Denise Heban and the authors prepared the Instructor’s Manual.
We also wish to thank the staff at McGraw-Hill. This includes Steve Schuetz, Executive Edi tor;
Wanda Zeman, Senior Development Editor; Diane Nowaczyk, Senior Project Manager; and oth ers
we do not know personally, but who have made valuable contributions.
xviii
Enhancements to Statistical
Techniques in Business &
Economics, 15e
Changes Made in All Chapters and system.
Major Changes to Individual Chapters: • Updated screens for Excel 2007, Minitab, and MegaStat.
• Changed Goals to Learning Objectives and identified the location in • Revised the core example in Chapters 1–4 to reflect the current
the chapter where the learning objective is discussed. economic conditions as it relates to automobile dealers. This
example is also discussed in Chapter 13 and 17.
• Added section numbering to each main heading.
• Added a new section in Chapter 7 describing the expo nential
• Identified exercises where the data file is included on the text website.
distribution.
• Revised the Major League Baseball data set to reflect the latest
• Added a new section in Chapter 13 describing a test to determine
complete season, 2009.
whether the slope of the regression line dif fers from zero.
• Revised the Real Estate data to ensure the outcomes are more
• Added updates and clarifications throughout.
realistic to the current economy.
• Added a new data set regarding school buses in a pub lic school
Chapter 1 What Is Statistics? and 82 (Kentucky Derby payoffs). Exer cises 30 to 34 were revised to
include the most recent data.
• New photo and chapter opening exercise on the “Nook” sold by
Barnes and Nobel.
Chapter 4 Describing Data: Displaying and
• Census updates on U.S. population, sales of Boeing air craft, and Exploring Data
Forbes data in “Statistics in Action” feature.
• New exercise 22 with 2010 salary data for the New York Yankees.
• New chapter exercises 17 (data on 2010 vehicle sales) and 19
• New chapter exercise 36 (American Society of Peri Anesthesia nurses
(ExxonMobil sales prior to Gulf oil spill).
component membership).
Enhancements to Statistical
Techniques in Business &
Economics, 15e
• A revised discussion of determining the confidence interval for the breakfast), and 60 (daily water usage).
population mean.
• Expanded section on calculating sample size.
Chapter 11 Two-Sample Tests of Hypothesis
• New exercise 12 (milk consumption), 33 (cost of apart ments in • New exercises 15 (2010 New York Yankee salaries), 37 (Consumer
Milwaukee), 47 (drug testing in the fashion industry), and 48 (survey Confidence Survey), and 39 (pets as listeners).
of small-business owners regarding healthcare).
Chapter 12 Analysis of Variance
• The discussion of the finite correction factor has been relocated in the
chapter. • Revised the names of airlines in the one-way ANOVA example.
• New exercise 30 (flight times between Los Angeles and San
Chapter 10 One-Sample Tests of Hypothesis Francisco).
• New exercises 17 (daily water consumption), 19 (number of text
messages by teenagers), 35 (household size in the United States), 49 Chapter 13 Correlation and Linear
(Super Bowl coin flip results), 54 (failure of gaming industry slot Regression • Rewrote the introduction section to the chapter.
machines), 57 (study of the percentage of Americans that do not eat
• Added a new section using the Applewood Auto Group data from • Updated economic data.
chapters 1 to 4.
• Added a section on testing the slope of a regression line.
Chapter 17 Nonparametric Methods:
Goodness-of-Fit Tests
• Added discussion of the regression ANOVA table with Excel
examples. • Reworked the Example/Solution on the chi-square goodness-of-fit
test with equal cell frequencies (favorite meals of adults).
• Rewrote and relocated the section on the coefficient of determination.
• Added a section and corresponding examples describing the
• Updated exercise 60 (movie box office amounts). Chapter goodness-of-fit test for testing whether sample data are from a
normal population.
14 Multiple Regression Analysis • Added a section and corresponding examples using graphical
methods for testing whether sample data are from a normal
• Rewrote the section on evaluating the multiple regression equation. population.
• More emphasis on the regression ANOVA table.
• Enhanced the discussion of the p-value in decision making. Chapter 18 Nonparametric Methods:
• Added a separate section on qualitative variables in regression Analysis of Ranked Data
analysis. • Revised the Example/Solution for the Kruskal-Wallis test (waiting
• Moved the “Stepwise Regression” section to improve the sequence of times in the emergency room).
topics. • Revised the Example/Solution for the Spearman coeffi cient of rank
• Added a summary problem at the end of the chapter to review the correlation (comparison of recruiter and plant scores for trainees).
major concepts.
Chapter 19 Statistical Process Control
Chapter 15 Index Numbers and Quality Management
• Updated census and economic data. • Updated the section on the Malcolm Baldrige National Quality Award.
• Reworked and updated the section on Six Sigma.
Chapter 16 Time Series and Forecasting
xx
Brief Contents
1 What Is Statistics? 1
2 Describing Data: Frequency Tables, Frequency
Distributions, and Graphic Presentation 21
3 Describing Data: Numerical Measures 57
4 Describing Data: Displaying and Exploring 8 Sampling Methods and the Central Limit
Data 102 5 A Survey of Probability Concepts Theorem 265 9 Estimation and Confidence
144 Intervals 297
6 Discrete Probability Distributions 186 10 One-Sample Tests of Hypothesis 333
7 Continuous Probability Distributions 222 11 Two-Sample Tests of Hypothesis 371
12 Analysis of Variance 410 Introduction to Decision Theory On the website:
Review Section
13 Correlation and Linear Regression 461
14 Multiple Regression Analysis 512
15 Index Numbers 573
Review Section Review Section
16 Time Series and Forecasting 604
17 Nonparametric Methods: Goodness-of-Fit
Tests 648 18 Nonparametric Methods: Analysis
Review Section Review Section Review Section Review
of Ranked Data 680 19 Statistical Process
Control and Quality Management 720 20 An
Section
www.mhhe.com/lind15e
Appendixes: Data Sets, Tables, Answers 753
Photo Credits 829
Index 831
xxi
Contents
1.8 Computer Applications 14
Chapter Summary 16
Chapter Exercises 16
Data Set Exercises 19
Answers to Self-Review 20
Chapter
xxii
6 Discrete Probability
Distributions 186 7.5 The Normal Approximation to the Binomial 242
6.1 Introduction 187
Continuity Correction Factor 242
6.2 What Is a Probability Distribution? 187 How to Apply the Correction Factor 244 Exercises 245
xxiv Contents
7.6 The Family of Exponential
Distributions 246
12 Analysis of Variance 410 12.1 Coefficient, the Coefficient of Determination, and the
Standard Error of Estimate 488
Introduction 411
Exercises 490
12.2 The F Distribution 411
13.8 Interval Estimates of Prediction 490
12.3 Comparing Two Population
Assumptions Underlying Linear
Variances 412
Regression 490
Exercises 415 Constructing Confidence and Prediction
12.4 ANOVA Assumptions 416 Intervals 492
12.5 The ANOVA Test 418 Exercises 494
Exercises 425 13.9 Transforming Data 495
12.6 Inferences about Pairs of Treatment Means 426 Exercises 497
Exercises 429 Chapter Summary 498
12.7 Two-Way Analysis of Variance 430 Exercises Pronunciation Key 499
434 Chapter Exercises 500
12.8 Two-Way ANOVA with Interaction 435 Data Set Exercises 509
Interaction Plots 436 Software Commands 510
Hypothesis Test for Interaction 437 Answers to Self-Review 511
Exercises 440
Chapter Summary 442
Pronunciation Key 443 Chapter
Chapter Exercises 443 14 Multiple Regression
Data Set Exercises 451
Analysis 512
Software Commands 452
14.1 Introduction 513 Simple Average of the Price Indexes 579 Simple
Aggregate Index 580
14.2 Multiple Regression Analysis 513
15.6 Weighted Indexes 581
Laspeyres Price Index 581
Paasche Price Index 582
Exercises 517 Fisher’s Ideal Index 584
Contents xxvii Exercises 578 Exercises 584
15.7 Value Index 585
14.9 Review of Multiple Regression 546 Chapter 16.2 Components of a Time Series 605
1
Answers to Self-Review 752
Appendixes 753
Appendix A: Data Sets 754
Appendix B: Tables 764
Appendix C: Answers to Odd-Numbered Chapter
Exercises and Review Exercises and Solutions
to Practice Tests 782
Photo Credits 829
Index 831
charts.
Barnes & Noble stores recently began selling the Nook. With this
device, you can download over 1,500 books electronically and read
the book on a small monitor instead of purchasing the book. Assume
you have the number of Nooks sold each day for the last month at the
Barnes & Noble store at the Market Commons Mall in Riverside,
California. Describe a condition in which this information could be
considered a sample. Illustrate a second situation in which the same
data would be regarded as a population. (See Exercise 11 and LO3.)
2 Chapter 1
1.1 Introduction
More than 100 years ago, H. G. Wells, an English author and historian, suggested
that one day quantitative reasoning will be as necessary for effective citizenship as
the ability to read. He made no mention of business because the Industrial Revo
lution was just beginning. Mr. Wells could not have been more correct. While “busi
ness experience,” some “thoughtful guesswork,” and “intuition” are key
attributes of successful managers, today’s business problems tend to
be too complex for this type of decision making alone.
One of the tools used to make decisions is statistics. Statistics is
used not only by businesspeople; we all also apply statistical concepts
in our lives. For example, to start the day you turn on the shower and
let it run for a few moments. Then you put your hand in the shower to
sample the temperature and decide to add more hot water or more cold
water, or determine that the temperature is just right and then enter the
shower. As a second example, suppose you are at Costco Wholesale
and wish to buy a frozen pizza. One of the pizza makers has a stand,
and they offer a small wedge of their pizza. After sampling the pizza, you
decide whether to purchase the pizza or not. In both the shower and pizza
examples, you make a decision and select a course of action based on a sample.
Businesses face similar situations. The Kellogg Company must ensure that the
mean amount of Raisin Bran in the 25.5-gram box meets label specifications. To do
so, it sets a “target” weight somewhat higher than the amount specified on the label.
Each box is then weighed after it is filled. The weighing machine reports a distribu
tion of the content weights for each hour as well as the number “kicked-out” for
being under the label specification during the hour. The Quality Inspection Depart
ment also randomly selects samples from the production line and checks the qual ity
of the product and the weight of the contents of the box. If the mean product weight
differs significantly from the target weight or the percent of kick-outs is too large, the
process is adjusted.
As a student of business or economics, you will need basic knowledge and
skills to organize, analyze, and transform data and to present the information. In this
text, we will show you basic statistical techniques and methods that will develop
your ability to make good personal and business decisions.
LO1 List ways that statistics is used. What are the differences in the sta tistics courses taught in the
Engineering College, the Psychology or Sociology Departments
in the Liberal Arts College, and the College of Business? The
biggest difference is the examples used. The course content is
basically the same. In the College of Business we are interested
in such things as profits, hours worked, and wages.
Psychologists are interested in test scores, and engineers are
interested in how many units are manufactured on a particular
machine. However, all three are interested in what is a typical
value and how much variation there is in the data. There may
also be a difference in the level of mathematics required. An
engi neering statistics course usually requires calculus.
Statistics courses in colleges of business and education usually
teach the course at a more applied level. You should be able to
handle the mathematics in this text if you have completed high
school algebra.
Examples of why we study statistics
So why is statistics required in so many majors? The first
reason is that numer ical information is everywhere. Look in the
1.2 Why Study Statistics? newspapers (USA Today), news maga zines (Time, Newsweek,
U.S. News and World Report), business magazines (Busi
If you look through your university catalog, you will find that nessWeek, Forbes), or general interest magazines (People),
statistics is required for many college programs. Why is this so? women’s magazines
What Is Statistics? 3
(Ladies Home Journal or Elle), or sports magazines (Sports Illustrated, ESPN The
Magazine), and you will be bombarded with numerical information. Here are some
examples:
• The average increase in weekly earnings, in 1982–84 dollars, from January 2009
to January 2010 was $8.32.
• In January 2010 the average amount of credit card debt per household was
$7,394. This is a decrease from $7,801 in July 2009. A 2010 Federal Reserve
survey found that 75 percent of U.S. households have at least one credit card.
• The following table summarizes the number of commercial aircraft manufactured
by Boeing, Inc. between 2006 and 2009.
Sales of Boeing Aircraft
Type of Aircraft
Year 737 747 767 777 787 Total
2006 733 72 8 77 160 1,050
2007 850 25 36 143 369 1,423
2008 488 4 29 54 94 669
2009 197 5 7 30 24 263
• Go
to
the
A second reason for taking a statistics course is that statistical techniques are
used to make decisions that affect our daily lives. That is, they affect our personal
welfare. Here are a few examples:
• Insurance companies use statistical analysis to set rates for home, automobile,
life, and health insurance. Tables are available showing estimates that a 20-year
old female has 60.25 years of life remaining, an 87-year-old woman 4.56 years
remaining, and a 50-year-old man 27.85 years remaining. Life insurance premi
ums are established based on these estimates of life expectancy. These tables
are available at www.ssa.gov/OACT/STATS/table4cb.html. [This site is sensitive
to capital letters.]
4 Chapter 1
• The
56%
Frito-Lay Rest of Industry
The subject of statistics, as we will explore it in this text, has a much broader
meaning than just collecting and publishing numerical information. We define sta
tistics as:
LO2 Know the differences between descriptive and DESCRIPTIVE STATISTICS Methods of organizing,
inferential statistics. summarizing, and presenting data in an informative way.
Descriptive Statistics
The definition of statistics given earlier referred to “organizing, For instance, the United States government reports the
presenting, analyz ing . . . data.” This facet of statistics is population of the United States was 179,323,000 in 1960;
usually referred to as descriptive statistics. 203,302,000 in 1970; 226,542,000 in 1980; 248,709,000 in
1990; 265,000,000 in 2000; and 308,400,000 in 2010. This LO3 Understand the differences between a sample and a
informa tion is descriptive statistics. It is descriptive statistics if population.
we calculate the percent age growth from one decade to the
next. However, it would not be descriptive sta tistics if we used
these to estimate the population of the United States in the year Reasons for sampling
2020 or the percentage growth from 2010 to 2020. Why? The What Is Statistics? 7
reason is these sta tistics are not being used to summarize past
populations but to estimate future pop ulations. The following
are some other examples of descriptive statistics. decimals, and percentages; and only 77 percent of high school
seniors correctly totaled the cost of a salad, burger, fries, and a
• There are a total of 46,837 miles of interstate highways in the cola on a restaurant menu. Since these are inferences about a
United States. The interstate system represents only 1 percent
population (all high school seniors) based on sample data, we
of the nation’s total roads but carries more than 20 percent of refer to them as inferential statistics. You might think of
the traffic. The longest is I-90, which stretches from Boston to
inferential statistics as a “best guess” of a population value
Seattle, a distance of 3,099 miles. The shortest is I-878 in New based on sample information.
York City, which is 0.70 of a mile in length. Alaska does not
have any interstate highways, Texas has the most interstate
miles at 3,232, and New York has the most interstate routes INFERENTIAL STATISTICS The methods used to estimate a
with 28. property of a population on the basis of a sample.
• The average person spent $103.00 on traditional Valentine’s
Day merchandise in 2010. This is an increase of $0.50 from
Note the words population and sample in the definition of
2009. As in previous years, men will spend nearly twice the
inferential statistics. We often make reference to the population
amount women spend on the holiday. The average man spent
of 308.8 million people living in the United States or the 1,336.1
$135.35 to impress the people in his life while women only
million people living in China. However, in statistics the word
spent $72.28. Family pets will also feel the love, the average
population has a broader meaning. A population may consist
person spending $3.27 on their furry friends, up from $2.17 last
of individuals—such as all the stu dents enrolled at Utah State
year.
University, all the students in Accounting 201, or all the CEOs
Masses of unorganized data—such as the census of from the Fortune 500 companies. A population may also consist
population, the weekly earnings of thousands of computer of objects, such as all the Cobra G/T tires produced at Cooper
programmers, and the individual responses of 2,000 registered Tire and Rubber Company in the Find lay, Ohio, plant; the
voters regarding their choice for president of the United States accounts receivable at the end of October for Lorrange Plastics,
— are of little value as is. However, statistical techniques are Inc.; or auto claims filed in the first quarter of 2010 at the
available to organize this type of data into a meaningful form. Northeast Regional Office of State Farm Insurance. The
Data can be organized into a frequency dis tribution. (This measurement of interest might be the scores on the first
procedure is covered in Chapter 2.) Various charts may be examination of all students in Accounting 201, the tread wear of
used to describe data; several basic chart forms are also the Cooper Tires, the dollar amount of Lorrange Plastics’s
presented in Chapter 4. accounts receivable, or the amount of auto insur ance claims at
Specific measures of central location, such as the mean, State Farm. Thus, a population in the statistical sense does not
describe the central value of a group of numerical data. A always refer to people.
number of statistical measures are used to describe how
closely the data cluster about an average. These measures of
POPULATION The entire set of individuals or objects of interest
central tendency and dispersion are discussed in Chapter 3.
or the measurements obtained from all individuals or objects of
interest.
Inferential Statistics
The second type of statistics is inferential statistics—also To infer something about a population, we usually take a
called statistical infer ence. Our main concern regarding sample from the population.
inferential statistics is finding something about a population
from a sample taken from that population. For example, a
recent survey showed only 46 percent of high school seniors SAMPLE A portion, or part, of the population of interest.
can solve problems involving fractions,
Why take a sample instead of studying every member of the
population? A sam ple of registered voters is necessary
because of the prohibitive cost of contacting millions of voters
before an election. Testing wheat for moisture content destroys
the wheat, thus making a sample imperative. If the wine tasters
tested all the wine, none would be available for sale. It would be
physically impossible for a few marine biologists to capture and
tag all the seals in the ocean. (These and other reasons for
sampling are discussed in Chapter 8.)
As noted, using a sample to learn something about a population
is done exten sively in business, agriculture, politics, and
government, as cited in the following examples:
We strongly suggest you do the Self-Review exercise. method of solution are given at the end of the chapter. You can
Following is a self-review problem. There are a number of them find the answer to the following Self-Review on page 19. We
interspersed throughout each chapter. They test your recommend that you solve each one and then check your
comprehension of the preceding material. The answer and answer.
Qualitative Quantitative
• Brand of PC
• Marital status • Hair color • TV sets owned
Discrete Continuous
Quantitative variable Discrete variables can assume only certain values, and there
are “gaps” between the values. Examples of discrete variables
are the number of bedrooms in a house (1, 2, 3, 4, etc.), the
LO5 Compare the differences between discrete and continuous num ber of cars arriving at Exit 25 on I-4 in Florida near Walt
variables. Disney World in an hour (326, 421, etc.), and the number of
students in each section of a statistics course (25 in section A,
42 in section B, and 18 in section C). We count, for example,
the number of cars arriving at Exit 25 on I-4, and we count the
number of statistics stu dents in each section. Notice that a
home can have 3 or 4 bedrooms, but it can not have 3.56
bedrooms. Thus, there is a “gap” between possible values.
Typically, discrete variables result from counting.
Observations of a continuous variable can assume any value
within a specific range. Examples of continuous variables are
the air pressure in a tire and the weight of a shipment of
tomatoes. Other examples are the amount of raisin bran in a
box and the duration of flights from Orlando to San Diego.
Grade point average (GPA) is a continuous variable. We could
LO6 Recognize the levels of measurement in data. report the GPA of a particular student as 3.2576952. The usual
practice is to round to 3 places—3.258. Typically, continuous
variables result from measuring.
Nominal-Level Data
For the nominal level of measurement, observations of a
qualitative variable can only be classified and counted. There is
no particular order to the labels. The classifica tion of the six
colors of M&M’s milk chocolate candies is an example of the
outcomes. 2. There is no natural order to the outcomes.
nomi nal level of measurement. We simply classify the candies
by color. There is no nat ural order. That is, we could report the USA TODAY Snapshot
brown candies first, the orange first, or any of the colors first.
Gender is another example of the nominal level of 03/15/2007-updated 11:51 PM ET
measurement. Suppose we count the number of students
entering a football game with a student ID and report how many Workers say they
are men and how many are women. We could report either the prefer higher salaries
to any other perks.
men or the women first. For the nominal level, the only
measurement involved consists of counts. Sometimes, for
better reader understanding, we convert these counts to
percentages. The following “Snapshot” from USA Today shows
the results from a survey of workers. The variable of interest is
“Perks” and there are five pos sible outcomes: “More money,”
“Better healthcare,” “Better retirement,” “Work/ family balance,”
and, we will assume, “Other.” The outcome “Other” is not
shown on the chart, but is necessary to make the percent of
respondents total 100 percent. There is no natural order to the
outcomes, we could have put “Better healthcare” first instead
of “More money.”
To process the data, such as the information regarding worker
perks, or infor mation on gender, employment by industry, or
state of birth of a student, we often numerically code the
information. That is, we assign students from Alabama a code
of 1, Alaska a code of 2, Arizona as 3, and so on. Using this
procedure, Wiscon sin is coded 49 and Wyoming 50. This
coding facilitates counting by a computer. However, because
we have assigned numbers to the various categories, this does
not give us license to manipulate the numbers. To explain, 1 2
does not equal 3; that is, Alabama Alaska does not yield
Arizona.
To summarize, the nominal level has the following properties:
By Anne R. Carey and Chad Palmer, USA Today Reprinted with permission (March 15, 2007) USA TODAY.
Source: hudson-index.com
What Is Statistics? 11
Ordinal-Level Data
The next higher level of data is the ordinal level. Table 1–1 lists the student ratings
of Professor James Brunner in an Introduction to Finance course. Each student in
the class answered the question “Overall, how did you rate the instructor in this
class?” The variable rating illustrates the use of the ordinal scale of measurement.
One clas sification is “higher” or “better” than the next one. That is, “Superior” is
better than “Good,” “Good” is better than “Average,” and so on. However, we are
not able to distinguish the magnitude of the differences between groups. Is the
difference between “Superior” and “Good” the same as the difference between
“Poor” and “Infe rior”? We cannot tell. If we substitute a 5 for “Superior” and a 4 for
“Good,” we can conclude that the rating of “Superior” is better than the rating of
“Good,” but we can not add a ranking of “Superior” and a ranking of “Good,” with
the result being mean ingful. Further we cannot conclude that a rating of “Good”
(rating is 4) is necessarily twice as high as a “Poor” (rating is 2). We can only
conclude that a rating of “Good” is better than a rating of “Poor.” We cannot
conclude how much better the rating is.
Rating Frequency
Superior 6
Good 28
Average 25
Poor 12
Inferior 3
Interval-Level Data
The interval level of measurement is the next highest level. It includes all the
characteristics of the ordinal level, but in addition, the difference between val ues is
a constant size. An example of the interval level of measurement is tem perature.
Suppose the high temperatures on three consecutive winter days in Boston are 28,
31, and 20 degrees Fahrenheit. These temperatures can be eas ily ranked, but we
can also determine the difference between temperatures. This is possible because 1
degree Fahrenheit represents a constant unit of measure ment. Equal differences
between two temperatures are the same, regardless of their position on the scale.
That is, the difference between 10 degrees Fahrenheit
12 Chapter 1
Why is the “size” scale an interval measurement? Observe as the size changes by 2
units (say from size 10 to size 12 or from size 24 to size 26) each of the
measurements increases by 2 inches. To put it another way, the intervals are the
same.
There is no natural zero point for dress size. A “size 0” dress does not have
“zero” material. Instead, it would have a 24-inch bust, 16-inch waist, and 27-inch
hips. Moreover, the ratios are not reasonable. If you divide a size 28 by a size 14,
you do not get the same answer as dividing a size 20 by 10. Neither ratio is equal to
two as the “size” number would suggest. In short, if the distances between the
numbers make sense, but the ratios do not, then you have an interval scale of
measurement.
The properties of the interval-level data are:
1. Data classifications are ordered according to the amount of the characteristic
they possess.
2. Equal differences in the characteristic are represented by equal differences in the
measurements.
Ratio-Level Data
Practically all quantitative data is recorded on the ratio level of measurement. The
ratio level is the “highest” level of measurement. It has all the characteristics of the
interval level, but in addition, the 0 point is meaningful and the ratio between two
numbers is meaningful. Examples of the ratio scale of measurement include wages,
units of production, weight, changes in stock prices, distance between branch
offices, and height. Money is a good illustration. If you have zero dollars, then you
have no money. Weight is another example. If the dial on the scale of a correctly
calibrated device is at 0, then there is a complete absence of weight. The ratio of
two numbers is also meaningful. If Jim earns $40,000 per year selling insur ance and
Rob earns $80,000 per year selling cars, then Rob earns twice as much as Jim.
Table 1–2 illustrates the use of the ratio scale of measurement. It shows the
incomes of four father-and-son combinations.
What Is Statistics? 13
Observe that the senior Lahey earns twice as much as his son. In the Rho fam
ily, the son makes twice as much as the father.
In summary, the properties of the ratio-level data are:
1. Data classifications are ordered according to the amount of the characteristics
they possess.
2. Equal differences in the characteristic are represented by equal differences in the
numbers assigned to the classifications.
3. The zero point is the absence of the characteristic and the ratio between two
numbers is meaningful.
Levels of Measurement
• Jersey numbers of football • Your rank in class • Team • Temperature • Dress size • Number of sales calls made
players • Make of car standings in the Pac-10 • Number of patients seen • Distance to class
Self-Review 1–2 What is the level of measurement reflected by the following data? (a) The age of each person
in a sample of 50 adults who listen to one of the 1,230 talk radio stations in the United States is:
35 29 41 34 44 46 42 42 37 47
30 36 41 39 44 39 43 43 44 40
47 37 41 27 33 33 39 38 43 22
44 39 35 35 41 42 37 42 38 43
35 37 38 43 40 48 42 31 51 34
(b) In a survey of 200 luxury-car owners, 100 were from California, 50 from New York, 30
from Illinois, and 20 from Ohio.
14 Chapter 1
Exercises
The answers to the odd-numbered exercises are at the end of the book.
bundled with many home computers. In this text, we use both Excel and Minitab for
the applications. We also use an Excel add-in called MegaStat. This add-in gives
Excel the capability to produce additional statistical reports.
The following example shows the application of computers in statistical analy
sis. In Chapters 2, 3, and 4, we illustrate methods for summarizing and describing
data. An example used in these chapters refers to profit, as well as other variables,
on each of the 180 vehicles sold last month by the Applewood Auto Group. The
following Excel output reveals, among other things, (1) there were 180 vehicles sold,
the mean (average) profit per vehicle was $1,843.17, and the amount of profit ranged
from $294 to $3,292.
The following output is from the Minitab system. It contains much of the same
information.
Had we used a calculator to arrive at these measures and others needed to fully
analyze the selling prices, hours of calculation would have been required. The
16 Chapter 1
likelihood of an error in arithmetic is high when a large number of values are con
cerned. On the other hand, statistical software packages and spreadsheets can
provide accurate information in seconds.
At the option of your instructor, and depending on the software system avail
able, we urge you to apply a computer package to the exercises in the Data Set
Exercises section in each chapter. It will relieve you of the tedious calculations and
allow you to concentrate on data analysis.
Chapter Summary
I. Statistics is the science of collecting, organizing, presenting, analyzing, and interpreting
data to assist in making more effective decisions.
II. There are two types of statistics.
A. Descriptive statistics are procedures used to organize and summarize data.
B. Inferential statistics involve taking a sample from a population and making estimates
about a population based on the sample results.
1. A population is an entire set of individuals or objects of interest or the measure
ments obtained from all individuals or objects of interest.
2. A sample is a part of the population.
III. There are two types of variables.
A. A qualitative variable is nonnumeric.
1. Usually we are interested in the number or percent of the observations in each
category.
2. Qualitative data are usually summarized in graphs and bar charts.
B. There are two types of quantitative variables and they are usually reported numerically.
1. Discrete variables can assume only certain values, and there are usually gaps
between values.
2. A continuous variable can assume any value within a specified range.
IV. There are four levels of measurement.
A. With the nominal level, the data are sorted into categories with no particular order to the
categories.
B. The ordinal level of measurement presumes that one classification is ranked higher than
another.
C. The interval level of measurement has the ranking characteristic of the ordinal level of
measurement plus the characteristic that the distance between values is a con
stant size.
D. The ratio level of measurement has all the characteristics of the interval level, plus there
is a 0 point and the ratio of two values is meaningful.
Chapter Exercises
5. Explain the difference between qualitative and quantitative variables. Give an example of
qualitative and quantitative variables.
6. Explain the difference between a sample and a population.
7. Explain the difference between a discrete and a continuous variable. Give an example of
each not included in the text.
8. For the following questions, would you collect information using a sample or a popula
tion? Why?
a. Statistics 201 is a course taught at a university. Professor A. Verage has taught nearly
1,500 students in the course over the past 5 years. You would like to know the aver
age grade for the course.
b. As part of a research project, you need to report the average profitability of the num
ber one corporation in the Fortune 500 for the past 10 years.
c. You are looking forward to graduation and your first job as a salesperson for one of five
large pharmaceutical corporations. Planning for your interviews, you will need to
know about each company’s mission, profitability, products, and markets.
What Is Statistics? 17
d. You are shopping for a new MP3 music player such as the Apple iPod. The manu
facturers advertise the number of music tracks that can be stored in the memory. Usu
ally, the advertisers assume relatively short, popular music to estimate the number of
tracks that can be stored. You, however, like Broadway musical tunes and they are
much longer. You would like to estimate how many Broadway tunes will fit on your
MP3 player.
9. Exits along interstate highways were formerly numbered successively from the western or
southern edge of a state. However, the Department of Transportation has recently
changed most of them to agree with the numbers on the mile markers along the highway.
a. What level of measurement were data on the consecutive exit numbers? b. What level
of measurement are data on the milepost numbers?
c. Discuss the advantages of the newer system.
10. A poll solicits a large number of college undergraduates for information on the following
variables: the name of their cell phone provider (AT&T, Verizon, and so on), the numbers
of minutes used last month (200, 400, for example), and their satisfaction with the ser vice
(Terrible, Adequate, Excellent, and so forth). What is the data scale for each of these three
variables?
11. Barnes & Noble stores recently began selling the Nook. With this device, you can
download over 1,500 books electronically and read the book on a small monitor instead
of purchasing the book. Assume you have the number of Nooks sold each day for the last
month at the Barnes & Noble store at the Market Commons Mall in Riverside, California.
Describe a condition in which this information could be considered a sample. Illustrate a
second situation in which the same data would be regarded as a population.
12. Utilize the concepts of sample and population to describe how a presidential election is
unlike an “exit” poll of the electorate.
13. Place these variables in the following classification tables. For each table, summarize your
observations and evaluate if the results are generally true. For example, salary is reported
as a continuous quantitative variable. It is also a continuous ratio-scaled variable. a.
Salary
b. Gender
c. Sales volume of MP3 players
d. Soft drink preference
e. Temperature
f. SAT scores
g. Student rank in class
h. Rating of a finance professor
i. Number of home computers
Quantitative a. Salary
Discrete Continuous
Nominal Ordinal
Interval
Ratio a. Salary
18 Chapter 1
14. Using data from such publications as the Statistical Abstract of the United States, The
World Almanac, Forbes, or your local newspaper, give examples of the nominal, ordinal,
interval, and ratio levels of measurement.
15. The Struthers Wells Corporation employs more than 10,000 white collar workers in its
sales offices and manufacturing facilities in the United States, Europe, and Asia. A sam
ple of 300 of these workers revealed 120 would accept a transfer to a location outside the
United States. On the basis of these findings, write a brief memo to Ms. Wanda Carter,
Vice President of Human Services, regarding all white collar workers in the firm
and their willingness to relocate.
16. AVX Stereo Equipment, Inc., recently began a “no-hassles” return policy. A sample of 500
customers who had recently returned items showed 400 thought the policy was fair, 32
thought it took too long to complete the transaction, and the rest had no opin ion. On the
basis of this information, make an inference about customer reaction to the new policy.
17. The following table reports the number of cars and light duty trucks sold by the eight
largest automakers in the first two months of 2010 compared to the first two months of
2009.
Year-to-Date Sales
Through February Through February
Manufacturer 2010 2009
General Motors Corp. 287,242 252,701
Ford Motor Company 249,514 185,825
Chrysler LLC 141,592 146,207
Toyota Motor Sales USA Inc. 198,823 226,870
American Honda Motor Co. Inc. 148,150 142,606
Nissan North America Inc. 132,761 108,133
Hyundai Motor America 64,507 55,133
Mazda Motor of America Inc. 32,748 31,821
a. Compare the total sales for the eight automakers. Has there been an increase or a
decrease in sales for 2010 compared to the same period in 2009?
b. Compute the market share for each of the companies. Has there been a large change
in the market share for any of the companies?
c. Compare the percentage increases for each of the eight companies. What significant
changes are there from 2009 to 2010 for each of the companies?
18. The following chart depicts the average amounts spent by consumers on holiday gifts.
Write a brief report summarizing the amounts spent during the holidays. Be sure to
include the total amount spent, and the percent spent by each group.
What Is Statistics? 19
19. The following chart depicts the earnings in billions of dollars for ExxonMobil for the period
2003 until 2009. Write a brief report discussing the earnings at ExxonMobil during the
period. Was one year higher than the others? Did the earnings increase, decrease, or stay
the same over the period?
Data Set Exercises
20. Refer to the Real Estate data at the back of the text, which report information on homes
sold in the Goodyear, Arizona, area last year. Consider the following variables: selling
price, number of bedrooms, township, and distance from the center of the city. a. Which
of the variables are qualitative and which are quantitative?
b. Determine the level of measurement for each of the variables.
21. Refer to the Baseball 2009 data, which report information on the 30 Major League Base
ball teams for the 2009 season. Consider the following variables: number of wins, team
salary, season attendance, whether the team is in the American or National League, and
the number of home runs hit.
a. Which of these variables are quantitative and which are qualitative?
b. Determine the level of measurement for each of the variables.
22. Refer to the Buena School District bus data, which report information on the school dis
trict’s bus fleet.
a. Which of the variables are qualitative and which are quantitative?
b. Determine the level of measurement for each variable.
20 Chapter 1
Chapter 1 Answers to
Self-Review
1–1 a. On the basis of the sample of 1,960 consumers, we estimate
that, if it is marketed, 60 percent of all consumers will purchase the
Frequency Tables, Frequency
chicken dinner (1,176/1,960) 100 60 percent. Distributions, and Graphic
b. Inferential statistics, because a sample was used to
draw a conclusion about how all Presentation
consumers in the population would react if the chicken
dinner were marketed.
1–2 a. Age is a ratio-scale variable. A 40-year-old is twice as old
as someone 20 years old.
b. Nominal scale. We could arrange the states in any order.
Describing Data:
2
Learning Objectives When you have completed this
chapter, you will be able to:
2.1 Introduction
The highly competitive automobile retailing industry in the United States has
changed dramatically in recent years. These changes spurred events such as the:
• bankruptcies of General Motors and Chrysler in 2009.
• elimination of well-known brands such as Pontiac and
Saturn.
• closing of over 1,500 local dealerships.
• collapse of consumer credit availability.
• consolidation dealership groups.
Traditionally, a local family owned and operated the com
munity dealership, which might have included one or two man
ufacturers or brands, like Pontiac and GMC Trucks or Chrysler
and the popular Jeep line. Recently, however, skillfully managed
and well-financed companies have been acquiring local dealer
ships in large regions of the country. As these groups acquire the local dealerships,
they often bring standardized selling practices, common software and hardware
technology platforms, and management reporting techniques. The goal of these new
organizations is to provide an improved buying experience for the consumer, while
increasing profitability. Megadealerships often employ over 10,000 people, generate
several billion dollars in annual sales, own more than 50 franchises, and are traded
on the New York Stock Exchange or NASDAQ. Today, the largest megadealership is
AutoNation (ticker symbol AN). Others include Penske Auto Group (PAG and second
largest), Asbury Automotive Group (ABG), and Hendrick Auto Group (which is
privately held).
The Applewood Auto Group is an ownership group that includes four deal
erships. The group sells a wide range of vehicles, including the inexpensive but
popular Korean brands Kia and Hyundai, BMW and Volvo sedans and luxury SUVs,
and a full line of Ford and Chevrolet cars and trucks.
Ms. Kathryn Ball is a member of the senior management team at Applewood
Auto Group, which has its corporate offices adjacent to Hilltop Motors. She is
responsible for tracking and analyzing vehicle sales and the profitability of those
vehicles. Kathryn would like to summarize the profit earned on the vehicles sold with
tables, charts, and graphs that she would review monthly. She wants to know the
profit per vehicle sold, as well as the lowest and highest amount of profit. She is also
interested in describing the demographics of the buyers. What are their ages? How
many vehicles have they previously purchased from one of the Applewood
dealerships? What type of vehicle did they purchase?
The Applewood Auto Group oper
ates four dealerships:
• Tionesta Ford Lincoln Mercury sells
the Ford, Lincoln, and Mercury cars and
trucks.
• Olean Automotive Inc. has the Nis
san franchise as well as the General
Motors brands of Chevrolet, Cadillac,
and GMC Trucks.
• Sheffield Motors Inc. sells Buick,
GMC trucks, Hyundai, and Kia.
• Hilltop Motors offers the Chrysler,
Dodge, and Jeep line as well as BMW
and Volvo.
Every month, Ms. Ball collects data from
each of the four dealerships and enters it
TABLE 2–1 Frequency Table for Vehicles Sold Last Month at Applewood Auto Group by Location
relationship between a class total and the total number of observations. In the vehi
cle sales example, we may want to know the percentage of total cars sold at each of
the four locations. To convert a frequency distribution to a relative frequency dis
tribution, each of the class frequencies is divided by the total number of observa
tions. For example, the fraction of vehicles sold last month at the Kane location is
0.289, found by 52 divided by 180. The relative frequency for each location is shown
in Table 2–2.
TABLE 2–2 Relative Frequency Table of Vehicles Sold by Type Last Month at
Applewood Auto Group
Presentation 25 50
dl
S
30 Sheffield Tionesta
s
20
c
he
V
f
10
r
u
0
N
Kane Olean
40
LO3 Present a set of data using a pie chart. Amount Percentage
Location Use of Sales ($ million) of Sales
Prizes 1,460.0 60
Education 702.3 29
CHART 2–1 Number of Vehicles Sold by Location Bonuses 150.0 6
Expenses 124.3 5
location is of nominal scale, so the order of the locations on the Total 2,436.6 100
horizontal axis does not matter. Listing this variable
alphabetically or by some type of geograph ical arrangement
might also be appropriate. The first step to develop a pie chart is to record the
Another useful type of chart for depicting qualitative information percentages 0, 5, 10, 15, and so on evenly around the
is a pie chart. circumference of a circle (see Chart 2–2). To plot the 60 percent
share awarded for prizes, draw a line from the center of the
circle to 0 and another line from the center of the circle to 60
PIE CHART A chart that shows the proportion or percentage percent. The area in this “slice” represents the lottery proceeds
that each class represents of the total number of frequencies. that were awarded in prizes. Next, add the 60 per cent of
expenses awarded in prizes to the 29 percent payments to
education; the result is 89 percent. Draw a line from the center
We explain the details of constructing a pie chart using the of the circle to 89 percent, so the area between 60 percent and
information in Table 2–3, which shows a breakdown of the 89 percent depicts the payments made to education.
expenses for the Ohio State Lottery in 2009.
Continuing, add the 6 percent for bonuses, which gives us a total of 95 percent.
Draw a line from the center of the circle to 95, so the “slice” between 89 percent and
95 percent represents the payment of bonuses. The remainder, 5 percent, is for
operating expenses.
Because each slice of the pie represents the relative share of each component,
we can easily compare them:
We can use software to quickly create a visually appealing and informative pie
chart. The following chart, using the information in Table 2–3, depicts the uses of
Ohio Lottery expenses in 2009.
Pie charts and bar charts serve much the same function. What are the criteria
for selecting one over the other? In most cases, pie charts are the most informative
Awesome 102
ExampleSolution Excellent 58
Good 30
Poor 10
The data are measured on an ordinal scale. That is, the scale is
ranked in relative ease when moving from “poor” to
“awesome.” Also, the interval between each rating is unknown
so it is impossible, for example, to conclude that a rating of
good is twice the value of a poor rating.
Describing Data: Frequency Tables, Frequency We can use a bar chart to graph the data. The vertical scale
shows the rel ative frequency and the horizontal shows the
Distributions, and Graphic Presentation 27
values of the ease of navigation scale.
when the goal is to compare the relative difference in the Ease of Navigation of SkiLodges.com Web Page
percentage of observa tions for each of the nominal scale
variables. Bar charts are preferred when the goal is to compare 60
the number of observations in each category.
50
40
y
c
q
30
e
i
20
t
10
0
Poor Good Excellent Awesome
Ease of Navigation
A pie chart can also be used to graph this data. The pie chart emphasizes that more
than half of the respondents rate the relative ease of using the website awesome.
28 Chapter 2 Good
15%
Awesome
51%
Excellent
Ease of Navigation of SkiLodges.com Web Page 29%
Poor
5%
Beverage Number
Cola-Plus 40
Coca-Cola 25
Pepsi 20
Lemon-Lime 15
Total 100
Exercises
The answers to the odd-numbered exercises are at the end of the book.
1. A pie chart shows the relative market share of cola products. The “slice” for Pepsi-Cola has
a central angle of 90 degrees. What is its market share?
2. In a marketing study, 100 consumers were asked to select the best digital music player from
the iPod, the iRiver, and the Magic Star MP3. To summarize the consumer responses with
a frequency table, how many classes would the frequency table have?
3. A total of 1,000 residents in Minnesota were asked which season they preferred. The results
were 100 liked winter best, 300 liked spring, 400 liked summer, and 200 liked fall. If the
data were summarized in a frequency table, how many classes would be used? What
would be the relative frequencies for each class?
4. Two thousand frequent midwestern business travelers are asked which midwestern city
they prefer: Indianapolis, Saint Louis, Chicago, or Milwaukee. The results were 100 liked
Indianapolis best, 450 liked Saint Louis, 1,300 liked Chicago, and the remainder preferred
Milwaukee. Develop a frequency table and a relative frequency table to summarize this
information.
LO4 Create a frequency distribution for a data set. Bright white 130
Describing Data: Frequency Tables, Frequency Metallic black 104
Distributions, and Graphic Presentation 29 Magnetic lime 325
Tangerine orange 455
Fusion red 286
5. Wellstone Inc. produces and markets replacement covers for cell
phones in a variety of colors. The company would like to allocate its
a. What is the table called?
production plans to five different colors: bright white, metallic black,
b. Draw a bar chart for the table.
magnetic lime, tangerine orange, and fusion red. The com pany set up
c. Draw a pie chart.
a kiosk in the Mall of America for several hours and asked randomly
d. If Wellstone Inc. plans to produce 1 million cell phone covers, how
selected people which cover color was their favorite. The results follow:
many of each color should it produce?
6. A small business consultant is investigating the performance of
several companies. The fourth quarter sales for last year (in thousands
of dollars) for the selected companies were:
Fourth-Quarter Sales section, using the Applewood Automotive Group data, we
Corporation ($ thousands) summarized a qualitative variable, location of the sale, using a
Hoden Building Products $ 1,645.2 frequency table, a relative frequency table, and a bar chart.
J & R Printing Inc. 4,757.0 The Applewood Auto Group data also includes several
Long Bay Concrete Construction 8,913.0 quantitative variables: the age of the buyer, the profit earned on
Mancell Electric and Plumbing 627.1 the sale of the vehicle, and the number of previous purchases.
Maxwell Heating and Air Conditioning 24,612.0 Suppose Ms. Ball wants to summarize last month’s sales by
Mizelle Roofing & Sheet Metals 191.9 profit earned. We can describe profit using a frequency
distribution.
reports the profit on each of the 180 vehicles sold last month at
Example the four Applewood locations. What is the typical profit on each
We return to the situation where Ms. Kathryn Ball of the
sale? What is the largest profit on any sale? What is the lowest
Applewood Auto Group wants to develop tables, charts, and
profit on any sale? Around what value did the profits tend to
graphs to show the typical profit for each sale. Table 2–4
cluster?
TABLE 2–4 Profit on Vehicles Sold Last Month by the Highest
Applewood Auto Group
$1,387 $2,148 $2,201 $ 963 $ 820 $2,230 $3,043 $2,584 $2,370 1,754 2,207 996 1,298
1,266 2,341 1,059 2,666 2,637 1,817 2,252 2,813 1,410 1,741 3,292 1,674 2,991 1,426
1,040 1,428 323 1,553 1,772 1,108 1,807 934 2,944 1,273 1,889 352 1,648 1,932 1,295
2,056 2,063 2,147 1,529 1,166 482 2,071 2,350 1,344 2,236 2,083 1,973 3,082 1,320
1,144 2,116 2,422 1,906 2,928 2,856 2,502 1,951 2,265 1,485 1,500 2,446 1,952 1,269
2,989 783 2,692 1,323 1,509 1,549 369 2,070 1,717 910 1,538 1,206 1,761 1,638 2,348
978 2,454 1,797 1,536 2,339 1,342 1,919 1,961 2,498 1,238 1,606 1,955 1,957 2,700
443 2,357 2,127 294 1,818 1,680 2,199 2,240 2,222 754 2,866 2,430 1,115 1,824 1,827
2,482 2,695 2,597 1,621 732 1,704 1,124 1,907 1,915 2,701 1,325 2,742 870 1,464 1,876
1,532 1,938 2,084 3,210 2,250 1,837 1,174 1,626 2,010 1,688 1,940 2,639 377 2,279
2,842 1,412 1,761 2,165 1,822 2,197 842 1,220 2,626 2,434 1,809 1,915 2,231 1,897
2,646 1,963 1,401 1,501 1,640 2,415 2,119 2,389 2,445 1,461 2,059 2,175 1,752 1,821
1,546 1,766 335 2,886 1,731 2,338 1,118 2,058 2,487
Lowest
Table 2–4 shows the profits from the 180 sales. We refer to this unorganized infor
Solution mation as
raw data or ungrouped data. With a little searching, we can find the smallest profit
($294) and the highest profit ($3,292), but that is about all. It is diffi cult to determine
a typical amount of profit. It is also difficult to visualize where the profits tend to
cluster. The raw data are more easily interpreted if organized into a frequency
distribution.
Statistics in Action
In 1788, James
Madison, John Jay, and Alexander
Hamilton anony mously published a series of essays enti tled The
Federalist. These Federalist pa pers were an attempt to convince the
peo ple of New York that they should ratify the Constitution. In the
course of history, the authorship of most of these papers became
known, but 12 remained con tested. Through the use of statistical
analysis, and particu larly the study of the frequency of the use of
various words, we
can now conclude that James Madison is the likely author of the 12
papers. In fact, the statistical evidence that Madi son is the author is
overwhelming.
Describing Data: Frequency Tables, Frequency
Distributions, and Graphic Presentation 31
example, there were 180 vehicles
sold. So n 180. If we try k 7,
which means we would use 7
classes, 27 128, which is less
than 180. Hence, 7 is too few
classes. If we let k 8, then 28
256, which is greater than 180.
So the recommended number of
classes is 8.
Step 2: Determine the class interval or class width.
Generally the class interval or class width is the same for all
classes. The classes all taken together must cover at least the
distance from the lowest value in the data up to the highest
value. Expressing these words in a formula:
i H L
k
where i is the class interval, H is the highest observed value, L
is the lowest observed value, and k is the number of classes.
For the Applewood Auto Group, the lowest value is $294 and
the highest value is $3,292. If we need 8 classes, the interval
should be:
i H L
k $3,292 $294
8 $374.75
In practice, this interval size is usually rounded up to some
convenient number, such as a multiple of 10 or 100. The value
of $400 is a rea sonable choice.
In frequency distributions, equal class intervals are preferred.
How ever, unequal class intervals may be necessary in certain
situations to avoid a large number of empty, or almost empty,
classes. Such is the case in Table 2–6. The Internal Revenue
Service used unequal-sized class intervals to report the
adjusted gross income on individual tax returns. Had they used Number of Returns
an equal-sized interval of, say, $1,000, more than 1,000 classes Adjusted Gross Income (in thousands)
would have been required to describe all the incomes. A No adjusted gross income 178.2
frequency distribution with 1,000 classes would be difficult to $ 1 up to $ 5,000 1,204.6
inter pret. In this case, the distribution is easier to understand in 5,000 up to 10,000 2,595.5
spite of the unequal classes. Note also that the number of 10,000 up to 15,000 3,142.0
income tax returns or “frequencies” is reported in thousands in 15,000 up to 20,000 3,191.7
this particular table. This also makes the information easier to 20,000 up to 25,000 2,501.4
understand. 25,000 up to 30,000 1,901.6
30,000 up to 40,000 2,502.3
TABLE 2–6 Adjusted Gross Income for Individuals Filing
Income Tax Returns 40,000 up to 50,000 1,426.8
50,000 up to 75,000 1,476.3
75,000 up to 100,000 338.8
100,000 up to 200,000 223.3
200,000 up to 500,000 55.2
500,000 up to 1,000,000 12.0
1,000,000 up to 2,000,000 5.1
2,000,000 up to 10,000,000 3.4
10,000,000 or more 0.6
32 Chapter 2
Step 3: Set the individual class limits. State clear class limits so you can put each
observation into only one category. This means you must avoid overlapping
or unclear class limits. For example, classes such as “$1,300–$1,400”
and “$1,400–$1,500” should not be used, because it is not clear whether
the value $1,400 is in the first or second class. Classes stated as
“$1,300–$1,400” and “$1,500–$1,600” are frequently used, but may also be
confusing without the additional common convention of rounding all data at
or above $1,450 up to the second class and data below $1,450 down to the
first class. In this text, we will generally use the format $1,300 up to $1,400
and $1,400 up to $1,500 and so on. With this format it is clear that $1,399
goes into the first class and $1,400 in the second.
Because we round the class interval up to get a convenient class size,
we cover a larger than necessary range. For example, using 8 classes
with a width of $400 in the Applewood Auto Group example results in
a range of 8($400) $3,200. The actual range is $2,998, found by
($3,292 $294). Comparing that value to $3,200, we have an excess
of $202. Because we need to cover only the distance (H L), it is nat
ural to put approximately equal amounts of the excess in each of the
two tails. Of course, we should also select convenient class limits. A
guideline is to make the lower limit of the first class a multiple of the
class interval. Sometimes this is not possible, but the lower limit should
at least be rounded. So here are the classes we could use for this data.
Classes
$ 200 up to $ 600
600 up to 1,000
1,000 up to 1,400
1,400 up to 1,800
1,800 up to 2,200
2,200 up to 2,600
2,600 up to 3,000
3,000 up to 3,400
Step 4: Tally the vehicle profit into the classes. To begin, the profit from the
sale of the first vehicle in Table 2–4 is $1,387. It is tallied in the $1,000
up to
$1,400 class. The second profit in the first row of Table 2–4 is $2,148. It is
tallied in the $1,800 up to $2,200 class. The other profits are tallied in a sim
ilar manner. When all the profits are tallied, the table would appear as:
Profit Frequency
$ 200 up to $ 600 |||
||||
600 up to 1,000 |
|||| ||||
1,000 up to 1,400 |||
|||| |||| |||| ||||
1,400 up to 1,800 |||
|||| |||| |||| |||| |||| |||| ||||
1,800 up to 2,200 |||| |||| |||| |||| |||| |||| |||| |||| |||| |||| |||| |||| |||| |||| ||||
2,200 up to 2,600 ||
2,600 up to 3,000 ||||
|||| |||| ||||
3,000 up to 3,400 ||||
Total
Step 5: Count the number of items in each class. The number of observations
in each class is called the class frequency. In the $200 up to $600 class
there are 8 observations, and in the $600 up to $1,000 class there are 11
observations. Therefore, the class frequency in the first class is 8 and
the class frequency in the second class is 11. There are a total of
Describing Data: Frequency Tables, Frequency Distributions, and Graphic Presentation 33
TABLE 2–7 Frequency Distribution of Profit for Vehicles Sold Last Month at
Applewood Auto Group
Profit Frequency
$ 200 up to $ 600 8
600 up to 1,000 11
1,000 up to 1,400 23
1,400 up to 1,800 38
1,800 up to 2,200 45
2,200 up to 2,600 32
2,600 up to 3,000 19
3,000 up to 3,400 4
Total 180
Now that we have organized the data into a frequency distribution, we can sum
marize the pattern in the profits of the vehicles for the Applewood Auto Group.
Observe the following:
1. The profit from a vehicle ranged between $200 and $3,400.
2. The profits are concentrated between $1,000 and $3,000. The profit on 157 vehi
cles or 87 percent was within this range.
3. The largest concentration, or highest frequency, is in the $1,800 up to $2,200
class. There are 45 observations. The middle of this class is $2,000. So we say
that the typical profit on selling a vehicle is $2,000.
By presenting this information to Ms. Ball, we give her a clear picture of the distri
bution of the vehicle profits for last month.
We admit that arranging the information on profits into a frequency distribution
does result in the loss of some detailed information. That is, by organizing the data
into a frequency distribution, we cannot pinpoint the exact profit on any vehicle,
such as $1,387, $2,148, or $2,201. Further, we cannot tell that the actual lowest
amount of profit for any vehicle sold is $294 or that the most profit was $3,292.
However, the lower limit of the first class and the upper limit of the largest class
convey essentially the same meaning. Likely, Ms. Ball will make the same judgment
if she knows the lowest price is about $200 that she will if she knows the exact profit
is $292. The advantages of condensing the data into a more understandable and
organized form more than offset this disadvantage.
Self-Review 2–2 The commissions earned for the first quarter of last year by the 11 members of the sales
staff at Master Chemical Company are:
$1,650 $1,475 $1,510 $1,670 $1,595 $1,760 $1,540 $1,495 $1,590 $1,625 $1,510
(a) What are the values such as $1,650 and $1,475 called?
(b) Using $1,400 up to $1,500 as the first class, $1,500 up to $1,600 as the second class,
and so forth, organize the quarterly commissions into a frequency distribution.
(c) What are the numbers in the right column of your frequency distribution called?
(d) Describe the distribution of quarterly commissions, based on the frequency distribution.
What is the largest concentration of commissions earned? What is the smallest, and the
largest? What is the typical amount earned?
We will use two other terms frequently: class midpoint and class interval. The
midpoint is halfway between the lower limits of two consecutive classes. It is com
puted by adding the lower limits of consecutive classes and dividing the result by 2.
34 Chapter 2
Referring to Table 2–7, the lower class limit of the first class is $200 and the next
class limit is $600. The class midpoint is $400, found by ($600 $200) 2. The mid
point of $400 best represents, or is typical of, the profits of the vehicles in that class.
To determine the class interval, subtract the lower limit of the class from the
lower limit of the next class. The class interval of the Applewood data is $400, which
we find by subtracting the lower limit of the first class, $200, from the lower limit of
the next class; that is, $600. ($600 $200 $400.) You can also determine the class
interval by finding the difference between consecutive midpoints. The midpoint of
the first class is $400 and the midpoint of the second class is $800. The difference is
$400.
Frequency Distribution-Quantitative
Profit Cumulative
Lower Upper Midpoint Width Frequency Percent Frequency Percent
200 600 400 400 8 4.4 8 4.4
600 1,000 800 400 11 6.1 19 10.6
1,000 1,400 1,200 400 23 12.8 42 23.3
1,400 1,800 1,600 400 38 21.1 80 44.4
1,800 2,200 2,000 400 45 25.0 125 69.4
2,200 2,600 2,400 400 32 17.8 157 87.2
2,600 3,000 2,800 400 19 10.6 176 97.8
3,000 3,400 3,200 400 4 2.2 180 100.0
180 100.0
Self-Review 2–3 Barry Bonds of the San Francisco Giants established a new single-season Major League
Baseball home run record by hitting 73 home runs during the 2001 season. The longest of these home runs
traveled 488 feet and the shortest 320 feet. You need to construct a frequency distribution of these home run
lengths.
(a) How many classes would you use?
(b) What class interval would you suggest?
(c) What actual classes would you suggest?
2.5 Relative Frequency Distribution
LO5 Understand a relative frequency distribution. centage of the vehicle profits are in the $1,000 up to $1,400
class. In another study, we may want to know what percentage
of the employees used 5 up to 10 personal leave days last year.
To convert a frequency distribution to a relative frequency dis
A relative frequency converts the frequency to a percentage. tribution, each of the class frequencies is divided by the total
It may be desirable, as we did earlier with qualitative data, to number of observa tions. From the distribution of vehicle
convert class frequen cies to relative class frequencies to show profits, Table 2–7, the relative frequency for the $1,000 up to
the proportion of the total number of observations in each $1,400 class is 0.128, found by dividing 23 by 180. That is,
class. In our vehicle profits, we may want to know what per profit
Describing Data: Frequency Tables, Frequency Distributions, and Graphic Presentation 35
on 12.8 percent of the vehicles sold is between $1,000 and $1,400. The relative fre
quencies for the remaining classes are shown in Table 2–8.
TABLE 2–8 Relative Frequency Distribution of Profit for Vehicles Sold Last Month at
Applewood Auto Group
Self-Review 2–4 Refer to Table 2–8, which shows the relative frequency distribution for the profit earned
on vehicles sold last month at the Applewood Auto Group.
(a) How many vehicles are in the $1,800 up to $2,200 class?
(b) What proportion of the vehicles sold for a profit of between $1,800 up to $2,200? (c)
What proportion of the vehicles sold for a profit of $2,200 or more?
Exercises
7. A set of data consists of 38 observations. How many classes would you recommend for
the frequency distribution?
8. A set of data consists of 45 observations between $0 and $29. What size would you rec
ommend for the class interval?
9. A set of data consists of 230 observations between $235 and $567. What class interval
would you recommend?
10. A set of data contains 53 observations. The lowest value is 42 and the largest is 129. The
data are to be organized into a frequency distribution.
a. How many classes would you suggest?
b. What would you suggest as the lower limit of the first class?
11. Wachesaw Manufacturing Inc. produced the following number of units in the last 16 days.
This icon indicates that the data is available at the text website: www.mhhe.com/lind15e. You will be able to download the
data directly into Excel or Minitab from this site.
27 27 27 28 27 25 25 28
26 28 26 28 31 30 26 26
65 98 55 62 79 59 51 90 72 56
70 62 66 80 94 79 63 73 71 85
c. What lower limit would you recommend for the first class?
d. Organize the number of oil changes into a frequency distribution.
e. Comment on the shape of the frequency distribution. Also determine the relative fre
quency distribution.
13. The manager of the BiLo Supermarket in Mt. Pleasant, Rhode Island, gathered the fol
lowing information on the number of times a customer visits the store during a month.
The responses of 51 customers were:
5 3314 4 56 42666 71
1 14 1 2 4 4 4 5 6 3 5 3 4 5 6
8 4 7 6 5 9 11 3 12 4 7 6 5 15 1
1 10 8 9 2 12
a. Starting with 0 as the lower limit of the first class and using a class interval of 3, orga
nize the data into a frequency distribution.
b. Describe the distribution. Where do the data tend to cluster?
c. Convert the distribution to a relative frequency distribution.
14. The food services division of Cedar River Amusement Park Inc. is studying the amount
families who visit the amusement park spend per day on food and drink. A sample of 40
families who visited the park yesterday revealed they spent the following amounts:
$77 $18 $63 $84 $38 $54 $50 $59 $54 $56 $36 $26 $50 $34 $44
41 58 58 53 51 62 43 52 53 63 62 62 65 61 52
60 60 45 66 83 71 63 58 61 71
a. Organize the data into a frequency distribution, using seven classes and 15 as the
lower limit of the first class. What class interval did you select?
b. Where do the data tend to cluster?
c. Describe the distribution.
d. Determine the relative frequency distribution.
Example Below is the frequency distribution of the profits on vehicle sales last month at the
Applewood Auto Group.
Profit Frequency
$ 200 up to $ 600 8
600 up to 1,000 11
1,000 up to 1,400 23
1,400 up to 1,800 38
1,800 up to 2,200 45
2,200 up to 2,600 32
2,600 up to 3,000 19
3,000 up to 3,400 4
Total 180
Construct a histogram. What observations can you reach based on the information
presented in the histogram?
yc
16
l a
n
l
ci
e
he c
V
(
f
24
u
8 811
q
r
e
eb
r
m
23
f
u s
s
N
From Chart 2–3 we note the profit on eight vehicles was $200 up to $600.
Therefore, the height of the column for that class is 8. There are 11 vehicles sales
where the profit was $600 up to $1,000. So, logically, the height of that column is 11.
The height of the bar represents the number of observations in the class.
This procedure is continued for all classes. The complete histogram is shown in
Chart 2–4. Note that there is no space between the bars. This is a feature of the
histogram. Why is this so? Because the variable plotted on the horizontal axis is
quantitative and the ratio scale of measurement. In a bar chart, the scale of mea
surement is nominal and the vertical bars are separated. These are important dis
tinctions between the histogram and the bar chart.
From Chart 2–4 we can make the following statement:
1. The profit from a vehicle ranged from about $200 up to about $3,400. 2. The
profits are concentrated between $1,000 and $3,000. The profit on 157 vehicles, or
87 percent, was within this range.
3. The largest concentration, or highest frequency, is in the $1,800 up to $2,200
class. The middle of this class is $2,000. So we say that the typical profit on
selling a vehicle is $2,000.
35 45 32
l r
Statistics in Nightingale is c
i
f
s
8 11 4
Action known as the
s
he
V
c a
l
dl
n
23
c
Florence
o
e
(
32 24 16 8 19
S
u
founder of the nurs ing profession. How ever, 200 600 1000 1400 1800 2200 2600 3000 3400 Profit $
she also saved many lives by using
that class. The class frequency is the number of observations in a particular class.
The profit earned on the vehicles sold last month by the Applewood Auto Group is
repeated below.
As noted previously, the $200 up to $600 class is represented by the mid point
$400. To construct a frequency polygon, move horizontally on the graph to the
midpoint, $400, and then vertically to 8, the class frequency, and place a dot. The X
and the Y values of this point are called the coordinates. The coordinates of the next
point are X 800 and Y 11. The process is continued for all classes. Then the points
are connected in order. That is, the point representing the lowest class is joined to
the one representing the second class and so on. Note in Chart 2–5 that, to
complete the frequency polygon, midpoints of $0 and $3,600 are added to the X-
axis to “anchor” the polygon at zero frequencies. These two values, $0 and $3,600,
were derived by subtracting the class interval of $400 from the lowest mid point
($400) and by adding $400 to the highest midpoint ($3,200) in the frequency
distribution.
48
40
yc
e
24 0 400
u
16
Profit $
e
8
F
32
CHART 2–5 Frequency Polygon of Profit on 180 Vehicles Sold at Applewood Auto Group
Both the histogram and the frequency polygon allow us to get a quick picture of
the main characteristics of the data (highs, lows, points of concentration, etc.).
Although the two representations are similar in purpose, the histogram has the
advantage of depicting each class as a rectangle, with the height of the rectangu lar
bar representing the number in each class. The frequency polygon, in turn, has an
advantage over the histogram. It allows us to compare directly two or more
40 Chapter 2
frequency distributions. Suppose Ms. Ball wants to compare the profit per vehicle
sold at Applewood Auto Group with a similar auto group, Fowler Auto in Grayling,
Michigan. To do this, two frequency polygons are constructed, one on top of the
other, as in Chart 2–6. Two things are clear from the chart:
• The typical vehicle profit is larger at Fowler Motors—about $2,000 for Apple
wood and about $2,400 for Fowler.
• There is less dispersion in the profits at Fowler Motors than at Applewood. The
lower limit of the first class for Applewood is $0 and the upper limit is
$3,600. For Fowler Motors, the lower limit is $800 and the upper limit is the
same: $3,600.
The total number of cars sold at the two dealerships is about the same, so a
direct comparison is possible. If the difference in the total number of cars sold is
large, then converting the frequencies to relative frequencies and then plotting the
two distributions would allow a clearer comparison.
yc
e
24 800 1200 1600 2000 2400 2800 3200 3600
Profit $
u
e
16
r
8
F
56
48 0 400
40 Fowler Motors
32 Applewood
CHART 2–6 Distribution of Profit at Applewood Auto Group and Fowler Motors
Self-Review 2–5 The annual imports of a selected group of electronic suppliers are shown in the following
frequency distribution.
Exercises
15. Molly’s Candle Shop has several retail stores in the coastal areas of North and South
Carolina. Many of Molly’s customers ask her to ship their purchases. The following chart
shows the number of packages shipped per day for the last 100 days.
13 10
yc
ne
23
18
u
30 20 28
q
35
r
10
0
5 10 15 20 25 30 35
Number of Packages
yc
e
30 0
u
2 4 6 8 10 12 Number of Patients
q
e
20
r
10
d. A frequency polygon is to be drawn. What are the coordinates of the plot for the first
class?
e. Construct a frequency polygon.
f. Interpret the frequent flier miles accumulated using the two charts.
18. Ecommerce.com, a large Internet retailer, is studying the lead time (elapsed time between
when an order is placed and when it is filled) for a sample of recent orders. The lead
times are reported in days.
Example The frequency distribution of the profits earned at Applewood Auto Group is repeated from
Table 2–7.
Profit Frequency
$ 200 up to $ 600 8
600 up to 1,000 11
1,000 up to 1,400 23
1,400 up to 1,800 38
1,800 up to 2,200 45
2,200 up to 2,600 32
2,600 up to 3,000 19
3,000 up to 3,400 4
Total 180
the profit earned was less than $600. Those 8 vehicles, plus the 11 in the next higher
class, for a total of 19, earned a profit of less than $1,000. The cumulative frequency
for the next higher class is 42, found by 8 11 23. This process is continued for all
the classes. All the vehicles earned a profit of less than $3,400. (See Table 2–9.)
TABLE 2–9 Cumulative Frequency Distribution for Profit on Vehicles Sold Last
Month at Applewood Auto Group
To plot a cumulative frequency distribution, scale the upper limit of each class
along the X-axis and the corresponding cumulative frequencies along the Y-axis. To
provide additional information, you can label the vertical axis on the left in units and
the vertical axis on the right in percent. In the Applewood Auto Group, the vertical
axis on the left is labeled from 0 to 180 and on the right from 0 to 100 percent. The
value of 50 percent corresponds to 90 vehicles.
To begin, the first plot is at X 200 and Y 0. None of the vehicles sold for a
profit of less than $200. The profit on 8 vehicles was less than $600, so the next plot
is at X 600 and Y 8. Continuing, the next plot is X 1,000 and Y 19. There were
19 vehicles that sold for a profit of less than $1,000. The rest of the points are
plotted and then the dots connected to form the chart below.
dl V
S
20
s
f
e
0
l
o
he
V
f 100 t
r
r
e
e
25
m
u
75 P
d
N
l
60 s
40
c
CHART 2–7 Cumulative Frequency Polygon for Profit on Vehicles Sold Last Month
at Applewood Auto Group
44 Chapter 2
To find the amount of profit earned on 75 percent of the cars sold, draw a hor
izontal line from the 75 percent mark on the right-hand vertical axis over to the poly
gon, then drop down to the X-axis and read the amount of profit. The value on the X-
axis is about $2,300, so we estimate that 75 percent of the vehicles sold earned a
profit for the Applewood group of $2,230.
To find the profit earned on 60 vehicles, we locate the value of 60 on the left
hand vertical axis. Next, we draw a horizontal line from the value of 60 to the poly
gon and then drop down to the X-axis and read the profit. It is about $1,590, so we
estimate that 60 of the vehicles sold for a profit of less than $1,590. We can also
make estimates of the percentage of vehicles that sold for less than a partic ular
amount. To explain, suppose we want to estimate the percentage of vehicles that
sold for a profit of less than $1,600. We begin by locating the value of $1,600 on the
X-axis, move vertically to the polygon, and then horizontally to the vertical axis on
the right. The value is about 56 percent, so we conclude that 56 percent of the
vehicles sold for a profit of less than $1,600.
Self-Review 2–6 A sample of the hourly wages of 15 employees at Home Depot in Brunswick, Georgia,
was organized into the following table.
Exercises
19. The following chart shows the hourly wages of a sample of certified welders in the Atlanta,
Georgia area.
yc
e
50
c
r
u
e
q
P
e
10
F
40
100
30
75
t 25
n
20
0 5 10 15 20 25 30 Hourly Wage
c. About how many welders earn less than $10.00 per hour?
d. About 75 percent of the welders make less than what amount?
e. Ten of the welders studied made less than what amount?
f. What percent of the welders make less than $20.00 per hour?
20. The following chart shows the selling price ($000) of houses sold in the Billings, Montana
area.
yc e
n
100 P
25
e
75
q
r
t
F
n
200 150 100 50 50 0 100 150 200 250 350 50 300 Selling Price
($000)
e
a. How many orders were filled in less than 10 days? In less than 15 days? b.
Convert the frequency distribution to a cumulative frequency distribution. c.
Develop a cumulative frequency polygon.
d. About 60 percent of the orders were filled in less than how many days?
46 Chapter 2
Chapter Summary
I. A frequency table is a grouping of qualitative data into mutually exclusive classes show ing
the number of observations in each class.
II. A relative frequency table shows the fraction of the number of frequencies in each class.
III. A bar chart is a graphic representation of a frequency table.
IV. A pie chart shows the proportion each distinct class represents of the total number of
frequencies.
V. A frequency distribution is a grouping of data into mutually exclusive classes showing the
number of observations in each class.
A. The steps in constructing a frequency distribution are:
1. Decide on the number of classes.
2. Determine the class interval.
3. Set the individual class limits.
4. Tally the raw data into classes.
5. Count the number of tallies in each class.
B. The class frequency is the number of observations in each class.
C. The class interval is the difference between the limits of two consecutive classes.
D. The class midpoint is halfway between the limits of consecutive classes.
VI. A relative frequency distribution shows the percent of observations in each class.
VII. There are three methods for graphically portraying a frequency distribution.
A. A histogram portrays the number of frequencies in each class in the form of a rectangle.
B. A frequency polygon consists of line segments connecting the points formed by the
intersection of the class midpoint and the class frequency.
C. A cumulative frequency distribution shows the number or percent of observations
below given values.
Chapter Exercises
23. Describe the similarities and differences of qualitative and quantitative variables. Be sure to
include:
a. What level of measurement is required for each variable type?
b. Can both types be used to describe both samples and populations?
24. Describe the similarities and differences of a frequency table and a frequency distribution.
Be sure to include which requires qualitative data and which requires quantitative data.
25. Alexandra Damonte will be building a new resort in Myrtle Beach, South Carolina. She
must decide how to design the resort based on the type of activities that the resort will
offer to its customers. A recent poll of 300 potential customers showed the following
results about customers’ preferences for planned resort activities:
On-time On-time Early Late On-time On-time On-time On-time Late On-time Early On-time On-time Early On-time
On-time On-time On-time On-time On-time Early On-time Early On-time On-time On-time Early On-time On-time
On-time Early On-time On-time Late Early Early On-time On-time On-time Early
On-time Late Late On-time On-time On-time On-time On-time On-time On-time On-time Late Early On-time Early
On-time Lost On-time On-time On-time Early Early On-time On-time Late Early Lost On-time On-time On-time On-
time On-time Early On-time Early On-time Early On-time Late On-time On-time Early On-time On-time On-time Late
On-time Early On-time On-time On-time On-time On-time On-time On-time Early Early On-time On-time On-time
a. What scale is used to measure delivery performance? What kind of variable is delivery
performance?
b. Construct a frequency table for delivery performance for last month.
c. Construct a relative frequency table for delivery performance last month.
d. Construct a bar chart of the frequency table for delivery performance for last month. e.
Construct a pie chart of on-time delivery performance for last month.
f. Analyze the data summaries and write an evaluation of last month’s delivery perfor
mance as it relates to Speedy Swift’s performance objectives. Write a general recom
mendation for further analysis.
27. A data set consists of 83 observations. How many classes would you recommend for a
frequency distribution?
28. A data set consists of 145 observations that range from 56 to 490. What size class inter val
would you recommend?
29. The following is the number of minutes to commute from home to work for a group of
automobile executives.
28 25 48 37 41 19 32 26 16 23 23 29 36
31 26 21 32 25 31 43 35 42 38 33 28
31. A social scientist is studying the use of iPods by college students. A sample of 45 stu
dents revealed they played the following number of songs yesterday.
468 79637767147 7
4 6 4 10 2 4 6 3 4 6 8 4 3 3 6
8 8 4 6 4 6 5 5 9 6 8 8 6 5 10
8 8 6 11 11 9 8 5 11 4 8 5 14 7 12 8 6 11 9 7
9 15 8 8 12 5 9 8 5 9 10 11 3 9 8 6
34. The Journal of Finance made its content available on the Internet starting in July of 2005.
The table below shows the number of times a monthly version was downloaded and the
number of articles that were viewed during each month. Suppose you wish to make a
frequency distribution of the number of downloads.
312 2,753 2,595 6,057 7,624 6,624 6,362 6,575 7,760 7,085 7,272
5,967 5,256 6,160 6,238 6,709 7,193 5,631 6,490 6,682 7,829 7,091
6,871 6,230 7,253 5,507 5,676 6,974 6,915 4,999 5,689 6,143 7,086
Presentation 49 35. The following histogram shows the scores on the first exam for a
statistics class.
yc
e
10 14 21
u
q
5
0 12
3
e
25 20 15 6
50 60 70 80 90 100
Score
yc
e
150 100 200 250 300 350
u
q
100 75
50
e
50
r
250
n
0 50 100 150
e
25
cr
200
eP
Selling Price ($000)
$140 $ 82 $265 $168 $ 90 $114 $172 $230 $142 86 125 235 212 171 149 156 162 118
139 149 132 105 162 126 216 195 127 161 135 172 220 229 129 87 128 126 175 127
149 126 121 118 172 126
a. Arrive at a suggested class interval. Use six classes, and let the lower limit of the first
class be $70.
b. What would be a better class interval?
c. Organize the data into a frequency distribution using a lower limit of $80. d.
Interpret your findings.