0% found this document useful (0 votes)

6 views6 pages

NHUMUOI Ws4 Spring25 Summary Statistics

The document is a worksheet for an introductory statistics course, covering topics such as distribution shapes, variance, standard deviation, quantiles, and robust statistics. It includes examples and calculations related to Facebook friends, housing prices, noise levels, and income variability, highlighting the importance of using median and IQR in the presence of skewed data and outliers. The worksheet provides practical exercises for students to apply statistical concepts and methods.

Uploaded by

lathinhumuoi.work

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views6 pages

NHUMUOI Ws4 Spring25 Summary Statistics

Uploaded by

lathinhumuoi.work

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

MATH105-Intro to Stats Worksheet #4 Spring 25

Name: La Thi Nhu Muoi Date:

16/01/2025

1. [Distribution shape: mean vs median]

50% of Facebook users have 100 or more friends (median), while the average number of
friends (mean) is 190. Since the mean (190) is higher than the median (100), it implies
that the distribution of the number of Facebook friends is right-skewed.

2. [Shape of distribution]

(a) The housing prices are right-skewed because 25% of houses cost below $350,000,
50% below $450,000, and 75% below $1,000,000, with a significant number of houses
costing more than $6,000,000. The large gap between Q3 = $1,000,000 and the extreme
values $6,000,000 indicates a long right tail. Therefore, the median is the best measure of
central tendency, and the IQR is the best measure of variability.
(b) 25% of houses cost below $300,000, 50% below $600,000, and 75% below $900,000,

1
MATH105-Intro to Stats Worksheet #4 Spring 25

with very few houses exceeding $1,200,000. Therefore, the distribution is only slightly
right-skewed and nearly symmetric due to the nearly equal gaps. The median effectively
represents the typical house price, and the IQR appropriately captures the variability,
reducing the influence of rare expensive homes.
(c) This distribution is right-skewed, because the lower quartiles Q1 & Q2 are around
zero when most don’t drink but there are some who drink excessively, leading to a long
right tail. The median best represents typical consumption, while the IQR effectively
measures variability by focusing on the middle 50% of drinkers and excluding heavy
drinkers.
(d) In this case, most employees earn similar salaries, but a few high-level executives
earn disproportionately more. This may create a right-skewed distribution where the
upper quartile (Q3) and extreme salaries have a large gap. However, it just happens when
there is a very big gap among salaries. The median is the most suitable measure of central
tendency, and the IQR best represents the spread of typical salaries by ignoring the
extreme outliers.

3. [Variance and standard deviation (std)] The time between an electric light
stimulus and a bar press to avoid a shock was noted for each of the five conditioned rats.
Use the definition (formula) to compute the sample variance and the standard deviation
(std). Shock avoidance times (in seconds) are:

a. 5, 4, 3, 1, 3

b. 3, 3.5, 3.5, 2.8, 3.2

Compute and then compare the mean and std in (a) and (b).

5+4+3+1+3
(a) 𝜇 = = 3.2
5
(5−3.2)2+(4−3.2)2+(3−3.2)2+(1−3.2)2+(3−3.2)2
s2 = = 2.2
5−1

 s= √2.2 = 1.48

2
MATH105-Intro to Stats Worksheet #4 Spring 25

3+3.5+3.5+2.8+3.2
b) 𝜇 = = 3.2
5
(3−3.2)2+(3.5−3.2)2+(3.5−3.2)2+(2.8−3.2)2+(3.2−3.2)2
s2 = = 0.095
5−1

 s= √0.095 = 0.31
Both datasets have the same mean of 3.2. Dataset (a) has a higher variance (2.2) and
standard deviation (1.48), leading to more spread out data. Dataset (b) has a lower
variance (0.095) and standard deviation (0.31), meaning that (b) have a more consistent
data.

4. [Quantiles] The following data give noise levels (in decibels) measured at
different times directly outside of Grand Central Station in Manhattan.
82, 89, 94, 110, 74, 122, 112, 95, 100, 78, 65, 60, 90, 83, 87, 75, 114, 85

a) Determine the quartiles and IQR.

Arranged data:
60, 65, 74, 75, 78, 82, 83, 85, 87, 89, 90, 94, 95, 100, 110, 112, 114, 122
Because there is an even number of data values (18), the median is the mean of the ninth
and tenth values.
(87+ 89)/2 = 88 => Q2 = 88
The median of this half is found in the fifth position:
60, 65, 74, 75, 78, 82, 83, 85, 87
 The first quartile is found to equal Q1 = 78
To find the third quartile, we look at the median of the top half of the original data set.
89, 90, 94, 95, 100, 110, 112, 114, 122
 The third quartile Q3 = 100.
IQR = Q3 − Q1 = 100 − 78 = 22

b) Draw a boxplot of the noise levels.

3
MATH105-Intro to Stats Worksheet #4 Spring 25

c) Find the top 20% noise levels

Percentile = 80
Total count of values (N)= 18
Percentile = (n/N) x 100
From the given formula we can find n by
n= (P x N)/100
= (80 x 18) / 100
=14.4
And 80th percentile value is higher than 100 (14th position), but lower than 110.
 Top 20% noise levels can be 110, 112, 114, 122.

d) Find the bottom 10% noise levels.

Percentile = 10

4
MATH105-Intro to Stats Worksheet #4 Spring 25

Total count of values (N)= 18

Percentile = (n/N) x 100
From the given formula we can find n by
n= (P x N)/100
= (10 x 18) / 100
=1.8
And 10th percentile value is between the 1st and 2nd values in the sorted data.
 Bottom 10% noise levels is 60.

5. [Robust statistics]

5
MATH105-Intro to Stats Worksheet #4 Spring 25

a) The median would best represent the typical income of the 42 patrons at this coffee shop.
Before adding the two extremely high incomes ($225,000 and $250,000), the mean was $65,090
and the median was $65,240—both values were very close, reflecting the symmetric distribution.
However, after adding these high incomes, the mean jumped to $73,300, while the median only
slightly increased to $65,350. This significant change in the mean demonstrates its sensitivity to
extreme values, whereas the median remained stable. This indicates that the median is more
robust than the mean when outliers are present, making it a better measure of typical income in
this situation.

(b) The interquartile range (IQR) would best represent the variability in the incomes of the 42
patrons. Before adding the two high incomes, the standard deviation was $2,122, but it
drastically increased to $37,321 after the outliers were introduced. This sharp increase shows that
the standard deviation is highly sensitive to outliers, making it an unreliable measure of
variability in this case. In contrast, the IQR focuses on the middle 50% of the data and remains
largely unaffected by extreme values. Therefore, the IQR is a more robust and reliable measure
of variability compared to the standard deviation in the presence of outliers.

3.3 Measures of Skew and Outliers
No ratings yet
3.3 Measures of Skew and Outliers
42 pages
Bagozzi and Yi, 1988 PDF
0% (1)
Bagozzi and Yi, 1988 PDF
21 pages
Descriptive Statistics - Numerical Measures
No ratings yet
Descriptive Statistics - Numerical Measures
91 pages
Central Tendency Variation Outliers
No ratings yet
Central Tendency Variation Outliers
59 pages
Statistics For Business and Economics: Describing Data: Numerical
No ratings yet
Statistics For Business and Economics: Describing Data: Numerical
56 pages
Variability Final
No ratings yet
Variability Final
53 pages
Lecture Notes 02
No ratings yet
Lecture Notes 02
54 pages
FRA Assignment - India Credit Model
No ratings yet
FRA Assignment - India Credit Model
14 pages
Lecture 2b - Describing Data-Numerical
No ratings yet
Lecture 2b - Describing Data-Numerical
47 pages
03 Numerical Description
No ratings yet
03 Numerical Description
52 pages
Chapter 3 Using Numerical Measures To Describe Data
No ratings yet
Chapter 3 Using Numerical Measures To Describe Data
72 pages
V2 Chapter4 Summer 2020 - 21 - Tagged
No ratings yet
V2 Chapter4 Summer 2020 - 21 - Tagged
48 pages
4.statistics 2
No ratings yet
4.statistics 2
55 pages
Engineering Statistics Handbook 3. Production Process Characterization
No ratings yet
Engineering Statistics Handbook 3. Production Process Characterization
137 pages
Module 3 Part 2S
No ratings yet
Module 3 Part 2S
22 pages
Discrete and Continous
No ratings yet
Discrete and Continous
9 pages
ch03 Ver3
No ratings yet
ch03 Ver3
25 pages
Measures of The Location of The Data
No ratings yet
Measures of The Location of The Data
13 pages
Topic 4 - Measures of Spread PDF
No ratings yet
Topic 4 - Measures of Spread PDF
14 pages
Note 02
No ratings yet
Note 02
31 pages
Anshul Dyundi Predictive Modelling Alternate Project July 2022
No ratings yet
Anshul Dyundi Predictive Modelling Alternate Project July 2022
11 pages
Topic 1 Describing Data II
No ratings yet
Topic 1 Describing Data II
68 pages
Recap W4 L7: - Measures of Dispersion
No ratings yet
Recap W4 L7: - Measures of Dispersion
50 pages
8th PPT Lecture On Measures of Position
0% (1)
8th PPT Lecture On Measures of Position
19 pages
1 s2.0 S2214241X15000589 Main
0% (1)
1 s2.0 S2214241X15000589 Main
8 pages
Noise, Information Theory, and Entropy: CS414 - Spring 2007
No ratings yet
Noise, Information Theory, and Entropy: CS414 - Spring 2007
44 pages
S1181 U03 Notes
No ratings yet
S1181 U03 Notes
5 pages
Measures of Position Quartile
No ratings yet
Measures of Position Quartile
60 pages
Gtu 302 Biostatistics: Descriptive Statistics
100% (2)
Gtu 302 Biostatistics: Descriptive Statistics
57 pages
Statistics Midterm Review
No ratings yet
Statistics Midterm Review
21 pages
Exploratory Factor Analysis
No ratings yet
Exploratory Factor Analysis
9 pages
Statistics For Health Research: Non-Parametric Methods
No ratings yet
Statistics For Health Research: Non-Parametric Methods
56 pages
Chapter-4: Error Control Coding: (Digital Communication)
No ratings yet
Chapter-4: Error Control Coding: (Digital Communication)
35 pages
Newbold Sbe8 Ch02
No ratings yet
Newbold Sbe8 Ch02
59 pages
An Introduction To Statistics: Keone Hon
100% (2)
An Introduction To Statistics: Keone Hon
14 pages
Patricia Sison Grade 9
No ratings yet
Patricia Sison Grade 9
3 pages
Chap 02
No ratings yet
Chap 02
54 pages
PMMT100 FT 20 2020 1
No ratings yet
PMMT100 FT 20 2020 1
12 pages
Newbold Sbe8 Ch02 Ge
No ratings yet
Newbold Sbe8 Ch02 Ge
65 pages
Probability and Random Variables: Abu Bakr Siddique
No ratings yet
Probability and Random Variables: Abu Bakr Siddique
41 pages
Agr3701 - Exercise 4 - Anova
100% (1)
Agr3701 - Exercise 4 - Anova
6 pages
Tutorial 2 - Asnwer Key
No ratings yet
Tutorial 2 - Asnwer Key
14 pages
Arda Gozacanl Quant Essay
No ratings yet
Arda Gozacanl Quant Essay
28 pages
Intro W03 Rev
No ratings yet
Intro W03 Rev
23 pages
Lampiran Hasil Analisis Jalur Dengan Lisrel
No ratings yet
Lampiran Hasil Analisis Jalur Dengan Lisrel
7 pages
Chapter 3
No ratings yet
Chapter 3
23 pages
T Test Exercises
No ratings yet
T Test Exercises
9 pages
Practice 3 Measures of Dispersion 2023 09 20 19 02 53
No ratings yet
Practice 3 Measures of Dispersion 2023 09 20 19 02 53
18 pages
Introductory of Statistics - Chapter 3
No ratings yet
Introductory of Statistics - Chapter 3
7 pages
AGA 3842-2022-2023. Descriptive Statistics
No ratings yet
AGA 3842-2022-2023. Descriptive Statistics
101 pages
An Introduction To Neural Data Compression: Yibo Yang, Stephan Mandt, and Lucas Theis
No ratings yet
An Introduction To Neural Data Compression: Yibo Yang, Stephan Mandt, and Lucas Theis
20 pages
3 Stats Box and Whisker
No ratings yet
3 Stats Box and Whisker
35 pages
FINAL (SG) - PR 2 11 - 12 - UNIT 7 - LESSON 2 - Testing The Difference of Two Means
No ratings yet
FINAL (SG) - PR 2 11 - 12 - UNIT 7 - LESSON 2 - Testing The Difference of Two Means
27 pages
Computation Variation and Quartile
No ratings yet
Computation Variation and Quartile
18 pages
Forecasting Long Range Dependent Time Series With Exogenous Variable Using ARFIMAX Model
No ratings yet
Forecasting Long Range Dependent Time Series With Exogenous Variable Using ARFIMAX Model
4 pages
Mid Assignment - Business Statistics - FGS - Mbus.2024.207 - KSL Harshapriya.
No ratings yet
Mid Assignment - Business Statistics - FGS - Mbus.2024.207 - KSL Harshapriya.
86 pages
PPT3
No ratings yet
PPT3
26 pages
Measures of Dispersion Tendency
No ratings yet
Measures of Dispersion Tendency
7 pages
CH 13 Non Parametric Test
No ratings yet
CH 13 Non Parametric Test
35 pages
Cheat Sheet: Interpreting Regressions: L (P (Y X X
No ratings yet
Cheat Sheet: Interpreting Regressions: L (P (Y X X
1 page
Normal Approximation To Binomial
No ratings yet
Normal Approximation To Binomial
5 pages
Descriptive Statistics - Numerical Measure
No ratings yet
Descriptive Statistics - Numerical Measure
33 pages
Measures of Dispersion
No ratings yet
Measures of Dispersion
44 pages
R22 Unit2 CH2
No ratings yet
R22 Unit2 CH2
28 pages
4 Variability
No ratings yet
4 Variability
47 pages
NAOMI Assasment 2 BUS STATS
No ratings yet
NAOMI Assasment 2 BUS STATS
4 pages
3.3 Assignment: One Variable Statistics: A) Histogram
No ratings yet
3.3 Assignment: One Variable Statistics: A) Histogram
12 pages
Measures of Central Tendency & Variability: Lina, Karima, Joselyn, Arlene
No ratings yet
Measures of Central Tendency & Variability: Lina, Karima, Joselyn, Arlene
34 pages
Machine Learning Techniques - Types of Machine Learning - Applications Mathematical Foundations of Machine Learning
No ratings yet
Machine Learning Techniques - Types of Machine Learning - Applications Mathematical Foundations of Machine Learning
15 pages
Chart Title: Tablet Computer Sales Week Units Sold
No ratings yet
Chart Title: Tablet Computer Sales Week Units Sold
4 pages
Stats 2
No ratings yet
Stats 2
20 pages
Bereket Presentation
No ratings yet
Bereket Presentation
29 pages
Evaluation Activity #14
No ratings yet
Evaluation Activity #14
4 pages
Solutions To Exam 1 Problem Set
100% (1)
Solutions To Exam 1 Problem Set
24 pages
EECM3724 Unit 1 Ch3 Slides 2022
No ratings yet
EECM3724 Unit 1 Ch3 Slides 2022
48 pages
Lecture 04
No ratings yet
Lecture 04
88 pages
Chapter 5 Measures of Variability
No ratings yet
Chapter 5 Measures of Variability
39 pages
Sjasr-3-123 The Impact of Academic Performance
No ratings yet
Sjasr-3-123 The Impact of Academic Performance
4 pages
Measures of Centrality and Variability
No ratings yet
Measures of Centrality and Variability
42 pages
Session 6 BEDO - Hyd (Before Class)
No ratings yet
Session 6 BEDO - Hyd (Before Class)
31 pages
Full Download Theory of Sampling and Sampling Practice, Third Edition Francis R Pitard PDF
100% (3)
Full Download Theory of Sampling and Sampling Practice, Third Edition Francis R Pitard PDF
63 pages
CH 06
No ratings yet
CH 06
68 pages
STAT - 101 - TUTORIAL - 4 Solutions
No ratings yet
STAT - 101 - TUTORIAL - 4 Solutions
10 pages
Lecture - 04 - TP
No ratings yet
Lecture - 04 - TP
126 pages
Answers IBS
No ratings yet
Answers IBS
13 pages
Lecture IV Measures of Relative Positioning
No ratings yet
Lecture IV Measures of Relative Positioning
7 pages
Part 3 - Mesaures
No ratings yet
Part 3 - Mesaures
68 pages
Solutions Manual to accompany Introduction to Linear Regression Analysis
From Everand
Solutions Manual to accompany Introduction to Linear Regression Analysis
Douglas C. Montgomery
1/5 (1)
Student Solutions Manual to Accompany Loss Models: From Data to Decisions, Fourth Edition
From Everand
Student Solutions Manual to Accompany Loss Models: From Data to Decisions, Fourth Edition
Stuart A. Klugman
4/5 (1)

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

NHUMUOI Ws4 Spring25 Summary Statistics

Uploaded by

NHUMUOI Ws4 Spring25 Summary Statistics

Uploaded by

MATH105-Intro to Stats Worksheet #4 Spring 25

Name: La Thi Nhu Muoi Date:

1. [Distribution shape: mean vs median]

b. 3, 3.5, 3.5, 2.8, 3.2

a) Determine the quartiles and IQR.

b) Draw a boxplot of the noise levels.

c) Find the top 20% noise levels

d) Find the bottom 10% noise levels.

Total count of values (N)= 18

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.