0% found this document useful (0 votes)

5 views7 pages

Statistics

The document discusses key statistical concepts including measures of spread such as range, interquartile range, and standard deviation, along with their calculations. It also covers the relationship between variables through correlation coefficients and highlights the importance of identifying outliers in data analysis. Additionally, it touches on probability, particularly focusing on mutually exclusive events and the binomial distribution.

Uploaded by

mcvoidrunner1952

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views7 pages

Statistics

Uploaded by

mcvoidrunner1952

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 7

Statistics

16 – Working with Data

frequeny
Frequency density =
class width

The range is the difference between the largest and smallest values in a data set, and the
interquartile range is the difference between the upper and lower quartile. While they can be
a measure of spread, neither consider all the values given.

To counter this, we can use the standard deviation, usually given by the symbol σ.

Let’s consider the data 2,5,8, which has the mean (usually denoted by x̄ ) of 5.

We can look at the difference of each data point from the mean:

( x represents the data x x−x̄

point)
2 -3

5 0

8 3

The mean of the differences will always be 0, as the negatives will cancel out the positives.
This means that it cannot be used as a measure of spread. Because of this, we can square
the differences to ensure that they are non-negative.
x ( x - x̄ )²

2 9

5 0

8 9

The average is given by adding all the values of ( x - x̄ )² and dividing by n , the number of data
items. The symbol for adding up all the values is Σ.

9+0+ 9 18
In our case, the average would be = =6
3 3
However, we would need to undo the squaring to ensure the measure has the same units as
x . This means that the standard deviation for our data would be √ 6

√
2
Standard deviation: σ = Σ ( x− x̄ )
n

Standard deviation can also be thought of as:

σ=
√
Σ x2
n
− x̄ ²

Which can also be written as ‘the mean of the squares minus the square of the means’:

σ =(x ²)−(x)²
Variance (σ ² ) is the square of standard deviation and has very useful mathematical
properties.

Calculations from frequency tables:

Σ fx ²
x=
n
Where f is the frequency of each x value and n is the total frequency

2 Σ fx ²
σ = −x ²
n

Now, let’s have a look if there is a relationship between two variables. Data that comes in
pairs int this fashion is said to be bivariate. When we have these two sets of data, there may
or may not be a relationship between them. We can describe the relationship between them
by investigation their correlation.

However, instead of describing the correlation with words, we can use a numerical value, the
correlation coefficient, r, which can only take values of −1 ¿ r ¿1
As x increases, y generally
Strong posi- increases. r ≈ 1
tive correla-
tion

As x increases, y generally
Strong nega- decreases. r ≈−1
tive

No clear linear relationship

No correla- between x and y . r ≈ 0
tion

If there is perfect correlation, r =±1

However, just because r ≈ 0 doesn't mean that there is no relationship between the two
variables – it just means that there is no linear relationship.

Scatter diagrams can also reveal if there are 2 separate groups within the data

However, you must remember that correlation does not equal causation. Such correlation
may be due to a coincidence, or due to a third hidden variable. For example, there might be
a strong correlation between ice cream sales and number of swimmers at a beach. Clearly,
eating ice cream doesn’t make you want to swim; instead, the hidden variable of
temperature could cause both to rise.
When working with real-world data, there may be errors, missing data, or extreme values
that can distort results.

Often the most useful thing to do is to look at your data graphically. And if the underlying
pattern is strong, outliers can become obvious.

There are also some calculations you can do to check for outliers:

 An outlier is any number more than 1.5 interquartile ranges away from the nearest
quartile

 Any outlier is more than 2 standard deviations away from the mean

Once an outlier has been spotted, you must decide then decide whether to include it in your
calculation. This often requires you to look at the data in context:

 If an outlier is clearly an error (e.g. wrong units/impossible value) then it should be

excluded from the data

 If there are several outliers it might be a distinctly different group which should be
analysed separately.
17 – Probability

Events are mutually exclusive if they both cannot happen at the same time e.g. rolling a 6
and a 5 on a die in 1 roll. If the events are mutually exclusive:

P ( A∧B ) =0

P ( A∨B ) =P ( A )+ P (B)

P ( A∧B ) =P ( A ) × P (B)

P ( A ) + P ( ' not A ' ) =1

Bionomial distribution

Full Statistics For Managers Using Microsoft Exce Global 8th Edition Levine Solutions Manual All Chapters
100% (3)
Full Statistics For Managers Using Microsoft Exce Global 8th Edition Levine Solutions Manual All Chapters
49 pages
Maths Stats
No ratings yet
Maths Stats
15 pages
STD 10 Chap 4 Data Merging Notes
No ratings yet
STD 10 Chap 4 Data Merging Notes
4 pages
Methanol Talk
No ratings yet
Methanol Talk
9 pages
Onyx Stats Notes CH 1 6 Dg0r5q
No ratings yet
Onyx Stats Notes CH 1 6 Dg0r5q
63 pages
MLCourse Slides
No ratings yet
MLCourse Slides
427 pages
1.1 CS3352-FDS - Unit 1
No ratings yet
1.1 CS3352-FDS - Unit 1
42 pages
Research For RA2 - Hess & Hess's Law
No ratings yet
Research For RA2 - Hess & Hess's Law
1 page
Research For RA3
No ratings yet
Research For RA3
1 page
BP 4 Methanol Talk
No ratings yet
BP 4 Methanol Talk
1 page
File Acc Praktikum
No ratings yet
File Acc Praktikum
51 pages
MAT 161 Lesson - 4
No ratings yet
MAT 161 Lesson - 4
26 pages
Stats - The Theory 2
No ratings yet
Stats - The Theory 2
25 pages
Statistics Notes
No ratings yet
Statistics Notes
32 pages
Full Bound Reference
No ratings yet
Full Bound Reference
83 pages
Data Visualization
No ratings yet
Data Visualization
37 pages
Auronova Consulting
No ratings yet
Auronova Consulting
8 pages
Unit 3
No ratings yet
Unit 3
6 pages
ML Course Slides
No ratings yet
ML Course Slides
356 pages
BCA Mathematics
No ratings yet
BCA Mathematics
25 pages
Data Analysis and Visualization EDA
No ratings yet
Data Analysis and Visualization EDA
51 pages
Chapter 02-Describing Distributions With Numbers
No ratings yet
Chapter 02-Describing Distributions With Numbers
21 pages
Chapter 10 Data Analysis-Quantitative
No ratings yet
Chapter 10 Data Analysis-Quantitative
93 pages
A Level Maths - Statistics Revision Notes
No ratings yet
A Level Maths - Statistics Revision Notes
9 pages
DSILYTC Session 5 - Descriptive Statistics
No ratings yet
DSILYTC Session 5 - Descriptive Statistics
99 pages
Week Probability and Statistics
No ratings yet
Week Probability and Statistics
17 pages
Data Mining Unit 1
No ratings yet
Data Mining Unit 1
39 pages
Measures of Dispersion Updated
No ratings yet
Measures of Dispersion Updated
38 pages
CH 4
No ratings yet
CH 4
49 pages
Chapter 3
No ratings yet
Chapter 3
23 pages
Measures of The Spread of The Data (Ch2Sec7)
No ratings yet
Measures of The Spread of The Data (Ch2Sec7)
24 pages
Comprehensive MSC Biostatistics Notes
No ratings yet
Comprehensive MSC Biostatistics Notes
5 pages
Stats Notes by Warad
No ratings yet
Stats Notes by Warad
5 pages
Stats 1 Formulae
No ratings yet
Stats 1 Formulae
26 pages
RM EBBA Class 8 CH0 11 Quatitative Analysis
No ratings yet
RM EBBA Class 8 CH0 11 Quatitative Analysis
37 pages
Prob and Stats Notes
No ratings yet
Prob and Stats Notes
12 pages
BA Computer Lab 1-Data Preprocessing
No ratings yet
BA Computer Lab 1-Data Preprocessing
6 pages
PMT Mock 2 QP
No ratings yet
PMT Mock 2 QP
20 pages
107 Final Q. Solve 50 Batch
No ratings yet
107 Final Q. Solve 50 Batch
63 pages
AP Statistics Portfolio Q2
No ratings yet
AP Statistics Portfolio Q2
17 pages
C955 Formulas and Key Concepts
100% (1)
C955 Formulas and Key Concepts
14 pages
Assignment 2
No ratings yet
Assignment 2
33 pages
Data Analytics Compendium BITeSys 2024
No ratings yet
Data Analytics Compendium BITeSys 2024
46 pages
Chapter 3
No ratings yet
Chapter 3
59 pages
Unit 3
No ratings yet
Unit 3
31 pages
Stastics For Data Science1 (Quiz1 Notes)
No ratings yet
Stastics For Data Science1 (Quiz1 Notes)
2 pages
Clinical Practice: Development and Validation of A New Index To Measure Emergency Department Crowding
No ratings yet
Clinical Practice: Development and Validation of A New Index To Measure Emergency Department Crowding
5 pages
Statistical Analysis in Excel by Golden MCpherson
No ratings yet
Statistical Analysis in Excel by Golden MCpherson
315 pages
Data Visualization 2
No ratings yet
Data Visualization 2
3 pages
Descriptive Statistics - Book
No ratings yet
Descriptive Statistics - Book
101 pages
4x @6ote ) 'Btda2@m
No ratings yet
4x @6ote ) 'Btda2@m
55 pages
MLCourse Slides
No ratings yet
MLCourse Slides
356 pages
Mathematics 2024 BOT Grade 10 Term 3 Learner Notes 240617 141106
No ratings yet
Mathematics 2024 BOT Grade 10 Term 3 Learner Notes 240617 141106
25 pages
Ge 4 - Topic 2-Statistics
No ratings yet
Ge 4 - Topic 2-Statistics
8 pages
DV Stat
No ratings yet
DV Stat
39 pages
Statistics Introduction
No ratings yet
Statistics Introduction
37 pages
(2017) Capturing Channelized Reservoir Connectivity Uncertainty With Amalgamation Curves
No ratings yet
(2017) Capturing Channelized Reservoir Connectivity Uncertainty With Amalgamation Curves
42 pages
7CCMMS61 Statistics For Data Analysis: Francisco Javier Rubio Department of Mathematics
No ratings yet
7CCMMS61 Statistics For Data Analysis: Francisco Javier Rubio Department of Mathematics
13 pages
C15 Statistics TI84
No ratings yet
C15 Statistics TI84
178 pages
AP ECON 2500 Session 2
No ratings yet
AP ECON 2500 Session 2
22 pages
Chapter 4 Analysis and Interpretation of Assessment Results
No ratings yet
Chapter 4 Analysis and Interpretation of Assessment Results
36 pages
Module I. Basic Calculations. Average, Standard Deviation by Excel
No ratings yet
Module I. Basic Calculations. Average, Standard Deviation by Excel
48 pages
Statistics Learners' Working Manual
No ratings yet
Statistics Learners' Working Manual
25 pages
SBST1303 - MAY2020 - Take Home Exam
No ratings yet
SBST1303 - MAY2020 - Take Home Exam
9 pages
Basic Stats Session
No ratings yet
Basic Stats Session
16 pages
002 Probability-and-Statistics-Part-1-Data
No ratings yet
002 Probability-and-Statistics-Part-1-Data
84 pages
GE MODMAT Unit 4 Statistics 1
No ratings yet
GE MODMAT Unit 4 Statistics 1
14 pages
Quantitative Data Analysis
No ratings yet
Quantitative Data Analysis
22 pages
Statistics S1 Theory
No ratings yet
Statistics S1 Theory
8 pages
Manual
No ratings yet
Manual
46 pages
Ch.2 Measures of Location and Spread
No ratings yet
Ch.2 Measures of Location and Spread
1 page
Screenshot 2024-10-16 at 8.23.19 PM
No ratings yet
Screenshot 2024-10-16 at 8.23.19 PM
68 pages
Stats 2024
No ratings yet
Stats 2024
14 pages
Measures of Dispersion and Relative Standing
No ratings yet
Measures of Dispersion and Relative Standing
11 pages
Business Statistics and Analysis Course 2&3
No ratings yet
Business Statistics and Analysis Course 2&3
42 pages
Day 01-Basic Statistics
No ratings yet
Day 01-Basic Statistics
36 pages
1.1 - Statistical Analysis PDF
No ratings yet
1.1 - Statistical Analysis PDF
10 pages
Module 4 (Data Management) - Math 101
No ratings yet
Module 4 (Data Management) - Math 101
8 pages
ML Course Slides
No ratings yet
ML Course Slides
345 pages
Algebra 1 Unit 6 Describing Data Notes
No ratings yet
Algebra 1 Unit 6 Describing Data Notes
13 pages
Probability and Statistics in Engineering
No ratings yet
Probability and Statistics in Engineering
24 pages
Prob and Stats Notes PDF
No ratings yet
Prob and Stats Notes PDF
12 pages
Probability and Statistics in Engineering
No ratings yet
Probability and Statistics in Engineering
24 pages
CM6 - Mathematics As A Tool - Dispersion and Correlation
No ratings yet
CM6 - Mathematics As A Tool - Dispersion and Correlation
18 pages
Gse Mathematics-Glossary-K-12
No ratings yet
Gse Mathematics-Glossary-K-12
10 pages
AS Level Mathematics Statistics (New)
No ratings yet
AS Level Mathematics Statistics (New)
49 pages
Intro To Probability and Statistics
100% (3)
Intro To Probability and Statistics
70 pages
4 - IB Math Applications & Interpretations SL Notes - Unit 4 Statistics
No ratings yet
4 - IB Math Applications & Interpretations SL Notes - Unit 4 Statistics
17 pages
Frequency Distribution Table: Measure of Dispersion: Range, Variance, Standard Deviation
No ratings yet
Frequency Distribution Table: Measure of Dispersion: Range, Variance, Standard Deviation
4 pages
8ma0 21 0624 MS
No ratings yet
8ma0 21 0624 MS
11 pages
Review of Basic Statistical Concepts Hanke
No ratings yet
Review of Basic Statistical Concepts Hanke
28 pages
YMS Topic Review (Chs 1-8)
No ratings yet
YMS Topic Review (Chs 1-8)
7 pages
Statistics 1 AQA Revision Notes
No ratings yet
Statistics 1 AQA Revision Notes
7 pages
Math for Computer Applications
From Everand
Math for Computer Applications
The Editors of REA
No ratings yet
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Statistics

Uploaded by

Statistics

Uploaded by

Statistics

16 – Working with Data

( x represents the data x x−x̄

Standard deviation can also be thought of as:

Calculations from frequency tables:

No clear linear relationship

If there is perfect correlation, r =±1

 If an outlier is clearly an error (e.g. wrong units/impossible value) then it should be

P ( A ) + P ( ' not A ' ) =1

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.