0% found this document useful (0 votes)

6 views6 pages

Statistics

The document explains key statistical measures including mean, median, mode, variance, and standard deviation, highlighting their definitions and calculations. It also discusses how to handle missing data by suggesting when to replace missing values with the median or average based on the standard deviation. Additionally, it provides guidance on using Google spreadsheets functions for these calculations.

Uploaded by

v82514791

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views6 pages

Statistics

Uploaded by

v82514791

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Handling Missing Data: Business

Problems

Statistical Function

Mean
In statistics, the mean, also known as the arithmetic mean or average, is a
measure of central tendency of a set of numerical values. It is calculated by
adding up all the values in the data set and then dividing the sum by the total
number of values. The formula for calculating the mean is:

Here, m = mean. The mean is a useful tool for summarizing a set of data and
understanding its general properties. It can be affected by extreme values or
outliers, so it is important to consider other measures of central tendency,
such as the median and mode, as well as the variability of the data when
analyzing a dataset.

In this course, you will use the AVERAGE function in Google spreadsheets to
compute the mean.

Here, denotes the mean value and denotes the sum of n values where n
is the number of values in the sample.
Median
The median is a statistical measure that represents the middle value of a
dataset when it is arranged in order of magnitude. It is the value that divides
the data set into two equal halves, such that half of the values are above the
median and half are below it.

To find the median, the data set is first arranged in ascending or descending
order.
1. If the data set contains an odd number of values, then the median is the
middle value. For example, the median for a sorted list of 15
observations is the 8th value.
2. If the data set contains an even number of values, then the median is
the average of the two middle values. For example, the median for a
sorted list of 16 observations is the average of the 8th and 9th values.

Unlike the mean, the median is not influenced by extreme values or outliers in
the data set, making it a useful measure of central tendency in skewed or
asymmetric distributions. A skewed or asymmetric distribution is a type of
data distribution where the values are not evenly spread out around the
average or middle of the data. In this type of distribution, the data tends to be
concentrated on one side of the center, and the other side has fewer values
that are more spread out.

In this course, you will use the MEDIAN function in Google spreadsheets to
compute the mean.

2
Mode
In statistics, the mode is a measure of central tendency that represents the
most frequent value in a dataset. More specifically, the mode is the value that
occurs with the highest frequency in a set of observations or data points. It is
one of the three main measures of central tendency, along with the mean and
median.

The mode is particularly useful when dealing with categorical or discrete data,
such as the number of times a certain event occurs, or the most common color
of cars in a parking lot. It is also useful when dealing with continuous data that
can be grouped into categories or bins.

Unlike the mean and median, the mode does not take into account the actual
values of the data, only their frequency of occurrence. This makes it less
sensitive to outliers or extreme values that may affect the mean or median.
However, it may not be a representative measure of central tendency if there
are multiple modes in the dataset or if the frequency of the modes is close to
each other. You can use the MODE function to compute the mode.

Variance
In statistics, variance is a measure of how spread out a dataset is. More
specifically, it measures the average squared difference between each data
point and the mean of the dataset. Variance is represented by the symbol σ²
for a population and s² for a sample.
Variance is commonly used in statistics to describe the variability or spread of
a dataset. It is a useful tool for comparing the spread of two or more datasets,
as well as for identifying outliers or extreme values in a dataset.

The formula for variance depends on whether you are calculating the
variance of a population or a sample.

3
• Population Variance (σ²)
The population variance is calculated using the following formula:

σ² = Σ(x - μ)² / N
o σ² represents the population variance
o x represents each data point in the dataset
o μ represents the population mean
o N represents the total number of data points in the dataset

This formula calculates the average squared difference between each

data point and the population mean.

• Sample Variance (s²)

The sample variance is calculated using a similar formula, but with n-1 in the
denominator instead of N to account for the fact that the sample mean is an
estimate of the population mean:

s² = Σ(x - x̄)² / (n-1)

o s² represents the sample variance
o x represents each data point in the dataset
o x̄ represents the sample mean
o n represents the sample size

This formula calculates the average squared difference between each

data point and the sample mean.

Both formulas involve squaring the differences between each data point and
the mean, which gives more weight to larger differences and emphasizes the
spread of the dataset. The result is a measure of how much the data points in
a dataset vary from the mean. Google Spreadsheets use the sample variance
formula. You can use the VAR function to compute the variance.

4
Standard Deviation
Standard deviation is a statistical measure that is used to quantify the amount
of variability or dispersion in a set of data. It is defined as the square root of
the variance and is typically denoted by the symbol σ (sigma).
The standard deviation tells us how spread out the data is from the mean, or
average, value. A low standard deviation means that the data points tend to
be close to the mean, while a high standard deviation means that the data
points are spread out over a wider range.
To calculate the standard deviation, first find the mean of the data set. Then,
for each data point, subtract the mean and square the result. Next, find the
average of these squared differences, which is the variance. Finally, take the
square root of the variance to get the standard deviation. The formula for the
standard deviation is:

The higher the standard deviation the more variability or spread you have in
your data. The larger your standard deviation, the more spread or variation
in your data. Small standard deviations mean that most of your data is
clustered around the mean.

As you can see in the graph When Low standard deviation values are clustered
around the mean but in another case, the values are spread. You can use
STDEV function to compute the standard deviation.

5
How to impute missing values?
1. If the Standard deviation is similar/ near to Average or bigger value, then
we replace the missing value with Median
2. If the Standard deviation is less than the average value or has a small
value that means values are clustered near to Average, then we replace
the missing value with the Average

Here is the implementation of all the mentioned statistical operations

in a given Student dataset

Statistical operations were performed using the “Marks” column in the dataset

Read More
• https://support.google.com/docs/answer/3094063?hl=en

Common Tangents
No ratings yet
Common Tangents
19 pages
Bmte 141 em 2024 MP
No ratings yet
Bmte 141 em 2024 MP
28 pages
standard error
No ratings yet
standard error
14 pages
Complete Thesis
No ratings yet
Complete Thesis
106 pages
Lecture 03
No ratings yet
Lecture 03
31 pages
Unit II TYCS DS
No ratings yet
Unit II TYCS DS
176 pages
Week 4 Measures of Central Tendency
No ratings yet
Week 4 Measures of Central Tendency
29 pages
Chapter 03
No ratings yet
Chapter 03
44 pages
Dbs3e PPT ch03
No ratings yet
Dbs3e PPT ch03
61 pages
Lecture 5&6
No ratings yet
Lecture 5&6
15 pages
Freq. distribution Characteristics
No ratings yet
Freq. distribution Characteristics
13 pages
ai- ssmda
No ratings yet
ai- ssmda
142 pages
Unit 6 Interpreting Evaluation Results
No ratings yet
Unit 6 Interpreting Evaluation Results
54 pages
9709_w24_ms_43
No ratings yet
9709_w24_ms_43
19 pages
Social Science Statistics (June-Aug) 2025-Topic 2
No ratings yet
Social Science Statistics (June-Aug) 2025-Topic 2
21 pages
05 - Lecture 2
No ratings yet
05 - Lecture 2
111 pages
MMW-FINALS-LESSON-1-4-NOTE
No ratings yet
MMW-FINALS-LESSON-1-4-NOTE
10 pages
Descriptive Statistics.pptx
No ratings yet
Descriptive Statistics.pptx
14 pages
Oval Concrete Domes: Sciencedirect
No ratings yet
Oval Concrete Domes: Sciencedirect
16 pages
3.3.1 Data Summarization
No ratings yet
3.3.1 Data Summarization
56 pages
3 Measures of Central Tendency
No ratings yet
3 Measures of Central Tendency
30 pages
measures of dispersion updated
No ratings yet
measures of dispersion updated
38 pages
Ids Unit 2 Notes Ckm-1
No ratings yet
Ids Unit 2 Notes Ckm-1
30 pages
Discriptive Statistics
No ratings yet
Discriptive Statistics
23 pages
UNIT-III Geometric Modeling
No ratings yet
UNIT-III Geometric Modeling
139 pages
Biostatistics (Descriptive Statistics)
No ratings yet
Biostatistics (Descriptive Statistics)
30 pages
Chapter-2 Vectors (MCQ & CQ)
No ratings yet
Chapter-2 Vectors (MCQ & CQ)
22 pages
ECO2004_Ch3
No ratings yet
ECO2004_Ch3
16 pages
LESSON-6-ADVANCED-STATISTICS
No ratings yet
LESSON-6-ADVANCED-STATISTICS
11 pages
Lecture 4 Copy 1
No ratings yet
Lecture 4 Copy 1
13 pages
Midterms-Day-4 (1)
No ratings yet
Midterms-Day-4 (1)
51 pages
Ch3 Numerically Summarizing Data
No ratings yet
Ch3 Numerically Summarizing Data
35 pages
Tian Statistics Lesson 3 Descriptive Statistics
No ratings yet
Tian Statistics Lesson 3 Descriptive Statistics
64 pages
Statistics
No ratings yet
Statistics
29 pages
Stats Prac 1
No ratings yet
Stats Prac 1
10 pages
3-Measures of Dispersion
No ratings yet
3-Measures of Dispersion
33 pages
Aurora education-xii-maths-key-remesh-model-2025
No ratings yet
Aurora education-xii-maths-key-remesh-model-2025
9 pages
Group 4 - Activity
No ratings yet
Group 4 - Activity
17 pages
Statistics, Statistical Modelling & Data Analytics
No ratings yet
Statistics, Statistical Modelling & Data Analytics
68 pages
UNGROUPED DATA Measures of Central Tendency, Dispersion, and Position
No ratings yet
UNGROUPED DATA Measures of Central Tendency, Dispersion, and Position
34 pages
Ge 4 - Topic 2-Statistics
No ratings yet
Ge 4 - Topic 2-Statistics
8 pages
Quest - Potential Energy and Energy Conservation
No ratings yet
Quest - Potential Energy and Energy Conservation
9 pages
Week 2 Measures of Dispersion II
No ratings yet
Week 2 Measures of Dispersion II
34 pages
Fibre Bundles PDF
No ratings yet
Fibre Bundles PDF
2 pages
Central Tendency
No ratings yet
Central Tendency
5 pages
Unit - 2 Biostatistics
No ratings yet
Unit - 2 Biostatistics
9 pages
Stat 1101 4 7
No ratings yet
Stat 1101 4 7
18 pages
Important Measures of Central Tendency Are Mean, Median and Mode
No ratings yet
Important Measures of Central Tendency Are Mean, Median and Mode
31 pages
Surface Finish and Surface Integrity (Compatibility Mode)
100% (1)
Surface Finish and Surface Integrity (Compatibility Mode)
23 pages
Q 4 RESEARCH Module 2 3
No ratings yet
Q 4 RESEARCH Module 2 3
27 pages
Assignment No 2
No ratings yet
Assignment No 2
25 pages
Test Bank
100% (1)
Test Bank
34 pages
Free Vibration Analysis of Simply Supported Rectan
No ratings yet
Free Vibration Analysis of Simply Supported Rectan
4 pages
2018 01 31 Popcorn Activity
No ratings yet
2018 01 31 Popcorn Activity
4 pages
Click To Add Text Dr. Cemre Erciyes
No ratings yet
Click To Add Text Dr. Cemre Erciyes
69 pages
Measures of Dispersion or Variability
No ratings yet
Measures of Dispersion or Variability
15 pages
Lecture 4. Dispersion
No ratings yet
Lecture 4. Dispersion
6 pages
Syllabus For Planning Assistant Examination
No ratings yet
Syllabus For Planning Assistant Examination
2 pages
Module I. Basic Calculations. Average, Standard Deviation by Excel (5)
No ratings yet
Module I. Basic Calculations. Average, Standard Deviation by Excel (5)
48 pages
earth pressure theory
No ratings yet
earth pressure theory
38 pages
Measures of Dispersion and Relative Standing
No ratings yet
Measures of Dispersion and Relative Standing
11 pages
Chapter 2
No ratings yet
Chapter 2
40 pages
Ch 2 Lecture Notes
No ratings yet
Ch 2 Lecture Notes
12 pages
Presentation 4
No ratings yet
Presentation 4
29 pages
Statistical Analysis_ Descriptive Stat (2)
No ratings yet
Statistical Analysis_ Descriptive Stat (2)
6 pages
Introduction To Statistics PDF
No ratings yet
Introduction To Statistics PDF
32 pages
Introduction To Statistics
No ratings yet
Introduction To Statistics
21 pages
Standard Deviation Formulas
No ratings yet
Standard Deviation Formulas
10 pages
Assignment
No ratings yet
Assignment
30 pages
Univariate Statistics
No ratings yet
Univariate Statistics
4 pages
CFD Mid1 Exam
No ratings yet
CFD Mid1 Exam
12 pages
Unit 1 - Business Statistics & Analytics
No ratings yet
Unit 1 - Business Statistics & Analytics
25 pages
MCS31 combinedQPapers
No ratings yet
MCS31 combinedQPapers
23 pages
Descriptive Statistics PDF
100% (1)
Descriptive Statistics PDF
40 pages
Assignment
No ratings yet
Assignment
23 pages
De-Mystifying Math and Stats for Machine Learning: Mastering the Fundamentals of Mathematics and Statistics for Machine Learning
From Everand
De-Mystifying Math and Stats for Machine Learning: Mastering the Fundamentals of Mathematics and Statistics for Machine Learning
Seaport AI Madhavan
No ratings yet
Central Tendency
No ratings yet
Central Tendency
11 pages
Measures of Central Tendency
100% (1)
Measures of Central Tendency
48 pages
Steve Mann - Chaos Theory and Strategic Thought PDF
100% (1)
Steve Mann - Chaos Theory and Strategic Thought PDF
16 pages
Measures of Variation
No ratings yet
Measures of Variation
30 pages
Indranil Saaki Paper On Upqc
No ratings yet
Indranil Saaki Paper On Upqc
7 pages
Evaluating Analytical Chemistry
No ratings yet
Evaluating Analytical Chemistry
4 pages
Measures of Dispersion
100% (1)
Measures of Dispersion
13 pages
Control Narratives GDC-121-8
100% (3)
Control Narratives GDC-121-8
89 pages
Unified Field Chart Physics
0% (1)
Unified Field Chart Physics
1 page
Lecture 1 Matrices and Determinants
No ratings yet
Lecture 1 Matrices and Determinants
14 pages
Lec-5 LoB PDF
No ratings yet
Lec-5 LoB PDF
19 pages
Heuristic Problem Solving
No ratings yet
Heuristic Problem Solving
12 pages
Measures of Variability
100% (2)
Measures of Variability
71 pages
AERO 321 Not The Textbook PDF
100% (4)
AERO 321 Not The Textbook PDF
913 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Statistics

Uploaded by

Statistics

Uploaded by

Handling Missing Data: Business

This formula calculates the average squared difference between each

• Sample Variance (s²)

s² = Σ(x - x̄)² / (n-1)

This formula calculates the average squared difference between each

Here is the implementation of all the mentioned statistical operations

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.