SD Final 2
SD Final 2
Introduction
The standard deviation is a measure of the spread of scores within a set of data. Usually, we
are interested in the standard deviation of a population. However, as we are often presented
with data from a sample only, we can estimate the population standard deviation from a
sample standard deviation. These two standard deviations - sample and population standard
deviations - are calculated differently. In statistics, we are usually presented with having to
calculate sample standard deviations, and so this is what this article will focus on, although
the formula for a population standard deviation will also be shown.
This statistical measurement of dispersion about an average, depicts how widely a mutual
fund's returns varied over a certain period of time. Investors use the standard deviation of
historical performance to try to predict the range of returns that are most likely for a given
fund. When a fund has a high standard deviation, the predicted range of performance is wide,
implying greater volatility.
Standard deviation is most appropriate for measuring the risk a fund that is an investor's only
holding. The figure cannot be combined for more than one fund because the standard
deviation for a portfolio of multiple funds is a function of not only the individual standard
deviations, but also of the degree of correlation among the funds' returns.
If a fund's returns follow a normal distribution, then approximately 68% of the time they will
fall within one standard deviation of the mean return for the fund, and 95% of the time within
two standard deviations.
Noun
A measure of dispersion obtained by extracting the squareroot of the mean of the squared de
viations of the observed values fromtheir mean in a frequency distribution.
Medicine
Symbol σ
A statistic used as a measure of the dispersion or variation in a distribution,equal to the squar
e root of the arithmetic mean of the squares of thedeviations from the arithmetic mean.
Science
A statistic used as a measure of the dispersion or variation in a distribution,equal to the squar
e root of the arithmetic mean of the squares of thedeviations from the arithmetic mean.
Culture
In statistics, a measure of how much the data in a certain collection arescattered around the m
ean. A low standard deviation means that the dataare tightly clustered; a high standard deviati
on means that they are widelyscattered.
Technology
A measure of the range of values in a set of numbers. Standarddeviation is a statistic used as
a measure of the dispersion or variation in adistribution, equal to the square root of the arithm
etic mean of the squaresof the deviations from the arithmetic mean.
Standard deviation
The reported margin of error of a poll is computed from the standard error of the mean (or
alternatively from the product of the standard deviation of the population and the inverse of
the square root of the sample size, which is the same thing) and is typically about twice the
standard deviation—the half-width of a 95 percent confidence interval. In science,
researchers commonlyreport the standard deviation of experimental data, and only effects that
fall much farther than two standard deviations away from what would have been expected are
considered statistically significant—normal random error or variation in the measurements is
in this way distinguished from causal variation. The standard deviation is also important in
finance, where the standard deviation on the rate of returnon an investment is a measure of
the volatility of the investment.
When only a sample of data from a population is available, the term standard deviation of the
sample or sample standard deviation can refer to either the above-mentioned quantity as
applied to those data or to a modified quantity that is a better estimate of the population
standard deviation (the standard deviation of the entire population)
The square root of the sample variance of a set of values is the sample standard deviation
The sample standard deviation distribution is a slightly complicated, though well-studied and
well-understood, function.
However, consistent with widespread inconsistent and ambiguous terminology, the square
root of the bias-corrected variance is sometimes also known as the standard deviation,
Physical scientists often use the term root-mean-square as a synonym for standard deviation
when they refer to the square root of the mean squared deviation of a quantity from a given
baseline.
The standard deviation arises naturally in mathematical statistics through its definition in
terms of the second central moment. However, a more natural but much less frequently
encountered measure of average deviation from the mean that is used in descriptive statistics
is the so-called mean deviation.
Advantages:
Lends itself to computation of other stable measures (and is a prerequisite for many
ofthem).
Average of deviations around the mean
Majority of data within one standarddeviation above or below the mean.
Not expressed in squared units, so makesmore sense descriptively.
Shows how much data is clustered around a mean value
It gives a more accurate idea of how the data is distributed
Not as affected by extreme values
The standard deviation is the best measure of variation because of its mathematical
characteristics. It is based on every item of the distribution. Also it is amenable to
algebraic treatment and is less affected by fluctuations of sampling than most other
measures of dispersion.
It is possible to calculate the combined standard deviation of two or more groups.
This is not possible with any other measure.
For comparing the variability of two or more distribution coefficient of variation is
considered to be most appropriate and this is based on mean standard deviation.
Standard deviation is most prominently used in further statistical work. For example,
in computing skewness, correlation, etc., use is made of standard deviation. It is
keynote in sampling and provides a unit of measurement for the normal distribution.
It is rigidly defined and its value is always definite and based on all the
observationsand the actual signs of deviations are used.
As it is based on arithmetic mean, it has all the merits of arithmetic mean.
It is the most important and widely used measure of dispersion.
It is possible for further algebraic treatment.
It is less affected by the fluctuations of sampling and hence stable.
It is the basis for measuring the coefficient of correlation and sampling.
Demerits
As compared to other measures it is difficult to compute. However, it does not reduce
the importance of this measure because of high degree of accuracy of results it gives.
It gives more weight to extreme items and less to those which are near the mean. It is
because of the fact that the squares of the deviations which are big in size would be
proportionately greater than the squares of those deviations which are comparatively
small. The deviations 2 and 8 are in the ratio of 1:4 but their squares, i.e., 4 and 64
would be in the ratio of 1:16.
It is not easy to understand and it is difficult to calculate.1.
It gives more weight to extreme values because the values are squared up.
As it is an absolute measure of variability, it cannot be used for the purpose
ofcomparison.
Uses:
Despite the drawbacks mentioned above the standard deviation is the best measure of
dispersion and should be used wherever possible.
Just as mean is the best measure of central tendency (leaving exceptional cases)
standard deviation is the best measure of dispersion, excepting a few cases where
mean deviation or quartile deviation may give better results
However since standard deviation gives greater weight to extreme items, it does not
find much favor with economists and businessmen who are more interested in the
results of the model class.
The first thing to note about absolute deviation is that it does not rank distances at all.
Suppose that we have two observations, where one observation is 1 unit away from
the mean, while the other is 3 units from the mean. Absolute deviation says that this is
the same amount of dispersion as having both observations be 2 units away from the
mean. That's fishy--3 units deviation is further from the mean than 2 units of
deviation, yet mean absolute deviation is telling us that these two distributions have
the same amount of dispersion. However, standard deviation actually gives greater
weight to larger deviations than smaller ones, so the standard deviation tells us,
correctly, that actually the former distribution is more spread out.
Computer usage
Excel provides almost a dozen statistical functions for calculating standard deviations and
variances. A standard deviation describes dispersion (spread of data) about (around) the data
set’s mean.
The STDEV function calculates the standard deviation of a sample, a measure of how
widely values in a data set vary around the mean — and a common input to other
statistical calculations. The function uses the syntax=STDEV(number1,[number2])
To calculate the standard deviation of the worksheet range A1:A5 using the STDEV
function, for example, use the formula=STDEV(A1:A5)
If the worksheet range holds the values 1, 4, 8, 9 and 11, the function returns the
standard deviation value 4.037326.
The STDEV function lets you include up to 255 arguments as inputs; those arguments
can be values, cell references, formulas, and range references. The STDEV function
ignores logical values, text, and empty cells.
The STDEVA function calculates the standard deviation of a sample, but unlike the
STDEV function, STDEVA doesn’t ignore the logical values TRUE(which is 1)
and FALSE (which is 0). The function uses the syntax
=STDEVA(number1,[number2])
STDEVA arguments, which can number up to 255, can be values, cell references,
formulas, and range references.
=STDEVP(number1,[number2])
To calculate the standard deviation of the worksheet range A1:A5 using the STDEVP
function, for example, use the formula
=STDEVP(A1:A5)
If the worksheet range holds the values 1, 4, 8, 9 and 11, the function returns the
standard deviation value 3.611094.
The STDEVP function lets you include up to 255 arguments as inputs; the arguments
can be values, cell references, formulas, and range references. The STDEV function
ignores logical values, text, and empty cells.
=STDEVPA(number1,[number2])
These all are the key words used in excel sheet to calculate standard deviation.
For Example
1. Click on cell D12 to make it the active cell - the location where the results for the
STDEV function will be displayed
2. Click on the Formulas tab.
3. Choose More Functions > Statistical from the ribbon to open the function drop down
list.
4. Click on STDEV in the list to bring up the function's dialog box.
5. Highlight cells A5 to D7 in the worksheet to enter the range into the dialog box as
theNumber argument
6. Click OK to close the dialog box and return to the worksheet
7. The answer 2.37 should present in cell D12.
8. This number represents the estimated standard deviation of each number in the list
from the average value of 4.5
9. When you click on cell E8 the complete function =STDEV(A5:D7) appears in
the formula bar above the worksheet
10. After you have made your selections, click on OK at the bottom of the dialog box.
Articles :
Standard deviation.
9. Andy Kiersz Dec 2 ,2014 Here's What Statisticians Mean By A Standard Deviation.
10. Hunkar Ozyasar Demand Media April 4,2015Rate of Return & Standard Deviation in
small business .
ARTICLE-1
BardeMP,Barde PJ July 3, 2012 What to use to express the variability of data: Standard
deviation.
The terms “standard error” and “standard deviation” are often confused. The contrast between
these two terms reflects the important distinction between data description and inference, one
that all researchers should appreciate.
The standard deviation (often SD) is a measure of variability. When we calculate the standard
deviation of a sample, we are using it as an estimate of the variability of the population from
which the sample was drawn. For data with a normal distribution,about 95% of individuals
will have values within 2 standard deviations of the mean, the other 5% being equally
scattered above and below these limits. Contrary to popular misconception, the standard
deviation is a valid measure of variability regardless of the distribution. About 95% of
observations of any distribution usually fall within the 2 standard deviation limits, though
those outside may all be at one end. We may choose a different summary statistic, however,
when data have a skewed distribution.
When we calculate the sample mean we are usually interested not in the mean of this
particular sample, but in the mean for individuals of this type in statistical terms, of the
population from which the sample comes. We usually collect data in order to generalise from
them and so use the sample mean as an estimate of the mean for the whole population. Now
the sample mean will vary from sample to sample; the way this variation occurs is described
by the “sampling distribution” of the mean. We can estimate how much sample means will
vary from the standard deviation of this sampling distribution, which we call the standard
error (SE) of the estimate of the mean. As the standard error is a type of standard deviation,
confusion is understandable. Another way of considering the standard error is as a measure of
the precision of the sample mean.
The standard error of the sample mean depends on both the standard deviation and the sample
size, by the simple relation SE = SD/√(sample size). The standard error falls as the sample
size increases, as the extent of chance variation is reduced—this idea underlies the sample
size calculation for a controlled trial, for example. By contrast the standard deviation will not
tend to change as we increase the size of our sample.
So, if we want to say how widely scattered some measurements are, we use the standard
deviation. If we want to indicate the uncertainty around the estimate of the mean
measurement, we quote the standard error of the mean. The standard error is most useful as a
means of calculating a confidence interval. For a large sample, a 95% confidence interval is
obtained as the values 1.96×SE either side of the mean. We will discuss confidence intervals
in more detail in a subsequent Statistics Note. The standard error is also used to calculate P
values in many circumstances.
The principle of a sampling distribution applies to other quantities that we may estimate from
a sample, such as a proportion or regression coefficient, and to contrasts between two
samples, such as a risk ratio or the difference between two means or proportions. All such
quantities have uncertainty due to sampling variation, and for all such estimates a standard
error can be calculated to indicate the degree of uncertainty.
In many publications a ± sign is used to join the standard deviation (SD) or standard error
(SE) to an observed mean—for example, 69.4±9.3 kg. That notation gives no indication
whether the second figure is the standard deviation or the standard error (or indeed something
else). A review of 88 articles published in 2002 found that 12 (14%) failed to identify which
measure of dispersion was reported (and three failed to report any measure of variability).The
policy of the BMJ and many other journals is to remove ± signs and request authors to
indicate clearly whether the standard deviation or standard error is being quoted. All journals
should follow this practice.
ARTICLE-2
Deborah J. Rumsey from Statistics For Dummies, 2nd EditionJanuary 31, 2013 How
Standard deviation can be difficult to interpret as a single number on its own. Basically, a
small standard deviation means that the values in a statistical data set are close to the mean of
the data set, on average, and a large standard deviation means that the values in the data set
are farther away from the mean, on average.
The standard deviation measures how concentrated the data are around the mean; the more
concentrated, the smaller the standard deviation.
A small standard deviation can be a goal in certain situations where the results are restricted,
for example, in product manufacturing and quality control.
A particular type of car part that has to be 2 centimeters in diameter to fit properly had better
not have a very big standard deviation during the manufacturing process.
A big standard deviation in this case would mean that lots of parts end up in the trash because
they don’t fit right; either that or the cars will have problems down the road.
But in situations where you just observe and record data, a large standard deviation isn’t
necessarily a bad thing; it just reflects a large amount of variation in the group that is being
studied.
For example, if you look at salaries for everyone in a certain company, including everyone
from the student intern to the CEO, the standard deviation may be very large.
On the other hand, if you narrow the group down by looking only at the student interns, the
standard deviation is smaller, because the individuals within this group have salaries that are
less variable. The second data set isn’t better, it’s just less variable.
Similar to the mean, outliers affect the standard deviation (after all, the formula for standard
deviation includes the mean). Here’s an example: the salaries of the L.A. Lakers in the 2009–
2010 season range from the highest, $23,034,375 (Kobe Bryant) down to $959,111 (Didier
Ilunga-Mbenga and Josh Powell).
Lots of variation, to be sure! The standard deviation of the salaries for this team turns out to
be $6,567,405; it’s almost as large as the average. However, as you may guess, if you remove
Kobe Bryant’s salary from the data set, the standard deviation decreases because the
remaining salaries are more concentrated around the mean. The standard deviation becomes
$4,671,508
Here are some properties that can help you when interpreting a standard deviation:
The standard deviation can never be a negative number, due to the way it’s calculated
and the fact that it measures a distance (distances are never negative numbers).
The smallest possible value for the standard deviation is 0, and that happens only in
contrived situations where every single number in the data set is exactly the same (no
deviation).
The standard deviation has the same units as the original data.
ARTICLE 3:
Standard deviation is a more difficult concept than the others we've covered. And unless you
are writing for a specialized, professional audience, you'll probably never use the words
"standard deviation" in a story. But that doesn't mean you should ignore this concept.
The standard deviation is kind of the "mean of the mean," and often can help you find the
story behind the data. To understand this concept, it can help to learn about what statisticians
call "normal distribution" of data.
A normal distribution of data means that most of the examples in a set of data are close to the
"average," while relatively few examples tend to one extreme or the other.
Let's say you are writing a story about nutrition. You need to look at people's typical daily
calorie consumption. Like most data, the numbers for people's typical consumption probably
will turn out to be normally distributed. That is, for most people, their consumption will be
close to the mean, while fewer people eat a lot more or a lot less than the mean.
When you think about it, that's just common sense. Not that many people are getting by on a
single serving of kelp and rice. Or on eight meals of steak and milkshakes. Most people lie
somewhere in between.
If you looked at normally distributed data on a graph, it would look something like this:
The x-axis (the horizontal one) is the value in question... calories consumed, dollars earned or
crimes committed, for example. And the y-axis (the vertical one) is the number of datapoints
for each value on the x-axis... in other words, the number of people who eat xcalories, the
number of households that earn x dollars, or the number of cities with x crimes committed.
Now, not all sets of data will have graphs that look this perfect. Some will have relatively flat
curves, others will be pretty steep. Sometimes the mean will lean a little bit to one side or the
other. But all normally distributed data will have something like this same "bell curve" shape.
The standard deviation is a statistic that tells you how tightly all the various examples are
clustered around the mean in a set of data. When the examples are pretty tightly bunched
together and the bell-shaped curve is steep, the standard deviation is small. When the
examples are spread apart and the bell curve is relatively flat, that tells you you have a
relatively large standard deviation.
Computing the value of a standard deviation is complicated. But let me show you graphically
what a standard deviation represents...
One standard deviation away from the mean in either direction on the horizontal axis (the two
shaded areas closest to the center axis on the above graph) accounts for somewhere around 68
percent of the people in this group. Two standard deviations away from the mean (the four
areas closest to the center areas) account for roughly 95 percent of the people. And three
standard deviations (all the shaded areas) account for about 99 percent of the people.
If this curve were flatter and more spread out, the standard deviation would have to be larger
in order to account for those 68 percent or so of the people. So that's why the standard
deviation can tell you how spread out the examples in a set are from the mean.
Why is this useful? Here's an example: If you are comparing test scores for different schools,
the standard deviation will tell you how diverse the test scores are for each school.
Let's say Springfield Elementary has a higher mean test score than Shelbyville Elementary.
Your first reaction might be to say that the kids at Springfield are smarter.
But a bigger standard deviation for one school tells you that there are relatively more kids at
that school scoring toward one extreme or the other. By asking a few follow-up questions you
might find that, say, Springfield's mean was skewed up because the school district sends all
of the gifted education kids to Springfield. Or that Shelbyville's scores were dragged down
because students who recently have been "mainstreamed" from special education classes have
all been sent to Shelbyville.
In this way, looking at the standard deviation can help point you in the right direction when
asking why information is the way it is.
Of course, you'll want to seek the advice of a trained statistician whenever you try to evaluate
the worth of any scientific research. But if you know at least a little about standard deviation
going in, that will make your talk with him or her much more productive.
For each value x, subtract the overall avg (x) from x, then multiply that result by itself
(otherwise known as determining the square of that value). Sum up all those squared values.
Then divide that result by (n-1). Got it? Then, there's one more step... find the square root of
that last number. That's the standard deviation of your set of data.
Now, remember how I told you this was one way of computing this? Sometimes, you divide
by (n) instead of (n-1). It's too complex to explain here. So don't try to go figuring out a
standard deviation if you just learned about it on this page. Just be satisfied that you've now
got a grasp on the basic concept.
Or, use...
ARTICLE 4:
DataStar, Inc , Posted January 31, 2013 How to Interpret Standard Deviation and Standard
Error in Survey Research:
Standard Deviation and Standard Error are perhaps the two least understood statistics
commonly shown in data tables. The
following article is intended to explain their meaning and provide additional insight on how
they are used in data analysis.
Standard Deviation and Standard Error are perhaps the two least understood statistics
commonly shown in data tables. The following article is
intended to explain their meaning and provide additional insight on how they are used in data
analysis. Both statistics are typically shown with
the mean of a variable, and in a sense, they both speak about the mean. They are often
referred to as the "standard deviation of the mean" and
the "standard error of the mean." However, they are not interchangeable and represent very
di᯿అ erent concepts.
Standard Deviation
Standard Deviation (often abbreviated as "Std Dev" or "SD") provides an indication of how
far the individual responses to a question vary or
"deviate" from the mean. SD tells the researcher how spread out the responses are -- are they
concentrated around the mean, or scattered far
& wide? Did all of your respondents rate your product in the middle of your scale, or did
some love it and some hate it?
Let's say you've asked respondents to rate your product on a series of attributes on a 5-point
scale. The mean for a group of ten respondents
(labeled 'A' through 'J' below) for "good value for the money" was 3.2 with a SD of 0.4 and
the mean for "product reliability" was 3.4 with a SD of
2.1. At 1 rst glance (looking at the means only) it would seem that reliability was rated higher
than value. But the higher SD for reliability could
indicate (as shown in the distribution below) that responses were very polarized, where most
respondents had no reliability issues (rated the
attribute a "5"), but a smaller, but important segment of respondents, had a reliability problem
and rated the attribute "1". Looking at the mean
alone tells only part of the story, yet all too often, this is what researchers focus on. The
distribution of responses is important to consider and the
SD provides a valuable descriptive measure of this.
A 3 1
B 3 1
C 3 1
D 3 1
E 4 5
F 4 5
G 3 5
H 3 5
I 3 5
J 3 5
Mean 3.2 3.4
Std Dev 0.4 2.1
Two very different distributions of responses to a 5-point rating scale can yield the same
mean. Consider the following example showing
response values for two different ratings. In the 1rst example (Rating "A") the Standard
Deviation is zero because ALL responses were exactly the
mean value. The individual responses did not deviate at all from the mean. In Rating "B",
even though the group mean is the same (3.0) as the
1rst distribution, the Standard Deviation is higher. The Standard Deviation of 1.15 shows that
the individual responses, on average, were a little
over 1 point away from the mean.
Sample: Mean
1st 3.2
2nd 3.4
3rd 3.3
4th 3.2
5th 3.1
Mean 3.3
Std Dev 0.13
Think about this. If the SD of this distribution helps us to understand how far a sample mean
is from the true population mean, then we can use
this to understand how accurate any individual sample mean is in relation to the true mean.
That is the essence of the Standard Error. In actuality
we have only drawn a single sample from our population, but we can use this result to
provide an estimate of the reliability of our observed
sample mean.
In fact, SE tells us that we can be 95% condent that our observed sample mean is plus or
minus roughly 2 (actually 1.96) Standard Errors from
the population mean.
The below table shows the distribution of responses from our 1rst (and only) sample used for
our research. The SE of 0.13, being relatively small,
gives us an indication that our mean is relatively close to the true mean of our overall
population. The margin of error (at 95% condence) for our
mean is (roughly) twice that value (+/- 0.26), telling us that the true mean is most likely
between 2.94 and 3.46.
Respondent: Rating:
A 3
B 3
C 3
D 3
E 4
F 4
G 3
H 3
I 3
J 3
Mean 3.2
Std Err 0.13
Summary
Many researchers fail to understand the distinction between Standard Deviation and Standard
Error, even though they are commonly included in
data analysis. While the actual calculations for Standard Deviation and Standard Error look
very similar, they represent two very different, but
complementary, measures. SD tells us about the shape of our distribution, how close the
individual data values are from the mean value. SE tells
us how close our sample mean is to the true mean of the overall population. Together, they
help to provide a more complete picture than the
mean alone can tell us.
References :
https://getrevising.co.uk/grids/standarddeviation
http://www.stretchgo.com/standard-deviation-definition-advantages-and-disadvantages-and-
its-alternatives.html
http://www.transtutors.com/homework-help/statistics/measures-of-dispersion/features-
standard-deviation.aspx
https://statistics.laerd.com/statistical-guides/measures-of-spread-standard-deviation.php
https://en.wikipedia.org/wiki/Standard_deviation
Accessed on 1st April 2016
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3487226
http://spreadsheets.about.com/od/excelstatisticalfunctions/ss/excel-2007-stdev-function.htm
accessed on 2nd April 2016