Group 4 - Normal Distributions
Group 4 - Normal Distributions
Candaba Campus
College of Education
Bachelor of Secondary Education Major in Filipino
Submitted by:
Jhonalyn M. Alfaro
Jovelyn S. Cruz
Karla Mae S. Reyes
Sunshine Mela D. Canete
Avie C. Dungo
Jerico F. Mamawan
Louiejie G. Reyes
Submitted to:
Engr. Arlon G. Calma, CLSSYB
Instructor
Introduction
Don Honorio Ventura State University
Candaba Campus
A frequency distribution indicates how often each different value in data set occurs.
The most widely used graph to illustrate frequency distribution is histogram. It look very
much like a bar map, but the variations between them are major. This valuable method for
data collection and analysis is considered essential tool of quality.
The Empirical Rule notes that 99.7% of the data observed after a normal distribution
lies within the mean of 3 standard deviations. According to this rule, 68% of the information
falls within one standard deviation, 95% within two standard deviations, and 99.7% within
three standard deviations from the mean.
A normal distribution with a mean of zero and a standard deviation of 1 is the standard
normal distribution. The standard normal distribution is centered at zero and the standard
deviation gives the degree to which a given measurement deviates from the mean.
NORMAL DISTRIBUTION
Don Honorio Ventura State University
Candaba Campus
In statistics, the normal distribution is the most relevant and most commonly used
distribution. It is often called the "bell curve," although it would be less than pleasing to have
the tonal features of such a bell. After the mathematician Karl Friedrich Gauss, it is also
called the "Gaussian curve". Abraham de Moivre is the one who first discovered the normal
Below is the density of the normal distribution (the height on the x axis for a given
value). The μ and σ parameters are, respectively, the mean and standard deviation and
determine the normal distribution. The e symbol is the basis of the natural logarithm and the
constant pi is the π symbol.
7. Approximately 95% of the area of a normal distribution is within two standard deviations
of the mean.
Discussions
Table 2.2.1 contains the amount of rent paid every month for 24 students from a statistics
course. Make a relative frequency distribution using 7 classes.
1500,1350,350,1200,850,900,1500,1500,1500,900,1400,1100,1250,600,610,960,890,1325,90
0,800,2550,495,1200, and 690
Solution:
Don Honorio Ventura State University
Candaba Campus
=350+664
2
=507,
=665+979
2
=822
table:
Histogram: a graph of the frequencies on the vertical axis and the class boundaries on the
horizontal axis. Rectangles where the height is the frequency and the width is the class width
are draw for each class.
Solution:
The class boundaries are plotted on the horizontal axis and the frequencies are plotted on the
vertical axis. You can plot the midpoints of the classes instead of the class boundaries. Graph
2.2.1 was created using the midpoints because it was easier to do with the software that
created the graph.
Don Honorio Ventura State University
Candaba Campus
Notice the graph has the axes labeled, the tick marks are labeled on each axis, and there is a
title. It is important that your graphs (all graphs) are clearly labeled.
Reviewing the graph you can see that most of the students pay around $750 per month for
rent, with about $1500 being the other common value. You can see from the graph, that most
students pay between $600 and $1600 per month for rent. Of course, these values are just
estimates from the graph. There is a large gap between the $1500 class and the highest data
value. This seems to say that one student is paying a great deal more than everyone else. This
Don Honorio Ventura State University
Candaba Campus
value could be considered an outlier. An outlier is a data value that is far from the rest of the
values. It may be an unusual value or a mistake. It is a data value that should be investigated.
In this case, the student lives in a very expensive part of town, thus the value is not a mistake,
and is just very unusual. There are other aspects that can be discussed, but first some other
concepts need to be introduced.
Don Honorio Ventura State University
Candaba Campus
Areas of the normal distribution are often represented by tables of the standard normal
distribution. A standard normal table, also called the unit normal table or Z table, is a
mathematical table for the values of Φ, which are the values of the
cumulative distribution function of the normal distribution. It is used to find
Don Honorio Ventura State University
Candaba Campus
The standard normal distribution table are shown in table 1 and 2 below. These table values
represent area to the left of the Z score.
The first column with the name 'Z' refers to the standard normal distribution values; the
second column refers to the are below Z. The Z column is proportional to the number of
standard deviations below (or above) the mean, since the distribution has a mean of 0 and a
standard deviation of 1. For instance, a Z of -2.5 represents a standard deviation value of 2.5
below the mean. 0.0062 is the region below Z.
It is possible to obtain the same information using the following Java applet. Figure 1
illustrates how the area below a value of -2.5 on the standard normal distribution can be
Don Honorio Ventura State University
Candaba Campus
measured using it. Note that the mean has been set to 0 and the default variance has been set
to 1.
Figure 1. An example from the applet.
Using the following formula below, a value from any normal distribution can be
translated into its corresponding value in a standard normal distribution.
Z = (X - μ)/σ
Wherein Z is the value on the standard normal distribution, X is the value on the original
distribution, μ is the mean of the original distribution, and σ is the standard deviation of the
original distribution.
For example, assume that the data on Baruch graduates' first-year salaries are
normally distributed with the mean population μ = $60000 and the standard population
deviation σ = $15000. Find the chance of receiving less than $45000 annually from a
randomly selected Baruch graduate. We have to locate the portion of the region under
the usual curve from 45 all the way to the left to answer this question.
?
45
You can apply complex mathematical formulas to find this formula, or you can use the
Z table, where statisticians have already applied these formulas for you. Since tables for any
possible combination of mean and standard deviation could not be developed, statisticians
developed one standardized and simplified normal distribution with a mean of 0 and a
standard deviation of 1.
Don Honorio Ventura State University
Candaba Campus
All other distributions with different : and σ can be converted into a standardized
normal distribution using the transformation formula: Z = X −µ σ
Using the transformation formula, find the value of Z and then find the number that
corresponds to that Z in the body of Z-table:
Z= = −1.
The value of the table corresponding to -1 is 0.1587. This number shows the area from
45 all the way to the left under the curve. It also reveals that 15.87 percent is the chance of
randomly picking a Baruch graduate who makes less than $45000 a year.
If all the values are translated into Z scores in a distribution, then the distribution will
have a mean of 0 and a standard deviation of 1. Standardizing the distribution is the process
of transforming a distribution to one with a mean of 0 and a standard deviation of 1.
Don Honorio Ventura State University
Candaba Campus
References
http://onlinestatbook.com/2/normal_distribution/intro.html
https://stats.libretexts.org/Courses/Las_Positas_College/Math_40%3A_Statistics_and_Probab
ility/02%3A_Frequency_Distributions_and_Graphs/2.05%3A_Frequency_Distributions_and_
Histograms
https://www.baruch.cuny.edu/sacc/documents/NormalDistribution.pdf