Lecture On Probability and Stats
Lecture On Probability and Stats
Chapter 02
2
Frequency distribution
• The arrangement of a set of data in a table showing the
distribution of the data into classes or groups together with the
number of observations in each class or group is called a Frequency
Distribution.
• The number of observations falling in a particular class is referred
to as the class frequency of simply frequency, and is denoted by f.
• Summarized data by distributing into classes or categories with
their frequencies.
3
What is Grouped and Ungrouped data?
4
4
• Class limits/Intervals: - The class limits are defined as the
numbers or the values of the variables which describe the
classes, the smaller number is called lower class limit and
the larger number is called upper class limit.
• Class limits should be non-overlapping i.e. 10-14, 15-19,
20-24, 25-29 etc.
• The values equal to upper/lower limit and falling between
them should be included to make frequency table.
5
Class boundaries: -
• The numbers which separate one class from another are called
class boundaries.
• A class boundary is located midway between the upper limit of a
class and the lower limit of the next higher class.
• For the above example it is 9.5 – 14.5, 14.5 – 19.5, 19.5 – 24.5,
24.5 – 29.5 etc.
• The upper class boundary and the lower class boundary of next
class are same.
• Class boundary takes always one more decimal place than the
class limit, so that the possibility of any observation falling exactly
on the boundary is avoided.
6
Midpoint / Class mark: -
• A midpoint is number which divides each class into two
equal parts.
• It is obtained by dividing either the sum of the lower and
upper limits or boundaries of a class by 2.
• For the above example, these are 12, 17, 22, 27 etc.
7
• Class Width: - The class width is equal to the
difference between the class boundaries.
Or
• Difference between two successive midpoints
Or
• Difference between two successive lower class limits.
• Class width of the above example is 5.
8
Constructing a grouped frequency distribution:
Step-01 Determine the number of classes:
Now how to make classes, if number of classes is not
mentioned?
• First decide the number of classes.
• There are no any specific rules for deciding the exact number
of classes. But as a general rule the number of classes should
be between 5 and 20.
• Actually the number of classes depends on the size of data.
• then find class width.
9
Step-02 Determine the Class width (h):
10
Step-03 Deciding Starting Point (Lower limit of the first class):
76 79 81 86 86 89 89 90 91 94
95 96 96 97 99 100 100 101 101 101
102 104 104 104 105 106 106 107 108 108
109 109 109 110 110 111 112 113 114 115
116 116 117 117 118 119 120 121 122 123
124 125 128 129 129 131 134 134 135 135
12
Solution:
• From this table 76, 81, 86, 91,… are called lower Grouped Frequency Distribution
limits of class intervals while 80, 85, 90,……. are Class interval Frequency
called upper limit of the class intervals. 76 – 80 2
81 – 85 1
• The length of each class is said to be the size or 86 – 90 5
width of the class interval denoted by h. here h=5 91 – 95 3
96 – 100 6
• The above frequency distribution is said to be 101 – 105 8
discrete type of frequency distribution. 106 – 110 10
111 – 115 5
• But some times we are required continuous type of 116 – 120 7
frequency distribution, for this purpose we convert 121 – 125 5
class limits into class boundaries by adding and 126 – 130 3
subtracting a certain amount in the class limits. 131 – 135 5
Total 60
13
Example:02 The weights of 50 football players are given
below. Construct a frequency distribution table with 10
classes beginning with 190.
193 240 217 283 268 212 251 263 275 208
230 288 259 225 252 236 243 247 280 234
250 236 277 218 245 268 231 269 224 259
258 231 255 228 202 245 246 271 249 255
265 235 243 219 255 245 238 257 254 284
14
Solution:
Range = largest number – smallest number
Class interval Tally Marks Frequency
= 290 – 190 = 100,
190 – 199 / 1
Class width (h) = 100/10 200 – 209 // 2
h = 10 210 – 219 //// 4
220 – 229 /// 3
230 – 239 /// 8
240 – 249 //// 9
250 – 259 / 11
260 – 269 5
270 – 279 /// 3
280 – 289 //// 4
Total 50
15
Example:03 The following data showing the marks of 25
students;
26 35 42 43 51
28 36 42 43 51
29 39 42 46 52
31 40 43 46 52
32 40 43 46 55
Arrange in a simple frequency distribution and grouped
frequency distribution.
16
Solution: Marks No. of Marks No. of students/
students/ Frequency
Frequency
26 1 40 2
28 1 42 3
29 1 43 4
31 1 46 3
32 1 51 2
36 1 52 2
39 1 55 1
Total f 25
19
Example-01: Find relative frequency distribution to the
following frequency distribution.
Frequency (f) 1 3 4 5 2 15
20
Solution:
Since, total of all frequencies : f 15
Class Interval Frequencies (f) Relative Frequency
2-4 1 1/15=0.07
5-7 3 3/15=0.20
8-10 4 4/15=0.27
11-13 5 5/15=0.33
14-16 2 2/15=0.13
Total 15 1.00
In above table each class has its relative frequency and the sum of all relative
frequencies is unity.
Therefore, we have a relative frequency distribution.
21
Cumulative Frequency:
•The total frequency of a variable from its one end to a certain value
(usually upper class boundary in grouped data), is known as cumulative
frequency. Cumulative frequency of the last class is the sum of all
frequencies in the distribution.
•In a frequency distribution the number of values or frequencies occur
between the upper and lower limits (or boundaries).
•But If the number of values occur under the upper boundary of a class that
frequency is known as “less than cumulative frequency distribution” .
•Similarly, if the number of values occur above or over the lower boundary
of a class that frequency is known as “more than cumulative frequency”.
•The distributions for less than or more than values obtained in this way are
called cumulative frequency distributions for less than and more than types.
22
Example-01:
Find cumulative frequency distributions for less than and more than
values of the following frequency distribution.
Frequency (f) 1 3 8 6 2 20
24
Cumulative frequency for less than Cumulative frequency for more than
Less than or U.C.B C.f. More than or L.C.B C.f.
Where U.C.B means upper class boundary and L.C.B means lower class boundary
25
Example-02:
Following raw data showing ages in years of 20 children's studying in a school.
4.9 3.2 4.3 3.5 6.1 5.3 2.6 3.8 3.3 4.6
4.1 4.6 3.6 5.1 3.9 5.8 4.3 4.6 3.2 3.3
a) Construct a frequency table by taking a class interval of one year.
b) Find Class boundaries and mid-values.
c) Find relative frequency distribution.
d) Prepare “less than” and “more than” types of cumulative frequency tables.
26
Solution:
Since Mini mum age=2.6 and Maximum age=6.1 therefore, frequency
distribution(table) for ages of 20 children is given below
a) Construct a frequency table by taking a class interval of one year.
Ages in years Tally Marks Number of children
C.I. f
2.5-3.4 //// 4
3.5-4.4 8
4.5-5.4 / 6
5.5-6.4 // 2
Total 20
27
b) Find Class boundaries and mid-values.
28
c) Find relative frequency distribution.
29
d) Prepare “less than” and “more than” types of cumulative frequency tables.
30
Question 3: The table below gives the values of temperature recorded in
Hyderabad for 25 days in summer. Represent the data in the form of less than
type cumulative frequency distribution:
20-25 2 2
25-30 10 12
30-35 13 25