0% found this document useful (0 votes)
23 views13 pages

Assignment One - Statistics - Abdisami Muhyadin

The document discusses statistics concepts including descriptive statistics, frequency distribution tables, measures of central tendency and dispersion. It contains sample data and calculations for these concepts across multiple questions and examples.

Uploaded by

Abdel Samie
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
23 views13 pages

Assignment One - Statistics - Abdisami Muhyadin

The document discusses statistics concepts including descriptive statistics, frequency distribution tables, measures of central tendency and dispersion. It contains sample data and calculations for these concepts across multiple questions and examples.

Uploaded by

Abdel Samie
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 13

Assignment-One

Sheet-1

Course: Applied Statistics

Submitted to: Dr. Mohamed Gad

Submitted by: Abdisami Muhyadin Abdulle

Student Code: 202210784

11/March/2024
Q.1
First, we arrange the data in ascending order
6 6 8 9 10 11 12 14 16 16 17 17 18
19 19 19 20 21 21 21 22 24 25 25 29
Range = Largest No. – Smallest No. = 29-6= 23
No. of classes is usually between 5 and 15, we select No. of classes = 5
Class interval= Range/No. of Class= 23/5 = 4.6, we round up and take Class Interval = 5
Class Boundaries: 6, 11, 16, 21, 26, 31

Frequency Distribution Table and The Percentage Distribution Table


Class Class Midpoints Frequency Relative Frequency Percentage
6-11 8.5 5 0.2 20
11-16 13.5 3 0.12 12
16-21 18.5 9 0.36 36
21-26 23.5 7 0.28 28
26-31 28.5 1 0.04 4
Sum 25

The cumulative “less than” percentage distribution( less than the Upper Limit)
Class Frequency Relative Frequency Percentage
6 0 0 0
11 5 0.2 20
16 8 0.32 32
21 17 0.68 68
26 24 0.96 96
31 25 100 100

The cumulative “more than” percentage distribution( more than the Lower Limit)
Class Frequency Relative Frequency Percentage
6 25 1 100
11 20 0.8 80
16 17 0.68 68
21 8 0.32 32
26 1 0.04 4
31 0 0 0
Q.2
Population Size, N=5, Sample Size, n=3
a) If order is important, then the permutation is used,
��� = 5�3 = 60 �������� ������� ��� �� ��������
b) If order is not important, then the combination is used,
��� = 5�3 = 10 ���������� ������� ��� �� ��������
c) Ten different samples in b are HIJ, HIK, HIL, HJK, HJL, HKL, IJK, IJL, IKL JKL
Q.3
First, we arrange the data in ascending order:
208 210 215 223 229 230 250 254 258 265 267 270 274
275 276 276 278 280 289 300 306 315 334 340 384
Range = Largest No. – Smallest No. = 384-208=176
No. of classes is usually between 5 and 15, we select No. of classes = 5
Class interval= Range/No. of Class= 176/5 = 35.2, we round up and take Class Interval =36
Class Boundaries: 208, 244, 280, 316, 352, 388
a) Using the Raw data

Xi Xi-Ẋ (Xi-Ẋ)^2
208 -64.24 4126.778
210 -62.24 3873.818
215 -57.24 3276.418
223 -49.24 2424.578
229 -43.24 1869.698
230 -42.24 1784.218
250 -22.24 494.6176
254 -18.24 332.6976
258 -14.24 202.7776
265 -7.24 52.4176
267 -5.24 27.4576
270 -2.24 5.0176
274 1.76 3.0976
275 2.76 7.6176
276 3.76 14.1376
276 3.76 14.1376
278 5.76 33.1776
280 7.76 60.2176
289 16.76 280.8976
300 27.76 770.6176
Results
306 33.76 1139.738
315 42.76 1828.418 Mean= Ẋ 272.24
334 61.76 3814.298
340 67.76 4591.418 Median 274
384 111.76 12490.3
Mode 276
Sum= 6806 43518.6

Variance 1813.273

Sta. Deviation 42.58255

b) Using Grouped Data

Class Class Midpoints=Xi Frequency Xi*fi Xi-Ẋ (Xi-Ẋ)^2 fi*(Xi-Ẋ)^2


208-244 226 6 1356 -44.64 1992.73 11956.38
244-280 262 11 2882 -8.64 74.6496 821.1456
280-316 298 5 1490 27.36 748.5696 3742.848
316-352 334 2 668 63.36 4014.49 8028.979
352-388 370 1 370 99.36 9872.41 9872.41
Sum 25 6766 34422

��� 6766
���� == �1 + = = 270.64
��� 25
� 25
− ��� −6
������ == �1 + 2 ∗ � = 244 + 2 ∗ 280 − 244 = 265.27
������� 11
�1 11 − 6
���� == �1 + ∗ � = 244 + ∗ 280 − 244 = 260.36
�1 + �2 11 − 6 + 11 − 5
1 2 1
�������� = �2 = �fi ∗ Xi − Ẋ = ∗ 34421.76 = 1434.24
� 25 − 1

S������� ��������� = �2 = 37.87


c) Comment: the results are approximately near in both cases: i.e when we used the raw data
and when we used the grouped data, but the results of raw is data more accurate.
Since mean>median>mode, the shape of the distribution skewed to the right.
Q.4
a)

Range of Grades for Months


Student Number
Months Range Mean Variance Sta. Dev. CV(%)
1 2 3 4 5 6 7 8 9 10
October 5 7 18 22 22 24 24 26 41 43 38 23.2 149.067 12.2093 52.6262
November 4 5 15 17 21 25 30 35 42 41 38 23.5 185.389 13.61575884 57.9394
December 11 12 20 22 22 23 24 25 31 33 22 22.3 48.9 6.99285 31.3581
January 2 4 5 7 23 25 42 43 45 45 43 24.1 344.767 18.5679 77.0452
February 5 37 38 39 40 41 42 43 42 44 39 37.1 132.1 11.4935 30.9797
March 4 7 3 6 5 6 8 12 12 44 41 10.7 146.011 12.0835 112.93

Descriptive Statistics for Every Student


St.1 St.2 St.3 St.4 St.5 St.6 St.7 St.8 St.9 St.10
5 7 18 22 22 24 24 26 41 43
4 5 15 17 21 25 30 35 42 41
11 12 20 22 22 23 24 25 31 33
2 4 5 7 23 25 42 43 45 45
5 37 38 39 40 41 42 43 42 44
4 7 3 6 5 6 8 12 12 44
Range 9 33 35 33 35 35 34 31 33 12
Mean 5.16667 12 16.5 18.8333 22.1667 24 28.33333 30.7 35.5 41.6667
Varience 9.36667 157.6 158.7 146.967 122.967 123.2 165.4667 145 155.5 19.8667
St. Deviation 3.0605 12.5539 12.5976 12.123 11.089 11.0995 12.86338 12 12.47 4.4572
C.V(%) 59.2355 104.616 76.3492 64.3698 50.0257 46.2481 45.40018 39.3 35.1267 10.6973

b) Grades are dispersed the most in March ( The Most CV value)


c) The grades are most centralized around a central mean in February ( The least CV)
d) The student number who has grades most dispersed for the six months is the Student-2
e) The student number who has grades most centralized around a central mean for the six months
is the Student-10
Q.5
Q.6
Additional Problems:
Q.1

Set A has smallest standard deviation since the numbers in set A are grouped most closely to the
center( the mean), and Set C has the largest standard deviation since the numbers in set C are
greatly dispersed from the center( the mean).
Q.2

Line 1 Line 2
2.18 1.62
2.12 2.52
2.24 1.69
2.31 1.79
2.02 2.49
2.09 1.67
2.23 2.04
2.02 1.98
2.19 2.66
2.32 1.99

Mean 2.172 2.045


Variance 0.01166222 0.14682778
Sta. Deviation 0.10799177 0.38318113
CV(%) 4.97199674 18.7374635
b) Line 2 is the most dispersed because it has a higher Sta. Deviation
c) production line 2 produces most centralized around the mean ball diameter ( least Standard Deviation)
Q.3
Arranges the samples( Minutes) in ascending Order. n=50
1.5, 2, 2, 2.5, 2.5, 2.5, 3, 3, 3.5, 3.5, 4.5, 4.5, 5, 5, 5, 5, 5, 5, 5.5, 5.5, 5.5, 5.5, 6, 6,
6.5, 6.5, 6.5, 6.5, 7, 7, 7, 7, 7, 7, 7, 7.5, 7.5, 7.5, 7.5, 8, 8.5, 8.5, 9, 9.5, 10, 10, 10,
10, 10.5.

Range = Largest No. – Smallest No. = 10.5-1.5=9


No. of classes is usually between 5 and 15, we select No. of classes = 5
Class interval= Range/No. of Class= 9/5 = 1.8, we round up and take Class Interval =2
Class Boundaries: 1, 3, 5, 7, 9, 11

a) We grouped the data into five classes with a minimum value of 1.0.

Class Frequency
1--3 7
3--5 6
5--7 16
7--9 14
9--11 7

b) The Percent of the response times less than seven minutes equals 58% ( 29 numbers
out of 50 ).
c) The percent of the response times are greater than three minutes and less than seven
minutes equals 40% ( 20 numbers out of 50 ).
d) The percent of the response times are nine or more minutes in length is equal to 14%
( 7 numbers out of 50 )
Q.4

a) There are 3 numbers actually between the left endpoint


of that interval and 8 (the hatched area).
b)
Case-1
Class Midpoints=Xi Frequency=fi Xi*fi
0 2 0
1 7 7
2 12 24
3 7 21
4 6 24
5 7 35
6 1 6
7 3 21
8 3 24
10 2 20

Mean 18.2
Sta. Deviation 10.5809052
CV 0.58136842

Case-2
Class Midpoints=Xi Frequency=fi Xi*fi
0.5 4 2
1.5 11 16.5
Conclusion:
2.5 9 22.5
3.5 9 31.5 Using statistical measures for the two
4.5 5 22.5 cases, we conclude that the second
5.5 4 22 histogram is best for data
6.5 1 6.5 representation, since it has the least
7.5 3 22.5 deviation, or in other words its data is
8.5 2 17 close to the center(mean)
9.5 2 19

Mean 18.2
Sta. Deviation 8.505553741
CV 0.467338118

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy