Sa Lab Manual Final Merged
Sa Lab Manual Final Merged
GOVERNMENT POLYTECHNIC,
GAURIBIDANUR – 561208
Academic Year :
Name : ___________________________________
Reg No : ___________________________________
Programme : ___________________________________
Batch No : ___________________________________
Prepared by
Venugopal C L M.Sc , B.Ed
Lecturer ,
Dept of Science
Government Polytechnic, Gauribidanur
INSTRUCTIONS TO THE STUDENTS
1. The following materials are required for every Practical Class.
i) SA Lab Manual
ii) Practical Record
iii) Observation Book
2. There are two Laboratory classes every week (each class 3 hours duration)
3. 75% of attendance is compulsory for appearing for examination
4. CIE two written tests for 20 marks will be conducted. CIE three skill tests will be conducted for 100
marks(3 hours duration) and obtained marks are scaled down to 20 marks. One Student activity
must be submitted at the end for 20 marks.
Sl Questions Answers
No
1 Was this your first visit to our Restaurant? Yes / No
2 Are our hours convenient ? Yes / No
3 Is our restaurant would be a good choice for Yes / No
Celebration?
4 How did you hear about our restaurant? Social media / Personal
recommendation / Advertisement
5 Was it easy to find our restaurant? Yes / No
6 Did you have any trouble with parking? Yes / No
7 Were the bath rooms and other facilities clean ? Yes / No
8 Were the seats and tables comfortable? Yes / No
9 How was the lighting level ? too bright/too dim /just right
10 How was the music volume ? too loud / too soft / just right
11 Did everything comeout exactly as you ordered it ? Yes / No
12 Was your food warm when it arrived? Yes / No
13 Did you order alcoholoic beverages? Yes / No
14 Was your service friendly and helpful Yes / No
15 Did the host greet you upon arrival? Yes / No
16 Did your food come out fast enough? Yes / No
17 Did the manager visit your table? Yes / No
18 Did your server provide recommendations? Yes / No
19 Was your water refilled in a timely manner? Yes / No
20 Was the food tasty ? Yes / No
21 Did you enjoy viewing our menu on your phone? Yes / No
22 Did you use any discount on special offers? Yes / No
23 Did you make reservation for your visit? Yes / No
24 How likely that you would come back? Daily/Sometimes/never
25 Are you satisfied with the online ordering of our food
on the website? Yes / No
Assignment : 1. Prepare a questionnaire containing 20 closed end questions about the facilities and
services provided to passengers in KSRTC bus stand.
2. Prepare a questionnaire containing 20 closed end questions about the feedback from a student
in a Government school
3.Prepare a questionnaire containing 20 closed end questions about the feedback from a patient in a
Government Hospital
Experiment No 2
Prepare a Google form for a specified problem statement to collect the data set
(For example : Questionnaire to conduct online Quiz)
Aim: To send out a survey for a Problem Statement to 20 persons by google forms and collect the
dataset in spreadsheet.
Procedure :
1. Prepare a online survey in google form with the steps as explained in experiment 2
(transform the same questions into google form)
2. Send the link to at least 25 people via E-mail or whatsapp and ask them to fill and submit.
3. Open concerned survey google form
4. At the top, click “Responses”. You can see the submitted responses in summary,
question, or Individual.
5. Click “individual” to see response one by one . To move between individuals , click
‘Previous’ < or ‘Next’>.
6. To collect the data click on 3 dots .
7. Collect the data into your E-mail ID or select the destination in your google drive or
download into your PC or take printout or delete all collected data.
Result /Inference : Google form to survey for a Problem statement was sent to 20 persons and their
response are collected in a spread sheet.
Experiment No 3[Output]
Send out a survey for a Problem Statement to 20 persons by google forms and collect the dataset in
spreadsheet.
Experiment No 4
Remove Duplicate or Irrelevant observations from the dataset provided , including
Duplicate observations or Irrelevant observations.
Aim: To Remove Duplicate or Irrelevant observations from the dataset provided , including Duplicate
observations or Irrelevant observations
Procedure :
1. Go to google.com and log in to your google account.
2. Click on google apps ( 9 dots ) -> Select Drive -> click on My drive
3. Open Experiment no 3 Google form -> Click on responses -> click on create spread sheet-> the
responses of Experiment no 3 will be displayed.
4. To identify the duplicates follow below steps.
i. For instance , select Column A -> Format -> Conditional formatting
ii. Under format rules open the drop-down list and select “Custom formula is”
iii. Enter the value for custom formula as “ =count if (A1:A,A1)>1”
iv. Below Format rules, clicking on Formatting styles, set a different colour for highlighted
duplicates. To do that Click the “Fill colour” icon and select the preferred shade(Ex : Red)
v. Once finished , Click “Done” to highlight the duplicates in a single column.
vi. Similarly , if we have to do it for column C, the formula becomes , “ =count if (C1:C,C1)>1” and
we do it so on for other columns as well.
5. To delete the duplicates follow below steps.
i. Select a column from where you want to remove the duplicates.
ii. Click Data -> Remove Duplicates.
iii. We will see a pop-up window -> Tick the box next to Data has header now -> Click ‘Remove
duplicates’ -> Click ‘’Done’’ .
iv. Repeat the steps for other columns as well.
Experiment No 4 (Output)
Remove Duplicate or Irrelevant observations from the dataset provided , including
Duplicate observations or Irrelevant observations
In the data set given below contains the malnutrition in kids in different states of India, based on
the three reasons is given. The data set contains duplicates which are highlighted. Now Duplicates
are removed as shown below
8. Then copy the Class interval column and frequency column separately and paste in the separate
column . This is the Frequency distribution table,
Out put : The Frequency Distribution Table for the given data is constructed.
Experiment No 5 (Output)
Experiment No 5
To draw the frequency distribution table for the randomly generated data
6. Then Click Data-> Click “ Data analysis” -> select “Histogram”-> Click “OK” .
7. Then in the page displayed, select “entire marks” for input range, select “Bin values” for bin range
, Tick label box, and select a “blank cell“ for out put range and click “OK”. Then the following table
get displayed.
Bin Frequency
20 7
40 8
60 10
80 10
100 15
8. Then copy the Class interval column and frequency column separately and paste in the separate
column . This is the Frequency distribution table,
9. Find the “Total Frequency ” and insert another column for “Relative Frequency” and use the formula
“frequency/Total frequency [ = frequency value/50 ] and press “Enter”. Then go to the corner of
the cell and drag it to the end to get relative frequency. The total of all relative frequencies should
be 1.
Output : The relative frequency distribution table for the given data is constructed.
Experiment No 6 (Output)
EXPERIMENT NO 6 :
To draw the relative frequency distribution table for the randomly generated data
Name of student Marks class interval Bin (upperlimit) Bin (upperlimit) Frequency
student 1 69 1-20 20 20 9
student 2 12 21-40 40 40 13
student 3 25 41-60 60 60 9
student 4 70 61-80 80 80 13
student 5 81 81-100 100 100 6
student 6 87 More 0
student 7 24
student 8 22
student 9 77
student 10 66 class interval Frequency relative frequency
student 11 94 1-20 9 0.18
student 12 23 21-40 13 0.26
student 13 68 41-60 9 0.18
student 14 61 61-80 13 0.26
student 15 50 81-100 6 0.12
student 16 23 Total 50 1
student 17 44 Relative frequency distribution table
student 18 78
student 19 95
student 20 62
student 21 52
student 22 68
student 23 35
student 24 17
student 25 43
student 26 58
student 27 32
student 28 62
student 29 10
student 30 41
student 31 56
student 32 28
student 33 29
student 34 28
student 35 7
student 36 24
student 37 77
student 38 37
student 39 1
student 40 90
student 41 78
student 42 43
student 43 34
student 44 8
student 45 57
student 46 67
student 47 81
student 48 16
student 49 15
student 50 4
Experiment No 7
Using Microsoft Excel spread sheet plot bar graph for the data collected from 50 people(
for example, conduct a survey on the favorite fruit of a person in your locality(restricting
to 5 to 6 fruits). Explain the bar graph with minimum 30 words.
Aim: Bar graph for the data collected survey on the favourite fruits from 50 persons.
Procedure :
1. Choose 04 to 05 fruits such as Apple,Orange,Banana,Grapes and enter these fruits randomly to 50
persons .
Name of Person Fruit name
Person 1 Apple
person 2 Orange
Person 3 Banana
Person 4 Grapes
Person 5 Apple
Person 6 Orange
2. Now write Fruits names and Frequency (count of fruits ) in a table as given below
Fruit name Count
Apple
Orange
Banana
Grapes
3. Now use “COUNTIF“ formula to find the count of each place by using [=COUNTIF(range,criteria)]
and select the data range and select the the place name ,close bracket and press enter to get the
count of that place.
4. Repeat the above steps to find the count of other places
Fruit name Count
Apple 15
Orange 14
Banana 14
Grapes 7
5. Now select the Fruit name and count, and press enter to get the bargraph and it can be modified
using chart elements.
Output : The barchart is plotted for the given data .
Experiment No 7(Output)
EXPT NO 7
To draw a Bargrap for survey of favourite fruits of 50 persons in your locality (restricted to 4 to 6 fruits)
10
Person 15 Apple 8 7
Person 16 Orange 6
Person 17 Banana 4
Person 18 Grapes 2
0
Person 19 Apple Apple Orange Banana Grapes
Person 20 Orange Count 15 14 14 7
Person 21 Banana Fruit name
Person 22 Apple
Person 23 Orange
Person 24 Banana
Person 25 Grapes
Person 26 Apple
Person 27 Orange
Person 28 Banana
Person 29 Apple
Person 30 Orange
Person 31 Banana
Person 32 Grapes
Person 33 Apple
Person 34 Orange
Person 35 Banana
Person 36 Apple
Person 37 Orange
Person 38 Banana
Person 39 Grapes
Person 40 Apple
Person 41 Orange
Person 42 Banana
Person 43 Apple
Person 44 Orange
Person 45 Banana
Person 46 Grapes
Person 47 Apple
Person 48 Orange
Person 49 Banana
Person 50 Apple
Experiment No 8
Using Microsoft Excel spread sheet plot pie chart for the data collected from 50 people(
for example, conduct a survey on the smokers with respect to their ages in your locality.
Explain the pie chart with minimum 30 words.
Aim : To plot Pie chart for a Survey on smokers with respect to age in MS excel spread sheet.
Procedure :
1. Open MS excel sheet and Type ‘ Smoker count ‘ in one column and ‘Age ’ in another column
2. Click ‘Insert” -> Pivot table -> OK
3. In Pivot table click “Table/range”-> Select Age column -> click and Choose Existing worksheet and
click ‘Location’ -> select any blank cell -> click OK
4. On rightside of the screen ,Pivot table fields get displayed
5. Tick “Age of Smokers” and drag to “Row Labels” . Then Click Value Field settings-> Select “count of
age” click ok
6. Then click on any one cell of Row labels -> Right click -> select Group-> for grouping ->Define starting
at ending value with common difference like 10-> Click OK. Table will look like as shown below.
Now Row labels indicates Age Range
Row Labels Count of Age
20-29 7
30-39 10
40-49 8
50-59 9
60-69 1
70-79 10
80-90 5
Grand Total 50
7. Select above table values -> Click insert -> Pie chart . Pie chart get displayed.
8. Then Right click on Pie graph -> Click “Format data Labels” -> Tick values, percentage, label outside
-> click OK
Output : Pie chart for a Survey on smokers with respect to age in MS excel spread sheet is
constructed
Experiment No 8 (Output)
EXPERIMENT NO 8 :
To plot Pie chart for a Survey on smokers with respect to age in MS excel spread sheet.
Smoker23 77 1, 2% 60-69
Smoker24 43 70-79
Smoker25 65 80-90
9, 18% 8, 16%
Smoker26 49
Smoker27 39
Smoker28 68
Smoker29 33
Smoker30 66
Smoker31 42
Smoker32 79
Smoker33 90
Smoker34 22
Smoker35 87
Smoker36 78
Smoker37 41
Smoker38 73
Smoker39 78
Smoker40 69
Smoker41 35
Smoker42 90
Smoker43 41
Smoker44 62
Smoker45 43
Smoker46 22
Smoker47 53
Smoker48 70
Smoker49 38
Smoker50 72
Experiment No 9
Using Microsoft Excel spread sheet draw a line graph for the given dataset.
Aim: To draw the line graph for the daily pocket expenses of 50 students in MS excel spreadsheet
Procedure:
1. Open MS excel sheet and Type ‘ Name of student ‘ in one column and ‘Expenses’ in another column
2. Under Name of student column type Student 01 and Student 02 drag it up to student 50
3. Generate the pocket expenses randomly for 50 students (or enter manually)
4. Then select complete table -> Click “insert” -> click “Line chart”
5. Then Line chart get displayed.
Output: Line graph for the given data is plotted.
Experiment No 9 (Output)
Experiment No 9
To draw a Line graph for the given data set
student 3
student 9
student 1
student 5
student 7
student 15
student 21
student 23
student 29
student 35
student 41
student 47
student 49
student 11
student 13
student 17
student 19
student 25
student 27
student 31
student 33
student 37
student 39
student 43
student 45
student 14 264
student 15 268
student 16 64
student 17 148
student 18 127
student 19 126
student 20 257
student 21 289
student 22 122
student 23 143
student 24 144
student 25 80
student 26 75
student 27 228
student 28 184
student 29 296
student 30 104
student 31 144
student 32 101
student 33 153
student 34 36
student 35 108
student 36 171
student 37 36
student 38 32
student 39 114
student 40 284
student 41 61
student 42 252
student 43 176
student 44 77
student 45 255
student 46 198
student 47 204
student 48 71
student 49 15
student 50 124
Experiment No 10
Using Microsoft Excel spread sheet draw frequency polygon and frequency curve for the
data collected from 50 people.
Aim : To draw frequency polygon and frequency curve for 50 students in a class in 5 subjects in previous
exam
Procedure :
1. Open MS excel sheet and Type ‘ Name of student ‘ in one column SUB1, SUB2,SUB3,SUB4,
SUB 5, in another columns
2. Under Name of student column type Student 01 and Student 02 drag it up to student 50
3. Enter the marks in 5 subjects manually (or generate randomly)
4. Create a “class interval” and “upper limit” of data set table manually
5. Click ‘Data’ -> Data analysis -> Histogram -> OK
6. Click input range tab -> select SUB 1 marks of all students
Click Bin range tab -> select all values of upper limit
Click in output range -> Select few empty cells
Tick cumulative percentage -> OK
7. Repeat step 5 and 6 for all subjects
8. Frequency distribution table get displayed for all 5 subjects
9. Now by Pressing control Key , select the frequency values of all 5 subjects
10. Then Click -> insert-> Line chart . Then Frequency polygon get displayed .
Out put : The frequency polygon is plotted for the given data
Experiment No 10 (Output)
Experiment no 10
To draw Frequency polygon and curve for the marks obtained by 50 students in 5 subjects in previous exam
SUB1
Name of student SUB1 SUB2 SUB3 SUB4 SUB5 Class interval Bin Bin FrequencyCumulative %
student 1 97 54 45 49 10 1-20 20 20 5 10.00%
student 2 59 25 82 48 39 21-40 40 40 9 28.00%
student 3 52 31 69 22 70 41-60 60 60 10 48.00%
student 4 72 40 91 91 39 61-80 80 80 13 74.00%
student 5 38 31 11 87 67 81-100 100 100 13 100.00%
student 6 37 90 51 98 66 More 0 100.00%
student 7 54 33 80 33 98
student 8 62 92 59 59 36 SUB 2 SUB 3
student 9 34 13 76 94 81 Bin Frequency Cumulative % Bin Frequency Cumulative %
student 10 87 24 53 60 84 20 6 12.00% 20 5 10.00%
student 11 90 29 33 43 38 40 17 46.00% 40 11 32.00%
student 12 19 33 15 90 86 60 4 54.00% 60 11 54.00%
student 13 71 12 33 98 69 80 12 78.00% 80 10 74.00%
student 14 44 70 27 56 70 100 11 100.00% 100 13 100.00%
student 15 33 16 78 91 85 More 0 100.00% More 0 100.00%
student 16 32 27 98 70 90
student 17 92 37 81 53 61 SUB 4 SUB 5
student 18 54 89 59 72 42 Bin Frequency Cumulative % Bin Frequency Cumulative %
student 19 85 84 57 81 91 20 1 2.00% 20 6 12.24%
student 20 84 72 95 80 73 40 9 20.00% 40 8 28.57%
student 21 62 72 82 92 76 60 15 50.00% 60 7 42.86%
student 22 68 19 41 79 94 80 13 76.00% 80 14 71.43%
student 23 69 53 93 65 86 100 12 100.00% 100 14 100.00%
student 24 12 76 66 27 69 More 0 100.00% More 0 100.00%
student 25 10 53 81 91 76
student 26 81 71 47 80 24
student 27 71 97 76 22 77
student 28 49 99 55 64 80
student 29 12 26 49 82 14
student 30 93 29 39 33 57 FREQUENCY POLYGON
student 31 66 12 49 53 88 18
student 32 37 78 12 65 95 16
student 33 72 72 95 89 63 14
student 34 21 90 10 53 11 12
student 35 69 69 89 55 44 10
student 36 45 24 91 30 16
FREQUENCY->
8
student 37 68 32 28 65 88 6
student 38 90 84 71 58 92 4
student 39 67 61 74 39 54 2
student 40 47 77 30 41 32 0
student 41 87 25 83 32 11 1 2 3 4 5 6
student 42 92 95 24 64 91 SUB->
student 43 17 66 18 53 23
student 44 45 83 21 72 57
student 45 88 19 28 11 29
student 46 34 26 24 38 46
student 47 92 90 37 60 42
student 48 32 78 75 79 19
student 49 65 46 77 49 65
student 50 55 24 85 74 25
Experiment No 11
Using Microsoft Excel spread sheet construct a Boxplot curve for the given data set
Aim: To construct a Boxplot for the number of passengers in a Flat form at different time in a day
Procedure :
1. Open MS excel sheet and Type ‘ Sl no ‘ in one column , ‘Timings ‘ in another column and No of
passangers in one more column .
2. Enter the timings and No of Passangers manually (or generate randomly)
3. Find the Minimum[= MIN(DATA) ] ,Quartile1[=QUARTILE(DATA,1)],
Median[=MEDIAN(DATA)], Quartile3[=QUARTILE (DATA, 3) and maximum[=MAX(DATA)] values .
4. Then Select No of passengers column and Click’ Insert’-> recommended charts-> click ALL-> Select
Boxplot chart
5. Then the Boxplot chart get displayed.
Output : The Box plot is constructed for the given data set.
Experiment No 11(Output)
Experiment no 11
To construct a Boxplot for the given Data set.
Sl no Timings No of Passangers
1 6.00 8 MIN 4
2 6.30 81 Q1 34.5
3 7.00 86 MEDIAN 94.5
4 7.30 15 Q3 136.75
5 8.00 68 MAX 199
6 8.30 16
7 9.00 45
8 9.30 142 0
9 10.00 113
10 10.30 103
11 11.00 124
12 11.30 96
13 12.00 51
14 12.30 27
15 13.00 188
16 13.30 187
17 14.00 97
18 14.30 128
19 15.00 152
20 15.30 56
21 16.00 175
22 16.30 93
23 17.00 73
24 17.30 71
25 18.00 118
26 18.30 103
27 19.00 107
28 19.30 47
29 20.00 75
30 20.30 19
31 21.00 4
32 21.30 108
33 22.00 195
34 22.30 25
35 23.00 23
36 23.30 36
37 24.00 152
38 1.15 24
39 2.15 12
40 3.15 30
41 4.15 36
42 6.15 187
43 7.15 151
44 8.15 135
45 9.15 13
46 10.15 104
47 11.15 149
48 12.15 199
49 12.45 129
50 12.55 157
Experiment No 12
Using Microsoft Excel spread sheet construct a leaf plot for the given dataset. Explain the
graph with minimum 30 words.
Aim: To construct a leaf plot for the given data set (marks scored in a subject by 50 students)
Procedure:
1. Open MS excel sheet and Type ‘ Name of student ‘ in one column and ‘Marks’ in another column
2. Under Name of student column type Student 01 and Student 02 drag it up to student 50
3. Using “ RANDBETWEEN” formula by taking limits 10 to 99 [ =RANDBETWEEN(10,99)] in a separate
column , generate the marks for 50 students.
4. Copy these values and paste in the Marks Column (otherwise values change every time when you
press enter)
6. Find the Minimum[= MIN(DATA) ] and maximum[=MAX(DATA)] values of the
5. Enter ‘stem value’ in one column and ‘Leaf value’ in another column
6. Enter 1 to 9 in stem value column
7. To find leaf value for stem value 1 use the following formula
=REPT("0",COUNTIF($C$6:$C$55,E16*10+0))
&REPT("1",COUNTIF($C$6:$C$55,E16*10+1))
&REPT("2",COUNTIF($C$6:$C$55,E16*10+2))
&REPT("3",COUNTIF($C$6:$C$55,E16*10+3))
&REPT("4",COUNTIF($C$6:$C$55,E16*10+4))
&REPT("5",COUNTIF($C$6:$C$55,E16*10+5))
&REPT("6",COUNTIF($C$6:$C$55,E16*10+6))
&REPT("7",COUNTIF($C$6:$C$55,E16*10+7))
&REPT("8",COUNTIF($C$6:$C$55,E16*10+8))
&REPT("9",COUNTIF($C$6:$C$55,E16*10+9))
[Note : to get $ symbol , after selecting data in COUNTIF function, by holding “fn” key press F4 key]
8. Then Click on the first leaf value and drag it up to 9 th value . Leaf values for all stem values get
displayed.
Output : A Leaf plot for the given data set is constructed
Stem Leaf
1 0 8 9
2 02566889
3 2237
4 112234445566
5 6
6 2334578
7 02378
8 01377
9 0 2 3 5 8
Experiment No 12(Output)
Experiment no 12
To construct a Leaf plot for the given data set
6. Then select entire data(height) -> click “insert”->select “Recommended charts”->click “All charts”->
select “Histogram” -> click “OK”
7. Then Histogram chart get displayed. Rename the tittle as Histogram .
Out put : Mean, Median and Mode for the given data is found and the data is represented in
Histogram.
Experiment No 13(Output)
Experiment No 13
To find Mean,Median and Mode for heights of 50 students in a class and represent them in Histogram
Aim : To generate a 50 random data sample (even and odd number dataset) using Microsoft Excel
spread sheet and determine the range and Quartiles
Procedure:
1. Open MS excel sheet and Type ‘ Sl no ‘ in one column and ‘Even data set’ in another column and
also “Odd data set” in one more column
2. Enter 50 even numbers randomly and 50 odd numbers manually
3. Then Find Min, Quartile 1, Quartile 2, Quartile 3, and Max, values for even data set using the formula
Out Put : The Range and Quartiles for Even data set and Odd data set is determined.
Experiment No 14 (Output)
Experiment No 14
To generate a 50 random data sample (even and odd number dataset) using Microsoft Excel spread sheet and determine the range and Quartiles
Sl No Even data set For Even data Sl No Odd data set For Odd data
1 10 1 21
2 12 Min 2 2 23 Min 1
3 14 Quartile1(Q1) 25.5 3 25 Quartile1(Q1) 24.5
4 16 Quartile2(Q2) 51 4 27 Quartile2(Q2) 50
5 18 Quartile3(Q3) 76.5 5 29 Quartile3(Q3) 75.5
6 20 Max 100 6 31 Max 99
7 22 7 33
8 2 Range= Max-Min 98 8 15 Range= Max-Min 98
9 4 IQR= Q3-Q1 51 9 17 IQR= Q3-Q1 51
10 6 10 19
11 8 11 1
12 24 12 3
13 26 13 5
14 28 14 7
15 30 15 9
16 32 16 11
17 34 17 13
18 36 18 35
19 38 19 37
20 40 20 39
21 42 21 41
22 44 22 43
23 46 23 45
24 48 24 47
25 50 25 49
26 52 26 51
27 54 27 53
28 56 28 55
29 58 29 57
30 60 30 59
31 62 31 61
32 64 32 63
33 66 33 65
34 68 34 67
35 70 35 69
36 72 36 71
37 74 37 73
38 76 38 75
39 78 39 77
40 80 40 79
41 82 41 81
42 84 42 83
43 86 43 85
44 88 44 87
45 90 45 89
46 92 46 91
47 94 47 93
48 96 48 95
49 98 49 97
50 100 50 99
Experiment No 15
Collect the current Yield of a crop from 50 different persons in your locality and
determine Mean deviation and Quartile deviation in Microsoft Excel spread sheet.
Aim: To collect the current yield of Ragi crop from 50 different persons and to determine Mean
deviation and Quartile deviation in MS Excel spread sheet
Procedure:
1. Open MS excel sheet and Type ‘ Person Name ‘ in first column , ‘Ragi crop yield(Quintal)’ in second
column, “Deviation” in third column and “Absolute deviation(ABS)” in fourth column.
2. Under “Person name” column type Person 01 and Person 02 drag it up to Person 50
3. Generate random yield data of 50 persons using “ RANDBETWEEN” formula by taking limits 10 to 50
“ =RANDBETWEEN(10,50)” in a separate column
4. Copy these values and paste as values in the Ragi yield Column (otherwise values change every time
when you press enter)
5. Then Find Mean, Quartile 1 and Quartile 3 values for crop yield data set using the formula
Output: Mean deviation and Quartile deviation for the given data (yield of Ragi crop from 50
different persons) is determined.
Experiment No 15 (Output)
Experiment No 15
To dtermine Mean deviation and Quartile deviation in MS Excel spread sheet for the data of current yield of crop (Ragi) collected
from 50 different persons.
Deviation Absolute
Person Name Ragi crop yield ( Quintals) (Yield-Mean) deviation(ABS) Mean 31.38
person1 50 18.62 18.62 Quartile 1(Q1) 22
person2 20 -11.38 11.38 Quartile 3(Q3) 43
person3 42 10.62 10.62
person4 33 1.62 1.62 Quartile deviation 10.5
person5 23 -8.38 8.38 (Q3-Q1)/2
person6 36 4.62 4.62
person7 20 -11.38 11.38 Mean deviation 10.2696
person8 34 2.62 2.62
person9 47 15.62 15.62
person10 38 6.62 6.62
person11 11 -20.38 20.38
person12 21 -10.38 10.38
person13 34 2.62 2.62
person14 22 -9.38 9.38
person15 46 14.62 14.62
person16 28 -3.38 3.38
person17 22 -9.38 9.38
person18 46 14.62 14.62
person19 23 -8.38 8.38
person20 45 13.62 13.62
person21 43 11.62 11.62
person22 34 2.62 2.62
person23 26 -5.38 5.38
person24 23 -8.38 8.38
person25 43 11.62 11.62
person26 25 -6.38 6.38
person27 48 16.62 16.62
person28 39 7.62 7.62
person29 44 12.62 12.62
person30 22 -9.38 9.38
person31 17 -14.38 14.38
person32 23 -8.38 8.38
person33 46 14.62 14.62
person34 18 -13.38 13.38
person35 34 2.62 2.62
person36 35 3.62 3.62
person37 10 -21.38 21.38
person38 38 6.62 6.62
person39 14 -17.38 17.38
person40 44 12.62 12.62
person41 41 9.62 9.62
person42 44 12.62 12.62
person43 33 1.62 1.62
person44 44 12.62 12.62
person45 10 -21.38 21.38
person46 23 -8.38 8.38
person47 19 -12.38 12.38
person48 43 11.62 11.62
person49 27 -4.38 4.38
person50 18 -13.38 13.38
Experiment No 16
Collect the data of any 2 livestock population from 50 different houses in your locality
and determine standard deviation for both the two separately in Microsoft excel spread
sheet and brief your inference with less than 30 words.
Aim: To collect the data of Goat population and Sheep population from 50 different houses and to
determine standard deviation for both the two separately in MS Excel spread sheet.
Procedure:
1. Open MS excel sheet and Type ‘ House name ‘ in first column , ‘Goats’ in second column, “Sheep”
in third column .
2. Under “House name” column type H1 and H2 drag it up to H50.
3. Generate randomly Goat population of 50 Houses using “ RANDBETWEEN” formula by taking limits
1 to 50 “ =RANDBETWEEN(10,50)” in a separate column.
4. Copy these values and paste as values in the Goat column. (otherwise values change every time
when you press enter)
5. Similarly Generate Sheep population randomly and Copy the values and paste as values in the Sheep
column.
6. Then find the standard deviation of the Goat population using the formula
“=STDEV.P(Select all data values of Goat)”
7. Similarly find the standard deviation of the Sheep population using the formula
“=STDEV.P(Select all data values of Sheep)”
Out put : The standard deviation for Goat population and Sheep population are determined.
Experiment No 16 (Output)
Experiment No 16
To determine Standard Deviation for the data of Goats poulation and Sheep
population collected from 50 different houses.
Experiment No 17
To Collect the data of two wheeler (with a rider and a pillion) crossing a busy junction in the peak hours and
to determine the variance of the data in Microsoft excel spread sheet
Place : Bangalore
Time : 5.00 pm To 6.00 pm
Procedure:
1. Open MS excel sheet and Type ‘ Sl no ‘ in first column , ‘Data values’ in second column.
2. Under “Sl no ” column type “1” and “2” drag it up to 50.
3. Generate randomly “Data values” using “ RANDBETWEEN” formula by taking limits 1 to 100
“ =RANDBETWEEN(1,100)” in a separate column.
4. Copy these values and paste as values in the ‘Data values ’column.
5. Click “Data” -> Data analysis ->Select “ Descriptive statistics” -> Click OK
6. Click “input range” -> select all values -> Tick “ summary statistics” -Click “OK”. Then the required
values get displayed.
Output : Mean, Median , Mode, Skewness, Kurtosis and other values are determined for the given data
set.
Experiment No 18(Output)
Experiment No 18
Using Microsoft Excel spread sheet draw a Skewness graph and kurtosis graph for randomly generated dataset.
Sl no Data values
1 68
2 89
3 53
4 54
5 7
6 35 68
7 60
8 8 Mean 42.95918
9 11 Standard Error 3.562245
10 19 Median 47
11 45 Mode 54
12 52 Standard Deviation 24.93572
13 29 Sample Variance 621.79
14 74 Kurtosis -0.72516
15 93 Skewness 0.161283
16 54 Range 91
17 94 Minimum 3
18 22 Maximum 94
19 26 Sum 2105
20 47 Count 49
21 52
22 47
23 36
24 76
25 14
26 60
27 32
28 54
29 53
30 22
31 4
32 36
33 59
34 3
35 38
36 52
37 69
38 42
39 12
40 33
41 62
42 48
43 70
44 14
45 17
46 62
47 17
48 86
49 60
50 3
PYTHON
Python program execution procedure
Step 1 :- Start -> “All Programs”-> Python3.10-> IDLE Python (3.10 64 bit)
Step 2:- The IDLE shell 3.10.5 window will open as shown below
Step 3 :- Click “File”-> New File-> A new untitled window will open.
Step 5 :- Click “File”-> save -> give the File name -> save. Then Click “Run”-> “Run module”.
Step 6 :- The result get displayed in IDLE shell 3.10.5 window as shown below .
Experiment No 19
Write a python program to convert Decimal to Binary , octal and Hexadecimal.
Aim : To write a python program to convert Decimal (Ex: 25) to Binary, Octal, and Hexadecimal.
Procedure :
1 . Open “New file” in “Python IDLE shell 3.10.5” and write the following python program code.
dec=25
b=bin(25)
o=oct(25)
h=hex(25)
print("decimal number is:",dec)
print("binary value is:",b)
print("octal value is: ",o)
print("hexadecimal value is: ",h)
Out put:
decimal number is: 25
binary value is: 0b11001
octal value is: 0o31
hexadecimal value is: 0x19
Experiment No 20
Write a python program to add 2 integers and 2 strings and print the result.
Aim : To write a python program to add 2 integers (Ex ; 200 and 800) and 2 strings (Govt Polytechnic,
Gauribidanur) and print the result
Procedure :
1 . Open “New file” in “Python IDLE shell 3.10.5” and write the following python program code.
Out put:
1 . Open “New file” in “Python IDLE shell 3.10.5” and write the following python program code.
n=int(input("Enter a number:"))
sum1=0
while(n>0):
sum1=sum1+n
n=n-1
print("The sum of first n natural numbers is",sum1)
Out put:
Result : Python program to find the sum of first 10 natural numbers was executed.
Experiment No 22
Write a python program to find whether the number is odd or even.
Aim : To write a python program to find whether the number is odd or even.
Procedure :
1 . Open “New file” in “Python IDLE shell 3.10.5” and write the following python program code.
Out put:
Enter any number:55
55 is an odd number
Result : Python program to find whether the number is odd or even was executed.
Experiment No 23
Write a python program to find the variance and standard deviation for the given data.
Aim : To write a python program to find the variance and standard deviation for the given data.
Procedure :
1 . Open “New file” in “Python IDLE shell 3.10.5” and write the following python program code.
import statistics
data = [486,148,565,122,534]
var = statistics.pvariance(data)
print("The variance is :",var)
sd = statistics.pstdev(data)
print("The standard deviation is: ", sd)
Out put:
The variance is : 37832
The standard deviation is: 194.50449866262733
Result : Python program to find the variance and standard deviation for the given data was executed.
Experiment No 24
Result : Python program to display student marks from the record was executed.
UNIT 1 STATISTICAL DATA COLLECTION AND TYPES
1. Define statistics.
Statistics is the science of collecting, presenting, analyzing, and interpreting the data , as
well as of making decisions based on such analysis.
2. Define sample.
Sample is a collection of elements from the population that represents the
characteristics of the population.
3. Define data .
A collection of facts such as numbers , words, measurements, observations,
description of things is called data.
14. Mention the basic rules for constructing frequency distribution table for large number of observations .
Following are the basic rules for constructing a frequency distribution table for large number of
observations.
i. Find the lowest and highest values of the variables.
ii. Decide the width of class intervals
iii. Count all the possible values of the variables in that class interval.
16. Mention the steps of enabling “ Analysis tool pack “ in Excel for windows.
1. Click the File tab, click Options, and then click the Add-Ins category.
2. In the Manage box, select Excel Add-ins and then click Go.
3. In the Add-Ins box, check the Analysis ToolPak check box, and then click OK.
a. If Analysis ToolPak is not listed in the Add-Ins available box, click Browse to locate it.
b. If you are prompted that the Analysis ToolPak is not currently installed on your computer,
click Yes to install it.
19. Mention the different types of graphs that are used to summarize and organize data.
Types of Graphical Representation
1. Line graph
2. Bar graph
3. Pie chart
4. Histogram
5. Frequency polygon
6. Box Plot
20. Define the following
a) Bar graph b) Pie chart c) Line graph d) Histogram d) Frequency polygon e) Box plot
f) Leaf stem plot.
a) Bar graph : A Bar graph is a graph that represents the categorical data using rectangular bars. There are
two types of Bar graphs.
i. Horizontal Bar graph
ii. Vertical Bar graph
b) Pie chart : A pie chart is a type of graph that represents the data in the circular graph. The slices of pie
show the relative size of the data.
c) Line graph : A line graph is a graph that uses lines to connect individual data points. A line graph displays
quantitative values over a specified time interval.
e) Frequency polygon : A frequency polygon is defined as a line graph created by joining all of the top points
of a histogram.
f) Box plot : A box and whisker plot (Box plot ) displays the five-number summary of a set of data. The five-
number summary is the minimum, first quartile, median, third quartile, and maximum.
g) Leaf stem plot : A stem and leaf plot is a unique table where values of data are split into a stem and leaf.
The first digit or digits will be written in stem and the last digit will be written in leaf.
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
UNIT 3 : MEASURE OF LOCATION AND DISPERSION
8. Mention the different Absolute measures of Dispersion for the given data.
The different types of Absolute measures of dispersion are
i) Range
ii) Variance
iii) Quartile deviation
iv) Mean Deviation
v) Standard Deviation
9. Mention the different Relative measures of Dispersion for the given data
The different types of Relative measures of dispersion are
i) Coefficient of Range
ii) Coefficient of variation
iii) Coefficient of Mean deviation
iv) Coefficient of Quartile deviation
10. Define the following with Excel formula.
a) Range b) Quartile c) Quartile deviation. d) Variance e) Standard Deviation.
a) Range : It is the difference between the maximum value and the minimum value given in a data
set.
Range = Maximum value – minimum value.
b) Quartile : Quartile refers to the values which divide the set into quarters.
c) Quartile deviation : It is half of the difference between First and Third quartile ((Q3-Q1)/2).
d) Variance: It gives us the info about how far the data is spread from its mean.
e) Standard deviation : In statistics, the standard deviation is a measure of the amount of variation or
dispersion of a set of values.
A low standard deviation indicates that the values tend to be close to the mean (also called
the expected value) of the set, while a high standard deviation indicates that the values are spread
out over a wider range.
Kurtosis in statistics describes the distribution of the data set. It shows to what extent the data
set points of a particular distribution differ from the data of a normal distribution. It is also used to
determine whether a distribution contains extreme values.
There are 3 types of Kurtosis.
1. Mesokurtic
2. Leptokurtic
3. Platykurtic
If the Kurtosis of data falls close to zero or equals zero, it is referred to as Mesokurtic. It
means that the data set follows a normal distribution
If the Kurtosis of data falls more than zero (kurtosis + ve), it is referred to as leptokurtic.
Leptokurtic has steep curves on both sides, indicating the large population of outliers in the data set.
If the Kurtosis of data falls less than zero (kurtosis - ve), it is referred to as Platykurtic.
Platykurtic has pale or flat curve, and that curve indicates the small number of outliers in a distribution.
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
UNIT 4 : INTRODUCTION TO PYTHON PROGRAMMING
1. What is Programming ?
Programming refers to a technological process for telling a computer which tasks to perform in order to
solve problems.
2. What is Python ?
Python is a high-level, general-purpose programming language.
1. Easy to Read, Learn and Write: Python is a high-level programming language that has English-like
syntax. This makes it easier to read and understand the code.
2. Improved Productivity : Python is a very productive language. Due to the simplicity of Python,
developers can focus on solving the problem. They don’t need to spend too much time in
understanding the syntax or behavior of the programming language. You write less code and get
more things done.
3. Interpreted Language : Python is an interpreted language which means that Python
directly executes
the code line by line. In case of any error, it stops further execution and reports back the error
which has occurred. Python shows only one error even if the program has multiple errors. This
makes debugging easier.
4. Dynamically Typed : Python doesn’t know the type of variable until we run the code. It
automatically
assigns the data type during execution. The programmer doesn’t need to worry about declaring
variables and their data types.
5. Free and Open-Source : Python comes under the OSI approved open-source license. This makes
it free to use and distribute. You can download the source code, modify it and even distribute
your version of Python. This is useful for organizations that want to modify some specific behavior
and use their version for development.
6. Vast Libraries Support : The standard library of Python is huge, you can find almost all the functions
needed for your task. So, you don’t have to depend on external libraries.But even if you do,
a Python package manager (pip) makes things easier to import other great packages from
the Python package index (PyPi). It consists of over 200,000 packages.
7. Portability : In many languages like C/C++, you need to change your code to run the program on
different platforms. That is not the same with Python. You only write once and run it anywhere.
a) Numbers in Python:
There are three numerical types in Python.
i) Int :- Int, or integer, is a whole number, positive or negative, without
decimals, of unlimited length.
Ex: x = 35656222554887711
ii) Float :- Float, or "floating point number" is a number, positive or negative,
containing one or more decimals. Ex : x = 1.10
iii) Complex :- Complex numbers are written with a+ib form where a is real part
and b is imaginary part .
b) String in Python : String is a collection of alphabets, words or other characters. It is one of
the primitive data structures and are the building blocks for data manipulation. Python has a
built-in string class named str . Strings in python are surrounded by either single quotation
marks, or double quotation marks.
c) ) Python Lists : A list in Python is used to store the sequence of various types of data. A
list can be defined as a collection of values or items of different types. The items in the list are
separated with the comma (,) and enclosed with the square brackets [].
d) Python Dictionary: A dictionary is a kind of data structure that stores items in key-value pairs.
A key is a unique identifier for an item, and a value is the data associated with that key.
Dictionaries often store information such as words and definitions, but they can be used for much
more. Dictionaries are mutable in Python, which means they can be changed after they are
created. They are also unordered, indicating the items in a dictionary are not stored in any
particular order.
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx