WEEK 1 - 2 - Intro To WEKA
WEEK 1 - 2 - Intro To WEKA
LAB EXERCISE
ii. Using weather.numeric dataset, find the frequencies for each value for each variable.
Outlook Frequency
Sunny 5
Overcast 4
Rainy 5
Windy Frequency
TRUE 6
FALSE 8
Play Frequency
YES 9
NO 5
Exercise 2:
No Activity
Load the labor.arff file into WEKA. The file is in the data folder of WEKA. This dataset classifies individuals
described by a set of attributes.
1. How many instances and attributes contained in the dataset?
Number of 57
instances
Number of 17
attributes
2. Identify type of attribute for vacation, the values of vacation and the number of instances for each value.
Type : Nominal
Values of vacation : 3
Number of instances: 18,17,16
3. For the third value of attribute vacation, give the count of each class.
Vacation = generous
Class=good – 14
Class=bad – 2
Exercise 3:
i. Go to the following link: https://www.data.gov.my/data/ms_MY/dataset/senarai-lokaliti-hotspot-
denggi-di-malaysia
ii. Download the dataset
iii. Try to prepare or process the data until it can be uploaded to WEKA
Note: If you still unable to do it, see this link: https://youtu.be/itixU0jIX3Q
Exercise 4:
i. Find any related data of COVID19 using Kaggle.
ii. Download the dataset.
iii. Prepare the dataset until it can be uploaded to WEKA. Report the problems that you encounter and
briefly explain how you solve it.
iv. Identify the knowledge that you think can be discovered from this dataset.
e.g., of Knowledge: Trends of COVID19 outbreak (See Mehrota & Agarwal, 2021)
Mehrotra, A., & Agarwal, R. (2021). A Review of Use of Data Mining during COVID-19 Pandemic. Turkish Journal of Computer and
Mathematics Education (TURCOMAT), 12(6), 4547-4552.
Found
problem
when
opening the
data.