0% found this document useful (0 votes)
15 views3 pages

WEEK 1 - 2 - Intro To WEKA

The document discusses exercises using WEKA to analyze datasets. It includes loading and exploring weather and labor datasets, identifying attribute types and values. It also discusses downloading and preparing COVID-19 and dengue hotspot data for analysis in WEKA.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
15 views3 pages

WEEK 1 - 2 - Intro To WEKA

The document discusses exercises using WEKA to analyze datasets. It includes loading and exploring weather and labor datasets, identifying attribute types and values. It also discusses downloading and preparing COVID-19 and dengue hotspot data for analysis in WEKA.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 3

ITS665/ISP565 DATA MINING 2022

HARIS ISKANDAR BIN ZEIFERI IDZLIN 2022827876 RAS2034A

LAB EXERCISE

INTRODUCTION INTO WEKA


Exercise 1:
i. Using weather.numeric dataset, find the min, max and average for each variable.

variable Max Min Average


Temperature 85 64 73.571
Humidity 96 65 81.643

ii. Using weather.numeric dataset, find the frequencies for each value for each variable.

Outlook Frequency
Sunny 5
Overcast 4
Rainy 5

Windy Frequency
TRUE 6
FALSE 8

Play Frequency
YES 9
NO 5

Exercise 2:
No Activity
Load the labor.arff file into WEKA. The file is in the data folder of WEKA. This dataset classifies individuals
described by a set of attributes.
1. How many instances and attributes contained in the dataset?

Number of 57
instances
Number of 17
attributes

2. Identify type of attribute for vacation, the values of vacation and the number of instances for each value.

Type : Nominal
Values of vacation : 3
Number of instances: 18,17,16

Prepared and Updated by Dr Sofianita / Dr Shuzlina Semester Oct. 2021


ITS665/ISP565 DATA MINING 2022

3. For the third value of attribute vacation, give the count of each class.

Vacation = generous
Class=good – 14
Class=bad – 2

Exercise 3:
i. Go to the following link: https://www.data.gov.my/data/ms_MY/dataset/senarai-lokaliti-hotspot-
denggi-di-malaysia
ii. Download the dataset
iii. Try to prepare or process the data until it can be uploaded to WEKA
Note: If you still unable to do it, see this link: https://youtu.be/itixU0jIX3Q

Prepared and Updated by Dr Sofianita / Dr Shuzlina Semester Oct. 2021


ITS665/ISP565 DATA MINING 2022

Exercise 4:
i. Find any related data of COVID19 using Kaggle.
ii. Download the dataset.
iii. Prepare the dataset until it can be uploaded to WEKA. Report the problems that you encounter and
briefly explain how you solve it.
iv. Identify the knowledge that you think can be discovered from this dataset.
e.g., of Knowledge: Trends of COVID19 outbreak (See Mehrota & Agarwal, 2021)

Mehrotra, A., & Agarwal, R. (2021). A Review of Use of Data Mining during COVID-19 Pandemic. Turkish Journal of Computer and
Mathematics Education (TURCOMAT), 12(6), 4547-4552.

Found
problem
when
opening the
data.

Prepared and Updated by Dr Sofianita / Dr Shuzlina Semester Oct. 2021

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy