Pengenalan Data Mining
Pengenalan Data Mining
DATA MINING
PTIK
Week 10
1
Pokok Bahasan
12/01/23
Latar Belakang Data Mining (1)
Melimpahnya Data
– Terciptanya data dari tools otomatis dan teknologi basis data
sehingga jumlah yang tercatat dalam basis data atau media
penyimpanan lain semakin membesar
12/01/23
Latar Belakang Data Mining (2)
12/01/23
Top 10 Database Terbesar 2012
No Badan/Organisasi Jumlah Data
1 World Data Centre for Climate • 20 terabytes of web data
• 6 petabytes of additional
data
3 AT&T • 23 terabytes of
information
• 1.9 trillion phone call
records
4 Google •1 million searches per day
Sumber: http://www.siliconindia.com/news/enterpriseit/Top-10-Largest-Databases-in-the-
World-nid-118891-cid-7.html
12/01/23
Perkembangan Data di Dunia (1)
12/01/23
Perkembangan Data di Dunia (2)
12/01/23
Perkembangan Data di Dunia (3)
" It is projected that just four years from now, the world’s
information base will be doubling in size every 11 hours. So rapid is
the growth in the global stock of digital data that the very
vocabulary used to indicate quantities has had to expand to keep
pace. A decade or two ago, professional computer users and
managers worked in kilobytes and megabytes. Now school children
have access to laptops with tens of gigabytes of storage, and
network managers have to think in terms of the terabyte (1,000
gigabytes) and the petabyte (1,000 terabytes). Beyond those lie the
exabyte, zettabyte and yottabyte, each a thousand times bigger
than the last.
(IBM Global Technical Services white paper published in July 2006, titled, "The toxic terabyte: How data-dumping threatens
business efficiency.)
12/01/23
Pokok Bahasan
12/01/23
12/01/23
Just Joke..
12/01/23
Definisi Data Mining
12/01/23
Awal Data Mining
12/01/23
Jenis Data pada Data Mining
12/01/23
Pokok Bahasan
12/01/23
Hubungan DM, DB dan DW
12/01/23
Data Mining &
Business Intelligence
End User
Meningkatkan potensi untuk Making
mendukung keputusan bisnis Decisions
Data
Statistical Analysis, Querying and Reporting
Exploration
Data Warehouses / Data Marts
OLAP, MDA DBA
Data Sources
Paper, Files, Information Providers, Database Systems, OLTP
12/01/23
Pokok Bahasan
12/01/23
Task dalam Data Mining
Metode Prediksi
– Dengan menggunakan beberapa variabel untuk memprediksi nilai
yang belum diketahui (unknown ) atau nilai selanjutnya (future)
dari variabel lain
Contoh:
Classification
Regression
Deviation Detection
Metode Deskripsi
– Menemukan pola pendeskripsian data yang dapat
diinterpretasikan oleh manusia
Contoh:
Clustering
Association Rule Discovery
Sequential Pattern Discovery
12/01/23
Pokok Bahasan
12/01/23
Fungsionalitas Data Mining (1)
12/01/23
Aplikasi Data Mining (1)
12/01/23
Pokok Bahasan
12/01/23
Permasalahan Utama
12/01/23