0% found this document useful (0 votes)

246 views3 pages

Activity 1 PDF

Data mining is defined as the process of extracting useful information and patterns from large datasets. It involves analyzing massive amounts of data to discover insights that can help businesses solve problems or seize opportunities. Data mining results from the evolution of both database technology and machine learning research, combining disciplines like algorithms, statistics, and pattern recognition to extract knowledge from data in a more complex way than simple transformations. The key steps in data mining as a knowledge discovery process are data cleaning, integration, selection, transformation, mining patterns, and presenting knowledge. A data warehouse differs from a standard database in that it is a central repository that integrates data from multiple sources for analysis to guide management decisions.

Uploaded by

John Michael Reyes

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

246 views3 pages

Activity 1 PDF

Uploaded by

John Michael Reyes

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Reyes, John Michael E.

Activity 1

1. What is data mining? In your answer, address the following:

- Data mining is defined as a process used to extract usable data from a larger set
of any raw data. It implies analysing data patterns in large batches of data using
one or more software. Data mining is also known as Knowledge Discovery in
Data (KDD). Also refers to the process of extracting or mining interesting
knowledge or patterns from large amounts of data. It analyzes massive volumes
of data to discover insights that help businesses solve problems, mitigate risks,
or seize new opportunities.

a. Is it another hype?

- From what I know and from what I’ve read and learned, it is not another
hype.Data mining grows because of its wide availability to everyone. It became
so vast because there is too much data that can be turned into information or
knowledge. It is somewhat we can always see from the future. We all know that
there will be changes, new things will pop up and represent themselves and
there is no reason to believe that there will be no changes in the future.

b. Is it a simple transformation or application of technology developed from

databases, statistics, machine learning, and pattern recognition?

- I think it is not a simple transformation or application of technology developed

from databases, statistics, machine learning, and pattern recognition. It is more
than that, because it includes combinations or amalgamation of disciplines. Like
the one that is presented in our lecture earlier which are algorithm, database
technology, statistics, machine learning, visualization, pattern recognition and
other disciplines. I think it is one of the most complex transformations or
applications of technology, it can affect what will happen in the future and you
can tell that it is really important. It is too vast that it can’t be called just a simple
transformation.

c. We have presented a view that data mining is the result of the evolution of
database technology. Do you think that data mining is also the result of the
evolution of machine learning research? Can you present such views based on
the historical progress of this discipline? Address the same for the fields of
statistics and pattern recognition.

- We all know that it all started at some data collection that guide to an efficient
development that can be used with such data that has been stored. I can say it is
somewhat a result of the evolution of machine learning because it is also capable
of analyzing and creating something new. It is what we call a relationship with
each other. Data mining is used on an existing dataset to find patterns. Machine
learning, on the other hand, is trained on a 'training’ data set, which teaches the
computer how to make sense of data, and then to make predictions about new
data sets.

d. Describe the steps involved in data mining when viewed as a process of

knowledge discovery.

- Data cleaning, a process of detecting and correcting inconsistent data.

- Data integration, from the word integration, it is where multiple data sources may
be combined.
- Data selection, where data relevant to the analysis task are retrieved from the
database
- Data transformation, where data are transformed or converted into forms
appropriate for mining or into one format.
- Data mining is defined as a process used to extract usable data from a larger set
of any raw data.
- Pattern evaluation, a process that identifies the truly interesting patterns
representing knowledge based on some interestingness measures.
- Knowledge presentation, where visualization and knowledge representation
techniques are used to present the mined knowledge to the user.

2. How is a data warehouse different from a database? How are they similar?

- First is a data warehouse is I think what we call a repository of information more

than a database. It is a large store of data accumulated from a wide range of
sources within a company and used to guide management decisions. It is a
central repository of integrated data from one or more disparate sources. While
Database is an organized collection of data, generally stored and accessed
electronically from a computer system. It is also a collection of structured data to
make it easily accessible, manageable and update. The similarities are they are
both storage and both of them have been storing huge amounts of persistent
data.

3. Define each of the following functionalities: characterization, discrimination,

association and correlation analysis, classification, regression, clustering and
outlier analysis.

- Characterization, is a summarization of general features of objects in a target

class, and produces what is called characteristic rules.This refers to summarizing
data of class under study.
- Discrimination, it refers to the mapping or classification of a class with some
predefined group or class. Somewhat a comparison of the general features of
target class data objects with the general features of objects from one or a set of
contrasting classes.
- Association and correlation analysis, Correlation analysis explores the
association between two or more variables and makes inferences about the
strength of the relationship. Technically, association refers to any relationship
between two variables, whereas correlation is often used to refer only to a linear
relationship between two variables.
- Classification, it is the organization of data in given classes and it predicts the
class of objects whose class label is unknown. Its objective is to find a derived
model that describes and distinguishes data classes or concepts.
- Regression, it is a technique used to fit an equation to a dataset. A function that
predicts a number.
- Clustering and outlier analysis, it is similar to classification, clustering is the
organization of data in classes.However, unlike classification, in clustering, class
labels are unknown and it is up to the clustering algorithm to discover acceptable
classes. It analyzes data objects without consulting a known class label. The
objects are clustered or grouped based on the principle of maximizing the
intraclass similarity and minimizing the interclass similarity. Each cluster that is
formed can be viewed as a class of objects. While outlier analysis are data
elements that cannot be grouped in a given class or cluster.They are often very
important to identify. Sometimes outliers are being discarded and considered as
noise in some applications but still can still be usable.

BCA Data Mining
No ratings yet
BCA Data Mining
116 pages
DMM Finals
No ratings yet
DMM Finals
30 pages
DW and DM Notes
No ratings yet
DW and DM Notes
89 pages
DMA QB Solved
No ratings yet
DMA QB Solved
42 pages
DMW - Unit 1
No ratings yet
DMW - Unit 1
21 pages
Unit 1 Data Mining Task
No ratings yet
Unit 1 Data Mining Task
7 pages
Data Mining Unit-1
No ratings yet
Data Mining Unit-1
59 pages
DWDM Unit-II Notes
No ratings yet
DWDM Unit-II Notes
29 pages
How To Upgrade SAP Kernel Windows
No ratings yet
How To Upgrade SAP Kernel Windows
5 pages
Unit III
No ratings yet
Unit III
101 pages
TY 2022 Pattern Computer Engineering
No ratings yet
TY 2022 Pattern Computer Engineering
93 pages
DM-Unit-I Introduction To Association-1
No ratings yet
DM-Unit-I Introduction To Association-1
97 pages
Unit-2 Introduction To Data Mining
100% (1)
Unit-2 Introduction To Data Mining
11 pages
Subject Data Warehouse
No ratings yet
Subject Data Warehouse
42 pages
Data Mining - Digital Notes (Unit I To V)
No ratings yet
Data Mining - Digital Notes (Unit I To V)
85 pages
Data Mining
No ratings yet
Data Mining
157 pages
p144 Data Mining
100% (3)
p144 Data Mining
11 pages
CHAPTER1 Datamining
No ratings yet
CHAPTER1 Datamining
33 pages
LECTURE NOTES ON DATA MINING and DATA WA
No ratings yet
LECTURE NOTES ON DATA MINING and DATA WA
84 pages
CA Business Intelligence For CAServDeskMgr PDF
No ratings yet
CA Business Intelligence For CAServDeskMgr PDF
395 pages
DMDW Unit1
No ratings yet
DMDW Unit1
31 pages
Unit-4 DWM
No ratings yet
Unit-4 DWM
73 pages
Data Mining Notes UNIT I
No ratings yet
Data Mining Notes UNIT I
21 pages
Data Mining-CH5
No ratings yet
Data Mining-CH5
49 pages
Unit 1 Mining
No ratings yet
Unit 1 Mining
15 pages
DM Module1 Notes
No ratings yet
DM Module1 Notes
25 pages
Module 1
No ratings yet
Module 1
41 pages
Advanced Database System Chapter 5
No ratings yet
Advanced Database System Chapter 5
22 pages
Unit 1
No ratings yet
Unit 1
21 pages
Unit 1 Datamining
No ratings yet
Unit 1 Datamining
16 pages
DWDMunit 2
No ratings yet
DWDMunit 2
27 pages
Data Mining Real
No ratings yet
Data Mining Real
19 pages
HBase (Unit 4)
No ratings yet
HBase (Unit 4)
37 pages
Data Mining Notes
No ratings yet
Data Mining Notes
25 pages
Data Mining and Its Applications
No ratings yet
Data Mining and Its Applications
60 pages
Yunita 2021 J. Phys. Conf. Ser. 1898 012044
No ratings yet
Yunita 2021 J. Phys. Conf. Ser. 1898 012044
15 pages
Screenshot 2023-10-19 at 11.36.57
No ratings yet
Screenshot 2023-10-19 at 11.36.57
27 pages
DM-unit 1
No ratings yet
DM-unit 1
22 pages
My Resume
No ratings yet
My Resume
1 page
Unit-1 Notes
No ratings yet
Unit-1 Notes
24 pages
Data Mining
No ratings yet
Data Mining
25 pages
DWM 4
No ratings yet
DWM 4
23 pages
Sites For Best Learning
No ratings yet
Sites For Best Learning
2 pages
Query Processing
No ratings yet
Query Processing
3 pages
Neso Note (5 DBMS Roles Including)
No ratings yet
Neso Note (5 DBMS Roles Including)
14 pages
Semantic Search Demo Booklet
No ratings yet
Semantic Search Demo Booklet
20 pages
Research Article
No ratings yet
Research Article
5 pages
Data Minng
No ratings yet
Data Minng
20 pages
CS-DM Module - 1
No ratings yet
CS-DM Module - 1
27 pages
Chapter 06 - ABAP Repository Information System
No ratings yet
Chapter 06 - ABAP Repository Information System
14 pages
DBMS
No ratings yet
DBMS
19 pages
UNIT-1 Introduction To Data Mining
No ratings yet
UNIT-1 Introduction To Data Mining
29 pages
Csi Zg518 Ec-3r First Sem 2023-2024
No ratings yet
Csi Zg518 Ec-3r First Sem 2023-2024
8 pages
Data Mining Tutorials
No ratings yet
Data Mining Tutorials
52 pages
Python - SQL - Project Class 12th
No ratings yet
Python - SQL - Project Class 12th
6 pages
Unit I DATA MINING AAGAC
No ratings yet
Unit I DATA MINING AAGAC
27 pages
6 TheRealTimeFaceDetectionandRecognitionSystem
No ratings yet
6 TheRealTimeFaceDetectionandRecognitionSystem
48 pages
Lecture Notes 1.1 & 1.2
No ratings yet
Lecture Notes 1.1 & 1.2
8 pages
ITP4903 Laboratory 8 (v2.1 - LWL) - Answer Sheet
No ratings yet
ITP4903 Laboratory 8 (v2.1 - LWL) - Answer Sheet
4 pages
Soln 1
100% (1)
Soln 1
6 pages
Data Mining 1 2 and 3
No ratings yet
Data Mining 1 2 and 3
20 pages
E - 20221018 MT940 Conf
No ratings yet
E - 20221018 MT940 Conf
4 pages
Monthly Sales Data of Air Compressors at Kirkland Industries
No ratings yet
Monthly Sales Data of Air Compressors at Kirkland Industries
16 pages
Whats App
No ratings yet
Whats App
23 pages
Data Binding
No ratings yet
Data Binding
9 pages
DM Module1
No ratings yet
DM Module1
15 pages
Data Warehousing & Mining: Unit - Ii
No ratings yet
Data Warehousing & Mining: Unit - Ii
41 pages
Data Mining
No ratings yet
Data Mining
7 pages
Error "Switch MDG - DRF - MAIN - 05 Is Inactive" When Running DRFIMG
No ratings yet
Error "Switch MDG - DRF - MAIN - 05 Is Inactive" When Running DRFIMG
2 pages
Data Mining - Prashant
No ratings yet
Data Mining - Prashant
10 pages
18mca52c U1
No ratings yet
18mca52c U1
17 pages
DM Unit1 Intro
No ratings yet
DM Unit1 Intro
12 pages
Interpretation:: Monthly Sales Data of Air Compressors at Kirkland Industries
No ratings yet
Interpretation:: Monthly Sales Data of Air Compressors at Kirkland Industries
4 pages
Introduction To Data Mining
No ratings yet
Introduction To Data Mining
11 pages
Module 3: File and Database Organization: Test-Your-Knowledge Questions
No ratings yet
Module 3: File and Database Organization: Test-Your-Knowledge Questions
21 pages
Storage Space of Disk: Average Compression Ratio by LZH Alogrithm (Not Correct Nos) in %
No ratings yet
Storage Space of Disk: Average Compression Ratio by LZH Alogrithm (Not Correct Nos) in %
2 pages
HDDScan Eng PDF
No ratings yet
HDDScan Eng PDF
18 pages
Project PDF
No ratings yet
Project PDF
4 pages
Table Index For RTCIS
No ratings yet
Table Index For RTCIS
24 pages
Technical Education Department: Academic Year: 2019 - 2020
No ratings yet
Technical Education Department: Academic Year: 2019 - 2020
10 pages
Data Storage Technologies and Networks - (Quiz 1)
No ratings yet
Data Storage Technologies and Networks - (Quiz 1)
5 pages
Act 2 - Ethics
No ratings yet
Act 2 - Ethics
1 page
Types of Taxes (Whichever Is Applicable) Per Month Per Annum
No ratings yet
Types of Taxes (Whichever Is Applicable) Per Month Per Annum
1 page
CS-505 Introduction To Data Mining Exercises: Page 1 of 4
No ratings yet
CS-505 Introduction To Data Mining Exercises: Page 1 of 4
4 pages
Data Mining Models and Tasks
No ratings yet
Data Mining Models and Tasks
6 pages
Q.1. What Is Data Mining?
No ratings yet
Q.1. What Is Data Mining?
15 pages
Useful SAP System Transactions
No ratings yet
Useful SAP System Transactions
9 pages
DATA MINING-Knowledge Discovery in Databases
No ratings yet
DATA MINING-Knowledge Discovery in Databases
6 pages
Sheet 1 Solution1
No ratings yet
Sheet 1 Solution1
4 pages
Data Mining Is Defined As The Procedure of Extracting Information From Huge Sets of Data
No ratings yet
Data Mining Is Defined As The Procedure of Extracting Information From Huge Sets of Data
6 pages
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
From Everand
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
Artem Kovera
No ratings yet
SaaS Security Questionnaires
No ratings yet
SaaS Security Questionnaires
6 pages
Database Testing
No ratings yet
Database Testing
3 pages
Pattern Recognition: Fundamentals and Applications
From Everand
Pattern Recognition: Fundamentals and Applications
Fouad Sabry
No ratings yet
Data Mining: Fundamentals and Applications
From Everand
Data Mining: Fundamentals and Applications
Fouad Sabry
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Activity 1 PDF

Uploaded by

Activity 1 PDF

Uploaded by

Reyes, John Michael E.

1. What is data mining? In your answer, address the following:

b. Is it a simple transformation or application of technology developed from

- I think it is not a simple transformation or application of technology developed

d. Describe the steps involved in data mining when viewed as a process of

- Data cleaning, a process of detecting and correcting inconsistent data.

- First is a data warehouse is I think what we call a repository of information more

3. Define each of the following functionalities: characterization, discrimination,

- Characterization, is a summarization of general features of objects in a target

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Activity 1 PDF

Uploaded by

Activity 1 PDF

Uploaded by

Reyes, John Michael E.

1. What is data mining? In your answer, address the following:

b. Is it a simple transformation or application of technology developed from

- I think it is not a simple transformation or application of technology developed

d. Describe the steps involved in data mining when viewed as a process of

- Data cleaning, a process of detecting and correcting inconsistent data.

- First is a data warehouse is I think what we call a repository of information more

3. Define each of the following functionalities: characterization, discrimination,

- Characterization, ​is a summarization of general features of objects in a target

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

- Characterization, is a summarization of general features of objects in a target