0% found this document useful (0 votes)

3 views5 pages

Data Mining Unit-i

Data mining is the process of extracting insights from large datasets using various techniques, aimed at discovering hidden patterns for informed decision-making. It encompasses functionalities like data characterization, discrimination, association analysis, and classification, and can be classified based on databases, knowledge types, techniques, and applications. The Knowledge Discovery in Databases (KDD) process involves steps such as selection, pre-processing, transformation, data mining, interpretation, evaluation, and deployment, with advantages including improved decision-making and efficiency, but also facing challenges like privacy concerns and data quality issues.

Uploaded by

kundurathin06101964

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views5 pages

Data Mining Unit-i

Uploaded by

kundurathin06101964

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 5

DATA MINING (UNIT-I)

1. What is Data Mining?

Ans.Data mining is the process of extracting knowledge or insights from large
amounts of data using various statistical and computational techniques. The data
can be structured, semi-structured or unstructured, and can be stored in various
forms such as databases, data warehouses, and data lakes.
The primary goal of data mining is to discover hidden patterns and relationships in
the data that can be used to make informed decisions or predictions.

2. What Is Motivated Data Mining? Why Is It Important?

Ans. Data mining has attracted a great deal of attention in the information industry
and in society as a whole in recent years, due to the wide availability of huge
amounts of data and imminent need for turning such data into information and
knowledge. The information and. knowledge gained, can be used for applications
ranging, from market analysis, fraud detection, and customer retention, to production
control and science exploration.

3. What are the functionalities in data mining?

Ans. There are various data mining functionalities which are as follows −
 Data characterization − It is a summarization of the general
characteristics of an object class of data. The data corresponding to the
user-specified class is generally collected by a database query. The
output of data characterization can be presented in multiple forms.
 Data discrimination − It is a comparison of the general characteristics
of target class data objects with the general characteristics of objects
from one or a set of contrasting classes. The target and contrasting
classes can be represented by the user, and the equivalent data
objects fetched through database queries.
 Association Analysis − It analyses the set of items that generally
occur together in a transactional dataset. There are two parameters
that are used for determining the association rules −
o It provides which identifies the common item set in the
database.
o Confidence is the conditional probability that an item
occurs in a transaction when another item occurs.
 Classification − Classification is the procedure of discovering a model
that represents and distinguishes data classes or concepts, for the
objective of being able to use the model to predict the class of objects
whose class label is anonymous.
 Evolution analysis − It defines the trends for objects whose behaviour
changes over some time.
4.What are the classification of data mining?
Ans. Data mining can be classified into the following systems:

Classification Based on the mined Databases: A data mining system can

be classified based on the types of databases that have been mined. A
database system can be further segmented based on distinct principles,
such as data models, types of data, etc., which further assist in classifying
a data mining system.

Classification Based on the type of Knowledge Mined: A data mining

system categorized based on the kind of knowledge mind may have the
following functionalities:

1. Characterization
2. Discrimination
3. Association and Correlation Analysis
4. Classification
5. Prediction
6. Outlier Analysis
7. Evolution Analysis

Classification Based on the Techniques Utilized: A data mining system

can also be classified based on the type of techniques that are being
incorporated. These techniques can be assessed based on the
involvement of user interaction involved or the methods of analysis
employed.

Classification Based on the Applications Adapted: Data mining systems

classified based on adapted applications adapted are as follows:

1. Finance
2. Telecommunications
3. DNA
4. Stock Markets
5. E-mail

5.What are the integration of a data mining system with a database?

Ans. The list of Integration Schemes is as follows −
 No Coupling − In this scheme, the data mining system does not utilize
any of the database or data warehouse functions. It fetches the data
from a particular source and processes that data using some data
mining algorithms. The data mining result is stored in another file.
 Loose Coupling − In this scheme, the data mining system may use
some of the functions of database and data warehouse system. It
fetches the data from the data respiratory managed by these systems
and performs data mining on that data. It then stores the mining result
either in a file or in a designated place in a database or in a data
warehouse.
 Semi−tight Coupling − In this scheme, the data mining system is
linked with a database or a data warehouse system and in addition to
that, efficient implementations of a few data mining primitives can be
provided in the database.
 Tight coupling − In this coupling scheme, the data mining system is
smoothly integrated into the database or data warehouse system. The
data mining subsystem is treated as one functional component of an
information system.

6.What are the issues in data mining?

Ans. The major issues regarding data mining are-
i)Mining Methodology and User Interaction- It refers to the following kinds of
issues
 Mining different kinds of knowledge in databases − Different users may be
interested in different kinds of knowledge. Therefore it is necessary for data
mining to cover a broad range of knowledge discovery task.
 Interactive mining of knowledge at multiple levels of abstraction − The
data mining process needs to be interactive because it allows users to focus
the search for patterns, providing and refining data mining requests based on
the returned results.
 Incorporation of background knowledge − To guide discovery process and
to express the discovered patterns, the background knowledge can be used.
 Pattern evaluation − The patterns discovered should be interesting because
either they represent common knowledge or lack novelty.

ii)Performance Issues- There can be performance-related issues such as follows −

 Efficiency and scalability of data mining algorithms − In order to
effectively extract the information from huge amount of data in
databases, data mining algorithm must be efficient and scalable.
 Parallel, distributed, and incremental mining algorithms − The
factors such as huge size of databases, wide distribution of data, and
complexity of data mining methods motivate the development of parallel
and distributed data mining algorithms.

iii)Diverse Data Types Issues-

 Handling of relational and complex types of data − The database may
contain complex data objects, multimedia data objects, spatial data, temporal
data etc. It is not possible for one system to mine all these kind of data.
 Mining information from heterogeneous databases and global
information systems − The data is available at different data sources on
LAN or WAN. These data source may be structured, semi structured or
unstructured.

6.Explain KDD process.

Ans. KDD (Knowledge Discovery in Databases) is a process that involves
the extraction of useful, previously unknown, and potentially valuable
information from large datasets. The KDD process in data mining typically
involves the following steps:
 Selection: Select a relevant subset of the data for analysis.
 Pre-processing: Clean and transform the data to make it ready for
analysis. This may include tasks such as data normalization, missing
value handling, and data integration.
 Transformation: Transform the data into a format suitable for data
mining, such as a matrix or a graph.
 Data Mining: Apply data mining techniques and algorithms to the data
to extract useful information and insights.
 Interpretation: Interpret the results and extract knowledge from the
data. This may include tasks such as visualizing the results, evaluating
the quality of the discovered patterns and identifying relationships and
associations among the data.
 Evaluation: Evaluate the results to ensure that the extracted
knowledge is useful, accurate, and meaningful.
 Deployment: Use the discovered knowledge to solve the business
problem and make decisions.

7.State the advantages and disadvantages of KDD process.

Ans. Advantages of KDD:

1. Improves decision-making: KDD provides valuable insights and

knowledge that can help organizations make better decisions.
2. Increased efficiency: KDD automates repetitive and time-
consuming tasks and makes the data ready for analysis, which
saves time and money.
3. Better customer service: KDD helps organizations gain a better
understanding of their customers’ needs and preferences, which
can help them provide better customer service.
4. Fraud detection: KDD can be used to detect fraudulent activities
by identifying patterns and anomalies in the data that may
indicate fraud.
Disadvantages of KDD:

1. Privacy concerns: KDD can raise privacy concerns as it involves

collecting and analyzing large amounts of data, which can
include sensitive information about individuals.
2. Complexity: KDD can be a complex process that requires
specialized skills and knowledge to implement and interpret the
results.
3. Data Quality: KDD process heavily depends on the quality of data,
if data is not accurate or consistent, the results can be misleading
4. High cost: KDD can be an expensive process, requiring significant
investments in hardware, software, and personnel.

8.State the differences between KDD and data mining.

Ans. Difference Between KDD and Data Mining

Paramete
KDD Data Mining
r

KDD refers to a process of Data Mining refers to a

identifying valid, novel, potentially process of extracting useful
Definition useful, and ultimately and valuable information or
understandable patterns and patterns from large data
relationships in data. sets.

To find useful knowledge from To extract useful information

Objective
data. from data.

Data cleaning, data integration,

Association rules,
data selection, data
classification, clustering,
Technique transformation, data mining,
regression, decision trees,
s Used pattern evaluation, and
neural networks, and
knowledge representation and
dimensionality reduction.
visualization.

Structured information, such as Patterns, associations, or

rules and models, that can be insights that can be used to
Output
used to make decisions or improve decision-making or
predictions. understanding.

Fundamentals of Data Science Unit 1
No ratings yet
Fundamentals of Data Science Unit 1
29 pages
DM passing package
No ratings yet
DM passing package
38 pages
EN - Security Center Administrator Guide 5.9
100% (1)
EN - Security Center Administrator Guide 5.9
1,260 pages
Chapter 1&2
No ratings yet
Chapter 1&2
91 pages
2 unit
No ratings yet
2 unit
15 pages
DWM Q1-10 - 240426 - 090822
No ratings yet
DWM Q1-10 - 240426 - 090822
13 pages
DMA_qb_solved
No ratings yet
DMA_qb_solved
42 pages
DATA_MINING_UNIT_1
No ratings yet
DATA_MINING_UNIT_1
13 pages
Unit 1
No ratings yet
Unit 1
46 pages
Intro To Data Minning
No ratings yet
Intro To Data Minning
24 pages
IDW Lecture 31--Basic Concepts About Data Mining
No ratings yet
IDW Lecture 31--Basic Concepts About Data Mining
9 pages
Subject Data Warehouse
No ratings yet
Subject Data Warehouse
42 pages
Unit 1 Data Mining
No ratings yet
Unit 1 Data Mining
30 pages
Datawarehouse&Data mining_ALL
No ratings yet
Datawarehouse&Data mining_ALL
46 pages
DWDM UNIT-2
No ratings yet
DWDM UNIT-2
13 pages
Tense Class 6
100% (1)
Tense Class 6
3 pages
data mining introduction
No ratings yet
data mining introduction
52 pages
DWM
No ratings yet
DWM
12 pages
My Notes DWDM
No ratings yet
My Notes DWDM
18 pages
Unit 4 Introduction To Data Mining
No ratings yet
Unit 4 Introduction To Data Mining
22 pages
CHAPTER1-datamining
No ratings yet
CHAPTER1-datamining
33 pages
Chapter 1___Data Mining and Data Warehouse
No ratings yet
Chapter 1___Data Mining and Data Warehouse
44 pages
Lesson 1
No ratings yet
Lesson 1
32 pages
How to Setup n8n Self Hosting - By Waseem
No ratings yet
How to Setup n8n Self Hosting - By Waseem
17 pages
Unit 3 Data Mining
No ratings yet
Unit 3 Data Mining
21 pages
datamining&warehousing
No ratings yet
datamining&warehousing
65 pages
DMW - Unit 1
No ratings yet
DMW - Unit 1
21 pages
Unit 1
No ratings yet
Unit 1
59 pages
Data Mining
No ratings yet
Data Mining
20 pages
Mekelle University-Mekelle Institute of Technology Department of Information Technology Data Mining and Knowledge Discovery
No ratings yet
Mekelle University-Mekelle Institute of Technology Department of Information Technology Data Mining and Knowledge Discovery
36 pages
unit-III
No ratings yet
unit-III
101 pages
Computers Are Your Future: Twelfth Edition
No ratings yet
Computers Are Your Future: Twelfth Edition
39 pages
Unit 1 Data Mining task
No ratings yet
Unit 1 Data Mining task
7 pages
DM Module1 notes
No ratings yet
DM Module1 notes
25 pages
Lec02 - Crypto
No ratings yet
Lec02 - Crypto
30 pages
1.1 - Data Mining
No ratings yet
1.1 - Data Mining
18 pages
Module II
No ratings yet
Module II
124 pages
3-OLAP Operations-13!08!2021 (13-Aug-2021) Material I 13-Aug-2021 Data Mining - Introductory Slides
No ratings yet
3-OLAP Operations-13!08!2021 (13-Aug-2021) Material I 13-Aug-2021 Data Mining - Introductory Slides
37 pages
data mining unit I notes
No ratings yet
data mining unit I notes
24 pages
6G and Next-Generation Internet - Under Blockchain Web3 Economy by Abdeljalil Beniiche
No ratings yet
6G and Next-Generation Internet - Under Blockchain Web3 Economy by Abdeljalil Beniiche
133 pages
Dwdm Unit-II Notes
No ratings yet
Dwdm Unit-II Notes
29 pages
p144 Data Mining
100% (3)
p144 Data Mining
11 pages
Os Paper Solutions
100% (1)
Os Paper Solutions
52 pages
CYB2203 Lecture Note Complete
No ratings yet
CYB2203 Lecture Note Complete
68 pages
DMW Notes by Me
No ratings yet
DMW Notes by Me
45 pages
2017 Planning Guide For Identity and Access Management: Key Findings
No ratings yet
2017 Planning Guide For Identity and Access Management: Key Findings
24 pages
SA3 - Notes Booklet
No ratings yet
SA3 - Notes Booklet
19 pages
Module 2 Data Mining
No ratings yet
Module 2 Data Mining
49 pages
Unit I
No ratings yet
Unit I
19 pages
Apps L6+Applied+Digital+Technology+Programme+Guide
No ratings yet
Apps L6+Applied+Digital+Technology+Programme+Guide
18 pages
TechnoEssentials Module 2 Where Are We TechnoKids PH
No ratings yet
TechnoEssentials Module 2 Where Are We TechnoKids PH
14 pages
DM Module1
No ratings yet
DM Module1
15 pages
DM Chapter 1
No ratings yet
DM Chapter 1
10 pages
Whats App
No ratings yet
Whats App
23 pages
DM Unit1 Intro
No ratings yet
DM Unit1 Intro
12 pages
7 The Internet
No ratings yet
7 The Internet
27 pages
Introduction To Data Mining-Week1
No ratings yet
Introduction To Data Mining-Week1
43 pages
Chap 1
No ratings yet
Chap 1
32 pages
CS1004 DWM 2marks 2013
No ratings yet
CS1004 DWM 2marks 2013
22 pages
Getting Started With Apache Kafka
No ratings yet
Getting Started With Apache Kafka
21 pages
Unit 1 Data Mining
No ratings yet
Unit 1 Data Mining
15 pages
Sap Introduction
No ratings yet
Sap Introduction
14 pages
Conceptual Model of UML
No ratings yet
Conceptual Model of UML
24 pages
DM-Model Question Paper Solutions
No ratings yet
DM-Model Question Paper Solutions
27 pages
Digital forensic activity 1
No ratings yet
Digital forensic activity 1
8 pages
Data Mining: An Overview From A Database Perspective
No ratings yet
Data Mining: An Overview From A Database Perspective
30 pages
Advantages and Disadvantages
No ratings yet
Advantages and Disadvantages
2 pages
AWS Cloud Confident Twitch Resources
No ratings yet
AWS Cloud Confident Twitch Resources
7 pages
Copy of Incident Response Plan Template
No ratings yet
Copy of Incident Response Plan Template
7 pages
Carl Teneng
No ratings yet
Carl Teneng
4 pages
Data Mining Summaries PDF
No ratings yet
Data Mining Summaries PDF
22 pages
Compiler
No ratings yet
Compiler
6 pages
Data Mining Unit-II
No ratings yet
Data Mining Unit-II
4 pages
DWDM R13 Unit 1 PDF
No ratings yet
DWDM R13 Unit 1 PDF
10 pages
Q.1. What Is Data Mining?
No ratings yet
Q.1. What Is Data Mining?
15 pages
Cs1004: Data Warehousing and Mining Two Marks Questions and Answers Unit I
No ratings yet
Cs1004: Data Warehousing and Mining Two Marks Questions and Answers Unit I
31 pages
Oracle BI Answers and Interactive Dashboards: Submitted By, Dheeraj Rekula Bhavik Shah
No ratings yet
Oracle BI Answers and Interactive Dashboards: Submitted By, Dheeraj Rekula Bhavik Shah
9 pages
RGPV Notes _ Data Analytics
No ratings yet
RGPV Notes _ Data Analytics
3 pages
10 Ssc Holiday Homework (25-26)-1
No ratings yet
10 Ssc Holiday Homework (25-26)-1
3 pages
Wipro ATS Resume Joya Khan
No ratings yet
Wipro ATS Resume Joya Khan
3 pages
Instructions IHTASim RAVEN Installation
No ratings yet
Instructions IHTASim RAVEN Installation
2 pages
Document 2402362.1 - Create WO With Oper and MAte
No ratings yet
Document 2402362.1 - Create WO With Oper and MAte
3 pages
Complete Windows Hacking With Kali and Python Course Content
No ratings yet
Complete Windows Hacking With Kali and Python Course Content
2 pages
01 Assignment 1
No ratings yet
01 Assignment 1
1 page
UNIT-1 Introduction To Data Mining
No ratings yet
UNIT-1 Introduction To Data Mining
29 pages
Student Enrollment Form
No ratings yet
Student Enrollment Form
1 page
Tense 2 Class 6
100% (1)
Tense 2 Class 6
8 pages
The Secret Of Machine Learning
From Everand
The Secret Of Machine Learning
Mhd Arjunanta
No ratings yet
Principles of Data Mining
From Everand
Principles of Data Mining
Subodh Keshari
No ratings yet
Introduction to Robotics
From Everand
Introduction to Robotics
Swarnalata Verma
No ratings yet
Mastering Data Mining Techniques
From Everand
Mastering Data Mining Techniques
Dhaanyalakshmi Ahuja
No ratings yet
Databases: System Concepts, Designs, Management, and Implementation
From Everand
Databases: System Concepts, Designs, Management, and Implementation
Jonathan Rigdon
No ratings yet
Practical Data Strategies and Recipes
From Everand
Practical Data Strategies and Recipes
Tom Henricksen
No ratings yet
Data Mining: Fundamentals and Applications
From Everand
Data Mining: Fundamentals and Applications
Fouad Sabry
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Data Mining Unit-i

Uploaded by

Data Mining Unit-i

Uploaded by

DATA MINING (UNIT-I)

1. What is Data Mining?

2. What Is Motivated Data Mining? Why Is It Important?

3. What are the functionalities in data mining?

Classification Based on the mined Databases: A data mining system can

Classification Based on the type of Knowledge Mined: A data mining

Classification Based on the Techniques Utilized: A data mining system

Classification Based on the Applications Adapted: Data mining systems

5.What are the integration of a data mining system with a database?

6.What are the issues in data mining?

ii)Performance Issues- There can be performance-related issues such as follows −

iii)Diverse Data Types Issues-

6.Explain KDD process.

7.State the advantages and disadvantages of KDD process.

Ans. Advantages of KDD:

1. Improves decision-making: KDD provides valuable insights and

1. Privacy concerns: KDD can raise privacy concerns as it involves

8.State the differences between KDD and data mining.

Ans. Difference Between KDD and Data Mining

KDD refers to a process of Data Mining refers to a

To find useful knowledge from To extract useful information

Data cleaning, data integration,

Structured information, such as Patterns, associations, or

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.