0% found this document useful (0 votes)

8 views5 pages

Chapter 1

Uploaded by

rahmansadaf46

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views5 pages

Chapter 1

Uploaded by

rahmansadaf46

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

You are on page 1/ 5

CHAPTER 1

Introduction
1.1 Background

Data mining extracts implicit, potentially useful knowledge from large amounts of data. It
is also called knowledge mining, knowledge extraction, data/sequence/pattern analysis, data
archaeology and data dredging from databases. In other words, data mining is the act of drilling
through huge volumes of data to discover relationships or answer queries, generalized for
traditional query tools.
In general, data mining tasks can be classified into two categories:

Descriptive mining: It is the process of drawing the essential characteristics or general

properties of the data in the database. Clustering, Association and Sequential mining are one of
the descriptive mining techniques.

Predictive mining: This is the process of inferring sequences form data to make
predictions. Classification, Regression and Deviation detection are predictive mining techniques.

Data mining technique is useful in various areas, such as market basket analysis, decision
support, fraud detection, business management, telecommunications etc. The data mining were
drawn from Database Technology, Machine Learning, Artificial Intelligence, Neural Networks,
Statistics, Pattern Recognition, Knowledge-based Systems, Knowledge Acquisition, Information
Retrieval, High-performance computation and Data Visualization.

Many methods came up to extract the information. The Sequential Sequence Mining is
one of the most important techniques that facilitate us to make the decisions in various
applications. The mining problem was first proposed by Agrawal and Srikant [10]. It discovers
sequential sequences which occur frequently in a sequence database.

In the Medicine, finding of time interval sequence of diseases from medical records like
diseases, treatments, and durations of hospital stay etc. are recorded in the database of Hospitals.
However, all the events such as suffering and curing diseases or occurring symptoms are
interval-based. The conventional sequential sequence mining is not appropriate for the discovery
of the sequences in these events. On other hand, time interval sequences are more useful to
identify if a patient suffers from a certain disease or not. It also predicts the symptoms of a
patient who has a certain disease.

In investment, a certain stock rises or falls is one of the important tasks that the stock
investors wanted to know. Further, the owners are worried about the stock trend of their own
businesses. Stockholders or Industry analysts also like to know the rise/fall of certain stocks,
which is actually one of the useful information extractions from the time interval sequences of
stock prices. The stock prices are recorded in every transaction which acts as a historical data.
We may find the time interval stock sequences from the stock interval event database.

In the E-marketing, some Internet vendors provide new selling methods like group
buying offer. These occur when vendors wanted to sell products at lower prices when someone
collects a crowd of people to buy this product. The duration when an individual joins a group
buying section for a certain product till the closing of the session is considered as an interval-
based event. Since many group buying customers may join buying sessions for a number of
products concurrently or later, these interval-based events form a set of sequences, which may
include some interesting time oriented sequences. Discovering time oriented sequences from
group buying records will help the purchasing behaviors of customers and make effective
marketing strategies.

Traditional Association Rule Mining [10] works on transactional data. It considers

various items to be purchased in single transaction of a particular customer. It doesn‟t care for
the same customer purchases items in different transactions. The concept of sequential sequence
mining arrived and it considers various items to be purchased in different transactions. It covers
the idea regarding same customer purchases items in more than one transaction and in more than
one time. However the current state-of-the-art techniques have limitations with the performance
of Memory and Time which are focused by us.
Sequential sequence mining mines sequential sequence from data base with efficient
support counting. It is used to find frequent subsequences occur with minimum support value.
The sequential sequence mining focuses on sequence of events occurred frequently in given
dataset unlike simple association rule mining. For example, the customer in electronics retail
shop purchases Computer System then again he purchases Scanner after some amount of time.
That means the purchasing of Scanner is made after the purchasing of Computer System. The
sequence of the items plays major role. We use the order dataset where all events stored in some
particular order. The traditional sequential sequence mining doesn‟t care for the timing between
the purchasing of items.

The goal of my research work is to develop and evaluate new algorithms of MySSM
which efficiently produce sequential sequences in large database having significant improvement
in execution Time and Memory.

1.2 Thesis organization

We have discussed introductory part of our thesis in Chapter 1. We have also focused on
the organization of our thesis and the aim of our research work in this chapter.

Chapter 2 focuses on the related work to our research. The first part of this chapter is
based on literature survey. In second section, we have discussed various sequential sequence
mining techniques. Third section of this chapter focuses on state-of-the-art techniques for finding
sequential sequence mining. Gradually these techniques are compared with in close proximity
techniques. The results of empirical analysis of state-of-the-art methods are discussed in fourth
section of this chapter. This chapter helped us to strengthen to our technique by considering
various parameters of matrix of evaluation in the area of sequential sequence mining.

Chapter 3 provides the motivation of our research work. It focuses on our inspiration to
do the research work in the sequential sequence mining. The deficiency in state-of-the-art
methods motivated us to develop new sequential sequence mining technique.
Chapter 4 focuses on the scope of work of our algorithm MySSM. We have discussed
proposed algorithms in chapter 5 which includes the steps of our Algorithm MySSM. We have
proposed seven algorithms named SYNTIM, MySSM, GCON, FS, GSGT, GAS, CMEM and
OUTR which all are discussed in this chapter.

Chapter 6 serves to experimentally validate the claims of efficiency in terms of Time and
Memory. In addition, we have empirically analyzed it for large database with

various parameters like various support values, no of items per transactions, no of transactions
per customers, no of customers per database.

Chapter 7 summarizes the thesis and focuses on future scope of the work. This chapter is
followed by references used in our thesis.

1.3 Aim of the Research

The fundamental aim of my thesis is to study and develop a new sequential sequence
mining technique that produces sequential sequences from the large database. It considers the
time gap between successive items to be purchased by the customers. It produces the sequential
sequences with reasonable amount of Time and Memory.

Online Movie Ticket Booking System
88% (78)
Online Movie Ticket Booking System
34 pages
MARKET TIMING FOR THE INVESTOR: Picking Market Tops and Bottoms with Technical Analysis
From Everand
MARKET TIMING FOR THE INVESTOR: Picking Market Tops and Bottoms with Technical Analysis
BC LOW
2/5 (2)
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
From Everand
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
Artem Kovera
No ratings yet
Pattern Sequence Mining: Presented By: Devika Mittal
No ratings yet
Pattern Sequence Mining: Presented By: Devika Mittal
15 pages
DM Unit-5
No ratings yet
DM Unit-5
27 pages
Mining High Utility Patterns in One Phase Without Generating Candidates
No ratings yet
Mining High Utility Patterns in One Phase Without Generating Candidates
17 pages
A Survey of Sequential Pattern Mining
No ratings yet
A Survey of Sequential Pattern Mining
24 pages
Compusoft, 3 (9), 1079-1082 PDF
No ratings yet
Compusoft, 3 (9), 1079-1082 PDF
4 pages
Sequential Rule PDF
No ratings yet
Sequential Rule PDF
4 pages
Research Papers
No ratings yet
Research Papers
4 pages
Research Modern Rules
No ratings yet
Research Modern Rules
4 pages
Data Mining Unit-5
No ratings yet
Data Mining Unit-5
6 pages
Data Mining - UNIT-V
No ratings yet
Data Mining - UNIT-V
13 pages
Study of Temporal Data Mining Techniques IJERTV3IS100183
No ratings yet
Study of Temporal Data Mining Techniques IJERTV3IS100183
4 pages
Upadhyay 2018 Ijca 916573
No ratings yet
Upadhyay 2018 Ijca 916573
9 pages
A Comprehensive Survey of Pattern Mining: Challenges and Opportunities
No ratings yet
A Comprehensive Survey of Pattern Mining: Challenges and Opportunities
8 pages
Mining Temporal Patterns For Interval-Based and Point-Based Events
No ratings yet
Mining Temporal Patterns For Interval-Based and Point-Based Events
6 pages
Good One
No ratings yet
Good One
12 pages
Sequential Pattern Mining by Pattern-Growth: Principles and Extensions
No ratings yet
Sequential Pattern Mining by Pattern-Growth: Principles and Extensions
38 pages
Prediction of Customer Behavior Using Cma
No ratings yet
Prediction of Customer Behavior Using Cma
9 pages
Concepts and Techniques: Mining Sequence Patterns in Transactional Databases
No ratings yet
Concepts and Techniques: Mining Sequence Patterns in Transactional Databases
26 pages
Efficient Mining of Correlated Sequential Patterns Based On Null Hypothesis
No ratings yet
Efficient Mining of Correlated Sequential Patterns Based On Null Hypothesis
8 pages
Datamining 1
No ratings yet
Datamining 1
7 pages
Icremental Mining of Sequential Pattern
No ratings yet
Icremental Mining of Sequential Pattern
25 pages
21 Maxweight
No ratings yet
21 Maxweight
15 pages
Data Mining - Mining Sequential Patterns
No ratings yet
Data Mining - Mining Sequential Patterns
10 pages
A Novel Methodology For Discrimination Prevention in Data Mining
No ratings yet
A Novel Methodology For Discrimination Prevention in Data Mining
21 pages
Sequential Pattern Mining
No ratings yet
Sequential Pattern Mining
3 pages
An Adoptive Algorithm For Mining Time-Interval Sequential Patterns
No ratings yet
An Adoptive Algorithm For Mining Time-Interval Sequential Patterns
5 pages
ETP-Mine: An Efficient Method For Mining Transitional Patterns
No ratings yet
ETP-Mine: An Efficient Method For Mining Transitional Patterns
9 pages
Lab3 3
No ratings yet
Lab3 3
3 pages
Unit5-Dwdm
No ratings yet
Unit5-Dwdm
58 pages
Informative Pattern Discovery Using Combined Mining
No ratings yet
Informative Pattern Discovery Using Combined Mining
5 pages
Compusoft, 3 (10), 1140-1142 PDF
No ratings yet
Compusoft, 3 (10), 1140-1142 PDF
3 pages
Data Mining
No ratings yet
Data Mining
103 pages
Data Mining Unit-V
No ratings yet
Data Mining Unit-V
19 pages
Paper 14
No ratings yet
Paper 14
9 pages
Paper - Xvii Data Mining and Warehousing
No ratings yet
Paper - Xvii Data Mining and Warehousing
140 pages
Efficient Mining of Top-K Sequential Rules: Abstract
No ratings yet
Efficient Mining of Top-K Sequential Rules: Abstract
14 pages
Name Suman Ghorai
No ratings yet
Name Suman Ghorai
7 pages
Data Warehousing Fundamentals - Unit 2
No ratings yet
Data Warehousing Fundamentals - Unit 2
38 pages
Chap 5.1: Mining Sequential Patterns
No ratings yet
Chap 5.1: Mining Sequential Patterns
20 pages
Performance Analysis of Sequential Pattern Mining Algorithms On Large Dense Datasets
No ratings yet
Performance Analysis of Sequential Pattern Mining Algorithms On Large Dense Datasets
7 pages
Parallel Data Mining of Association Rules
No ratings yet
Parallel Data Mining of Association Rules
10 pages
DWDM Unit3
No ratings yet
DWDM Unit3
15 pages
VTT Information Technology: Mining Sequential Patterns
No ratings yet
VTT Information Technology: Mining Sequential Patterns
34 pages
Introduction To Data Mining: Saeed Salem Department of Computer Science North Dakota State University Cs - Ndsu.edu/ Salem
No ratings yet
Introduction To Data Mining: Saeed Salem Department of Computer Science North Dakota State University Cs - Ndsu.edu/ Salem
26 pages
Data Mining (Module-1)
No ratings yet
Data Mining (Module-1)
14 pages
An Introduction To Data Mining Technique: August 2014
No ratings yet
An Introduction To Data Mining Technique: August 2014
6 pages
Time Table Scheduling in Data Mining
No ratings yet
Time Table Scheduling in Data Mining
61 pages
Sequential Pattern Mining: A Survey
No ratings yet
Sequential Pattern Mining: A Survey
27 pages
Seminar On Data Mining Concepts and Its
No ratings yet
Seminar On Data Mining Concepts and Its
8 pages
p144 Data Mining
100% (3)
p144 Data Mining
11 pages
Apriori Based Novel Frequent Itemset Mining Mechanism: Issn No
No ratings yet
Apriori Based Novel Frequent Itemset Mining Mechanism: Issn No
8 pages
BCA Data Mining
No ratings yet
BCA Data Mining
116 pages
Sequence Analysis: Athira P-AM - BU.P2MBA20029
No ratings yet
Sequence Analysis: Athira P-AM - BU.P2MBA20029
14 pages
6 1 Mining Complex Data
No ratings yet
6 1 Mining Complex Data
69 pages
5104 - 07.S. L. Nalawade1
No ratings yet
5104 - 07.S. L. Nalawade1
5 pages
Unit I DM
No ratings yet
Unit I DM
27 pages
Data Mining - Unit-V
No ratings yet
Data Mining - Unit-V
12 pages
Data Mining: Fundamentals and Applications
From Everand
Data Mining: Fundamentals and Applications
Fouad Sabry
No ratings yet
Mern Stack Interview Question Answer
No ratings yet
Mern Stack Interview Question Answer
18 pages
Top 2
No ratings yet
Top 2
6 pages
Project Book
No ratings yet
Project Book
31 pages
UIS For ECE
No ratings yet
UIS For ECE
48 pages
Business Activity Queries
No ratings yet
Business Activity Queries
76 pages
Power BI Gateway
No ratings yet
Power BI Gateway
19 pages
Cognizant Probable Sample Questions
No ratings yet
Cognizant Probable Sample Questions
2 pages
Mini Project On Information and Cyber Security: SQL (Structured Query Language) Injection
100% (1)
Mini Project On Information and Cyber Security: SQL (Structured Query Language) Injection
9 pages
Postgraduate PG Master Computer Applications Mca Semester 2 2023 November Advanced Dbms2020 Pattern
No ratings yet
Postgraduate PG Master Computer Applications Mca Semester 2 2023 November Advanced Dbms2020 Pattern
2 pages
Clustering in Machine Learning - Javatpoint
No ratings yet
Clustering in Machine Learning - Javatpoint
10 pages
DBMS Unit1 Notes
No ratings yet
DBMS Unit1 Notes
24 pages
Management-Information-System-Set-3 Mcqmat
No ratings yet
Management-Information-System-Set-3 Mcqmat
6 pages
Cse 17CS82 M2 S4 PPT
No ratings yet
Cse 17CS82 M2 S4 PPT
19 pages
Haramaya University Staff Clearance System SDD
No ratings yet
Haramaya University Staff Clearance System SDD
71 pages
Lab02 Hive1
No ratings yet
Lab02 Hive1
10 pages
OSDB Upgrade New Plan
No ratings yet
OSDB Upgrade New Plan
48 pages
Mock OLevel CS Paper1 and 2 2025
No ratings yet
Mock OLevel CS Paper1 and 2 2025
4 pages
A Framework For Trajectory Data Preprocessing For Data Mining
No ratings yet
A Framework For Trajectory Data Preprocessing For Data Mining
5 pages
Database Systems Concepts Design and Applications 2nd Edition 9788131760925 978 8131760925
100% (2)
Database Systems Concepts Design and Applications 2nd Edition 9788131760925 978 8131760925
1,405 pages
App Dna Lga
No ratings yet
App Dna Lga
15 pages
CDS View With Join Vs Associations
No ratings yet
CDS View With Join Vs Associations
4 pages
Data Structures Viva Questions
No ratings yet
Data Structures Viva Questions
5 pages
CS PQ
No ratings yet
CS PQ
11 pages
Mongo DB NOtes
No ratings yet
Mongo DB NOtes
5 pages
Store Passwords Securely in Database Using SHA256 - ASP .NET Core - by Juldhais Hengkyawan - Medium
No ratings yet
Store Passwords Securely in Database Using SHA256 - ASP .NET Core - by Juldhais Hengkyawan - Medium
31 pages
Technology Stack
No ratings yet
Technology Stack
2 pages
EmployeeProject IT10 241018 221909
No ratings yet
EmployeeProject IT10 241018 221909
14 pages
Full Stack UNIT3
No ratings yet
Full Stack UNIT3
57 pages
IMP Points On Sap-Hana
No ratings yet
IMP Points On Sap-Hana
3 pages
ZTE Uganda Is Hiring
No ratings yet
ZTE Uganda Is Hiring
2 pages
Spring Boot With MongoDB
No ratings yet
Spring Boot With MongoDB
16 pages
Prac
No ratings yet
Prac
54 pages
Unit V Google App Engine
No ratings yet
Unit V Google App Engine
20 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Chapter 1

Uploaded by

Chapter 1

Uploaded by

CHAPTER 1

Descriptive mining: It is the process of drawing the essential characteristics or general

Traditional Association Rule Mining [10] works on transactional data. It considers

1.2 Thesis organization

1.3 Aim of the Research

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.