0% found this document useful (0 votes)

37 views10 pages

CS37300 Data Mining & Machine Learning: Anomaly Detection

This document provides an overview of anomaly detection from a machine learning course. It defines anomalies as data points that are considerably different from the majority of data. It discusses different types of anomalies including point, contextual, and collective anomalies. It also outlines challenges in anomaly detection like defining outliers and evaluating models with skewed class distributions. The document concludes with outlining supervised, semi-supervised, and unsupervised approaches to anomaly detection.

Uploaded by

sanjay

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

37 views10 pages

CS37300 Data Mining & Machine Learning: Anomaly Detection

Uploaded by

sanjay

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 10

CS37300

Data Mining & Machine Learning

Anomaly Detection
Module 1: Overview
Prof. Chris Clifton
7 April 2020

Some materials from Introduction to Data Mining by Tan, Steinbach and Kumar
Task
• Anomalies/outliers: data points that are considerably
“different” from the remainder of the data
• Variants:
– Find all points with anomaly scores > threshold
– Find point with largest anomaly score
– Given a database D with mostly normal points, compute the
anomaly score of a point x with respect to D
Examples
• Fraud detection
• Intrusion detection
• Ecosystem disturbances
• System monitoring
• Biosurveillance/public health
• Data preprocessing
Types of anomalies
• Data from different classes
– “An outlier is an observation that differs so much from other
observations as to arouse suspicion that it was generated by a
different mechanism”
• Natural variation
– Extreme or unlikely variations are often interesting
• Data measurement and collection errors
– Preprocess to remove
Defining an outlier
• Notion of outlier is highly subjective and domain
dependent
• However, most definitions can be viewed as defining a
distribution for “normal” data and then looking for
deviations from that distribution

Source: Osmar Zaiane, UAlberta, PKDD

Point anomalies
• An individual data instance is anomalous with respect to
the data Y

N1 o1
O3

Source: Lazarevic et al, ECML/PKDD’08 Tutorial

Contextual anomalies
• An individual data instance is anomalous within a
context
• Requires a notion of context
• Also referred to as conditional anomalies (Song et. al,
TDKE ’06)

Anomaly
Normal

Source: Lazarevic et al, ECML/PKDD’08 Tutorial

Collective anomalies
• A collection of related data instances is anomalous
• Requires a relationship among data instances, e.g.:
– Sequential, Spatial, Graph Data
• The individual instances within a collective anomaly are
not anomalous by themselves

Anomalous Subsequence
Source: Lazarevic et al, ECML/PKDD’08 Tutorial
Anomaly detection
• Challenges
– How many attributes are used to define an outlier?
– How many outliers are there in the data?
– Class labels are costly (evaluation can be challenging)
– Skewed class distribution (finding needles in haystack)
• Working assumption:
– There are considerably more “normal” observations than
“abnormal” observations in the data
Approaches
• Supervised
– Labels available for both normal data and anomalies
– Similar to classification with imbalanced classes
• Semi-supervised
– Labels available only for normal data
• Unsupervised
– No labels assumed
– Based on the assumption that anomalies are very rare compared
to normal data

Anomaly Detection For Cyber Security
No ratings yet
Anomaly Detection For Cyber Security
31 pages
(Ebook PDF) The Anthropology of Language: An Introduction To Linguistic Anthropology 4th Edition PDF Download
100% (4)
(Ebook PDF) The Anthropology of Language: An Introduction To Linguistic Anthropology 4th Edition PDF Download
47 pages
Format For Enrolment Position For School Uniform 2025-26
No ratings yet
Format For Enrolment Position For School Uniform 2025-26
41 pages
Introduction to Robotics
From Everand
Introduction to Robotics
Swarnalata Verma
No ratings yet
741 Outlier Detection
No ratings yet
741 Outlier Detection
55 pages
ff12 Deep Learning For Anomaly Detection
No ratings yet
ff12 Deep Learning For Anomaly Detection
71 pages
IsiXhosa HL P1 May-June 2023
No ratings yet
IsiXhosa HL P1 May-June 2023
13 pages
On The Nature and Types of Anomalies: A Review of Deviations in Data
No ratings yet
On The Nature and Types of Anomalies: A Review of Deviations in Data
35 pages
5 Anomaly Detection Annotated Section 100 300
No ratings yet
5 Anomaly Detection Annotated Section 100 300
48 pages
Unit 3
No ratings yet
Unit 3
37 pages
Outlier Analysis
No ratings yet
Outlier Analysis
18 pages
Anomaly Detection
No ratings yet
Anomaly Detection
13 pages
Unit 5 - Lecture 1 - Outlier Detection
No ratings yet
Unit 5 - Lecture 1 - Outlier Detection
30 pages
Lecture Notes - Anomaly Detection in Time Series
No ratings yet
Lecture Notes - Anomaly Detection in Time Series
43 pages
Unit5 OutliersDetection
No ratings yet
Unit5 OutliersDetection
37 pages
Cambridge International As Amp A Level Further Mathematics Further Pure Mathematics 1 9781510422018 1510422013
100% (1)
Cambridge International As Amp A Level Further Mathematics Further Pure Mathematics 1 9781510422018 1510422013
211 pages
Tengeru Institute of Community Development
No ratings yet
Tengeru Institute of Community Development
48 pages
07 Outlier Detection
No ratings yet
07 Outlier Detection
54 pages
Quantum Simulation of Schrödingers Equation
No ratings yet
Quantum Simulation of Schrödingers Equation
50 pages
Unit 5
No ratings yet
Unit 5
47 pages
Anomoly Detection - Ensemble - Classifiers
No ratings yet
Anomoly Detection - Ensemble - Classifiers
68 pages
Introtoanomalydetection 170421012904
No ratings yet
Introtoanomalydetection 170421012904
53 pages
Datamining Seminar
No ratings yet
Datamining Seminar
19 pages
Anomaly-Fraud-Detection
No ratings yet
Anomaly-Fraud-Detection
50 pages
Iva 4
No ratings yet
Iva 4
43 pages
WP S-Ax Key Steps To Detect An Anomaly in Real-time-JAN10
No ratings yet
WP S-Ax Key Steps To Detect An Anomaly in Real-time-JAN10
10 pages
Conjuntions (Updated Feb4 2025)
No ratings yet
Conjuntions (Updated Feb4 2025)
7 pages
17 dm2 Anomaly Detection 2022 23
No ratings yet
17 dm2 Anomaly Detection 2022 23
113 pages
From Zero to Oracle Hero: A Journey Through SQL, PL/SQL, and DBA Dark Arts
From Everand
From Zero to Oracle Hero: A Journey Through SQL, PL/SQL, and DBA Dark Arts
Scott Markham
No ratings yet
T6 - QMchange Point Anomaly
No ratings yet
T6 - QMchange Point Anomaly
11 pages
Explainable Contextual Anomaly Detection
No ratings yet
Explainable Contextual Anomaly Detection
48 pages
고3 모의 3개동시 분석지 (원본)
No ratings yet
고3 모의 3개동시 분석지 (원본)
25 pages
Anomaly Detection For Data Streams in Large-Scale Distributed Heterogeneous Computing Environments
No ratings yet
Anomaly Detection For Data Streams in Large-Scale Distributed Heterogeneous Computing Environments
11 pages
References
No ratings yet
References
6 pages
Técnicas Estadísticas para la Ciencia de Datos a través de R. Aprendizaje Supervisado: Análisis Discriminante, Árboles de Decisión, Redes Neuronales y Modelos Lineales Generalizados
From Everand
Técnicas Estadísticas para la Ciencia de Datos a través de R. Aprendizaje Supervisado: Análisis Discriminante, Árboles de Decisión, Redes Neuronales y Modelos Lineales Generalizados
César Pérez López
No ratings yet
Anomaly Detection
No ratings yet
Anomaly Detection
49 pages
Outlier Detection
No ratings yet
Outlier Detection
10 pages
Outlier Detection
No ratings yet
Outlier Detection
30 pages
Anomaly Detection and Outlier Analysis
No ratings yet
Anomaly Detection and Outlier Analysis
25 pages
Date Sheet in Semester May 2025
No ratings yet
Date Sheet in Semester May 2025
5 pages
A Survey On Outlier Detection Methods
No ratings yet
A Survey On Outlier Detection Methods
4 pages
JSREP Volume 38 Issue 183ج1 Pages 223-310
No ratings yet
JSREP Volume 38 Issue 183ج1 Pages 223-310
88 pages
Machine Learning For Anomaly Detection A Systemati
No ratings yet
Machine Learning For Anomaly Detection A Systemati
47 pages
Ecmlpkdd08 Lazarevic Dmfa
No ratings yet
Ecmlpkdd08 Lazarevic Dmfa
116 pages
4C's and Principles of Communication
100% (1)
4C's and Principles of Communication
2 pages
Teaching New Head Way Plus English Course
No ratings yet
Teaching New Head Way Plus English Course
39 pages
Unit 2 - Part A
No ratings yet
Unit 2 - Part A
51 pages
Anomaly Detection Guidebook
100% (1)
Anomaly Detection Guidebook
16 pages
2024 Northeast Michigan Kids Count Data Profiles
No ratings yet
2024 Northeast Michigan Kids Count Data Profiles
9 pages
ADII10 Analisa Outlier
No ratings yet
ADII10 Analisa Outlier
37 pages
ISAT 600 Progress Report 3
No ratings yet
ISAT 600 Progress Report 3
4 pages
Anomaly Detection: A Tutorial
No ratings yet
Anomaly Detection: A Tutorial
101 pages
Unit V Outlier 2
No ratings yet
Unit V Outlier 2
13 pages
Anomaly Detection: Lecture Notes For Chapter 9 Introduction To Data Mining, 2 Edition by Tan, Steinbach, Karpatne, Kumar
No ratings yet
Anomaly Detection: Lecture Notes For Chapter 9 Introduction To Data Mining, 2 Edition by Tan, Steinbach, Karpatne, Kumar
33 pages
Synthesis Writing Template: I. Introduction - MUST HAVE ALL THREE
No ratings yet
Synthesis Writing Template: I. Introduction - MUST HAVE ALL THREE
4 pages
Business Development Sales Vishwanath Sajjan
No ratings yet
Business Development Sales Vishwanath Sajjan
2 pages
Thermal Anomaly Detection
No ratings yet
Thermal Anomaly Detection
3 pages
Anomaly Detection
No ratings yet
Anomaly Detection
7 pages
Borges - The Garden of Forking Paths PDF
No ratings yet
Borges - The Garden of Forking Paths PDF
7 pages
Chapter8 Student
No ratings yet
Chapter8 Student
60 pages
Igilik Saya
No ratings yet
Igilik Saya
11 pages
Outlier Detection
No ratings yet
Outlier Detection
22 pages
Grade 8 PA3 Syllabus
No ratings yet
Grade 8 PA3 Syllabus
2 pages
10 - Anomaly Detection
No ratings yet
10 - Anomaly Detection
12 pages
Algorithmic Injustice. A Relational Ethics Approach
No ratings yet
Algorithmic Injustice. A Relational Ethics Approach
9 pages
Ebook Beginners Guide To Anomaly Detection 2022
No ratings yet
Ebook Beginners Guide To Anomaly Detection 2022
12 pages
Data Minning Unit 4-1
No ratings yet
Data Minning Unit 4-1
10 pages
Art Curriculum Overview
100% (1)
Art Curriculum Overview
13 pages
Module 11 (C)
No ratings yet
Module 11 (C)
4 pages
MBA Analytics For Finance 08
No ratings yet
MBA Analytics For Finance 08
9 pages
Anamoly Detection
No ratings yet
Anamoly Detection
20 pages
Karan Resume Copy-1
No ratings yet
Karan Resume Copy-1
3 pages
Chapter 3: Numerical Summary Measures
No ratings yet
Chapter 3: Numerical Summary Measures
34 pages
Outlier Detection Techniques
100% (2)
Outlier Detection Techniques
56 pages
Assessment of Course Outcomes: Object Oriented Programming Through JAVA
No ratings yet
Assessment of Course Outcomes: Object Oriented Programming Through JAVA
51 pages
Chapter 2: Tables and Graphs For Summarizing Data
No ratings yet
Chapter 2: Tables and Graphs For Summarizing Data
21 pages
Anomaly Detection: A Tutorial: Arindam Banerjee, Varun Chandola, Vipin Kumar, Jaideep Srivastava
No ratings yet
Anomaly Detection: A Tutorial: Arindam Banerjee, Varun Chandola, Vipin Kumar, Jaideep Srivastava
101 pages
NUS AMP Brochure
No ratings yet
NUS AMP Brochure
15 pages
Module 3 - Characterization
No ratings yet
Module 3 - Characterization
16 pages
Primary 6 (Grade 6) Contest Paper: Singapore and Asian Schools Math Olympiad 2020
No ratings yet
Primary 6 (Grade 6) Contest Paper: Singapore and Asian Schools Math Olympiad 2020
15 pages
Anomaly Detection 2
No ratings yet
Anomaly Detection 2
8 pages
Outlier Detection For Different Applications Review IJERTV2IS3508
No ratings yet
Outlier Detection For Different Applications Review IJERTV2IS3508
13 pages
Outlier Detection
No ratings yet
Outlier Detection
36 pages
Anomaly Detection
No ratings yet
Anomaly Detection
7 pages
JD - Great Learning - IMI New Delhi
No ratings yet
JD - Great Learning - IMI New Delhi
3 pages
Module 2 - Setting
No ratings yet
Module 2 - Setting
11 pages
Tabular Arrangement & Distribution Worksheet
No ratings yet
Tabular Arrangement & Distribution Worksheet
3 pages
Anomaly Detection Survey
No ratings yet
Anomaly Detection Survey
72 pages
Checklist For Review of Schematic Floor Plan
100% (1)
Checklist For Review of Schematic Floor Plan
2 pages
Quiz 13
No ratings yet
Quiz 13
6 pages
The Weekly Schedule Sunday Monday Tuesday Wednesday Thursday Friday Saturday
No ratings yet
The Weekly Schedule Sunday Monday Tuesday Wednesday Thursday Friday Saturday
2 pages
Review For Final Exam: New Material ONLY
No ratings yet
Review For Final Exam: New Material ONLY
4 pages
CS373 Homework 1: 1 Part I: Basic Probability and Statistics
No ratings yet
CS373 Homework 1: 1 Part I: Basic Probability and Statistics
5 pages
PC Control Using Android Over Internet
No ratings yet
PC Control Using Android Over Internet
3 pages
6anomaly Fraud Detection
No ratings yet
6anomaly Fraud Detection
5 pages
Babu Krishna-Sanjay PDF
No ratings yet
Babu Krishna-Sanjay PDF
1 page
2020006101CustomLetter PDF
No ratings yet
2020006101CustomLetter PDF
1 page
3-Day Food-Activity Log Instructions
No ratings yet
3-Day Food-Activity Log Instructions
2 pages
MS Computer Engineering Degree Requirement Worksheet: Course Title Units Prerequisite Semester
No ratings yet
MS Computer Engineering Degree Requirement Worksheet: Course Title Units Prerequisite Semester
2 pages
Outlier Mining Techniques For Uncertain Data
No ratings yet
Outlier Mining Techniques For Uncertain Data
7 pages
Advantages and Disadvantages of Labeling Children With An
No ratings yet
Advantages and Disadvantages of Labeling Children With An
4 pages
Chapter 12. Outlier Analysis
No ratings yet
Chapter 12. Outlier Analysis
4 pages
Lesson Plan Grade 10
No ratings yet
Lesson Plan Grade 10
15 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

CS37300 Data Mining & Machine Learning: Anomaly Detection

Uploaded by

CS37300 Data Mining & Machine Learning: Anomaly Detection

Uploaded by

CS37300

Data Mining & Machine Learning

Source: Osmar Zaiane, UAlberta, PKDD

Source: Lazarevic et al, ECML/PKDD’08 Tutorial

Source: Lazarevic et al, ECML/PKDD’08 Tutorial

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.