0% found this document useful (0 votes)

14 views32 pages

Class10-Introduction To ML

The document provides an overview of machine learning and data science, emphasizing data modeling, preprocessing, and analytics. It distinguishes between supervised and unsupervised learning, detailing their methodologies, applications, and examples. Key concepts include classification, regression, clustering, and association, along with the importance of data characteristics in analysis.

Uploaded by

Paladin

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views32 pages

Class10-Introduction To ML

Uploaded by

Paladin

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 32

Introduction to Machine

Learning: Data Modeling

Data Science
• Multi-disciplinary field that uses scientific methods,
processes, algorithms and systems to extract
knowledge and insight from structured and
unstructured data
• Central concept is gaining insight from data

Data Modeling Inference

Data Collection (Machine
Learning)

Data Preprocessing

Data
Feature
Database Cleaning and
Representation
Cleansing
2
Data Science
• Multi-disciplinary field that uses scientific methods,
processes, algorithms and systems to extract
knowledge and insight from structured and
unstructured data
• Central concept is gaining insight from data

Data Modeling Inference

Data Collection (Machine
Learning)

Data Preprocessing

Data
Feature
Database Cleaning and
Representation
Cleansing
3
Data Preprocessing and Descriptive
Data Analytics
• Data preprocessing involve:
– Data cleaning, Data integration, Data transformation,
Data reduction
• Descriptive data analytics serves as a foundation for
data preprocessing
• It helps us to study the general characteristics of data
and identify the presence of noise or outliers
• Data characteristics:
– Central tendency of data
• Centre of the data
• Measuring mean, median and mode
– Dispersion of data
• The degree to which numerical data tend to spread
• Measuring range, quartiles, interquartile range (IQR), the
five-number summary and standard deviation
• Descriptive analytics are the backbone of reporting
Data Science
• Multi-disciplinary field that uses scientific methods,
processes, algorithms and systems to extract knowledge
and insight from structured and unstructured data
• Central concept is gaining insight from data
• Machine learning uses data to extract knowledge –
predictive analytics

Data Modeling Inference

Data Collection (Machine
Learning)

Data Preprocessing

Data
Feature
Database Cleaning and
Representation
Cleansing
5
Predictive Data Analytics
• It is used to identify the trends, correlations and
causation by learning the patterns from data
• Study and construction of algorithms that can learn
from data and make predictions on data
• It involve tasks like
– Classification: Categorical label prediction
• E.g.: predicting the presence or absence of disease or
• predicting the category of the disease according to
symptoms
– Regression: Numeric prediction
• E.g.: predicting the amount of landslide or
• predicting the amount of rainfall
– Clustering: Grouping of similar patterns
• E.g.: grouping the similar items to be sold or
• grouping the people from the same region
• Learning from data
6
Machine Learning:
Learning from Data
• 1, 2, 3, 4, 5, ?, …, 24, 25, 26, 27, ?
• 1, 3, 5, 7, 9, ?, …, 25, 27, 29, 31, ?
• 2, 3, 5, 7, 11, ?, …, 29, 31, 37, 41, ?
• 1, 4, 9, 16, 25, ?, …, 121, 144, 169, ?
• 1, 2, 4, 8, 16, 32, ?,…, 1024, 2048, 4096, ?
• 1, 1, 2, 3, 5, 8, ?, …, 55, 89, 144, 233, ?
• 1, 1, 2, 4, 7, 13, ?, 44, 81, 149, 274, 504, ?
• 3, 5, 12, 24, 41, ?, …., 201, 248, 300, 357, ?
• 1, 6, 19, 42, 59, ?, …, 95, 117, 156, 191, ?

8
• 1, 2, 3, 4, 5, 6, …, 24, 25, 26, 27, 28
• 1, 3, 5, 7, 9, 11, …, 25, 27, 29, 31, 33
• 2, 3, 5, 7, 11, 13, …, 29, 31, 37, 41, 43
• 1, 4, 9, 16, 25, 36, …, 121, 144, 169, 196
• 1, 2, 4, 8, 16, 32, 64,…, 1024, 2048, 4096, 8192
• 1, 1, 2, 3, 5, 8, 13, …, 55, 89, 144, 233, 377
• 1, 1, 2, 4, 7, 13, 24, 44, 81, 149, 274, 504, 927
• 3, 5, 12, 24, 41, 63, ….., 201, 248, 300, 357, 419
(2, 7, 12, 17, 22, 27, 32, 37, 42, 47, 52, 57, 62)
• 1, 6, 19, 42, 59, ?, …, 95, 117, 156, 191, ?

• Pattern: Any regularity or structure in data or source of

data
• Pattern Analysis: Automatic discovery of patterns in
data
9
Image Classification

Tige
r

Giraffe

Horse

Bear

Intraclass variability
10
Interclass
Scene Image Classification similarity
Tall Inside Street Highway Coast Open Mountain Forest
building city country

11
Scene Image Clustering

12
Scene Image Clustering
Residential Interiors

Mountain
s

Military Vehicles

Sacred Places

Sunsets & Sunrises

13
Machine Learning for Pattern
Recognition
• Learning: Acquiring new knowledge or modifying the existing
knowledge
• Knowledge: Familiarity with information present in data
• Learning by machines for pattern analysis: Acquisition of
knowledge from data to discover patterns in data
• Data-driven techniques for learning by machines: Learning from
examples (Training of models)
• Generalization ability of learning machines: Performance of trained
models on new (test) data
• Target of learning techniques: Good generalization ability
• Learning techniques: Estimation of parameters of models
• Learning machines and Learning techniques for pattern analysis:
– Statistical Models (Maximum likelihood)
– Artificial Neural Networks (Error correction learning)
– Kernel Methods (Learning optimal linear relationships) 14
Illustration - Data1: Representing a
Person
• A person is represented using two
attributes:
– Height
– Weight

Weight
in Kg
(x2)

Height in cm (x1)

x = [x1 x2]T
17
Illustration – Data2: Iris (Flower) Data [1]

x = [x1 x2 x3 x4]T

[1] R. A. Fisher, "The use of multiple measurements in taxonomic problems" Annual Eugenics, 7, Part II, pp. 179-
188, 1936. 18
Illustration – Data3: Years of Experience
and Salary
Years of Salary (in
experienc Rs 1000)
e (x2)
(x1)
3 30
8 57
9 64
13 72 Salary
3 36 (x2)
6 43
11 59
21 90
1 20
Years of experience (x1)
16 83

x = [x1 x2]T
19
Illustration – Data4: Environmental Data

x = [x1 x2 x3 x4]T

20
Supervised and Unsupervised
Learning
Supervised Learning

• Learning under the supervision

– Student learning from teacher
– Child learning to recognize objects/animals
• In the context of machine learning, data used for
learning (Train data) is labeled
• Labeled data: Data for which the target value is
already known

22
Labeled Data – Illustration:
Data1 - Representing a Person
• A person is represented using two
attributes: • Class (y):
– Height – Child (0)
– Weight – Adult (1)

Weight
in Kg
(x2)

Height in cm (x1)

x = [x1 x2]T
23
Labeled Data – Illustration:
Data2 - Iris (Flower) Data
• Class (y):
– Iris Setosa (1)
– Iris Versicolour (2)
– Iris Virginica (3)

x = [x1 x2 x3 x4]T

24
Labeled Data – Illustration:
Data3 - Years of Experience and Salary
Years of • Class – Raise in Salary (y):
Salary (in Raise
experienc
Rs 1000) – Yes(1)
e
(x2) (y) – No (0)
(x1)
3 30 1
8 57 0
9 64 1
13 72 1
3 36 1
6 43 0
11 59 1
21 90 1
1 20 0
16 83 0
x = [x1 x2]T
25
Labeled Data – Illustration:
Data3 - Years of Experience and Salary

Years of Salary (in • Input variable: Years of experience

experienc Rs 1000)
e (y)
• Output variable: Salary
(x)
3 30
8 57
9 64
13 72
3 36
6 43
11 59
21 90
1 20
16 83

26
Illustration – Data4: Environmental Data

• Predicting Rain (target

attribute) based on
Temperature, Humidity and
Pressure
• Input variable: Temperature,
Humidity and Pressure
• Output variable: Rain

27
Supervised Learning
• In supervised learning, each example (data sample) is
a pair consisting of an input example (typically a
vector) and a desired output value (also called
the target)
• Task of learning a function that maps an input to an
output based on example input-output pairs

• A supervised learning algorithm

– analyzes the training data and
– produces an inferred function, which can be used for
predicting the output of a new examples
• One of the scenario will be the algorithm to determine
the class labels for unseen instances
Class
Height, x1 Adult/Child Adult :Class C1 (1)
Classifier Child :Class C2 (0)
Weight, x2
28
Supervised Learning
• Supervised learning is grouped into
– Classification
– Regression
• Classification:
– Output variable is categorical
– Categorical label prediction
– Example:
• Predicting a person as adult or child (2-class)
• Predicting the raise in salary based on the year of
experience and salary (2-class)
• Identify an email as spam or not (2-class)
• Predicting the presence or absence of disease (2-class)
• Categorising the disease according to symptoms (Multi-
class)
• Categorizing the Iris flowers (Multi-class)

29
Supervised Learning

• Supervised learning is grouped into

– Classification
– Regression
• Regression:
– Output variable is real or continuous value
– Numeric prediction
– Example:
• predicting the salary based on the experience
• predicting the amount of rainfall based on atmospheric
temperature, humidity, pressure, amount of sunlight etc.

30
Unsupervised Learning
• Learning without a supervision
• In the context of machine learning, data used for
learning (Train data) is unlabeled
• Given these unlabeled data machine tries to identify
the pattern and give the response
• Example:
– A person is
represented using
two attributes: Height Weight
and Weight in Kg
– No label is given (x2)

– Machine try to learn

the patterns from the
given set and groups
them based on the Height in cm (x1)
similarity
31
Summary

• Machine learning: Learning from data

• Supervised machine learning
– Data used for learning (Train data) is labeled
– Each example (data sample) is a pair consisting of an
input example (typically a vector) and a desired output
value (also called the target)
– Task of learning a function that maps an input to an
output based on example input-output pairs
– Classification and Regression
• Unsupervised machine learning
– Data used for learning (Train data) is unlabeled
– Given these unlabeled data machine tries to identify the
pattern based on similarity
– Clustering and Association

32
Unsupervised Learning
• Unsupervised learning is grouped into
– Clustering
– Association
• Clustering:
– Partitioning the data into cohesive groups such that the
data samples in a group are similar
– Example:
• Grouping the persons based on their height and weight
• Given the customer and their purchase data:
– Grouping the customers based on the similar products
purchased

• Association:
– It is a rule-based machine learning to discover the
interesting variables in a data set
– Example:
• Given the customer and their purchase data:
– Finding the products purchased together 33
Text Books

1. J. Han and M. Kamber, Data Mining: Concepts and

Techniques, Third Edition, Morgan Kaufmann Publishers,
2011.

2. S. Theodoridis and K. Koutroumbas, Pattern Recognition,

Academic Press, 2009.

3. C. M. Bishop, Pattern Recognition and Machine Learning,

Springer, 2006.

Property File Listing July 2018
No ratings yet
Property File Listing July 2018
6,804 pages
Passport Stats 15-04-2023 0939 GMT Softdrinks
No ratings yet
Passport Stats 15-04-2023 0939 GMT Softdrinks
1 page
Software Multiples Normalizing at 6x NTM Revenue
No ratings yet
Software Multiples Normalizing at 6x NTM Revenue
16 pages
Introduction To Machine Learning
100% (1)
Introduction To Machine Learning
119 pages
5 Year Procurement Projection 30032023
No ratings yet
5 Year Procurement Projection 30032023
26 pages
Unit I 2
No ratings yet
Unit I 2
78 pages
21CSC305P ML - Unit 1-E
No ratings yet
21CSC305P ML - Unit 1-E
137 pages
An Introduction To Machine Learning
No ratings yet
An Introduction To Machine Learning
136 pages
Donalek Classif
No ratings yet
Donalek Classif
69 pages
Clustering Partitioning-Hierarchical-DensityBased
No ratings yet
Clustering Partitioning-Hierarchical-DensityBased
87 pages
Class11-PatternClassification KNN
No ratings yet
Class11-PatternClassification KNN
28 pages
Lab4 Linking
No ratings yet
Lab4 Linking
3 pages
CAT Bootcamp
No ratings yet
CAT Bootcamp
8 pages
Admin Resume Sample
100% (1)
Admin Resume Sample
6 pages
Altai Access Controller Catalog Eng 160815 1
No ratings yet
Altai Access Controller Catalog Eng 160815 1
2 pages
Presentation On vCloudPoint Solution by TechG Infotech
No ratings yet
Presentation On vCloudPoint Solution by TechG Infotech
31 pages
Class12-PatternClassification PerformanceMetric ReferenceTemplate
No ratings yet
Class12-PatternClassification PerformanceMetric ReferenceTemplate
33 pages
Topic3 Limit Continuity
No ratings yet
Topic3 Limit Continuity
9 pages
025 QHSEC SOP Manual Handling
No ratings yet
025 QHSEC SOP Manual Handling
4 pages
Introduction To Machine Learning: Jaime S. Cardoso
100% (1)
Introduction To Machine Learning: Jaime S. Cardoso
52 pages
Usermanual Em6400.v01
No ratings yet
Usermanual Em6400.v01
81 pages
01 - Introduction To Machine Learning
No ratings yet
01 - Introduction To Machine Learning
14 pages
AI Chapter 3 Part 1
No ratings yet
AI Chapter 3 Part 1
33 pages
Mlintro 4
No ratings yet
Mlintro 4
28 pages
UNit 1 Introduction To ML
No ratings yet
UNit 1 Introduction To ML
225 pages
IntroClassificationDA 2024
No ratings yet
IntroClassificationDA 2024
129 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
89 pages
Machine Learning
No ratings yet
Machine Learning
28 pages
Chapter-1 ML Intro
No ratings yet
Chapter-1 ML Intro
36 pages
Class10 13 PatternClassification 06 13oct2020
No ratings yet
Class10 13 PatternClassification 06 13oct2020
47 pages
CH 4
No ratings yet
CH 4
106 pages
Topic1-Natural Number System
No ratings yet
Topic1-Natural Number System
11 pages
Class10 14 PatternClassification - 13 24sept2019
No ratings yet
Class10 14 PatternClassification - 13 24sept2019
50 pages
1 ML M1503-Introduction - ABP
No ratings yet
1 ML M1503-Introduction - ABP
14 pages
Chapter 4 - Machine Learning
No ratings yet
Chapter 4 - Machine Learning
81 pages
Unit 1 Machine Learning
No ratings yet
Unit 1 Machine Learning
68 pages
Tutorial 1
No ratings yet
Tutorial 1
8 pages
Research Trends in Machine Learning: Muhammad Kashif Hanif
No ratings yet
Research Trends in Machine Learning: Muhammad Kashif Hanif
80 pages
Wa0007
No ratings yet
Wa0007
48 pages
Dummy Tables For QOC Assessment 1st Draft
No ratings yet
Dummy Tables For QOC Assessment 1st Draft
33 pages
Machine
No ratings yet
Machine
61 pages
Glenndal 2
No ratings yet
Glenndal 2
7 pages
Chp2-Binary Numbers and Codes (15.1.09)
No ratings yet
Chp2-Binary Numbers and Codes (15.1.09)
16 pages
C 3 Kernel 3 v2.11 June 2023
No ratings yet
C 3 Kernel 3 v2.11 June 2023
150 pages
USE Modals: Reading of Academic Texts in English II Modal Verbs
No ratings yet
USE Modals: Reading of Academic Texts in English II Modal Verbs
1 page
11 04 2019 Asea P1
No ratings yet
11 04 2019 Asea P1
40 pages
Ballistic Limit Evaluation For Impact of Pistol Projectile 9 MM Luger On Aircraft Skin Metal Plate
No ratings yet
Ballistic Limit Evaluation For Impact of Pistol Projectile 9 MM Luger On Aircraft Skin Metal Plate
10 pages
Bilal Ahmed Shaik Data Mining
No ratings yet
Bilal Ahmed Shaik Data Mining
88 pages
MLUnit - 1 Share
No ratings yet
MLUnit - 1 Share
162 pages
Unit 3
No ratings yet
Unit 3
33 pages
ML Unit-1
No ratings yet
ML Unit-1
28 pages
Bid Form
No ratings yet
Bid Form
4 pages
Machine Learning and Applications (5L)
No ratings yet
Machine Learning and Applications (5L)
185 pages
Supervised Machine Learning
No ratings yet
Supervised Machine Learning
20 pages
Directive Principles of State Policy
No ratings yet
Directive Principles of State Policy
2 pages
Hoàng Nguyễn Duy Anh- 11230008
No ratings yet
Hoàng Nguyễn Duy Anh- 11230008
10 pages
Conceptual Framework: E-Commerce Capabilities Organization Performance
No ratings yet
Conceptual Framework: E-Commerce Capabilities Organization Performance
4 pages
Introduction To ML
No ratings yet
Introduction To ML
31 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
20 pages
Impact of Bonus Issue On Market Price
No ratings yet
Impact of Bonus Issue On Market Price
70 pages
Lecture 2
No ratings yet
Lecture 2
36 pages
3 DM Classification
No ratings yet
3 DM Classification
55 pages
University Institute of Engineering Department of Computer Science and Engg
No ratings yet
University Institute of Engineering Department of Computer Science and Engg
27 pages
Unit-1 ML
No ratings yet
Unit-1 ML
19 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
24 pages
Unit Iii Classification
No ratings yet
Unit Iii Classification
57 pages
Continuous Quality Improvement Through Post-Occupancy Evaluation Feedback
No ratings yet
Continuous Quality Improvement Through Post-Occupancy Evaluation Feedback
15 pages
Lecture 01 Introducing ML 13102022 031101pm
No ratings yet
Lecture 01 Introducing ML 13102022 031101pm
36 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
24 pages
Lect3 Machine Learning
No ratings yet
Lect3 Machine Learning
27 pages
BRCGS Food Safety
No ratings yet
BRCGS Food Safety
3 pages
MAchine Learning Notes
No ratings yet
MAchine Learning Notes
6 pages
What Is NumPy
No ratings yet
What Is NumPy
5 pages
Machine Learning Tutorial PDF
No ratings yet
Machine Learning Tutorial PDF
56 pages
Machine Learning INTRO
No ratings yet
Machine Learning INTRO
12 pages
ML Unit 1
No ratings yet
ML Unit 1
9 pages
ML - Part - A
No ratings yet
ML - Part - A
10 pages
Lecture Notes 1 2 Intro Python
No ratings yet
Lecture Notes 1 2 Intro Python
13 pages
ML Lecture Notes Unit-1
No ratings yet
ML Lecture Notes Unit-1
45 pages
Lecture 1
No ratings yet
Lecture 1
36 pages
Unit 1 Notes
No ratings yet
Unit 1 Notes
20 pages
Boycott List of Israel Items
No ratings yet
Boycott List of Israel Items
3 pages
Module2 ch2
No ratings yet
Module2 ch2
36 pages
USA PCC Form Pages 2
No ratings yet
USA PCC Form Pages 2
1 page
Lecture 2 Unit 1
No ratings yet
Lecture 2 Unit 1
60 pages
What Is Machine Learning
No ratings yet
What Is Machine Learning
4 pages
Unit-1 MLT
No ratings yet
Unit-1 MLT
51 pages
Unit III - I
No ratings yet
Unit III - I
15 pages
ABB Azipod Brochure Lores
No ratings yet
ABB Azipod Brochure Lores
8 pages
Machine Learning Part: Domain Overview
No ratings yet
Machine Learning Part: Domain Overview
20 pages
Unit I MACHINE LEARNING
No ratings yet
Unit I MACHINE LEARNING
87 pages
Curriculum Reform
No ratings yet
Curriculum Reform
27 pages
105 Machine Learning Paper
No ratings yet
105 Machine Learning Paper
6 pages
Introduction to Robotics
From Everand
Introduction to Robotics
Swarnalata Verma
No ratings yet
The Secret Of Machine Learning
From Everand
The Secret Of Machine Learning
Mhd Arjunanta
No ratings yet
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Class10-Introduction To ML

Uploaded by

Class10-Introduction To ML

Uploaded by

Introduction to Machine

Learning: Data Modeling

Data Modeling Inference

Data Modeling Inference

Data Modeling Inference

• Pattern: Any regularity or structure in data or source of

Sunsets & Sunrises

• Learning under the supervision

Years of Salary (in • Input variable: Years of experience

• Predicting Rain (target

• A supervised learning algorithm

• Supervised learning is grouped into

– Machine try to learn

• Machine learning: Learning from data

1. J. Han and M. Kamber, Data Mining: Concepts and

2. S. Theodoridis and K. Koutroumbas, Pattern Recognition,

3. C. M. Bishop, Pattern Recognition and Machine Learning,

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.