0% found this document useful (0 votes)

16 views35 pages

w11 ML Security

Uploaded by

natalka.ciko

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views35 pages

w11 ML Security

Uploaded by

natalka.ciko

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 35

DS517 – Data Security

Lecture 11
Security and Privacy in ML Systems
Elias Athanasopoulos
athanasopoulos.elias@ucy.ac.cy
Outline
• Overview of Machine Learning (ML)
• Security definitions and goals
– Derive a threat model
• Training-time attacks
• Test-time (inference) attacks
• Defences

2
What is Machine Learning?
• We use computers to run algorithms for solving
common problems
– Sorting, searching, graph traversal, etc.
• Some problems cannot be solved by an algorithm
– These problems do have solutions
– These problems are not always hard problems that we do
not know fast algorithms for their solution (e.g., factoring a
large integer)
– E.g., classify a face photo to people with long or short hair
• These problems can be effectively solved by a model
– If we know the behavior of a lot of data, then we can solve
new instances of the same problem

3
ML problems
• Not all problems are suitable to be solved by ML
• There is a vast amount of problems that have efficient
ML solutions
– Pattern matching (OCR), image classification, anomaly
detection, language translation
• Candidate problems are the ones that past data can
predict the behaviour of new data
– Essentially the model learns from past data, in order to
decide about new data
– If you give the model 1 million of photos with long hair and
1 million of photos with short hair, then the model can
learn to distinguish new photos of long and short hair

4
ML types
• Supervised learning
– Labeled inputs with corresponding outputs
– Map new (unseen) inputs to known outputs
– Classification (of input to categories) or regression (of a value to
a certain range)
– Object recognition in images, spam filtering, etc.
• Unsupervised learning
– Inputs are unlabeled
– Clustering of inputs according to common properties
• Reinforcement learning
– Data is sequences of actions, observations, and rewards
– The goal is to produce a policy for acting in the environment
– Winning a video game

5
ML training
• Functions that take input and some parameters
and output a prediction for some property of
interest
– Input is usually a vector of values (features)
– The functions are a lot and we seek to find the
parameters (learning) that define the one that will be
used by the model
• Once the function is computed then we can test
new inputs to validate the performance of the
model
– For supervised learning we can take some samples
from the training set and add them to the testing set
6
ML inference
• The model is deployed to infer predictions on
inputs unseen during training
– The values of parameters are fixed, and the model
can compute outputs based on new inputs
• The model prediction may take different forms
– The most common for classification is a vector
assigning a probability for each class of the
problem, which characterizes how likely the input
is to belong to that class

7
Threat model
• When we discuss about security in a specific
context we define an attacker with certain
capabilities and goals
• The model may include an attack surface
– What attacks can be achieved in each stage of a
pipeline?

8
Attack surface

9
Trust model
• Data-owners are the owners or trustees of the
data/environment that the system is deployed within,
– e.g., an IT organisation deploying a face recognition
authentication service.
• System providers which construct the system and
algorithms
– e.g., the authentication service software vendors
• Consumers of the service the system provides
– e.g., the enterprise user
• Outsiders who may have explicit or incidental access
to the systems, or may simply be able to influence the
system inputs
– e.g., other users or adversaries within the enterprise

10
Adversarial goals
CIA + Privacy
• Confidentiality
– An adversary can extract information about the model
– We assume that the model is confidential or represents
intellectual property
• Privacy
– Models are trained on data that may contain sensitive
information
• Integrity
– Attacks that modify the output of the model
– E.g., create false positive in a face recognition system
• Availability
– Make the module inconsistent
– Make an autonomous vehicle non operational

11
Adversarial capabilities

12
Adversarial capabilities
Training
• Attempt to learn, influence, or corrupt the
model itself
– Usually facilitated by simply accessing a
summary, a portion, or all of the training data
– Can be done through explicit data breach or by
using collected data to train another model
• Alter the training data
– Insert adversarial inputs (injection), or alter the
training data directly (modification)
• Tamper with the learning algorithm

13
Training in adversarial setting
• During learning the attacker can pollute data
by inserting, editing, or removing points
– The intent is to modify the decision boundaries of
the trained model
– Commonly called as poisoning attack
– Very frequently they target classification tasks
• Under such scenario, the model will be still
functional but the predictions will favour the
attacker’s goals

14
Targeting integrity
Label manipulation
• Adversary modifies part of the training sample
– Needs to modify less than 10% for reducing the
accuracy of the model to 90%
• Label manipulation
– Limited attack, have been shown to work with
binary classifiers (i.e., swapping labels)
– Hard to quantify the amount of label
modifications, especially for multi-class classifiers,
needed to reduce the classifier’s accuracy

15
Targeting integrity
Input manipulation
• Direct poisoning of the learning inputs
– When training inputs are used with simple metrics
(e.g., centroid model use Euclidean distances), then
solving a linear programming problem can derive the
poisoned data
– The major goal of the attacker is to find how inputs
can reduce the accuracy of the classifier, dramatically
• Indirect poisoning of the learning inputs
– Adversaries have no access to pre-processed data
– They instead create ”data” that mislead the classifier,
e.g., a noisy polymorphic worm

16
Targeting privacy and
confidentiality
• During training, the confidentiality and privacy
of the data and model are not impacted by
the fact that ML is used, but rather the extent
of the adversary’s access to the system
hosting them
• This is a traditional access control problem,
which falls outside the scope of our discussion

17
Adversarial capabilities
Inference
• Attacks at inference time do not tamper with the
targeted model
– Drive the model to produce adversary selected outputs
(integrity)
– Collect evidence about the model characteristics
(confidentiality and privacy)
• White box attacks
– Adversary knows the model architecture, the model
parameters, training data, or a combination of these
• Black box attacks
– No knowledge about the model
– The adversary can only submit inputs and observe outputs

18
Inferring in adversarial setting
• Adversaries may also attack deployed ML model at
inference time
– An attacker may be targeting an intrusion detection
system’s whose rules were learned and fixed
– The attacker is interested in evading detection at runtime
• White-box attackers have access to the model internals
– Architecture, parameters
• Black-box adversaries are limited to interacting with
the model as an oracle
– Submitting inputs and observing the model’s predictions

19
Inferring in adversarial setting
White-box adversaries
• Varying degrees of access to the model
– How? An ML model trained in a data centre can
be bundled in a mobile app
• Integrity
– We cannot modify the training process anymore,
so we can only perturb new inputs

20
Direct manipulation of model
inputs
• Create adversarial examples by modifying
inputs accordingly and result to
misclassification
– The challenging part is to compute the adversarial
examples
• Why it works?
– Models often extrapolate linearly from the limited
subspace covered by the training data
– Algorithms can exploit this regularity in directing
search toward prospective adversarial regions
21
Indirect manipulation of model
inputs
• So far we assume that we can create an input and
feed it directly to the model
– Create a malware or a spam e-mail that causes the
model to misclassify the input
• Some ML systems operate in the physical world
– Robots navigate through obstacles
– Can we perturb such inputs (i.e., “physical objects”)
– It has been shown that photos taken with a
smartphone can be further given as an input to an ML
model that misclassifies them

22
Example

23
Beyond classification
• Although research has focused on
classification, adversarial example algorithms
extend to other settings
– E.g., reinforcement learning: the adversary
perturbs a frame in a video game to force an
agent to take wrong action

24
Privacy and confidentiality
• Confidentiality attacks in the white-box threat
model are trivial
– The adversary already has access to the model
parameters
• Targeting the privacy of data used in an ML
system is usually related to recovering
information about the training data
– The simplest attack against data consists in
performing a membership test, i.e. determining
whether a particular input was used in the training
dataset of a model

25
Inferring in adversarial setting
Black-box adversaries
• Computing adversarial examples when the
internals of the model is known is feasible
• What if the attacker can only interact with the
model by just sending queries and observing
the results?
• In this setup, the ML model acts as an oracle

26
Black-box setting
Integrity
• We define a cost function for estimating the
amount of queries needed in order to make
the ML model to misclassify an input
• The cost is associated with the modifications
needed from an input x (legit) to reach to an
input x* (malicious)
• The goal is to compute the least amount of
modifications needed (cost is minimized)

27
Direct manipulation of model
inputs
• Computing the modifications needed for transforming
a legitimate input to an adversarial one can be done
with several approaches
– Genetic functions, training another ML model, etc.
• Sometimes the ML model, that acts as an oracle, can
give more information per query answered
– E.g., class probabilities vs just the class
• Adversarial example transferability
– Adversarial examples that are misclassified by a model are
likely to misclassified by another one
– We can train our own ML model for predicting the
adversarial inputs

28
Privacy and confidentiality
• Membership attacks
– Testing if a specific point was part of the training dataset
• Training data extraction
– Model inversion enables adversaries to extract training
data from model predictions
– E.g., for a medicine dosage prediction task, access to the
model information about the patient’s stable medicine
dosage, can help in recovering genomic information
• Model extraction
– extract parameters of a model from the observation of its
predictions
– Some ML models are confidential (proprietary)

29
Defenses
• Defending against training-time attacks
– Several algorithms invented to rule out poisoning samples based on
the fact that they are typically out of the expected input distribution
– Some works propose obfuscation, which is not very attractive
• Defending against inference-time attacks
– Such attacks rely on the adversary being able to find small
perturbations that lead to significant changes in the model’s output
– Any defence that tampers with adversarial example crafting heuristics,
but does not mitigate the underlying erroneous model predictions can
be evaded
• Defending against larger perturbations
– Defending against adversarial examples will almost certainly need to
improve the ability of models to be uncertain when predicting far from
their training subspace

30
Learning and inferring with
privacy
• Most promising defence so far is differential privacy
– Recall that differential privacy is based on adding noise
– Noise should be also injected in the ML model, somehow
• Training
– At training, random noise may be injected to the data, the cost
minimized by the learning algorithm, or the values of
parameters learned
– Noise drops accuracy, and can be critical in certain applications
• Inference
– The ML’s behaviour may also be randomized at inference by
introducing noise to predictions
– This degrades the accuracy of predictions, since the amount of
noise introduced increases with the number of inference
queries answered by the ML model

31
Fairness and accountability in ML
• Models predict values based on data and in
some domains (e.g., banking, healthcare)
these predictions may be critical
• The European Data Protection Regulation
requires that companies provide an
explanation of their predictions if they are
made using sensitive or private data

32
Fairness
• Predictions from ML models should not be
biased or cause discrimination
– Training data can cause bias
– The learning algorithm can also cause bias
• Fairness can be connected with privacy
– Adversarial example algorithms can estimate how
representative of a class a particular input is,
which leads to the identification of racial biases in
popular image datasets
33
Accountability
• Techniques used for accountability and
transparency are likely to yield improved
attack techniques because they increase the
adversary’s understanding of how the model’s
decisions are made
• But, they also contribute to building a better
understanding of the impact of training data
on the model learned by ML algorithm, which
is beneficial to privacy-preserving ML

34
References
• SoK: Security and Privacy in Machine Learning.
Nicolas Papernot, Patrick McDaniel, Arunesh
Sinha, and Michael P. Wellman. In IEEE
European Symposium on Security and Privacy,
2018.

Internal and External Data Sources For MIS
No ratings yet
Internal and External Data Sources For MIS
2 pages
Slides Security and Privacy in Machine Learning
No ratings yet
Slides Security and Privacy in Machine Learning
59 pages
Security Engineering For Machine Learning
No ratings yet
Security Engineering For Machine Learning
4 pages
10 1109@MC 2019 2909955
No ratings yet
10 1109@MC 2019 2909955
4 pages
Attacks Against Machine Learning - Evasion
No ratings yet
Attacks Against Machine Learning - Evasion
45 pages
L3 Comp1806 2024
No ratings yet
L3 Comp1806 2024
42 pages
Poisoning Attacks Against Machine Learning Can Machine Learning Be Trustworthy
No ratings yet
Poisoning Attacks Against Machine Learning Can Machine Learning Be Trustworthy
6 pages
Lec1&2 Final
No ratings yet
Lec1&2 Final
37 pages
Sok: Security and Privacy in Machine Learning
No ratings yet
Sok: Security and Privacy in Machine Learning
16 pages
Machine Learning Security and Privacy A Review of
No ratings yet
Machine Learning Security and Privacy A Review of
24 pages
Module 5-1
No ratings yet
Module 5-1
10 pages
JJ
No ratings yet
JJ
5 pages
Trail 1 Original
No ratings yet
Trail 1 Original
4 pages
L12 - UCLxDeepMind DL2020
No ratings yet
L12 - UCLxDeepMind DL2020
152 pages
Machine Learning Security and Privacy A Review of Threats and Countermeasures
No ratings yet
Machine Learning Security and Privacy A Review of Threats and Countermeasures
23 pages
A Critical Overview of Privacy in Machine Learning
No ratings yet
A Critical Overview of Privacy in Machine Learning
9 pages
Untitled Presentation
No ratings yet
Untitled Presentation
9 pages
Ai With Generated Data
No ratings yet
Ai With Generated Data
42 pages
Towards Trustworthy LLMs - Understanding The Security and Privacy
No ratings yet
Towards Trustworthy LLMs - Understanding The Security and Privacy
82 pages
Towards Building Safe & Trustworthy AI Agents and A Path For Science-And Evidence-Based AI Policy
No ratings yet
Towards Building Safe & Trustworthy AI Agents and A Path For Science-And Evidence-Based AI Policy
133 pages
Security Boulevard Onelogin-Mastering Machine Learning
No ratings yet
Security Boulevard Onelogin-Mastering Machine Learning
13 pages
Backdoor Attacks in (ML)
No ratings yet
Backdoor Attacks in (ML)
30 pages
Sec16 Paper Tramer
No ratings yet
Sec16 Paper Tramer
19 pages
Data Security Tutorial 12 - Solutions
No ratings yet
Data Security Tutorial 12 - Solutions
4 pages
Adversarial Machine Learning
No ratings yet
Adversarial Machine Learning
39 pages
LLM Security
No ratings yet
LLM Security
24 pages
Document 1
No ratings yet
Document 1
5 pages
Crypto With Machine Learning
No ratings yet
Crypto With Machine Learning
30 pages
Detecting - Conventional - and - Adversarial - Attacks - Using - Deep - Learning - Techniques - A - Systematic - Review
No ratings yet
Detecting - Conventional - and - Adversarial - Attacks - Using - Deep - Learning - Techniques - A - Systematic - Review
7 pages
Ieeespmag 16
No ratings yet
Ieeespmag 16
5 pages
Dr. Mujiono - MachineLearningApplicationsCyberSecurity-Final-MS
No ratings yet
Dr. Mujiono - MachineLearningApplicationsCyberSecurity-Final-MS
28 pages
Machine Learning Security Threats Countermeasures and Evaluations
No ratings yet
Machine Learning Security Threats Countermeasures and Evaluations
23 pages
Trustworthy Machine Learning in The Context of Security and Privacy
No ratings yet
Trustworthy Machine Learning in The Context of Security and Privacy
28 pages
Blacklight
No ratings yet
Blacklight
18 pages
Module 1
No ratings yet
Module 1
22 pages
Book - A State of The Art Review On Adversarial Machine Learning
No ratings yet
Book - A State of The Art Review On Adversarial Machine Learning
66 pages
Machine Learning Security Threats
No ratings yet
Machine Learning Security Threats
39 pages
Practical Black-Box Attacks Against Machine Learning: Nicolas Papernot Patrick Mcdaniel Ian Goodfellow
No ratings yet
Practical Black-Box Attacks Against Machine Learning: Nicolas Papernot Patrick Mcdaniel Ian Goodfellow
14 pages
03 01 Lessonarticle
No ratings yet
03 01 Lessonarticle
5 pages
Random Spiking and Systematic Evaluation of Defenses Against Adversarial Examples
No ratings yet
Random Spiking and Systematic Evaluation of Defenses Against Adversarial Examples
12 pages
Lecture 02
No ratings yet
Lecture 02
26 pages
1 Introduction
No ratings yet
1 Introduction
24 pages
Adversarial Attacks and Defenses in Deep Learning
No ratings yet
Adversarial Attacks and Defenses in Deep Learning
39 pages
Wild Patterns: Ten Years After The Rise of Adversarial Machine Learning
No ratings yet
Wild Patterns: Ten Years After The Rise of Adversarial Machine Learning
17 pages
AI Product Security A Primer For Developers
No ratings yet
AI Product Security A Primer For Developers
10 pages
Hacking AI: A Primer For Policymakers On Machine Learning Cybersecurity
No ratings yet
Hacking AI: A Primer For Policymakers On Machine Learning Cybersecurity
34 pages
Defense Against Adversarial Attacks Using Convolutional Auto-Encoders
No ratings yet
Defense Against Adversarial Attacks Using Convolutional Auto-Encoders
9 pages
Improving Adversarial Robustness of Ensembles With Diversity Training
No ratings yet
Improving Adversarial Robustness of Ensembles With Diversity Training
10 pages
Abstract NT
No ratings yet
Abstract NT
33 pages
Explaining Vulnerabilities To Adversarial Machine Learning Through Visual Analytics
No ratings yet
Explaining Vulnerabilities To Adversarial Machine Learning Through Visual Analytics
11 pages
MLP Unit-I
No ratings yet
MLP Unit-I
62 pages
CSC 462 AI Week 07 ML
No ratings yet
CSC 462 AI Week 07 ML
29 pages
Mlfa Autumn 22 Lec 01
No ratings yet
Mlfa Autumn 22 Lec 01
43 pages
Adversarial Robustness and Defense Mechanisms in M
No ratings yet
Adversarial Robustness and Defense Mechanisms in M
12 pages
Previous Lecture
No ratings yet
Previous Lecture
43 pages
Introduction To Red Teaming AI
No ratings yet
Introduction To Red Teaming AI
35 pages
2023 LSE MY474 Applied Machine Learning Social Science, Lecture1
No ratings yet
2023 LSE MY474 Applied Machine Learning Social Science, Lecture1
65 pages
Safety of Data Security2
No ratings yet
Safety of Data Security2
6 pages
T SC 425 All About Jupiter Powerpoint - Ver - 3
100% (1)
T SC 425 All About Jupiter Powerpoint - Ver - 3
17 pages
GRAVITATION
No ratings yet
GRAVITATION
21 pages
The Mars Agency Retail Media Report Card ANZ Mar 2024
No ratings yet
The Mars Agency Retail Media Report Card ANZ Mar 2024
31 pages
Businesses Proposal
No ratings yet
Businesses Proposal
9 pages
Renal Diseases Pathophysiology
100% (1)
Renal Diseases Pathophysiology
6 pages
Bài Phô Cho Học Trò
No ratings yet
Bài Phô Cho Học Trò
27 pages
074-471-p1b Hapsite Er Om
No ratings yet
074-471-p1b Hapsite Er Om
560 pages
Macbag Msb-I Feb2012
No ratings yet
Macbag Msb-I Feb2012
1 page
UVEB Technology With 1.5 Nanometer Heteroatom Titanates Zirconates
No ratings yet
UVEB Technology With 1.5 Nanometer Heteroatom Titanates Zirconates
106 pages
Group 2: Topic: Industry Analysis of Akij Food and Beverage
No ratings yet
Group 2: Topic: Industry Analysis of Akij Food and Beverage
39 pages
Aroon Kumar: "Award Winning Global Marketer and Digital Business Leader"
No ratings yet
Aroon Kumar: "Award Winning Global Marketer and Digital Business Leader"
6 pages
Object-Oriented Software Engineering: Practical Software Development Using UML and Java
No ratings yet
Object-Oriented Software Engineering: Practical Software Development Using UML and Java
71 pages
TS 3.01.01 RES I1
No ratings yet
TS 3.01.01 RES I1
5 pages
Topic/ Lesson: Communicative Style
No ratings yet
Topic/ Lesson: Communicative Style
8 pages
Memory Addressing and Instruction Formats
No ratings yet
Memory Addressing and Instruction Formats
9 pages
Case Study Journey of Royal Enfield From Dusk To Down Along With Various Strategies Adopted
No ratings yet
Case Study Journey of Royal Enfield From Dusk To Down Along With Various Strategies Adopted
4 pages
Aye, Ai!, Ai Ai AI!, Ayes, and I: Market Comments
No ratings yet
Aye, Ai!, Ai Ai AI!, Ayes, and I: Market Comments
7 pages
Intermediate Relay: Wiring Diagram
No ratings yet
Intermediate Relay: Wiring Diagram
1 page
Att#11 - A - Painting Procedure
No ratings yet
Att#11 - A - Painting Procedure
14 pages
Hep & GIT Final MCQ 21 B
100% (4)
Hep & GIT Final MCQ 21 B
23 pages
Testbank For Economics of Money Banking and Financial Markets The 13th Edition Mishkin Instant Download
No ratings yet
Testbank For Economics of Money Banking and Financial Markets The 13th Edition Mishkin Instant Download
18 pages
Meeting Script
No ratings yet
Meeting Script
1 page
Group 1 - CJR PSYCHOLINGUITICS - DIK 19 B-1
No ratings yet
Group 1 - CJR PSYCHOLINGUITICS - DIK 19 B-1
4 pages
December 2 Flier Final-NEW PDF
No ratings yet
December 2 Flier Final-NEW PDF
1 page
Further Comments On DCTV's "Empathy"
100% (1)
Further Comments On DCTV's "Empathy"
13 pages
Himbro - Honey London
No ratings yet
Himbro - Honey London
140 pages
Cesc 12 - Q1 - M5 PDF
No ratings yet
Cesc 12 - Q1 - M5 PDF
14 pages
SMBTA43-Siemens Semiconductor Group
No ratings yet
SMBTA43-Siemens Semiconductor Group
4 pages
Paper - 2011 - Widowati - Glucose-Ethanol Fermentation Dynamic Model
No ratings yet
Paper - 2011 - Widowati - Glucose-Ethanol Fermentation Dynamic Model
8 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

w11 ML Security

Uploaded by

w11 ML Security

Uploaded by

DS517 – Data Security

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.