0% found this document useful (0 votes)

53 views20 pages

On The Safety of Machine Learning

This document proposes a definition of machine learning safety based on risk and uncertainty minimization to avoid harmful outcomes. It discusses how foundational machine learning principles like empirical risk minimization do not address uncertainty. It also outlines four categories of strategies - inherently safe design, safety reserves, safe fail, and procedural safeguards - that can promote machine learning safety by mitigating uncertainty. Finally, it examines applying the definition and strategies to specific applications like cyber-physical systems, decision sciences, and data products.

Uploaded by

el_charlie

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

53 views20 pages

On The Safety of Machine Learning

Uploaded by

el_charlie

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 20

1

On the Safety of Machine Learning:

Cyber-Physical Systems, Decision Sciences, and
Data Products
arXiv:1610.01256v2 [cs.CY] 22 Aug 2017

Kush R. Varshney (Corresponding Author)

Data Science Theory and Algorithms
IBM Thomas J. Watson Research Center
Yorktown Heights, New York 10598
Email: krvarshn@us.ibm.com
Homa Alemzadeh
Electrical and Computer Engineering
University of Virginia
Charlottesville, Virginia 22904
Email: alemzadeh@virginia.edu

Abstract

Machine learning algorithms increasingly influence our decisions and interact with us in all parts
of our daily lives. Therefore, just as we consider the safety of power plants, highways, and a variety
of other engineered socio-technical systems, we must also take into account the safety of systems
involving machine learning. Heretofore, the definition of safety has not been formalized in a machine
learning context. In this paper, we do so by defining machine learning safety in terms of risk, epistemic
uncertainty, and the harm incurred by unwanted outcomes. We then use this definition to examine
safety in all sorts of applications in cyber-physical systems, decision sciences, and data products. We
find that the foundational principle of modern statistical machine learning, empirical risk minimization,
is not always a sufficient objective. Finally, we discuss how four different categories of strategies for
achieving safety in engineering, including inherently safe design, safety reserves, safe fail, and procedural
safeguards can be mapped to a machine learning context. We then discuss example techniques that can
be adopted in each category, such as considering interpretability and causality of predictive models,
2

objective functions beyond expected prediction accuracy, human involvement for labeling difficult or
rare examples, and user experience design of software and open data.

I. I NTRODUCTION

In recent years, machine learning algorithms have started influencing every part of our lives,
including health and wellness, law and order, commerce, entertainment, finance, human capital
management, communication, transportation, and philanthropy. As the algorithms, the data on
which they are trained, and the models they produce are getting more powerful and more
ingrained in society, questions about safety must be examined. It may be argued that machine
learning systems are simply tools, that they will soon have a general intelligence that surpasses
human abilities, or something in-between. But from all perspectives, they are technological
components of larger socio-technical systems that may have to be engineered with safety in
mind [1].
Safety is a commonly used term across engineering disciplines connoting the absence of
failures or conditions that render a system dangerous [2]. Safety is a notion that is domain-
specific, cf. safe food and water, safe vehicles and highways, safe medical treatments, safe
toys, safe neighborhoods, and safe industrial plants. Each of these domains has specific design
principles and regulations that are applicable only to them.
There are some loose notions of safety for machine learning, but they are primarily of the
“I know it when I see it” variety or are very application-specific; to the best of our knowledge
[3], there is no precise, non-application-specific, first-principles definition of safety for machine
learning. The main contribution of this paper is to provide exactly such a definition. To do so,
we build upon a universal domain-agnostic definition of safety in the engineering literature [4],
[5].
In [4], [5] and numerous references therein, Moeller et al. propose a decision-theoretic defi-
nition of safety that applies to a broad set of domains and systems. They define safety to be the
reduction or minimization of risk and epistemic uncertainty associated with unwanted outcomes
that are severe enough to be seen as harmful. The key points in this definition are: i) the cost of
unwanted outcomes has to be sufficiently high in some human sense for events to be harmful,
and ii) safety involves reducing both the probability of expected harms and the possibility of
unexpected harms.
3

We define safety in machine learning in the same way, as the minimization of both risk and
uncertainty of harms, and devote Section II to fleshing out the details of this definition. As such,
formulations of machine learning for achieving safety that we describe in Section III must have
both risk and uncertainty minimization in their objective functions either explicitly, implicitly via
constraints, or through socio-technical components beyond the core machine learning algorithm.
The harmful cost regime is the part of the space that requires the dual objectives of risk
and uncertainty minimization; the non-harmful cost regime does not require the uncertainty
minimization objective.
As background before getting to those sections, we briefly describe harms, risk, and uncertainty
without specialization to machine learning. A system yields an outcome based on its state and
the inputs it receives. An outcome event may be desired or undesired. Single events and sets
of events have associated costs that can be measured and quantified by society. For example,
a numeric level of morbidity can be the cost of an outcome. An undesired outcome is only a
harm if its cost exceeds some threshold. Unwanted events of small severity are not counted as
safety issues. Risk is the expected value of the cost. Epistemic uncertainty results from the lack
of knowledge that could be obtained in principle, but may be practically intractable to gather
[6]. Harmful outcomes often occur in regimes and operating conditions that are unexpected
or undetermined. With risk, we do not know what the outcome will be, but its distribution is
known, and we can calculate the expectation of its cost. With uncertainty, we still do not know
what the outcome will be, but in contrast to risk, its probability distribution is also unknown
(or only partially known). Some decision theorists argue that all uncertainty can be captured
probabilistically, but we maintain the distinction between risk and uncertainty [5].
The first contribution of this work is to critically examine the foundational statistical machine
learning principles of empirical risk minimization and structural risk minimization [7] from the
perspective of safety. We discuss how they do not deal with epistemic uncertainty. Further, these
principles rely on arguments involving average losses and laws of large numbers, which may not
necessarily be fully applicable when considering safety. Moreover, the loss functions involved
in these principles are abstract measures of distance between true and predicted values rather
than application-specific quantities measuring the possibility of outcomes such as loss of life or
loss of quality of life that can be judged harmful or not [8].
A discussion of safety would be incomplete without a discussion of strategies to increase
the safety of socio-technical systems with machine learning components. Four categories of
4

approaches have been identified for promoting safety in general [4]: inherently safe design,
safety reserves, safe fail, and procedural safeguards. As a second contribution, we discuss these
approaches specifically for machine learning algorithms and especially to mitigate epistemic
uncertainty. Through this contribution, we can recommend strategies to engineer safer machine
learning methods and set an agenda for further machine learning safety research.
The third contribution of this paper is examining the definition of and strategies for safety
in specific machine learning applications. Today, machine learning technologies are used in a
variety of settings, including cyber-physical systems, decision sciences, and data products. By
cyber-physical systems, we mean engineered systems that integrate computational algorithms
and physical components, e.g. surgical robots, self-driving cars, and the smart grid [9]. By
decision sciences, we mean the use of algorithms to aid people in making important decisions
and informing strategy, e.g. prison parole, medical treatment, and loan approval [10]. By data
products, we mean the use of algorithms to automate informational products, e.g. web advertising
placement, media recommendation, and spam filtering [10]. These settings vary widely in terms
of their interaction with people, the scale of data, the time scale of operation and consequence,
and the cost magnitude of consequences. A further contribution is a discussion on how to even
understand and quantify the desirability and undesirability of outcomes along with their costs. To
complement simply eliciting such knowledge directly from people [11], we suggest a data-driven
approach for characterizing harms that are particularly relevant for cyber-physical systems with
large state spaces of outcomes.
Overall, the purpose of this paper is to introduce a common language and framework for
understanding, evaluating, and designing machine learning systems that involve society and
technology. Our goal is to set forth a fundamental organizing and unifying principle that carries
through to abstract theoretical formulations of machine learning as well as to concrete real-
world applications of machine learning. Thus it provides practitioners working at any level of
abstraction a principled way to reason about the space of socio-technical solutions.
The remainder of the paper is organized in the following manner. In Section II, after intro-
ducing the standard notation and concept of statistical machine learning, we discuss what harm,
risk, and epistemic uncertainty mean for machine learning. In Section III, we discuss specific
strategies for achieving safety in machine learning. Section IV dives into example applications
in cyber-physical systems, decision sciences, and data products. Section V concludes the paper.
5

II. S AFETY IN M ACHINE L EARNING

In this section, after briefly introducing statistical machine learning notation, we examine how
machine learning applications fit with the conception of safety given above.

A. Notation

In what follows, we use standard notation to describe concepts from empirical risk min-
imization [7]. Given joint random variables X ∈ X (features) and Y ∈ Y (labels) with
probability density function fX,Y (x, y), a function mapping h ∈ H : X → Y, and a loss
function L : Y × Y → R, the risk R(h) is defined as the expected value of loss:
Z Z
E[L(h(X), Y )] = L(h(x), y)fX,Y (x, y)dydx.
X Y

The loss function L typically measures the discrepancy between the value predicted for y
using h(x) and y itself, for example (h(x) − y)2 in regression problems. We would like to learn
the function h that minimizes the risk.
In the machine learning context, we do not have access to the probability fX,Y , but rather to
a training set of samples drawn i.i.d. from the joint distribution (X, Y ): {(x1 , y1 ), . . . , (xm , ym )}
emp
and the goal is to learn h such that the empirical risk Rm (h) is minimized. The emprical risk
is given by:
1 X
m
emp
Rm (h) = L(h(xi ), yi ).
m i=1

B. Harmful Costs

Analyzing safety requires us first to examine whether immediate human costs of outcomes
exceed some severity threshold to be harmful. Unlike other domains mentioned in the intro-
duction, such as safe industrial plants and safe toys, we have a great advantage when working
with machine learning systems because the optimization formulation explicitly includes the loss
function L. The domain of L is Y × Y and the output is an abstract quantity representing
prediction error. In real-world applications, the value of the loss function may be endowed with
some human cost and that human cost may imply a loss function that also includes X in the
domain. Moreover, the cost may be severe enough to be harmful and thus a safety issue in some
parts of the domain and not in others.
In many decision science applications, undesired outcomes are truly harmful in a human
sense and their effect is felt in near-real time. They are safety issues. Moreover, the space of
6

outcomes is often binary or of small cardinality and it is often self-evident which outcomes are
undesired. However, loss functions are not always monotonic in the correctness of predictions and
depend on whose perspective is in the objective. The space of outcomes for the machine learning
components of typical cyber-physical systems applications is so vast that it is near-impossible
to enumerate all of the outcomes, let alone elicit costs for them. Nevertheless, it is clear that
outcomes leading to accidents have high human cost in real time and require the consideration
of safety. In order to get more nuanced characterizations of the cost severity of outcomes, a data-
driven approach is prudent [12]. The quality of service implications of unwanted outcomes in
data product applications are not typically safety hazards because they do not have an immediate
severe human cost. Undesired outcomes may only hypothetically lead to human consequences.
In practice, often the acceptable levels of safety and accident rates are defined by the society
and the application domain. For example, the difference in acceptable accident rates and costs
in motor vehicles (hundreds of thousands of fatalities per year) versus commercial aircraft (tens
of fatalities per year) shows the subjectivity of the public’s acceptance of safety [13].

C. Risk and Epistemic Uncertainty

The risk minimization approach to machine learning has many strengths, which is evident
by its successful application in various domains. We benefit from this explicit optimization
formulation in the machine learning domain by automatically reducing the probability of harms,
which is not always the case in other domains. However, this standard formulation does not
capture the issues related to the uncertainty that are also relevant for safety.
First, although it is assumed that the training samples {(x1 , y1 ), . . . , (xm , ym )} are drawn from
the true underlying probability distribution of (X, Y ), that may not always be the case. Further,
it may be that the distribution the samples actually come from cannot be known, precluding
the use of covariate shift [14] and domain adaptation techniques [15]. This is one form of
epistemic uncertainty that is quite relevant to safety because training on a dataset from a different
distribution can cause much harm.
Also, it may be that the training samples do come from the true, but unknown, underlying
distribution, but are absent from large parts of the X × Y space due to small probability density
there. Here the learned function h will be completely dependent on an inductive bias encoded
through H rather than the uncertain true distribution, which could introduce a safety hazard.
7

The statistical learning theory analysis utilizes laws of large numbers to study the effect of
emp
finite training data and the convergence of Rm (h) to R(h). However, when considering safety,
we should also be cognizant that in practice, a machine learning system only encounters a finite
number of test samples and the actual operational risk is an empirical quantity on the test set.
Thus the operational risk may be much larger than the actual risk for small cardinality test sets,
even if h is risk-optimal. This uncertainty caused by the instantiation of the test set can have
large safety implications on individual test samples.
Applications performed at scales with large training sets, large testing sets, and the ability to
explore the feature space have little epistemic uncertainty, whereas in other applications it is more
often than not the case that there is uncertainty about the training samples being representative
of the testing samples and that only a few predictions are made. Moreover, in applications such
as cyber-physical systems, very large outcome spaces prevent even mild coverage of the space
through training samples.

III. S TRATEGIES FOR ACHIEVING S AFETY

As discussed, safety and strategies for achieving it are often investigated on an application-by-
application basis. For example, setting the minimum thickness of vessels and removing flammable
materials from a chemical plant are ways of achieving safety. By analyzing such strategies across
domains, [4] has identified four main categories of approaches to achieve safety.
First, inherently safe design is the exclusion of a potential hazard from the system (instead
of controlling the hazard). For example, excluding hydrogen from the buoyant material of a
dirigible airship makes it safe. (Another possible safety measure would be to introduce apparatus
to prevent the hydrogen from igniting.)
A second strategy for achieving safety is through multiplicative or additive reserves, known
as safety factors and safety margins, respectively. In mechanical systems, a safety factor is a
ratio between the maximal load that does not lead to failure and the load for which the system
was designed. Similarly, the safety margin is the difference between the two.
The third general category of safety measures is ‘safe fail,’ which implies that a system remains
safe when it fails in its intended operation. Examples are electrical fuses, so-called dead man’s
switches on trains, and safety valves on boilers.
8

Finally, the fourth strategy for achieving safety is given the name procedural safeguards. This
strategy includes measures beyond ones designed into the core functionality of the system, such
as audits, training, posted warnings, and so on.
In this section, we discuss each of these strategies with specific approaches that extend machine
learning formulations beyond risk minimization for safety.
1) Inherently Safe Design: In the machine learning context, we would like robustness against
the uncertainty of the training set not being sampled from the test distribution. The training set
may have various biases that are unknown to the user and that will not be present during the test
phase or may contain patterns that are undesired and might lead to harmful outcomes. Modern
techniques such as extreme gradient boosting and deep neural networks may exploit these biases
and achieve high accuracy, but they may fail in making safe predictions due to unknown shifts
in the data domain or inferring incorrect patterns or harmful rules [16].
These models are so complex that it is very difficult to understand how they will react to
such shifts and whether they will produce harmful outcomes as a result. Two related ways to
introduce inherently safe design are by insisting on models that can be interpreted by people
and by excluding features that are not causally related to the outcome [17]–[20]. By examining
interpretable models, features or functions capturing quirks in the data can be noted and excluded,
thereby avoiding related harm. Similarly, by carefully selecting variables that are causally related
to the outcome, phenomena that are not a part of the true ‘physics’ of the system can be
excluded, and associated harm be avoided. We note that post hoc interpretation and repair of
complex uninterpretable models, appealing for other reasons, does not assure safety via inherently
safe design because the interpretation is not the decision rule that is actually used in making
predictions.
Neither interpretability nor causality of models is properly captured within the standard risk
minimization formulation of machine learning. Extra regularization or constraints on H, beyond
those implied by structural risk minimization, are needed to learn inherently safe models. That
might lead to performance loss in accuracy when measured through standard metrics such as
training and testing data probability distributions, but the safety will be enhanced by reduction in
epistemic uncertainty and undesired bias. Both interpretability and causality may be incorporated
into a single learned model, e.g. [21], and causality may be used to induce interpretability,
e.g. [22]. In applications with very large outcome spaces such as those employing reinforcement
learning, it is shown that appropriate aggregation of states in outcome policies can lead to
9

interpretable models [23].

2) Safety Reserves: In machine learning formulations, the uncertainty in the matching of
training and test data distributions or in the instantiation of the test set can be parameterized
with the symbol θ. Let R∗ (θ) be the risk of the risk-optimal model if the θ were known. Along
the same lines as safety factors and safety margins, robust formulations find h while constraining
R(h,θ)
or minimizing maxθ R∗ (θ)
or maxθ (R(h, θ) − R∗ (θ)). Such formulations can capture uncertainty
in the class priors and uncertainty resulting from label noise in classification problems [24], [25].
They can also capture the uncertainty of which part of the X space the actual small set of test
samples comes from.
A different sort of safety factor comes about when considering fairness and equitability. In
certain prediction problems, the risk of harm for members of protected groups should not be much
worse (up to a multiplicative factor) than the risk of harm for others [26]–[28]. We can partition
the feature space X into the sets Xu , Xp ⊂ X , respectively, corresponding to the unprotected
and protected groups, indicated by features such as race and gender. Then using a rule such as
the 80% (or four-fifths) rule advocated in the study of disparate impact [29], we can constraint
the relative risk of harm for the protected versus unprotected group to a maximum value such
as 5/4: R R
L(x, h(x), y)fX,Y (x, y)dydx 5
R Xp
RY ≤ .
Xu Y
L(x, h(x), y)fX,Y (x, y)dydx 4
Under such a constraint, we ensure that the outcome of prediction for protected groups is not
much more harmful than for unprotected groups.
3) Safe Fail: A technique used in machine learning when predictions cannot be given confi-
dently is the reject option [30]: the model reports that it cannot reliably give a prediction and does
not attempt to do so, thereby failing safely. When the model selects the reject option, typically
a human operator intervenes, examines the test sample, and provides a manual prediction.
In classification problems, models are reported to be least confident near the decision boundary.
However, by doing so, there is an implicit assumption that distance from the decision boundary
is inversely related to confidence. This is reasonable in parts of X with high probability density
and large numbers of training samples because the decision boundary is located where there is
a large overlap in likelihood functions. However parts of X with low density may not contain
any training samples at all and the decision boundary may be completely based on an inductive
bias, thereby containing much epistemic uncertainty. In these parts of the space, distance from
10

the decision boundary is fairly meaningless and the typical trigger for the reject option should
be avoided [31]. For a rare combination of features in a test sample [32], a safe fail mechanism
is to always go for manual examination.
Both of these manual intervention options are suitable for applications with sufficiently long
time scales. When working on the scale of milliseconds, only options similar to dead man’s
switches that stop operations in a reasonable manner are applicable.
4) Procedural Safeguards: In addition to general procedural safeguards that carry over from
other domains, two directions in machine learning that can be used for increasing safety within
this category are user experience design and openness.
In decision science applications especially, non-specialists are often the operators of machine
learning systems. Defining the training data set and setting up evaluation procedures, among
other things, have certain subtleties that can cause harm during operation if done incorrectly.
User experience design can be used to guide and warn novice and experienced practitioners to
set up machine learning systems properly and thereby increase safety.
These days most modern machine learning algorithms are open source, which allows for the
possibility of the public audit. Safety hazards and potential harms can be discovered through
examination of source code. However, open source software is not sufficient, because the behavior
of machine learning systems is driven by data as much as it is driven by software implementations
of algorithms. Open data refers to data that can be freely used, reused, and redistributed by
anyone. Opening data is a procedural safeguard for increasing safety that is increasingly being
adopted by the community [33]–[35].

IV. E XAMPLE A PPLICATIONS

In this section, we further detail safety in machine learning systems by providing examples
from cyber-physical systems, decision sciences, and data products.

A. Cyber-Physical Systems

With advances in computing, networking, and sensing technologies, cyber-physical systems

have been deployed in various safety-critical settings such as aerospace, energy, transportation,
and healthcare. The increasing complexity and connectivity of these systems, the tight coupling
between their cyber and physical components, and the inevitable involvement of human operators
in their supervision and control has introduced significant challenges in ensuring system reliability
11

and safety while maintaining the expected performance. Cyber-physical systems continuously
interact with the physical world and human operators in real-time. In order to adapt to the
constantly changing and uncertain environment, they need to take into account not only the
current application but also the operator’s preferences, intent, and past behavior [36].
Autonomous machine learning and artificial intelligence techniques have been applied to
several decision-making and control problems in cyber-physical systems. Here we discuss two
examples where unexpected harmful events with epistemic uncertainty might impact human lives
in real-time.
1) Surgical Robots: Robotically-assisted surgical systems are a typical example of human-
in-the-loop cyber-physical systems. Surgical robots consist of a teleoperation console operated
by a surgeon, an embedded system hosting the automated robot control, and the physical
robotic actuators and sensors. The robot control system receives the surgeon’s commands issued
using the teleoperation console and translates the surgeon’s hand, wrist, and finger movements
into precisely engineered movements of miniaturized surgical instruments inside patient’s body.
Recent research shows an increasing interest in the use of machine learning algorithms for
modeling surgical skills, workflow, and environment and integration of this knowledge into
control and automation of surgical robots [37]. Machine learning techniques have been used
for detection and classification of surgical motions for automated surgical skill evaluation [38]–
[40] and automating portions of repetitive and time-consuming surgical tasks (e.g., knot-tying,
suturing). [40], [41].
In autonomous robotic surgery, a machine learning enabled surgical robot continuously esti-
mates the state of the environment (e.g., length or thickness of soft tissues under surgery) based
on the measurements from sensors (e.g., image data or force signals) and generates a plan for
executing actions (e.g., moving the robotic instruments along a trajectory). The mapping function
from the perception of environment to the robotic actions is considered as a surgical skill which
the robot learns, through evaluation of its own actions or from observing the actions of expert
surgeons. The quality of the learned surgical skills can be assessed using cost functions that are
either automatically learned or are manually defined by surgeons [37].
Given the uncertainty and large variability in the operator actions and behavior, organ/tissue
movements and dynamics, and possibility of incidental failures in the robotic system and instru-
ments, predicting all possible system states and outcomes and assessing their associated costs
is very challenging. As mentioned in Section II-B, due to the very large outcome space, it
12

is not straightforward to elicit costs of all different outcomes and characterize which tasks or
actions are costly enough to represent safety issues. For example, there have been ongoing
reports of safety incidents during use of surgical robots that negatively impact patients by
causing procedure interruptions or minor injuries. These incidents happen despite existing safe
fail mechanisms included in the system and often result from a combination of different causal
factors and unexpected conditions, including malfunctions of surgical instruments, actions taken
by the surgeon, and the patient’s medical history [12].
There are also practical limitations in learning optimal and safe surgical trajectories and
workflows due to epistemic uncertainty in such environments. The training data often consists
of samples collected from a select set of surgical tasks (e.g., elementary suturing gestures)
performed by well-trained surgeons, which might not represent the variety of actions and tasks
performed during a real procedure. Previous work shows that surgeon’s expertise level, surgery
type, and medical history have a significant impact on the possibility of complications and errors
occurring during surgery. Further, automated algorithms should be able to cope with uncertainty
and unpredictable events and guarantee patient safety just as expert surgeons do in such scenarios
[37].
One solution for dealing with these uncertainties is to assess the robustness of the system
in the presence of unwanted and rare hazardous events (e.g., failures in control system, noisy
sensor measurements, or incorrect commands sent by novice operators) by simulating such events
in virtual environments [42] and quantifying the possibility of making safe decisions by the
learning algorithm. This approach is an example of procedural safeguards (Section III-4). Such
a simulated assessment also serves to highlight the situations requiring safe fail strategies, such
as converting the procedure to non-robotic techniques, rescheduling it to a later time, or restarting
the system, that can refine the system. The costs of unwanted outcomes and safe fail strategies
to cope with them can also be characterized based on past data. For example, we mined the
FDA’s Manufacturer and User Facility Device Experience (MAUDE) database, a large database
containing 14 years worth of adverse events, to obtain such characterizations on the causes and
severity of safety incidents and recovery actions taken by the surgical team. Such analysis helps
focus development of machine learning algorithms containing safety strategies on regimes with
harmful outcomes and avoid concern for safety strategies in regimes with non-harmful outcomes.
Another solution currently adopted in practice is through supervisory control of automated
surgical tasks instead of fully autonomous surgery. For example, if the robot generates a geo-
13

metrically optimized suture plan based on sensor data or surgeon input, it should still be tracked
and updated in real time because of possible tissue motion and deformation during surgery [41].
This is an example of examining interpretable models to avoid possible harm (as discussed
in Section III-1). An example of adopting safety reserves (Section III-2) in robotic surgery is
robust optimization of preoperative planning to minimize the uncertainty at the task level while
maximizing the dexterity [43].
2) Self-Driving Cars: Self-driving cars are autonomous cyber-physical systems capable of
making intelligent navigation decisions in real-time without any human input. They combine a
range of sensor data from laser range finders and radars with video and GPS data to generate a
detailed 3D map of the environment and estimate their position. The control system of the car uses
this information to determine the optimal path to the destination and sends the relevant commands
to actuators that control the steering, braking, and throttle. Machine learning algorithms are used
in the control system of self-driving cars to model, identify, and track the dynamic environment,
including the road conditions and moving objects (e.g., other cars and pedestrians).
Although automated driving systems are expected to eliminate human driver errors and reduce
the possibility of crashes, there are several sources of uncertainty and failure that might lead to
potential safety hazards in these systems. Unreliable or noisy sensor signals (e.g., GPS data or
video signals in bad weather conditions), limitations of computer vision systems, and unexpected
changes in the environment (e.g., unknown driving scenes or unexpected accidents on the road)
can adversely affect the ability of control system in learning and understanding the environment
and making safe decisions [44]. For example, a self-driving car (in auto-pilot mode) recently
collided with a truck after failing to apply the brakes, leading to the death of the truck driver.
This was the first known fatality in over 130 million miles of testing the automated driving
system. The accident was caused under extremely rare circumstances of the high height of the
truck, its white color under the bright sky, combined with the positioning of the cars across the
road [45].
The importance of epistemic uncertainty or ”uncertainty on uncertainty” in these AI-assisted
systems has been recently recognized, and there are ongoing research efforts towards quantifying
the robustness of self-driving cars to events that are rare (e.g., distance to a bicycle running on
an expected trajectory) or not present in the training data (e.g., unexpected trajectories of moving
objects) [46]. Systems that recognize such rare events trigger safe fail mechanisms.
To the best of our knowledge, there is no self-driving car system with an inherently safe
14

design that utilizes, e.g., interpretable models [47]. Fail-safe mechanisms that upon detection
of failures or less confident predictions, stop the autonomous control software and switch to a
backup system or a degraded level of autonomy (e.g., full control by the driver) are considered
for self-driving cars [48].

B. Decision Sciences

In decision sciences applications, people are in the loop in a different way than in cyber-
physical systems, but in the loop nonetheless. Decisions are made about people and are made
by people using machine learning-based tools for support. Many emerging application domains
are now shifting to data-driven decision making due to a greater capture of information digitally
and the desire to be more scientific rather than relying on (fallible) gut instinct [49]. These
applications present many safety-related challenges.
1) Predicting Voluntary Resignation: We recently studied the problem of predicting which
IBM employees will voluntarily resign from the company in the next six months based on human
resources and compensation data, which required us to develop a classification algorithm to be
placed within a larger decision-making system involving human decision makers [50]. There are
several sources of epistemic uncertainty in this problem. First, the way to construct a training set
in the problem is to look at the historical set of employees and treat employees that voluntarily
resigned as positive samples and employees still in the workforce as negative samples. However,
since the prediction problem is to predict resignation in the next six months, our set of negative
samples will necessarily include employees who should be labeled positively because they will
be resigning soon [51].
Another uncertainty is related to quirks or vagaries in the data that are predictive but will
not generalize. In this problem, a few predictive features related to stipulations in employees’
contracts to remain with IBM for a fixed duration after their company was acquired, but such a
pattern would not remain true going forward. Another issue is unique feature vectors: if the data
contains an employee in Australia who has gone 17 years without being promoted and no other
similar employees, then there is huge uncertainty in that part of feature space, and inductive bias
must be completely relied upon.
In the solution created for this problem, the inherently safe design principle of interpretability
(Section III-1) was insisted upon and was what led to the discovery about the acquired company.
Specifically, C5.0 decision trees were used with the rule set option, and the project directly
15

motivated the study of an optimization approach for learning classification rules [52]. The reason
for conducting the project was to take actions such as salary increases to retain employees at risk
of resigning, and for this, the other inherently safe design principle of causality is important. Rare
samples such as the Australian employee led to the safe fail mechanism of manual inspection.
2) Loan Approval: As another example in the decision sciences that we have studied, let us
consider the decision to approve loans for solar panels given to the rural poor in India based on
data in application forms [53]. The epistemic uncertainty related to the training set not being
representative of the true test distribution repeat here and can be addressed by similar safety
strategies as discussed in the previous examples.
Loan approval is an example illustrating that loss functions that are not always monotonic in
the correctness of predictions and depend on perspective. The applicant would like an approval
decision regardless of their features indicating ability to repay, the lender would like approval
only in cases in which applicant features indicate likely repayment, and society would like there
to be fairness or equitability in the system so that protected groups, such as defined by gender
and religion, are not discriminated against. The lender perspective is consistent with the typical
choice of the loss function, but the others are not.
An interesting additional issue, in this case, relates to the human cost function from society’s
perspective including X . One of the attributes available in the problem was the surname of the
applicant; in this part of India, the surname is a strong indicator of religion and caste. The
use of this variable as a feature improved classification accuracy by a couple of percentage
points, but resulted in worse fairness: the true cost in the problem from society’s perspective.
Simply dropping the attribute as a feature does not ensure fairness because other features may
be correlated, but a safety margin on the accuracy of the groups make the system fairer.

C. Data Products

With data products applications, the first question to consider is whether immediate costs are
large enough for them to be considered safety issues. One may argue that an algorithm showing
biased or misguided advertisements or a spam filter not allowing an important email to pass
could eventually lead to harm, e.g., by being shown an ad for a lower-paying job rather than a
higher-paying one, a person may hypothetically end up with a lower quality of life at some point
in the future. Here the cost function does depend on X because misclassifying certain emails
16

is more costly than others. However, we do not view such a delayed and only hypothetical
consequence as a safety issue.
Moreover, in typical data products applications, one can use billions of data points as train-
ing, perform large-scale A/B testing, and evaluate average performance on millions or billions
of clicks. Therefore, uncertainty is not at the forefront, and neither are the safety strategies.
For example, the procedural safeguard of opening data is more common in decision science
applications such as those sponsored or run by governments than in data products applications
where the data is often the key value proposition.

V. C ONCLUSION

Machine learning systems are already embedded in many functions of society. The prognosis is
for broad adoption to only increase across all areas of life. With this prevailing trend, researchers,
engineers, and ethicists have started discussing the topic of safety in machine learning. In this
paper, we contribute to this discussion starting from a very basic definition of safety in terms of
harm, risk, and uncertainty and building upon it in the machine learning context. We identify that
the minimization of epistemic uncertainty is missing from standard modes of machine learning
developed around statistical risk minimization and that it needs to be included when considering
safety.
We discuss a few strategies for increasing safety in machine learning that are not a compre-
hensive list and are far from fully developed. This paper can be seen as laying the foundations
for a research agenda motivated by safety within which further strategies can be developed and
existing strategies can be fleshed out. In some respects, the research community has taken risk
minimization close to the limits of what is achievable. Safety, especially epistemic uncertainty
minimization, represents a direction that offers new and exciting problems to pursue, many
of which are being pursued already. As it is said in the Sanskrit literature, ahim
. sā paramo
dharmah. (non-harm is the ultimate direction). Moreover, not only is non-harm the first ethical
duty, many of the safety issues for machine learning we have discussed in this paper are starting
to enter legal obligations as well. For example, the European Union has recently adopted a set of
comprehensive regulations for data protection, which include prohibiting algorithms that make
any ”decision based solely on automated processing, including profiling” and significantly affect
a data subject or produce legal effects concerning him/her. This regulation which will take effect
17

in 2018 is anticipated to restrict a wide range of machine learning algorithms currently used in,
e.g., recommendation systems, credit and insurance risk assessments, and social networks [54].
We present example applications where machine learning algorithms are increasingly used
and discuss the aspects of epistemic uncertainty, harmful outcomes, and potential strategies for
achieving safety for each application. In some applications such as cyber-physical systems and
decision sciences, machine learning algorithms are used to support control and decision making
in safety-critical settings with considerable costs and direct harmful impact on people’s lives,
such as injury or loss of life. In other applications, machine learning based predictions are only
used in less critical settings for automated informational products. Applications with higher costs
of unwanted outcomes tend to be also those with higher uncertainty and the ones with less severe
outcomes are the ones with smaller uncertainty.

VI. ACKNOWLEDGEMENTS

No competing financial interests exist.

R EFERENCES

[1] A. Conn, “The AI wars: The battle of the human minds to keep artificial intelligence safe,”
http://futureoflife.org/2015/12/17/the-ai-wars-the-battle-of-the-human-minds-to-keep-artificial-intelligence-safe, Dec.
2015.
[2] T. Ferrell, “Engineering safety-critical systems in the 21st century,” 2010.
[3] K. R. Varshney, “Engineering safety in machine learning,” in Proc. Inf. Theory Appl. Workshop, La Jolla, CA, Feb. 2016.
[4] N. Möller and S. O. Hansson, “Principles of engineering safety: Risk and uncertainty reduction,” Reliab. Eng. Syst. Safe.,
vol. 93, no. 6, pp. 798–805, Jun. 2008.
[5] N. Möller, “The concepts of risk and safety,” in Handbook of Risk Theory, S. Roeser, R. Hillerbrand, P. Sandin, and
M. Peterson, Eds. Dordrecht, Netherlands: Springer, 2012, pp. 55–85.
[6] R. Senge, S. Bösner, K. Dembczynski, J. Haasenritter, O. Hirsch, N. Donner-Banzhoff, and E. Hüllermeier, “Reliable
classification: Learning classifiers that distinguish aleatoric and epistemic uncertainty,” Inf. Sci., vol. 255, pp. 16–29, Jan.
2014.
[7] V. Vapnik, “Principles of risk minimization for learning theory,” in Adv. Neur. Inf. Process. Syst. 4, 1992, pp. 831–838.
[8] K. L. Wagstaff, “Machine learning that matters,” in Proc. Int. Conf. Mach. Learn., Edinburgh, United Kingdom, Jun.–Jul.
2012, pp. 529–536.
[9] H. Alemzadeh, “Data-driven resiliency assessment of medical cyber-physical systems,” Ph.D. dissertation, Univ. Illinois,
Urbana-Champaign, Urbana, IL, 2016.
[10] J. Stanley and D. Tunkelang, “Doing data science right — your most common questions answered,”
http://firstround.com/review/doing-data-science-right-your-most-common-questions-answered, 2016.
[11] A. Olteanu, K. Talamadupula, and K. R. Varshney, “The limits of abstract evaluation metrics: The case of hate speech
detection,” in Proc. ACM Web Sci. Conf., Troy, NY, Jun. 2017, pp. 405–406.
18

[12] H. Alemzadeh, J. Raman, N. Leveson, Z. Kalbarczyk, and R. K. Iyer, “Adverse events in robotic surgery: A retrospective
study of 14 years of FDA data,” PLoS ONE, vol. 11, no. 4, pp. 1–20, 04 2016.
[13] J. Knight, Fundamentals of Dependable Computing for Software Engineers. CRC Press, 2012.
[14] H. Shimodaira, “Improving predictive inference under covariate shift by weighting the log-likelihood function,” Journal
of statistical planning and inference, vol. 90, no. 2, pp. 227–244, 2000.
[15] H. Daume III and D. Marcu, “Domain adaptation for statistical classifiers,” Journal of Artificial Intelligence Research,
vol. 26, pp. 101–126, 2006.
[16] R. Caruana, Y. Lou, J. Gehrke, P. Koch, M. Sturm, and N. Elhadad, “Intelligible models for healthcare: Predicting pneumonia
risk and hospital 30-day readmission,” in Proc. ACM SIGKDD Conf. Knowl. Discov. Data Min., Sydney, Australia, Aug.
2015, pp. 1721–1730.
[17] A. A. Freitas, “Comprehensible classification models – a position paper,” SIGKDD Explorations, vol. 15, no. 1, pp. 1–10,
Jun. 2013.
[18] C. Rudin, “Algorithms for interpretable machine learning,” in Proc. ACM SIGKDD Conf. Knowl. Discov. Data Min., New
York, NY, Aug. 2014, p. 1519.
[19] S. Athey and G. W. Imbens, “Machine learning methods for estimating heterogeneous causal effects,”
http://arxiv.org/pdf/1504.01132.pdf, Jul. 2015.
[20] M. Welling, “Are ML and statistics complementary?” in IMS-ISBA Meeting on ‘Data Science in the Next 50 Years’, Dec.
2015.
[21] F. Wang and C. Rudin, “Causal falling rule lists,” http://arxiv.org/pdf/1510.05189.pdf, Oct. 2015.
[22] A. Chakarov, A. Nori, S. Rajamani, S. Sen, and D. Vijaykeerthy, “Debugging machine learning tasks,”
http://arxiv.org/pdf/1603.07292.pdf, Mar. 2016.
[23] M. Petrik and R. Luss, “Interpretable policies for dynamic product recommendations,” in Proc. Conf. Uncertainty Artif.
Intell., Jersey City, NJ, Jun. 2016, p. 74.
[24] F. Provost and T. Fawcett, “Robust classification for imprecise environments,” Mach. Learn., vol. 42, no. 3, pp. 203–231,
Mar. 2001.
[25] M. A. Davenport, R. G. Baraniuk, and C. D. Scott, “Tuning support vector machines for minimax and Neyman-Pearson
classification,” vol. 32, no. 10, pp. 1888–1898, Oct. 2010.
[26] S. Hajian and J. Domingo-Ferrer, “A methodology for direct and indirect discrimination prevention in data mining,” IEEE
Transactions on knowledge and data engineering, vol. 25, no. 7, pp. 1445–1459, Jul. 2013.
[27] M. Feldman, S. A. Friedler, J. Moeller, C. Scheidegger, and S. Venkatasubramanian, “Certifying and removing disparate
impact,” in Proc. ACM SIGKDD Conf. Knowl. Discov. Data Min., Sydney, Australia, Aug. 2015, pp. 259–268.
[28] S. Barocas and A. D. Selbst, “Big data’s disparate impact,” California Law Rev., vol. 104, 2016.
[29] The U.S. EEOC, “Uniform guidelines on employee selection procedures,” 1979.
[30] K. R. Varshney, R. J. Prenger, T. L. Marlatt, B. Y. Chen, and W. G. Hanley, “Practical ensemble classification error bounds
for different operating points,” IEEE Transactions on Knowledge and Data Engineering, vol. 25, no. 11, pp. 2590–2601,
Nov. 2013.
[31] J. Attenberg, P. Ipeirotis, and F. Provost, “Beat the machine: Challenging humans to find a predictive model’s “unknown
unknowns”,” ACM J. Data Inf. Qual., vol. 6, no. 1, p. 1, Mar. 2015.
[32] G. M. Weiss, “Mining with rarity: A unifying framework,” SIGKDD Explor. Newsletter, vol. 6, no. 1, pp. 7–19, Jun. 2004.
[33] A. Sahuguet, J. Krauss, L. Palacios, and D. Sangokoya, “Open civic data: Of the people, by the people, for the people,”
Bull. Tech. Comm. Data Eng., vol. 37, no. 4, pp. 15–26, Dec. 2014.
19

[34] E. Shaw, “Improving service and communication with open data: A history and how-to,” Ash Center, Harvard Kennedy
School, Tech. Rep., Jun. 2015.
[35] S. Kapoor, A. Mojsilović, J. N. Strattner, and K. R. Varshney, “From open data ecosystems to systems of innovation: A
journey to realize the promise of open data,” in Proc. Data for Good Exchange Conf., New York, NY, Sep. 2015.
[36] G. Schirner, D. Erdogmus, K. Chowdhury, and T. Padir, “The future of human-in-the-loop cyber-physical systems,”
Computer, no. 1, pp. 36–45, 2013.
[37] Y. Kassahun, B. Yu, A. T. Tibebu, D. Stoyanov, S. Giannarou, J. H. Metzen, and E. Vander Poorten, “Surgical robotics
beyond enhanced dexterity instrumentation: a survey of machine learning techniques and their role in intelligent and
autonomous surgical actions,” International Journal of Computer Assisted Radiology and Surgery, vol. 11, no. 4, pp.
553–568, 2016.
[38] H. C. Lin, I. Shafran, T. E. Murphy, A. M. Okamura, D. D. Yuh, and G. D. Hager, Automatic Detection and Segmentation
of Robot-Assisted Surgical Motions. Berlin, Heidelberg: Springer Berlin Heidelberg, 2005, pp. 802–810.
[39] H. C. Lin, I. Shafran, D. Yuh, and G. D. Hager, “Towards automatic skill evaluation: Detection and segmentation of
robot-assisted surgical motions,” Computer Aided Surgery, vol. 11, no. 5, pp. 220–230, 2006.
[40] C. E. Reiley, E. Plaku, and G. D. Hager, “Motion generation of robotic surgical tasks: Learning from expert demonstrations,”
in 2010 Annual International Conference of the IEEE Engineering in Medicine and Biology, Aug 2010, pp. 967–970.
[41] A. Shademan, R. S. Decker, J. D. Opfermann, S. Leonard, A. Krieger, and P. C. W. Kim, “Supervised autonomous robotic
soft tissue surgery,” Science Translational Medicine, vol. 8, no. 337, pp. 37ra64–337ra64, 2016.
[42] H. Alemzadeh, D. Chen, A. Lewis, Z. Kalbarczyk, J. Raman, N. Leveson, and R. K. Iyer, “Systems-theoretic safety
assessment of robotic telesurgical systems,” in Proc. Int. Conf. Comput. Safety Reliability Secur., 2015, pp. 213–227.
[43] H. Azimian, M. D. Naish, B. Kiaii, and R. V. Patel, “A chance-constrained programming approach to preoperative planning
of robotic cardiac surgery under task-level uncertainty,” IEEE Trans. Biomed. Health Inf., vol. 19, no. 2, pp. 612–1898,
Mar. 2015.
[44] S. Rayej, “How do self-driving cars work?” http://robohub.org/how-do-self-driving-cars-work/, 2014.
[45] J. Lowy, “Driver killed in self-driving car accident for first time,” http://www.pbs.org/newshour/rundown/driver-killed-in-
self-driving-car-accident-for-first-time, 2016.
[46] J. Duchi, P. Glynn, and R. Johari, “Uncertainty on uncertainty, robustness, and simulation,” SAIL-Toyota Center for AI
Research, Stanford University, Tech. Rep., Jan. 2016.
[47] Y. Zhu and V. Janapa Reddi, “Cognitive computing safety: The new horizon for reliability,” IEEE Micro, forthcoming.
[48] P. Koopman and M. Wagner, “Challenges in autonomous vehicle testing and validation,” SAE International Journal of
Transportation Safety, vol. 4, no. 2016-01-0128, pp. 15–24, 2016.
[49] E. Brynjolfsson, L. Hitt, and H. Kim, “Strength in numbers: How does data-driven decision-making affect firm
performance?” in Proc. Int. Conf. Inf. Syst., Shanghai, China, Dec. 2011, p. 13.
[50] M. Singh, K. R. Varshney, J. Wang, A. Mojsilović, A. R. Gill, P. I. Faur, and R. Ezry, “An analytics approach for proactively
combating voluntary attrition of employees,” in Proc. IEEE Int. Conf. Data Min. Workshops, Brussels, Belgium, Dec. 2012,
pp. 317–323.
[51] D. Wei and K. R. Varshney, “Robust binary hypothesis testing under contaminated likelihoods,” in Proc. IEEE Int. Conf.
Acoust. Speech Signal Process., Brisbane, Australia, Apr. 2015, pp. 3407–3411.
[52] D. M. Malioutov and K. R. Varshney, “Exact rule learning via Boolean compressed sensing,” in Proc. Int. Conf. Mach.
Learn., Atlanta, GA, Jun. 2013, pp. 765–773.
[53] H. Gerard, K. Rao, M. Simithraaratchy, K. R. Varshney, K. Kabra, and G. P. Needham, “Predictive modeling of customer
20

repayment for sustainable pay-as-you-go solar power in rural India,” in Proc. Data for Good Exchange Conf., New York,
NY, Sep. 2015.
[54] B. Goodman and S. Flaxman, “European Union regulations on algorithmic decision-making and a ’right to explanation’,”
in Proc. ICML Workshop Human Interpretability, New York, NY, Jun. 2016, pp. 26–30.

DAA Lab Syllabus
No ratings yet
DAA Lab Syllabus
2 pages
CSC 311 Analysis and Design of Algorithm
No ratings yet
CSC 311 Analysis and Design of Algorithm
3 pages
Ok Principles-of-System-Safety-Engineering-and-Management
No ratings yet
Ok Principles-of-System-Safety-Engineering-and-Management
161 pages
Adaptive and Learning Based Control of Safety Critical Systems
No ratings yet
Adaptive and Learning Based Control of Safety Critical Systems
209 pages
BCA 103 - Mathematical Foundation of Computer SC - BCA
100% (2)
BCA 103 - Mathematical Foundation of Computer SC - BCA
274 pages
Kim 等 - 2021 - Safe Learning and Optimization Techniques Towards a Survey of the State of the Art
No ratings yet
Kim 等 - 2021 - Safe Learning and Optimization Techniques Towards a Survey of the State of the Art
16 pages
Neuroai For Ai Safety: These Authors Contributed Equally Corresponding Author: Patrick@Amaranth - Foundation
No ratings yet
Neuroai For Ai Safety: These Authors Contributed Equally Corresponding Author: Patrick@Amaranth - Foundation
133 pages
Evolutionary Computation and AI Safety - RQ and Real World Application of Evolution
No ratings yet
Evolutionary Computation and AI Safety - RQ and Real World Application of Evolution
21 pages
Machine Learning RESA
No ratings yet
Machine Learning RESA
51 pages
Probabilistic Artificial Intelligence
No ratings yet
Probabilistic Artificial Intelligence
418 pages
Aisafetybook
No ratings yet
Aisafetybook
568 pages
Trust Measurement in Human-Autonomy
No ratings yet
Trust Measurement in Human-Autonomy
103 pages
Security Risks of ML Models Adverserial Machine Learning
No ratings yet
Security Risks of ML Models Adverserial Machine Learning
7 pages
Unsolved Problems in ML Safety: Dan Hendrycks Nicholas Carlini John Schulman Jacob Steinhardt
No ratings yet
Unsolved Problems in ML Safety: Dan Hendrycks Nicholas Carlini John Schulman Jacob Steinhardt
28 pages
Burton 2017
No ratings yet
Burton 2017
12 pages
Nunes - User Model User-Adap
No ratings yet
Nunes - User Model User-Adap
52 pages
Artificial Errors AI Slater
No ratings yet
Artificial Errors AI Slater
3 pages
Using The Crowd To Prevent Harmful AI Behavior
No ratings yet
Using The Crowd To Prevent Harmful AI Behavior
25 pages
Situation Awareness-Based Agent Transparency
No ratings yet
Situation Awareness-Based Agent Transparency
36 pages
Machines and Computational Models: Understanding How Computers Work Input-Process-Output Model: Input Output Examples
No ratings yet
Machines and Computational Models: Understanding How Computers Work Input-Process-Output Model: Input Output Examples
3 pages
Predicting Poverty Level From Satellite
No ratings yet
Predicting Poverty Level From Satellite
8 pages
NT&I Presentation 3
No ratings yet
NT&I Presentation 3
6 pages
Week 4
No ratings yet
Week 4
18 pages
Li Littman Walsh 2008 PDF
No ratings yet
Li Littman Walsh 2008 PDF
8 pages
Engineering Problems in Machine Learning Systems: Hiroshi Kuwajima Hirotoshi Yasuoka Toshihiro Nakae
No ratings yet
Engineering Problems in Machine Learning Systems: Hiroshi Kuwajima Hirotoshi Yasuoka Toshihiro Nakae
24 pages
Does Explainable Artificial Intelligence Improve Human Decision-Making
No ratings yet
Does Explainable Artificial Intelligence Improve Human Decision-Making
9 pages
Type 2 ANFIS
No ratings yet
Type 2 ANFIS
2 pages
We and It
No ratings yet
We and It
70 pages
Machine Psychology
No ratings yet
Machine Psychology
5 pages
Essay On The Importance of Mathematics in Programming
100% (1)
Essay On The Importance of Mathematics in Programming
2 pages
A Simulation Based Approach For Plant Layout Design and Production Planning
No ratings yet
A Simulation Based Approach For Plant Layout Design and Production Planning
15 pages
Security Boulevard Onelogin-Mastering Machine Learning
No ratings yet
Security Boulevard Onelogin-Mastering Machine Learning
13 pages
Engineering Applications of Artificial Intelligence: Peter Vamplew, Cameron Foale, Richard Dazeley, Adam Bignold
No ratings yet
Engineering Applications of Artificial Intelligence: Peter Vamplew, Cameron Foale, Richard Dazeley, Adam Bignold
16 pages
Ada Unit-I
No ratings yet
Ada Unit-I
88 pages
Notes On Session 1-10 - DR
No ratings yet
Notes On Session 1-10 - DR
32 pages
Principles of System Safety Engineering and Management
100% (1)
Principles of System Safety Engineering and Management
161 pages
Concrete Problems in AI Safety
No ratings yet
Concrete Problems in AI Safety
29 pages
Cloth Simulation Through Position Based Dynamics Using CUDA CS 432: GPU Accelerated Computing Project Research Paper
No ratings yet
Cloth Simulation Through Position Based Dynamics Using CUDA CS 432: GPU Accelerated Computing Project Research Paper
4 pages
Linear Algorithm For HORNSAT
No ratings yet
Linear Algorithm For HORNSAT
18 pages
AI Safety US China Arms Race 1731313820
No ratings yet
AI Safety US China Arms Race 1731313820
16 pages
Methods of R M
No ratings yet
Methods of R M
42 pages
BFC505 - Python
No ratings yet
BFC505 - Python
30 pages
What Is Computer Programming?: Basic Terminologies in Programming Languages
No ratings yet
What Is Computer Programming?: Basic Terminologies in Programming Languages
7 pages
Sample Implied Consent For Online Surveys
No ratings yet
Sample Implied Consent For Online Surveys
2 pages
Chapter 1-Introduction To Programming I
No ratings yet
Chapter 1-Introduction To Programming I
15 pages
Bachelor of Sports Management Syllabus (Revised) '2008
No ratings yet
Bachelor of Sports Management Syllabus (Revised) '2008
22 pages
BSSE Curriculum 08-01-2024 Updated
No ratings yet
BSSE Curriculum 08-01-2024 Updated
60 pages
Limitations of ML
No ratings yet
Limitations of ML
10 pages
AI Safety
No ratings yet
AI Safety
23 pages
Unit I - Basic Concepts of Programming
No ratings yet
Unit I - Basic Concepts of Programming
52 pages
Algorithm and Flowchart
No ratings yet
Algorithm and Flowchart
27 pages
Holidayhomework 202205 GP Class06
No ratings yet
Holidayhomework 202205 GP Class06
15 pages
Ingenuity of Scratch Programming On Reflective Thi
No ratings yet
Ingenuity of Scratch Programming On Reflective Thi
26 pages
Multiplying by 2-Digit Numbers
No ratings yet
Multiplying by 2-Digit Numbers
5 pages
L-14.0 Algorithm - Flowchart - Example
No ratings yet
L-14.0 Algorithm - Flowchart - Example
20 pages
Guidelines Project Phase I and II
No ratings yet
Guidelines Project Phase I and II
7 pages
Computer Science Project Class 11
No ratings yet
Computer Science Project Class 11
43 pages
Software Requirements Specification Document: 1.1 Purpose
No ratings yet
Software Requirements Specification Document: 1.1 Purpose
5 pages
Development and Analysis of A Trading Algorithm Using Candlestick Patterns
No ratings yet
Development and Analysis of A Trading Algorithm Using Candlestick Patterns
21 pages
Discrete Structures Syllabus 2018 Anave
No ratings yet
Discrete Structures Syllabus 2018 Anave
5 pages
AI Risk Management, Analysis, and Assessment.
From Everand
AI Risk Management, Analysis, and Assessment.
Anand Vemula
No ratings yet
AI for Cyber Guardians: Revolutionizing Ethical Hacking
From Everand
AI for Cyber Guardians: Revolutionizing Ethical Hacking
Jane Smith, PhD
No ratings yet
The Complete Guide to Defense in Depth: Learn to identify, mitigate, and prevent cyber threats with a dynamic, layered defense approach
From Everand
The Complete Guide to Defense in Depth: Learn to identify, mitigate, and prevent cyber threats with a dynamic, layered defense approach
Akash Mukherjee
No ratings yet
AI on the Frontlines: Cyber Defence and Offensive Strategies for the Digital Age
From Everand
AI on the Frontlines: Cyber Defence and Offensive Strategies for the Digital Age
GEW Intelligence Unit
No ratings yet
Quantum Security: Practical implementation with Q-SLICE and QUANTA
From Everand
Quantum Security: Practical implementation with Q-SLICE and QUANTA
Digital Dreams Publishing
No ratings yet
Virus Safeguarding: Navigating Cybersecurity Challenges
From Everand
Virus Safeguarding: Navigating Cybersecurity Challenges
Ayir Ahsi
No ratings yet
Stay Safe!: A Basic Guide to Information Technology Security
From Everand
Stay Safe!: A Basic Guide to Information Technology Security
Abdul B. Subhani
No ratings yet
“Computer Viruses Unveiled: Types, Trends and Mitigation Strategies”: GoodMan, #1
From Everand
“Computer Viruses Unveiled: Types, Trends and Mitigation Strategies”: GoodMan, #1
Patrick Mukosha
No ratings yet
Security Economics
From Everand
Security Economics
Emmanuel Joseph
No ratings yet
Information Security In Health Systems
From Everand
Information Security In Health Systems
Ali Mosallanejad (Sami Ali)
No ratings yet
Exploring The Intersection Of Artificial Intelligence And Cyber Defense
From Everand
Exploring The Intersection Of Artificial Intelligence And Cyber Defense
Stephen Nnamdi
No ratings yet
Foundations of Physical Security: Comprehensive Strategies for Modern Threats
From Everand
Foundations of Physical Security: Comprehensive Strategies for Modern Threats
William Ubagan
5/5 (2)
Service and Advanced Technology: Practical Essays
From Everand
Service and Advanced Technology: Practical Essays
Harry Katzan Jr.
No ratings yet
Cybersecurity
From Everand
Cybersecurity
Harry Katzan Jr.
No ratings yet
Advances in Cyber Security: Technology, Operations, and Experiences
From Everand
Advances in Cyber Security: Technology, Operations, and Experiences
D. Frank Hsu
No ratings yet
Become a Cybersecurity Specialist
From Everand
Become a Cybersecurity Specialist
Othman Omran Khalifa
No ratings yet
The Little Book of Cybersecurity
From Everand
The Little Book of Cybersecurity
Harry Katzan Jr.
No ratings yet
Cybercrime in the Digital Age
From Everand
Cybercrime in the Digital Age
Roberto Miguel Rodriguez
No ratings yet
AI and ML Applications for Decision-Making in Zero Trust Cyber Security
From Everand
AI and ML Applications for Decision-Making in Zero Trust Cyber Security
Dr. Zemelak Goraga
No ratings yet
Securing Critical Infrastructures
From Everand
Securing Critical Infrastructures
Professor Mohamed K. Kamara Ph.D.
No ratings yet
CompTia Security 701: Fundamentals of Security
From Everand
CompTia Security 701: Fundamentals of Security
AS Snipes
No ratings yet
Advanced Persistent Threats in Cybersecurity – Cyber Warfare
From Everand
Advanced Persistent Threats in Cybersecurity – Cyber Warfare
Nicolae Sfetcu
No ratings yet
Advanced Network Defense: Architectures and Best Practices for Today’s Perimeter
From Everand
Advanced Network Defense: Architectures and Best Practices for Today’s Perimeter
Lawrence Bennier
No ratings yet
Mitigating Supply Chain Attacks in the Digital Age
From Everand
Mitigating Supply Chain Attacks in the Digital Age
Ami Adi
No ratings yet
Cybersecurity Fundamentals: Best Security Practices: cybersecurity beginner, #1
From Everand
Cybersecurity Fundamentals: Best Security Practices: cybersecurity beginner, #1
Bruce Brown
No ratings yet
"Careers in Information Technology: Cybersecurity Analyst": GoodMan, #1
From Everand
"Careers in Information Technology: Cybersecurity Analyst": GoodMan, #1
Patrick Mukosha
No ratings yet
Fortifying Digital Fortress: A Comprehensive Guide to Information Systems Security: GoodMan, #1
From Everand
Fortifying Digital Fortress: A Comprehensive Guide to Information Systems Security: GoodMan, #1
Patrick Mukosha
No ratings yet
Defending the Digital Perimeter: Network Security Audit Readiness Strategies
From Everand
Defending the Digital Perimeter: Network Security Audit Readiness Strategies
J.L Parham
No ratings yet
Managing Modern Security Operations Center & Building Perfect Career as SOC Analyst
From Everand
Managing Modern Security Operations Center & Building Perfect Career as SOC Analyst
Publicancy Ltd
No ratings yet
Hacking for Beginners: Mastery Guide to Learn and Practice the Basics of Computer and Cyber Security
From Everand
Hacking for Beginners: Mastery Guide to Learn and Practice the Basics of Computer and Cyber Security
Richard Dorsel
No ratings yet
CompTIA CySA+ Certification The Ultimate Study Guide to Practice Questions With Answers and Master the Cybersecurity Analyst Exam
From Everand
CompTIA CySA+ Certification The Ultimate Study Guide to Practice Questions With Answers and Master the Cybersecurity Analyst Exam
Jake T Mills
No ratings yet
Safeguarding the Digital Fortress: A Guide to Cyber Security: The IT Collection
From Everand
Safeguarding the Digital Fortress: A Guide to Cyber Security: The IT Collection
Christopher Ford
No ratings yet
CYBER SECURITY HANDBOOK Part-1: Hacking the Hackers: Unraveling the World of Cybersecurity
From Everand
CYBER SECURITY HANDBOOK Part-1: Hacking the Hackers: Unraveling the World of Cybersecurity
Poonam Devi
No ratings yet
Autonomic Networking: Fundamentals and Applications
From Everand
Autonomic Networking: Fundamentals and Applications
Fouad Sabry
No ratings yet
Artificial Intelligence Safety: Fundamentals and Applications
From Everand
Artificial Intelligence Safety: Fundamentals and Applications
Fouad Sabry
No ratings yet
CYBER SECURITY HANDBOOK Part-2: Lock, Stock, and Cyber: A Comprehensive Security Handbook
From Everand
CYBER SECURITY HANDBOOK Part-2: Lock, Stock, and Cyber: A Comprehensive Security Handbook
Poonam Devi
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

On The Safety of Machine Learning

Uploaded by

On The Safety of Machine Learning

Uploaded by

1

On the Safety of Machine Learning:

Kush R. Varshney (Corresponding Author)

II. S AFETY IN M ACHINE L EARNING

C. Risk and Epistemic Uncertainty

III. S TRATEGIES FOR ACHIEVING S AFETY

interpretable models [23].

IV. E XAMPLE A PPLICATIONS

With advances in computing, networking, and sensing technologies, cyber-physical systems

No competing financial interests exist.

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.