Artificial Intelligence: Learning and Limitations
Artificial Intelligence: Learning and Limitations
Innovation Center2
Fundação Dom Cabral
Avenida Princesa Diana, 760, Alphaville, Lagoa dos Ingleses, BRAZIL.
Received: June 5, 2020. Revised: July 17, 2020. Accepted: July 21, 2020. Published: July 22, 2020.
Abstract: Artificial Intelligence, IA, is a new technology with enormous potential to change the world forever as we know
it. It finds applications in many fields of human activity, including services, industry, education, social networks,
transportation, among others. However, there is little discussion about the accuracy and reliability of such technology,
which has been used in situations where human life depends on its decision-making process, which is the result of its
training, one of the stages of development. It is known that the learning process of an Artificial Intelligence, which can use
the Artificial Neural Networks technology, presents an error of the predicted value in relation to the real value, which can
compromise its application, being more critical in situations where the user's security is a major issue. In this article, we
discuss the main technologies used in AI, their development history, considerations about Artificial Neural Networks and
the failures arising from the training and hardware processes used. Three types of errors are discussed: The Adversarial
Examples, the Soft Errors and the Errors due the lack of Appropriate Training. A case study associated with the third type
of error is discussed and actions based on Design of Experiments are proposed. The objective is to change the way the AI
models are trained, to add some rare conditions, and to improve their ability to forecast with greater accuracy in any
situation.
Keywords: Artificial Intelligence; Artificial Neural Networks; Deep Learning; Machine Learning; Adversarial Examples;
Soft Errors.
1940’s 1960’s 1960’s-1970’s 1980’s Final dos 1990’s 2006 2009 2012 2014 2015 2016 2017
1980’s
K-Means Bayesian Non-linear Deep Forest
network SVM &
clustering
Machine
Decision
Learning tree
inaccurate information and their ability to generalize [12]. analogous to the biological neuron shown in Figure 3. It
Its ability to learn and processing information classifies it establishes a mapping between input activities (Stimulus)
as a form of Artificial Intelligence [13]. The most notable and the output signal. Perceptron presents n biological
feature of this technology is that it can be applied to a neurons with various signals of intensity x and synaptic
wide variety of problems, many of which were extremely strength w feeding a neuron with a minimum required
complex or lacking in more sophisticated theoretical stimulus value of b and the equivalent artificial neuron
models [14]. Figure 3 illustrates a scheme of a biological system. The Artificial Neural Network and the Biological
neuron with its three major functional units: dendrite, cell Network learn by adjusting the magnitude of synapse
body, and axon. The cell body has a nucleus that contains weights or forces [15].
information about hereditary characteristics and a plasma
in which resides the molecular equipment used by the 2.4. Deep Learning and Machine Learning
neuron in the process of communication with other
neurons [15]. In a simple definition, machine learning or deep learning
refers to the use of an artificial neural network with
multiple layers of hidden nodes between output and
input, as shown in Figure 5, where deep architectures are
built on multiple levels on nonlinear operations [17].
Output Layer
Attackers can generate opposing commands against Figure 6: An opposing example constructed by modifying
automatic speech recognition models in speech- a panda's image in such a way that a machine learning
controllable systems such as Apple Siri, Amazon Alexa model thinks it is a gibbon [20].
and Microsoft Cortana [18]. The cause of these opposing
examples was a mystery, and speculative explanations
suggested that it was due to the extreme nonlinearity of 3.2. Soft Errors
deep neural networks, perhaps combined with the
insufficient average of the model and insufficient The ever-increasing miniaturization of semiconductors
regularization of the purely supervised learning problem. leads to major advances in mobile, cloud and network
computing. However, this has made electronic devices
A variety of intriguing properties of artificial neural less reliable and microprocessors more susceptible to
networks and related models have been demonstrated transient errors. These intermittent failures do not cause
[21]. But the consequences of errors increase permanent damage but may result in program execution
dramatically when technology companies start using deep by changing the transfer of stored signals or values.
learning algorithms in applications such as two-ton These transient faults are also called soft errors. As
machines moving on high-speed highways. A wrong technology continues to escalate, industry experts project
decision made by autonomous artificial intelligence can that the problem of soft errors will become increasingly
lead the car to collide with the guardrail, another vehicle important [23]. Deep learning neural network-based
or run over pedestrians or cyclists [22]. applications are widely used in high performance
computing systems and data centers.
Sample
histogram correspond to the rare conditions in which area, however the Artificial Neural Networks stand out
there was insufficient neural network training to and are the basis of the so-called Deep Learning. These
modelling an important feature. Due to this fact, the error networks simulate the learning process of the human
found in this region is significantly higher when brain and learn through training with historical data.
compared to the central region of the histogram, as can be Technologies used in artificial intelligence tools have
seen in Figure 9. In this case, the estimated TS, been found to be subject to errors, known as adversary
represented by the blue curve, as a function of the examples and subtle errors, which are critical in certain
sample, presents greater error in regions coincident with uses where safety is a primary issue and lives may be
the highest normalized amplitudes (corresponding to the subject to decisions made by algorithms. An example is
maximum and minimum) of the measured TS, the advent of autonomous cars, subject to both
represented by the red curve. adversarial examples and subtle errors. The lack of
enough data on extreme situations are also critical for
training, prediction and subsequent action for uses where
safety is critical. A real case of building an artificial
intelligence for industrial application was discussed and
the possible measures to reduce the prediction error
obtained were pointed out. This article is expected to
contribute to the technical and professional growth of the
readers.
6. References
Figure 10: Histogram of the measured values of the [1] B. J. Copeland. (2019) Artificial intelligence.
Tensile Strength, [5]. https://www.britannica.com/technology/artificial-
intelligence;
It is proposed that, in the planning stage of the Artificial
Neural Networks model for the applications discussed, [2] Danny Y., & Dan T. 2016). Tesla driver dies in first
either in autonomous cars or in predictive models for fatal crash while using autopilot mode.
mechanical properties, it should be considered an https://www.theguardian.com/technology/2016/jun/30/tes
experimental design phase (Design of Experiments, la-autopilot-death-self-driving-car-elon-musk;
DOE), where the data related to the tails of the histogram
are intentionally generated and therefore would provide [3] Chen, A. (2018). IBM’s Watson gave unsafe
greater learning on specific and rare situations. For recommendations for treating cancer.
example, on the case of a recent car accident the https://www.theverge.com/2018/7/26/17619382/ibms-
autonomous driving system was not capable to take any watson-cancer-ai-healthcare-science;
action to avoid the collision with an overturned truck and,
also, ignored a pedestrian, at the same time [25]. Some [4] Vincent, J. (2016). Twitter taught Microsoft’s AI
extreme and possible situations, like an overturned truck chatbot to be a racist asshole in less than a day.
and similar objects lying down on the road, could be on https://www.theverge.com/2016/3/24/11297050/tay-
the training data to allow the car to learn that on microsoft-chatbot-racist;
situations like that, where the action of breaking is one of
the possible measures to avoid an accident. But it appears [5] Oliveira, A. P. Modelo de Previsão de propriedades
not to be the case on this specific example. For the mecânicas de perfis estruturais laminados a quente: uma
practical example discussed in section 4, simply design abordagem em redes neurais artificiais. (Dissertação,
the DOE such that the maximum and minimum regions Mestrado em Engenharia Metalúrgica e de Minas).
of the input variables have enough occurrences of Biblioteca Digital da Universidade Federal de Minas
training data. For this, the process must be modified in its Gerais, UFMG, 2008. http://hdl.handle.net/1843/MAPO-
origin, thus forcing the occurrence of these data [5]. 7RLKBJ;
[8] Shahin, M. A. State-of-the-art review of some defenses for deep learning. arXiv preprint
artificial intelligence applications in pile foundations. arXiv:1712.07107, 2018;
Geoscience Frontiers 7, 2014, 33-44.
http://dx.doi.org/10.1016/j.gsf.2014.10.002; [19] Mahmood Sharif, Sruti Bhagavatula, Lujo Bauer,
Michael K. Reiter. Accessorize to a crime: Real and
[9] Muhammad Shahbaz, Syed A. Taqvi, Adrian Chun stealthy attacks on state-of-the-art face recognition. In
Minh Loy, Abrar Inayat, Fahim Uddin, Awais Bokhari, Proceedings of the 23rd ACM SIGSAC Conference on
Salman Raza Naqvi. Artificial neural network approach Computer and Communications Security, October 2016.
for the steam gasification of palm oil waste using bottom DOI: http://dx.doi.org/10.1145/2976749.2978392;
ash and CaO. Renewable Energy 132, 2019, 243-254;
[20] Goodfellow, Ian; Papernot, Nicolas; Huang, Sandy;
[10] Cao, C.; Liu, F.; Tan, H.; Song, D.; Shu, W.; Li, W.; Duan, Yan; Abbeel, Pieter; Clark, Jack. (2017).
Zhou, Y.; Bo, X.; Xie, Z. Deep Learning and Its “Attacking Machine Learning with Adversarial
Applications in Biomedicine. Genomics, Proteomics & Examples.” OpenAI. https://blog.openai.com/adversarial-
Bioinformatics, 16, 2018, 17–32; example-research/;
[11] Chia-Yen, L., & Tsung-Lun, T. Data science [21] Ian J. Goodfellow, Jonathon Shlens, and Christian
framework for variable selection, metrology prediction, Szegedy. Explaining and harnessing adversarial
and T process control in TFT-LCD manufacturing. examples. 2015. arXiv:1412.6572;
Robotics and Computer Integrated Manufacturing, 55,
2018, 76–87. https://doi.org/10.1016/j.rcim.2018.07.013; [22] Hsu, J. (2017). A New Way to Find Bugs in Self-
Driving AI Could Save Lives.
[12] Tu, J. V. Advantages and Disadvantages of Using https://spectrum.ieee.org/tech-talk/robotics/artificial-
Artificial Neural Networks versus Logistic Regression for intelligence/better-bug-hunts-in-selfdriving-car-ai-could-
Predicting Medical Outcomes. Journal of Clinical save-lives;
Epidemiology, 49, 11, 1996, 1125-1231;
https://doi.org/10.1016/S0895-4356(96)00002-9; [23] Q. Shi, H. Omar, and O. Khan, “Exploiting the
tradeoff between program accuracy and soft-error
[13] Korczak, P., Dyja H., Labuda E. Using Neural resiliency overhead for machine learning workloads,”
Netwoks Models for Predicting Mechanical Properties CoRR, vol. abs/1707.02589, 2017. [Online]. Available:
after Plate Rolling Processes. Journal of Materials http://arxiv.org/abs/1707.02589;
Processing Technology, Poland, vol.80, n.81, 1998, 481-
486; [24] Li, G., Hari, S. K. S., Sullivan, M., Tsai, T.,
Pattabiraman, K., Emer, J., & Keckler, S. W. (2017).
[14] Dyja H., & Korczak P. The thermal-mechanical and Understanding error propagation in deep learning neural
microstructural model for the FEM simulation of hot network (DNN) accelerators and applications. Paper
plate rolling. Journal of Materials Processing presentend on: International Conference for High
Technology, 92-93, 1999 463-467. Performance Computing, Networking, Storage and
https://doi.org/10.1016/S0924-0136(99)00215-0; Analysis, Denver, Colorado.
https://doi.org/10.1145/3126908.3126964;
[15] I.A. Basheer, M. Hajmeer. Artificial Neural
Networks: Fundamentals, Computing, Design and [25] Templeton B. (2020). Tesla in Taiwan Crashes
Application, Journal of Microbiological Methods, vol.43, Directly into Overturned Truck, Ignores Pedestrian, With
2000, 3-31; Autopilot On.
https://www.forbes.com/sites/bradtempleton/2020/06/02/t
[16] Castrounis. (2016). Artificial Intelligence, Deep esla-in-taiwan-crashes-directly-into-overturned-truck-
Learning, and Neural Networks, Explained. ignores-pedestrian-with-autopilot-on/#5ad97d7758e5.
https://www.kdnuggets.com/2016/10/artificial-
intelligence-deep-learning-neural-networks-
explained.html;
Creative Commons Attribution License 4.0
[17] Milad Zafar Nezhad, Najibesadat Sadati, Kai Yang, (Attribution 4.0 International, CC BY 4.0)
Dongxiao Zhu. A Deep Active Survival Analysis
approach for precision treatment recommendations: This article is published under the terms of the Creative
Application of prostate cancer. Expert Systems with Commons Attribution License 4.0
Applications 115, 2018, 16–26; https://creativecommons.org/licenses/by/4.0/deed.en_US