Lecture 16 Introduction To AI
Lecture 16 Introduction To AI
Instructor:
Dr. Navid Asadi
Presenter:
Shayan (Sean) Taheri
Florida Institute for Cybersecurity (FICS) Research
Electrical and Computer Engineering Department
University of Florida (UF)
1
Lecture Plan
▪ The Arguments about Questioning the Thinking Ability of Machines? Given that the
nervous system is not a discrete-state machine, you cannot mimic the behavior of nervous
system with a discrete-state machine (Continuity in the Nervous System).
▪ Machines with thinking capabilities: Ctesibius of Alexandria - Water Clock with a Regulator
and Thermostat of Wiener - Controller of the Environment Temperature.
▪ How about having a machine capable of having human feelings and thoughts, and
Automated device for playing chess
performing actions? and performing arts .
▪ Turing Test:
✓ A machine can be described as thinking machine if it passes the Turing Test.
This test evaluates the intelligence.
✓ If a human agent is engaged in two isolated dialogues (connected by
teletype), one with a computer, and the other with another human.
✓ The human agent cannot reliably identify which dialogue is with the
computer due to its kind of intelligence.
✓ A human communicates with a computer via a teletype. If the human cannot
tell he is talking to a computer or another human, it passes the test.
▪ How does the machine with thinking ability perform and how to test it?
Text
?
Cognition
Text
All Rights Reserved 7
Artificial Intelligence (AI): Initial Appearances
▪ John McCarthy: “We propose that a two-month, ten man study of Artificial
Intelligence carried out during the summer of 1956 […]”
✓ The study is to proceed on the basis of conjecture that every aspect of learning
or any other feature of intelligence can in principle be so precisely described
that a machine can be made to simulate it. […]
✓ It may be speculated that a large part of human thought consists of manipulating
words according to rules of reasoning and rules of conjecture.
▪ The first generation of AI researchers made these predictions about their work:
✓ 1958, H. A. Simon and Allen Newell: "within ten years a digital computer will
John McCarthy: American Computer
be the world's chess champion" and "within ten years a digital computer will Scientist
discover and prove an important new mathematical theorem.“
✓ 1965, H. A. Simon: "machines will be capable, within twenty years, of doing
any work a man can do.“
✓ 1967, Marvin Minsky: "Within a generation ... the problem of creating 'artificial
intelligence' will substantially be solved.“
✓ 1970, Marvin Minsky (in Life Magazine): "In from three to eight years we will
have a machine with the general intelligence of an average human being."
▪ When did the Artificial Intelligence get appeared since the time of the artificial
mind theory and later on the Turing Machine idea (1950)?
All Rights Reserved 8
AI: How to Define AI?
▪ The term got coined by John McCarthy in 1956 when a group of when scientists began exploring how computers could
solve problems on their own..
▪ Def. 1 by David Marr: “AI is the study of complex information processing problems that often have their roots in some
aspects of biological information processing. The goal of the subject is to identify solvable and interesting information
processing problems, and solve them.”
▪ Def. 3 by Alan Turing: “Actions that are indistinguishable from a human’s ones.”
▪ How to simply define AI? A machine with the ability to perform cognitive functions such as perceiving, learning,
reasoning and solve problems are deemed to hold an artificial intelligence. The benchmark for AI is the human level
concerning reasoning, speech, and vision.
▪ How to well define AI? We can define intelligence as the computational part of the ability to achieve goals in the
world. Varying kinds and degrees of intelligence occur in humans, many animals and some machines. It is the
capacity to learn and solve problems in particular tacking novel problems, act rationally, and act like humans.
▪ The purpose of having AI can be described as better understanding of the human thinking and how to improve it.
▪ The root of AI is found in Computer Science and Engineering, Philosophy, Mathematics, Cognitive Science and
Psychology, Neural Science, and Linguistic.
▪ AI Levels:
✓ Narrow AI: A artificial intelligence is said to be narrow when the machine can perform a specific task better than a
human. The current research of AI is here now.
✓ General AI: An artificial intelligence reaches the general state when it can perform any intellectual task with the
same accuracy level as a human would.
✓ Strong AI: An AI is strong when it can beat humans in many tasks.
▪ Major Goals:
✓ Understand the principles that make intelligence possible (in humans, animals, and artificial agents).
✓ Developing intelligent machines or agents (no matter whether they operate as humans or not).
✓ Formalizing knowledge and mechanizing reasoning in all areas of human endeavor.
✓ Making the working with computers as easy as working with people.
✓ Developing human-machine systems that exploit the complementariness of human and automated reasoning.
All Rights Reserved 10
AI: Different Views on AI
▪ Philosophy, Ethics, and Religion:
✓ What is intelligence?
✓ Is there any formal expression?
✓ How to define mind as a machine with internal operations?
▪ Engineering:
✓ Can we build intelligent devices and systems?
✓ Autonomous and semi-autonomous for replicating human capabilities,
improving performance, and so forth.
▪ How should scientists from different areas of science view AI and what
technical elements (i.e. models, libraries, and etc.) should we have
inside AI?
▪ State-Based Models:
✓ Solutions are defined as a sequence of steps.
✓ Model a task as a graph of states and a solution as a path in the graph.
✓ A state captures all of the relevant information about the past in order to act in
the future.
✓ Apps: Navigation and Games.
✓ Options: Tree Search (Breadth-first search, Depth-first search, and Iterative
deepening), Graph search (Dynamic programming)
▪ Parametric, Reflex-Based Models:
✓ Given a set of <Input, Output> pairs of training data, learn a set of parameters
that will map input to output for future data.
✓ Apps: Classification and Regression.
✓ Options: Artificial Neural Networks (ANN), Decision Trees, Support Vector
Machines, Regression, Principal Component Analysis, K-Means Clustering,
and K-Nearest Neighbor.
▪ The signal strength is high in Excited state and is low in Inhibited state.
▪ When the input exceeds a threshold the neuron sends an electrical spike
that travels throughout the body, gets to the axon, and reaches to next
neuron(s).
▪ “In our view, people are smarter than today’s computers because the brain employs a basic computational architecture
that is more suited to deal with a central aspect of the natural information processing tasks that people are so good at.”
▪ Assumption: Mental phenomena can be described by interconnected networks of simple and often uniform units.
▪ Can we build a functional brain using computers?
x1
w1 Y1
PE: Processing Element
x2 w2 Neuron (or PE) f (S )
. S =
n
X iW
Y
. Y2
. i =1
i
.
. Summation
Transfer
.
Function
wn Yn
xn
n
Bi:
X0= +1 Yi = f (ni) = f (Xi.Wj + Bi)
i=1
Bias Weight
▪ How can the AI models like ANNs create intelligent agents and the AI systems?
▪ Characteristic Features:
✓ Knowledge Representation and Reasoning. ✓ Exploration of huge search spaces.
✓ Transparency and Explanations. ✓ Use of heuristics.
✓ Ability to Communicate. ✓ Reasoning with incomplete or conflicting data.
✓ Use of huge amounts of knowledge. ✓ Ability to learn and adapt.
▪ Representation: All AI problems require some form of representation for input data.
✓ The input data can be: chess board, maze, text, object, room, sound, and visual
scene.
✓ A major part of AI is representing the problem space so as to allow efficient search Perception Action
Cognition
for the best solution(s).
✓ Sometimes the representation is the output. An example is discovering patterns.
▪ Thinking: What do you do once you have a representation? This requires a goal and objective.
✓ Rational Behavior: Choosing actions that maximize goal achievements given available information.
✓ The thinking data can be: the best move (for chess board), the shortest path (for maze), the semantic parsing (for
text), the recognition (for object, speech, biometric, and etc.), the localization (for room), and the navigation (for
visual scene).
✓ What is the strategy for multiple agents?
▪ Reasoning: It can be thought of as constructing an accurate world/real model.
✓ Rational Inference: What can be logically inferred giving available information?
✓ When information is uncertain: Most of the facts are not concrete and are not known with certainty.
✓ Probabilistic Inference: How do we give the proper weight to each observation?
Thought
Human-like Thinking Systems Rational Thinking Systems
Behavior
Human Rational
▪ Learning: What if your world is changing? How do we maintain an accurate model? Requiring a learnable model.
✓ Adapting internal representation so that is as accurate as possible.
✓ Adapting the models of other agents.
▪ Output: The output action can also be complex. The representation and complexity of output data is important.
✓ The output data can be: next move, text, label, actuator, and movement.
✓ Sample output can be from a simple chess move to a motor sequence to grasp an object.
▪ Search: Finding an “optimal” sequence of states between initial state and final state. How to find the best model states?
▪ Humans are not always rational. Rational means doing the right thing. The right thing is defined as expecting to
maximizing the goal achievements, given the available information.
▪ The perception and the motor skills are the most important part of intelligence.
All Rights Reserved 25
AI: Intelligent Systems – Mechanism and Properties
▪ Mechanism:
✓ The stimulus must be translated into an internal representation.
✓ The representation is manipulated by cognitive processes to derive new
internal representations.
✓ These representations in turn are translated into action.
✓ The agent includes different elements, including feature extraction,
(machine/deep) learning model, decision making unit, sensors, and
actuators.
▪ What is Learning?
✓ It denotes changes in the system that are adaptive in the sense that they enable the system to do the same tasks or
tasks drawn from the same population more effectively the next time.
✓ It is about making useful changes in our minds.
✓ We construct or modify representations of what is being experienced.
✓ A computer system learns if it improves its performance at some task through experience.
✓ It denotes the way people and computers interact:
a. Acquire, discover, and organize knowledge by building, modifying, and organizing internal representations
of some external reality.
b. Acquire skills by gradually improving their motor or cognitive skills through repeated practice, sometimes
involving little or no conscious thought.
✓ It results in changes in the agent or mind that improve its competence and/or efficiency:
❖ Competence: A system is improving its competence if it learns to solve a broader class of problems, and to
make fewer mistakes in problem solving.
❖ Efficiency: A system is improving its efficiency, if it learns to solve the problems from its area of competence
faster or by using fewer resources.
▪ After understanding artificial (shallow) neural network and machine learning, how to define deep neural
network and deep learning?
▪ What are the important techniques in machine/deep learning for the purpose of improving performance and
overcoming possible limitations?
All Rights Reserved 30
AI: Deep Learning and Deep Neural Networks - 1
▪ Deep Neural Network (DNN): Referring to a feedforward neural network with more than one hidden layer.
✓ DNNs try to learn representation by using a hierarchy of multiple layers.
✓ Multi-class Classification: The posterior probability of each class can be estimated using an output softmax layer.
✓ Pre-Training: Initializing all the weights especially when the amount of training data is limited and when no
constraints are imposed on the weights.
✓ Restricted Boltzmann Machine (RBM): Used as a building block for pre-training. All the weights from this
block can be used as initialization for one layer.
▪ Convolutional Neural Networks (CNNs):
✓ Among the oldest deep neural network architectures.
✓ Popular among different applications like handwriting recognition.
✓ A specialized kind of neural network.
✓ Usage for processing data with a known grid-like topology.
❖ Example Data: Time-series data (i.e. a one-dimensional grid with taking samples at intervals) and Image Data
(ie.. A two-dimensional grid of pixels).
✓ Utilization of convolution that is a specialized kind of linear operation.
✓ The convolution operation is used in place of general matrix multiplication in at least one layer.
✓ Convolution layer leverages three important ideas to improve the AI system: Sparse Interactions, Parameter
Sharing, and Equivariant Representations.
✓ Convolution layer allows for working with inputs of variable size.
✓ Making the kernel smaller than the input.
▪ Why Image Classification/Recognition and Cybersecurity applications are important for the UF FICS Research
and their ongoing projects?
(L) Deep Leaning Generative model can learn and predict the turbulence wake behind a cylinder with different geometry and shape.
(R) Time-dependent generative models can learn and predict the time-dependent heat diffusion by just giving the boundary condition!
▪ AI Material Discovery:
✓ Accelerate material discovery using AI tools.
✓ Since it is prohibitively expensive to use experimental and computational tools to search for novel material for
energy applications, the search process can be greatly accelerated by using deep learning technology.
✓ Developing and applying these tools to find optimal materials for energy and health applications.