SlideShare a Scribd company logo
2
Most read
11
Most read
12
Most read
1
Artificial Intelligence
Chapter 2: Intelligent Agents
Andreas Zell
After the Textbook: Artificial IntelligenceAfter the Textbook: Artificial Intelligence,
A Modern Approach
by Stuart Russel and Peter Norvig (3rd Edition)
• Rational agent: one that behaves as well as
possible
This behaviour depends on the environment:
2. Intelligent Agents
• This behaviour depends on the environment:
• Some environments are more difficult than
others
Zell: Artificial Intelligence (after Russel/Norvig, 3rd Ed.) 2
2
• An agent is anything that can be viewed as
perceiving its environment through sensors and
acting upon that environment through actuators
2.1 Agents and Environments
g p g
• Human agent:
• Sensors: eyes, ears,
and other organs,
• Actuators: hands, legs,
mouth other body partsmouth, other body parts
• Robotic agent:
• Sensors: cameras, sonars, laser range finders;
• Actuators: various motors, gripper
Zell: Artificial Intelligence (after Russel/Norvig, 3rd Ed.) 3
• Percept: an agent’s perceptual inputs at any
time.
Percept sequence: history of percepts
2.1 Agents and Environments
• Percept sequence: history of percepts
• An agent’s actions may depend on its whole
history
• Agent function: mapping of percept sequence to
action: : *f P A
• Agent program: the internal implementation of
the abstract mathematical agent function
Zell: Artificial Intelligence (after Russel/Norvig, 3rd Ed.) 4
3
Example Vacuum Cleaner Miniworld
Simple agent function:
Percept sequence Action
[A, clean] right
• Only two locations: A, B
• Floor clean or dirty
• Percept: [A, clean]
• Actions: right, left, suck
[ , c ea ] g t
[A, dirty] suck
[B, clean] left
[B, dirty] suck
[A, clean], [A, clean]right
[A, clean], [A, dirty] suck
[A, clean], [B, clean]leftg
[A, clean], [B, dirty] suck
[A, dirty], [A, clean] right
…
Zell: Artificial Intelligence (after Russel/Norvig, 3rd Ed.) 5
• A rational agent is “one that does the right thing”.
• …but what is the right thing?
f
2.2 The Concept of Rationality
• Need performance measures.
• Performance measure: objective criterion for
success of an agent’s behaviour
• E.g., performance measure of the vacuum-
cleaner agent could be amount of dirt cleaned
up, amount of time taken, amount of electricity
consumed, cleanliness of floor afterwards, …
Zell: Artificial Intelligence (after Russel/Norvig, 3rd Ed.) 6
4
• Rationality here depends on four things:
1. The performance measure that defines the criterion
of success
2.2 The Concept of Rationality
2. The agent’s prior knowledge of the environment
3. The actions that the agent can perform
4. The agent’s percept sequence to date.
• Rational agent: For each possible percept
sequence, a rational agent should select an
action that is expected to maximize its
performance measure, given the evidence
provided by the percept sequence and whatever
built-in knowledge the agent has.
Zell: Artificial Intelligence (after Russel/Norvig, 3rd Ed.) 7
• Rationality is distinct from omniscience (all-
knowing with infinite knowledge)
• Agents can perform actions in order to modify
Omniscience, Learning, Autonomy
Agents can perform actions in order to modify
future percepts so as to obtain useful information
(information gathering, exploration)
• A rational agent should not only gather
information, but also learn as much as possible
from what it perceives
A t i t if it b h i i• An agent is autonomous if its behavior is
determined by its own experience (with ability to
learn and adapt). Rational agents should be
autonomous.
Zell: Artificial Intelligence (after Russel/Norvig, 3rd Ed.) 8
5
• Specifying the task environment:
• PEAS: Performance measure, Environment,
Actuators Sensors
2.3 The Nature of Environments
Actuators, Sensors
• Example agent: Automated taxi driver:
• Performance measure: Safe, fast, legal, comfortable
trip, maximize profits
• Environment: Roads, other traffic, pedestrians,
customerscustomers
• Actuators: Steering wheel, accelerator, brake, signal,
horn
• Sensors: Cameras, sonar, speedometer, GPS,
odometer, engine sensors, keyboard
Zell: Artificial Intelligence (after Russel/Norvig, 3rd Ed.) 9
• Example Agent: Medical diagnosis system
• Performance measure: Healthy patient, minimize
costs lawsuits
Specifying the Task Environment
costs, lawsuits
• Environment: Patient, hospital, staff
• Actuators: Screen display (questions, tests,
diagnoses, treatments, referrals)
• Sensors: Keyboard (entry of symptoms, findings,
patient's answers)
• Example Agent: Part-picking robot
• Example Agent: Interactive English tutor
Zell: Artificial Intelligence (after Russel/Norvig, 3rd Ed.) 10
6
• Fully observable (vs. partially observable): An
agent's sensors give it access to the complete state
of the environment at each point in time.
Properties of task environments
• Deterministic (vs. stochastic): The next state of the
environment is completely determined by the current
state and the action executed by the agent. (If the
environment is deterministic except for the actions of
other agents, then the environment is strategic)
• Episodic (vs. sequential): The agent's experience is
divided into atomic "episodes" (each episode
consists of the agent perceiving and then performing
a single action), and the choice of action in each
episode depends only on the episode itself.
Zell: Artificial Intelligence (after Russel/Norvig, 3rd Ed.) 11
• Static (vs. dynamic): The environment is
unchanged while an agent is deliberating. (The
environment is semidynamic if the environment
Properties of task environments
environment is semidynamic if the environment
itself does not change with the passage of time
but the agent's performance score does)
• Discrete (vs. continuous): A limited number of
distinct, clearly defined percepts and actions.
• Single agent (vs multiagent): An agent• Single agent (vs. multiagent): An agent
operating by itself in an environment.
Zell: Artificial Intelligence (after Russel/Norvig, 3rd Ed.) 12
7
Task Environment Observ-
able
Deter-
ministic
Episodic Static Discrete Agents
Crossword puzzle Fully Det. Sequ. Static Discrete Single
Chess with a clock Fully Strateg Sequ Semi Discrete Multi
Examples of Task Environments
Chess with a clock Fully Strateg. Sequ. Semi Discrete Multi
Poker Partially Stoch. Sequ. Static Discrete Multi
Backgammon Fully Stoch. Sequ. Static Discrete Multi
Taxi driving Partially Stoch. Sequ. Dynam. Contin. Multi
Medical Diagnosis Partially Stoch. Sequ. Dynam. Contin. Single
Image analysis Fully Det. Sequ. Semi Contin. Single
Part-picking robot Partially Stoch. Sequ. Dynam. Contin. Singlep g y q y g
Refinery controller Partially Stoch. Sequ. Dynam. Contin. Single
English tutor Partially Stoch. Sequ. Dynam. Discrete Multi
Zell: Artificial Intelligence (after Russel/Norvig, 3rd Ed.) 13
• Agent architecture: computing device with
physical sensors and actuators
Agent = architecture + program
2.4 The Structure of Agents
• Agent = architecture + program
• Four basic types in order of increasing
generality:
• Simple reflex agents
• Model-based reflex agents
• Goal-based agents
• Utility-based agents
Zell: Artificial Intelligence (after Russel/Norvig, 3rd Ed.) 14
8
Simple Reflex Agents
Zell: Artificial Intelligence (after Russel/Norvig, 3rd Ed.) 15
• It keeps track of the current state of the world,
using an internal model. It then chooses an
action in the same way as the reflex agent
Model-based reflex agents
action in the same way as the reflex agent.
Zell: Artificial Intelligence (after Russel/Norvig, 3rd Ed.) 16
9
Model-based reflex agents
and action
Zell: Artificial Intelligence (after Russel/Norvig, 3rd Ed.) 17
• A model-based, goal-based agent. It keeps track
of the world state and of a set of goals, and
chooses an action that will (eventually) lead to
Model-based, goal-based agent
chooses an action that will (eventually) lead to
achievement
of its goals.
Zell: Artificial Intelligence (after Russel/Norvig, 3rd Ed.) 18
10
• Goals alone are not enough to achieve high-
quality behaviour in most environments
Some action sequences are better (cheaper
Utility-based Agents
• Some action sequences are better (cheaper,
quicker, more reliable, …) than others.
• A utility function maps a state (or a sequence of
states) onto a real number, which describes the
associated degree of happiness of the agent
Zell: Artificial Intelligence (after Russel/Norvig, 3rd Ed.) 19
• It uses a model of the world, along with a utility
function. Then it chooses the action that leads to
the best expected utility
Utility-based Agents
the best expected utility,
• Expected utility
is computed by
averaging over
all possible
outcome states,
weighted by theweighted by the
probability of the
outcome.
Zell: Artificial Intelligence (after Russel/Norvig, 3rd Ed.) 20
11
• Programming intelligent machines by hand is
extremely time consuming
Better method: program a learning algorithm and
Learning Agents
• Better method: program a learning algorithm and
then teach machines what they should learn
• Example: Supervised learning of artificial neural
networks (ANNs) or support vector machines (SVMs)
• Or better still, let these machines learn by
themselves (under some constraints)themselves (under some constraints)
• Reinforcement learning of artificial neural networks
• Learning is nowadays the big issue in AI!
Zell: Artificial Intelligence (after Russel/Norvig, 3rd Ed.) 21
• A general learning agent, consisting of 4
components:
• Learning
Learning Agents
• Learning
element
• Performance
element
• Critic
• Problem
generator
Zell: Artificial Intelligence (after Russel/Norvig, 3rd Ed.) 22
12
• Learning element: is responsible for making
improvements, uses feedback from the critic on
how the agent is doing and determines how the
Learning Agents
how the agent is doing and determines how the
performance element should be modified
• Performance element: is responsible for
selecting external actions
• Critic: tells the learning element how well the
agent is doing with respect to a performanceagent is doing with respect to a performance
standard (the percept itself does not say so).
• Problem generator: is responsible for suggesting
actions that lead to new information
Zell: Artificial Intelligence (after Russel/Norvig, 3rd Ed.) 23

More Related Content

PDF
Ai 03 solving_problems_by_searching
PDF
Matrix Factorization In Recommender Systems
PPTX
MatplotLib.pptx
PPTX
NAAC CSE PRESENTATION
PDF
How AI Enhances & Accelerates Diabetic Retinopathy Detection
PPTX
Knowledge representation In Artificial Intelligence
PPTX
Artificial intelligence- Logic Agents
PDF
Stuart russell and peter norvig artificial intelligence - a modern approach...
Ai 03 solving_problems_by_searching
Matrix Factorization In Recommender Systems
MatplotLib.pptx
NAAC CSE PRESENTATION
How AI Enhances & Accelerates Diabetic Retinopathy Detection
Knowledge representation In Artificial Intelligence
Artificial intelligence- Logic Agents
Stuart russell and peter norvig artificial intelligence - a modern approach...

What's hot (20)

PPTX
Intelligent agent
PPTX
First order logic
PPT
Artificial Intelligence Chapter two agents
PPT
Knowledge Representation & Reasoning
PDF
Lecture 2 agent and environment
PDF
Multi-agent systems
PPTX
Local search algorithm
PPTX
Intelligence Agent - Artificial Intelligent (AI)
PPTX
Agents and environments
PPT
AI Lecture 7 (uncertainty)
PPTX
Informed and Uninformed search Strategies
ZIP
Ai Slides
PPT
Knowledge Representation in Artificial intelligence
PPTX
Artificial Intelligence- TicTacToe game
PPTX
Knowledge representation in AI
PPTX
Intelligent agents
PPTX
AI: Logic in AI
PDF
I. Mini-Max Algorithm in AI
PPTX
Natural Language Processing
PPTX
AI: Learning in AI
Intelligent agent
First order logic
Artificial Intelligence Chapter two agents
Knowledge Representation & Reasoning
Lecture 2 agent and environment
Multi-agent systems
Local search algorithm
Intelligence Agent - Artificial Intelligent (AI)
Agents and environments
AI Lecture 7 (uncertainty)
Informed and Uninformed search Strategies
Ai Slides
Knowledge Representation in Artificial intelligence
Artificial Intelligence- TicTacToe game
Knowledge representation in AI
Intelligent agents
AI: Logic in AI
I. Mini-Max Algorithm in AI
Natural Language Processing
AI: Learning in AI
Ad

Viewers also liked (9)

PDF
Swiching
PPTX
PPTX
Ch 4 software engineering
PPT
Ian Sommerville, Software Engineering, 9th EditionCh 8
PPT
PPT
Ian Sommerville, Software Engineering, 9th Edition Ch2
PDF
Ai 01 introduction
DOCX
Swe notes
PPTX
Ian Sommerville, Software Engineering, 9th Edition Ch 4
Swiching
Ch 4 software engineering
Ian Sommerville, Software Engineering, 9th EditionCh 8
Ian Sommerville, Software Engineering, 9th Edition Ch2
Ai 01 introduction
Swe notes
Ian Sommerville, Software Engineering, 9th Edition Ch 4
Ad

Similar to Ai 02 intelligent_agents(1) (20)

PDF
Lec 2 agents
PPTX
AI Basic.pptx
PPTX
m2-agents.pptx
PPT
Unit 1.ppt
PPT
M2 agents
PPTX
Unit-1.pptx
PDF
Agents1
PPT
Jarrar.lecture notes.aai.2011s.ch2.intelligentagents
PPT
Jarrar.lecture notes.aai.2011s.ch2.intelligentagents
PDF
agents.pdf
PPTX
AI: Artificial Agents on the Go and its types
PPT
Slide01 - Intelligent Agents.ppt
PPTX
Lecture 1 about the Agents in AI & .pptx
PPTX
AIML Unit1 ppt for the betterment and help of students
PPT
Lecture1
PPTX
Unit 2-KSK.pptx Introduction to Data Structures and Algorithms
PPT
Elective(Intellegent agent )__cha.Two.ppt
PPTX
2. Intelligent_Agents_ShgfutydtfxcfdxdfL.pptx
PPT
2.IntelligentAgents.ppt
Lec 2 agents
AI Basic.pptx
m2-agents.pptx
Unit 1.ppt
M2 agents
Unit-1.pptx
Agents1
Jarrar.lecture notes.aai.2011s.ch2.intelligentagents
Jarrar.lecture notes.aai.2011s.ch2.intelligentagents
agents.pdf
AI: Artificial Agents on the Go and its types
Slide01 - Intelligent Agents.ppt
Lecture 1 about the Agents in AI & .pptx
AIML Unit1 ppt for the betterment and help of students
Lecture1
Unit 2-KSK.pptx Introduction to Data Structures and Algorithms
Elective(Intellegent agent )__cha.Two.ppt
2. Intelligent_Agents_ShgfutydtfxcfdxdfL.pptx
2.IntelligentAgents.ppt

More from Mohammed Romi (13)

PDF
PPTX
PDF
Ch19 network layer-logical add
PDF
PPTX
PPTX
PPTX
PPTX
PPTX
PPT
Angel6 e05
PPTX
Chapter02 graphics-programming
PPT
Ian Sommerville, Software Engineering, 9th Edition Ch1
PPT
Ian Sommerville, Software Engineering, 9th Edition Ch 23
Ch19 network layer-logical add
Angel6 e05
Chapter02 graphics-programming
Ian Sommerville, Software Engineering, 9th Edition Ch1
Ian Sommerville, Software Engineering, 9th Edition Ch 23

Recently uploaded (20)

PDF
STATICS OF THE RIGID BODIES Hibbelers.pdf
PDF
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
PPTX
Cell Structure & Organelles in detailed.
PDF
01-Introduction-to-Information-Management.pdf
PPTX
Week 4 Term 3 Study Techniques revisited.pptx
PDF
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
PDF
VCE English Exam - Section C Student Revision Booklet
PDF
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
PDF
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
PDF
Origin of periodic table-Mendeleev’s Periodic-Modern Periodic table
PDF
Module 4: Burden of Disease Tutorial Slides S2 2025
PPTX
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
PDF
Microbial disease of the cardiovascular and lymphatic systems
PDF
Anesthesia in Laparoscopic Surgery in India
PPTX
PPH.pptx obstetrics and gynecology in nursing
PPTX
Renaissance Architecture: A Journey from Faith to Humanism
PPTX
BOWEL ELIMINATION FACTORS AFFECTING AND TYPES
PPTX
master seminar digital applications in india
PPTX
Cell Types and Its function , kingdom of life
PDF
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...
STATICS OF THE RIGID BODIES Hibbelers.pdf
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
Cell Structure & Organelles in detailed.
01-Introduction-to-Information-Management.pdf
Week 4 Term 3 Study Techniques revisited.pptx
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
VCE English Exam - Section C Student Revision Booklet
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
Origin of periodic table-Mendeleev’s Periodic-Modern Periodic table
Module 4: Burden of Disease Tutorial Slides S2 2025
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
Microbial disease of the cardiovascular and lymphatic systems
Anesthesia in Laparoscopic Surgery in India
PPH.pptx obstetrics and gynecology in nursing
Renaissance Architecture: A Journey from Faith to Humanism
BOWEL ELIMINATION FACTORS AFFECTING AND TYPES
master seminar digital applications in india
Cell Types and Its function , kingdom of life
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...

Ai 02 intelligent_agents(1)

  • 1. 1 Artificial Intelligence Chapter 2: Intelligent Agents Andreas Zell After the Textbook: Artificial IntelligenceAfter the Textbook: Artificial Intelligence, A Modern Approach by Stuart Russel and Peter Norvig (3rd Edition) • Rational agent: one that behaves as well as possible This behaviour depends on the environment: 2. Intelligent Agents • This behaviour depends on the environment: • Some environments are more difficult than others Zell: Artificial Intelligence (after Russel/Norvig, 3rd Ed.) 2
  • 2. 2 • An agent is anything that can be viewed as perceiving its environment through sensors and acting upon that environment through actuators 2.1 Agents and Environments g p g • Human agent: • Sensors: eyes, ears, and other organs, • Actuators: hands, legs, mouth other body partsmouth, other body parts • Robotic agent: • Sensors: cameras, sonars, laser range finders; • Actuators: various motors, gripper Zell: Artificial Intelligence (after Russel/Norvig, 3rd Ed.) 3 • Percept: an agent’s perceptual inputs at any time. Percept sequence: history of percepts 2.1 Agents and Environments • Percept sequence: history of percepts • An agent’s actions may depend on its whole history • Agent function: mapping of percept sequence to action: : *f P A • Agent program: the internal implementation of the abstract mathematical agent function Zell: Artificial Intelligence (after Russel/Norvig, 3rd Ed.) 4
  • 3. 3 Example Vacuum Cleaner Miniworld Simple agent function: Percept sequence Action [A, clean] right • Only two locations: A, B • Floor clean or dirty • Percept: [A, clean] • Actions: right, left, suck [ , c ea ] g t [A, dirty] suck [B, clean] left [B, dirty] suck [A, clean], [A, clean]right [A, clean], [A, dirty] suck [A, clean], [B, clean]leftg [A, clean], [B, dirty] suck [A, dirty], [A, clean] right … Zell: Artificial Intelligence (after Russel/Norvig, 3rd Ed.) 5 • A rational agent is “one that does the right thing”. • …but what is the right thing? f 2.2 The Concept of Rationality • Need performance measures. • Performance measure: objective criterion for success of an agent’s behaviour • E.g., performance measure of the vacuum- cleaner agent could be amount of dirt cleaned up, amount of time taken, amount of electricity consumed, cleanliness of floor afterwards, … Zell: Artificial Intelligence (after Russel/Norvig, 3rd Ed.) 6
  • 4. 4 • Rationality here depends on four things: 1. The performance measure that defines the criterion of success 2.2 The Concept of Rationality 2. The agent’s prior knowledge of the environment 3. The actions that the agent can perform 4. The agent’s percept sequence to date. • Rational agent: For each possible percept sequence, a rational agent should select an action that is expected to maximize its performance measure, given the evidence provided by the percept sequence and whatever built-in knowledge the agent has. Zell: Artificial Intelligence (after Russel/Norvig, 3rd Ed.) 7 • Rationality is distinct from omniscience (all- knowing with infinite knowledge) • Agents can perform actions in order to modify Omniscience, Learning, Autonomy Agents can perform actions in order to modify future percepts so as to obtain useful information (information gathering, exploration) • A rational agent should not only gather information, but also learn as much as possible from what it perceives A t i t if it b h i i• An agent is autonomous if its behavior is determined by its own experience (with ability to learn and adapt). Rational agents should be autonomous. Zell: Artificial Intelligence (after Russel/Norvig, 3rd Ed.) 8
  • 5. 5 • Specifying the task environment: • PEAS: Performance measure, Environment, Actuators Sensors 2.3 The Nature of Environments Actuators, Sensors • Example agent: Automated taxi driver: • Performance measure: Safe, fast, legal, comfortable trip, maximize profits • Environment: Roads, other traffic, pedestrians, customerscustomers • Actuators: Steering wheel, accelerator, brake, signal, horn • Sensors: Cameras, sonar, speedometer, GPS, odometer, engine sensors, keyboard Zell: Artificial Intelligence (after Russel/Norvig, 3rd Ed.) 9 • Example Agent: Medical diagnosis system • Performance measure: Healthy patient, minimize costs lawsuits Specifying the Task Environment costs, lawsuits • Environment: Patient, hospital, staff • Actuators: Screen display (questions, tests, diagnoses, treatments, referrals) • Sensors: Keyboard (entry of symptoms, findings, patient's answers) • Example Agent: Part-picking robot • Example Agent: Interactive English tutor Zell: Artificial Intelligence (after Russel/Norvig, 3rd Ed.) 10
  • 6. 6 • Fully observable (vs. partially observable): An agent's sensors give it access to the complete state of the environment at each point in time. Properties of task environments • Deterministic (vs. stochastic): The next state of the environment is completely determined by the current state and the action executed by the agent. (If the environment is deterministic except for the actions of other agents, then the environment is strategic) • Episodic (vs. sequential): The agent's experience is divided into atomic "episodes" (each episode consists of the agent perceiving and then performing a single action), and the choice of action in each episode depends only on the episode itself. Zell: Artificial Intelligence (after Russel/Norvig, 3rd Ed.) 11 • Static (vs. dynamic): The environment is unchanged while an agent is deliberating. (The environment is semidynamic if the environment Properties of task environments environment is semidynamic if the environment itself does not change with the passage of time but the agent's performance score does) • Discrete (vs. continuous): A limited number of distinct, clearly defined percepts and actions. • Single agent (vs multiagent): An agent• Single agent (vs. multiagent): An agent operating by itself in an environment. Zell: Artificial Intelligence (after Russel/Norvig, 3rd Ed.) 12
  • 7. 7 Task Environment Observ- able Deter- ministic Episodic Static Discrete Agents Crossword puzzle Fully Det. Sequ. Static Discrete Single Chess with a clock Fully Strateg Sequ Semi Discrete Multi Examples of Task Environments Chess with a clock Fully Strateg. Sequ. Semi Discrete Multi Poker Partially Stoch. Sequ. Static Discrete Multi Backgammon Fully Stoch. Sequ. Static Discrete Multi Taxi driving Partially Stoch. Sequ. Dynam. Contin. Multi Medical Diagnosis Partially Stoch. Sequ. Dynam. Contin. Single Image analysis Fully Det. Sequ. Semi Contin. Single Part-picking robot Partially Stoch. Sequ. Dynam. Contin. Singlep g y q y g Refinery controller Partially Stoch. Sequ. Dynam. Contin. Single English tutor Partially Stoch. Sequ. Dynam. Discrete Multi Zell: Artificial Intelligence (after Russel/Norvig, 3rd Ed.) 13 • Agent architecture: computing device with physical sensors and actuators Agent = architecture + program 2.4 The Structure of Agents • Agent = architecture + program • Four basic types in order of increasing generality: • Simple reflex agents • Model-based reflex agents • Goal-based agents • Utility-based agents Zell: Artificial Intelligence (after Russel/Norvig, 3rd Ed.) 14
  • 8. 8 Simple Reflex Agents Zell: Artificial Intelligence (after Russel/Norvig, 3rd Ed.) 15 • It keeps track of the current state of the world, using an internal model. It then chooses an action in the same way as the reflex agent Model-based reflex agents action in the same way as the reflex agent. Zell: Artificial Intelligence (after Russel/Norvig, 3rd Ed.) 16
  • 9. 9 Model-based reflex agents and action Zell: Artificial Intelligence (after Russel/Norvig, 3rd Ed.) 17 • A model-based, goal-based agent. It keeps track of the world state and of a set of goals, and chooses an action that will (eventually) lead to Model-based, goal-based agent chooses an action that will (eventually) lead to achievement of its goals. Zell: Artificial Intelligence (after Russel/Norvig, 3rd Ed.) 18
  • 10. 10 • Goals alone are not enough to achieve high- quality behaviour in most environments Some action sequences are better (cheaper Utility-based Agents • Some action sequences are better (cheaper, quicker, more reliable, …) than others. • A utility function maps a state (or a sequence of states) onto a real number, which describes the associated degree of happiness of the agent Zell: Artificial Intelligence (after Russel/Norvig, 3rd Ed.) 19 • It uses a model of the world, along with a utility function. Then it chooses the action that leads to the best expected utility Utility-based Agents the best expected utility, • Expected utility is computed by averaging over all possible outcome states, weighted by theweighted by the probability of the outcome. Zell: Artificial Intelligence (after Russel/Norvig, 3rd Ed.) 20
  • 11. 11 • Programming intelligent machines by hand is extremely time consuming Better method: program a learning algorithm and Learning Agents • Better method: program a learning algorithm and then teach machines what they should learn • Example: Supervised learning of artificial neural networks (ANNs) or support vector machines (SVMs) • Or better still, let these machines learn by themselves (under some constraints)themselves (under some constraints) • Reinforcement learning of artificial neural networks • Learning is nowadays the big issue in AI! Zell: Artificial Intelligence (after Russel/Norvig, 3rd Ed.) 21 • A general learning agent, consisting of 4 components: • Learning Learning Agents • Learning element • Performance element • Critic • Problem generator Zell: Artificial Intelligence (after Russel/Norvig, 3rd Ed.) 22
  • 12. 12 • Learning element: is responsible for making improvements, uses feedback from the critic on how the agent is doing and determines how the Learning Agents how the agent is doing and determines how the performance element should be modified • Performance element: is responsible for selecting external actions • Critic: tells the learning element how well the agent is doing with respect to a performanceagent is doing with respect to a performance standard (the percept itself does not say so). • Problem generator: is responsible for suggesting actions that lead to new information Zell: Artificial Intelligence (after Russel/Norvig, 3rd Ed.) 23