Intelligent Agents

Intelligent Agents
Chapter 2
Modified by Vali Derhami
1/34
Outline
• Agents and environments
• Rationality
• PEAS (Performance measure,
Environment, Actuators, Sensors)
• Environment types
• Agent types
2/34
Agents
• An agent is anything that can be viewed as perceiving its
environment through sensors and acting upon that
environment through actuators
•
• Human agent: eyes, ears, and other organs for sensors;
hands,legs, mouth, and other body parts for actuators
• Robotic agent: cameras and infrared range finders for
sensors;various motors for actuators
•
• General assumption: agent can perceive its own action
3/34
Agents (cont.)
• Percept sequence is the complete history of
everything the agent has ever perceived.
4/34
Agents and environments
• Agent's behavior is described by the agent function that maps any
given percept sequence to an action.
•
‫ در حالیکه برنامه عامل پیاده‬. ‫تابع عامل یک شرح ریاضی انتزاع گونه است‬
.‫سازی کامل آن است که در داخل سیستم فیزیکی اجرا می شود‬
• The agent function maps from percept histories to
actions:
• [f: P*  A]
•
 Tabular representation: a large table for most agents
• The agent program runs on the physical architecture to
produce f
5/34
• agent = architecture + program
Vacuum-cleaner world
• Percepts: location and contents, e.g.,
[A,Dirty]
• Actions: Left, Right, Suck, NoOp
•
6/34
A vacuum-cleaner agent
7/34
Rational agents
• An agent should "do the right thing", based on what it can perceive
and the actions it can perform. The right action is the one that will
cause the agent to be most successful
•
• Performance measure: An objective criterion for success of an
agent's behavior
• E.g., performance measure of a vacuum-cleaner agent could be
amount of dirt cleaned up, amount of time taken, amount of
electricity consumed, amount of noise generated, etc.
• Note: Design performance measures according to what
one actually wants in the environment, rather than
according to how one thinks the agent should behave.
8/34
What is rational
• The performance measure that defines the
criterion of success.
• The agent's prior knowledge of the
environment.
• The actions that the agent can perform.
• The agent's percept sequence to date.
9/34
Rational agents
• Rational Agent: For each possible percept
sequence, a rational agent should select
an action that is expected to maximize its
performance measure, given the evidence
provided by the percept sequence and
whatever built-in knowledge the agent has.
•
10/34
Omniscience, learning, and
autonomy
Rationality is distinct from omniscience •
(all-knowing with infinite knowledge)
‫• عقالنیت بیشینه میکند راندمانی که امید می رود در حالیکه کامل‬
.‫( راندمان واقعی را بیشینه میکند‬perfection) ‫بودن‬
• Agents can perform actions in order to
modify future percepts so as to obtain
useful information (information gathering,
exploration)
• An agent is autonomous if its behavior is
determined by its own experience (with
11/34
ability to learn and adapt)
Successful agents split the task of
computing the agent function into three
different periods:
 During designing: some of the computation is
done by its designers;
 During deliberating (thinking) on its next
action, the agent does more computation;
 During learning from experience: it does even
more computation to decide how to modify its
behavior.
12/34
Task Environment (PEAS)
• PEAS: Performance measure,
Environment, Actuators, Sensors
• Consider, e.g., the task of designing an
automated taxi driver:
•
– Performance measure
– Environment
– Actuators
– Sensors
–
13/34
PEAS
• Consider, e.g., the task of designing an
automated taxi driver:
•
14/34
PEAS
• Agent: Medical diagnosis system
 Performance measure: Healthy patient,
minimize costs, lawsuits
 Environment: Patient, hospital, staff
 Actuators: Screen display (questions, tests,
diagnoses, treatments, referrals)
 Sensors: Keyboard (entry of symptoms,
findings, patient's answers)
15/34
PEAS
• Agent: Part-picking robot
 Performance measure: Percentage of parts in
correct bins
 Environment: Conveyor belt with parts, bins
 Actuators: Jointed arm and hand
 Sensors: Camera, joint angle sensors
16/34
PEAS
• Agent: Interactive English tutor
 Performance measure: Maximize student's
score on test
 Environment: Set of students, testing agency
 Actuators: Screen display (exercises,
suggestions, corrections)
 Sensors: Keyboard entry
17/34
Environment types
• Fully observable (vs. partially observable): An agent's
sensors give it access to the complete state of the
environment at each point in time.
• Deterministic (vs. stochastic): The next state of the
environment is completely determined by the current
state and the action executed by the agent. (If the
environment is deterministic except for the actions of
other agents, then the environment is strategic)
•
‫(محیطی که کامال مشاهده پذیر نباشد یا قطعی نباشد نایقین‬uncertaion)‫گفته می شود‬
• Episodic (vs. sequential): The agent's experience is
divided into atomic "episodes" (each episode consists of
the agent perceiving and then performing a single
action), and the choice of action in each episode
depends only on the episode itself.
18/34
•
Environment types
• Static (vs. dynamic): The environment is
unchanged while an agent is deliberating. (The
environment is semidynamic if the environment
itself does not change with the passage of time
but the agent's performance score does)
• Discrete (vs. continuous): A limited number of
distinct, clearly defined percepts and actions.
• Single agent (vs. multiagent): An agent
operating by itself in an environment.
•
19/34
Environment types
Fully observable
Deterministic
Episodic
Static
Discrete
Single agent
Chess with
a clock
Yes
Strategic
No
Semi
Yes
No
Chess without
a clock
Yes
Strategic
No
Yes
Yes
No
Taxi driving Image
Analysis
No
Yes
No
Yes
No
Yes
No
Semi
No
No
No
Yes
• The environment type largely determines the agent design
•
• The real world is (of course) partially observable, stochastic, sequential,
dynamic, continuous, multi-agent
•
20/34
Structure of Agents
• The job of AI: Design the agent program
that implements the agent function
mapping percepts to actions.
• Agent= Architecture +Program
• Program has to be appropriate for the
architecture
21/34
Agent functions and programs
• Agent programs: they take the current percept
as input from the sensors and return an action to
the actuators.
 Difference between agent program and agent
function:
o Agent program: takes the current percept as input
o Agent function: takes the entire percept history.
• Aim: find a way to implement the rational agent
function concisely
•
22/34
Table-lookup agent
Drawbacks:
Huge table
Take a long time to build the table
No autonomy
Even with learning, need a long time to learn the table entries
23/34
Agent types
Four basic types in order of increasing
generality:
• Simple reflex agents
• Model-based reflex agents
• Goal-based agents
• Utility-based agents
25/34
Simple reflex agents
26/34
Vacuum agent
27/34
Simple reflex agents
28/34
Model-based reflex agents
29/34
Model-based reflex agents
30/34
Goal-based agents
31/34
Utility-based agents
32/34
Learning agents
33/34
Learning agents
Four conceptual components
• Learning element: making improvements.
• Performance element: selecting external actions.
• Critic: tells the learning element how well the
agent is doing with respect to a fixed performance
standard
• Problem generator: suggesting actions that will
lead to new and informative experiences.
34/34
‫تمرین دوم از وب سایت برداشته شود‪.‬‬
‫‪35/34‬‬