AI Agents Guide

Artificial Intelligence
Intelligent Agents
Agents and environments
• Agent: An agent is anything that can be viewed as:
– Perceiving its environment through sensors and
– acting upon that environment through actuators.
– perceivingitsenvironmentthrough sensors and
– actingupon thatenvironmentthrough actuators.
• An agent program runs in cycles of:(1)perceive,(2)think, and(3)act.

– perceivingitsenvironmentthrough sensors and
– actingupon thatenvironmentthrough actuators.
Thinking/
delibrating
• An agent program runs in cycles of:(1)perceive,(2)think,

And (3)act.
• Agent = Architecture + Program
• They have to work in harmony.
• E.g., You can’t ask the agent to run if you did not provide it with legs/
wheels
Example Agents and environments
• Human agent:
– Sensors: eyes, ears, and otherorgans.
– Actuators: hands, legs, mouth, and other body parts.
• Robotic agent:
– Sensors: Cameras and infrared rangefinders.
– Actuators: Various motors.
• Human agent:
– Sensors: eyes, ears, and otherorgans.
– Actuators: hands, legs, mouth, and other body parts.
• Robotic agent:
– Sensors: Cameras and infrared rangefinders.
– Actuators: Various motors.
• Agents everywhere!
– Thermostat
– Cell phone
– Vacuum cleaner
– Robot
– Alexa Echo
– Self-driving car
– Human
– etc.
Vacuum cleaner
A B
• Percepts: location and contents e.g., [A, Dirty]

• Actions: Left, Right, Suck, NoOp
• Agent function: mapping from percepts to actions.
Vacuum cleaner
A B
• Percepts: location and contents e.g., [A, Dirty]

• Agent function: mapping from percepts to actions
• We can let the table maintain not only the percept but also
the sequence of percepts. E.g. [A,dirty] [A,clean]
Percept Action
[A, clean] Right
[A, dirty] Suck
[B, clean] Left
[B, dirty] Suck
Well-behaved agents
Rational Agent:
“For each possible percept sequence, a rational agent should select

an action t h a t is expected t o maximize its performance measure,
given the evidence provided by the percept sequence and whatever
built-in knowledge the agent h a s . ”
Rationality
• Rationality is relative to a performance measure.
• Judge rationality based on:

– The performance measure that defines the criterion ofsuc- cess.
– The agent prior knowledge of the environment.
– The possible actions that the agent can perform.
– The agent’s percept sequence to date.
PEAS
• When we define a rational agent, we group these properties under
PEAS, the problem specification for the task environment.
• The rational agent we want to design for this task environment is the
solution.
• PEAS stands for:
–P erformance
Which are the problem specifications for the
–E nvironment task environment that the rational agents is
meant to solve.
–A ctuators
–S ensors
PEAS
What is PEAS for a self-driving car?
• Performance:
• Environment:
• Actuators:
• Sensors:
PEAS
• Performance: Safety, time, legal drive, comfort.

• Environment:
• Actuators:
• Sensors:
PEAS

• Environment: Roads, other cars, pedestrians, road signs.
• Actuators:
• Sensors:
PEAS

• Actuators: Steering, accelerator, brake, signal,horn.
• Sensors:
PEAS

• Actuators: Steering, accelerator, brake, signal, horn.
• Sensors: Camera, sonar, GPS, Speedometer, odometer, ac-
celerometer, engine sensors, keyboard.
PEAS
How about a vacuum cleaner?
iRobot Roomba series

PEAS

• Performance: cleanness, efficiency: distance traveled to clean,
battery life, security.
• Environment:
• Actuators:
• Sensors:
PEAS

• Performance: cleanness, efficiency: distance traveled to clean, battery life,
security.
• Environment: room, table, wood floor, carpet, different ob- stacles.
• Actuators:
• Sensors:
PEAS

security.
• Actuators: wheels, different brushes, vacuum extractor.

• Sensors:
PEAS

security.
• Actuators: wheels, different brushes, vacuum extractor.

• Sensors: camera, dirt detection sensor, cliff sensor, bump sensors,
infrared wall sensors.
Environment types
• Fully observable (vs. partially observable): An agent’s sensors give it
access to the complete state of the environment at each point in time.
• Deterministic (vs. stochastic): The next state of the environment is

completely determined by the current state and the action executed by
the agent. (If the environment is deterministic except for the actions of
other agents, then the environment is strategic) (car can’t determine the
traffic condition)
• Episodic (vs. sequential): The agent’s experience is divided into

atomic ”episodes” (each episode consists of the agent perceiving and
then performing a single action), and the choice of action in each episode
depends only on the episode itself. An AI that looks at radiology images
to determine if there is a sickness is an example of an episodic
environment. One image has nothing to do with the next.
Environment types
• Static (vs. dynamic): The environment is unchanged while (The
an agent is deliberating. environment is semi-dynamic
if the environment itself does not change with the passage of time but
the agent’s performance score does.)
• Discrete (vs. continuous): A limited number of distinct, clearly defined

percepts and actions. E.g., chess is an example of a discrete
environment, while self-driving car/drone evolves in a continuous one.
• Single agent(vs. multi-agent): An agent operating by itself
in an environment.
• Known (vs. Unknown): The designer of the agent may or

may not have knowledge about the environment makeup. If the
environment is unknown the agent will need to know how it works in
order to decide. (e.g., Roomba that doesn’t know the room needs
sensors to decide what to do)
Environment types
Deterministic: The next state of the environment is completely determined by the

current state and the action executed by the agent.
Static: The environment is unchanged while the agent is deliberating
Discrete: A limited number of distinct, clearly defined percepts and actions
Agent types
• Four basic types in order of increasing generality:
– Simple reflex agents
– Model-based reflex agents
– Goal-based agents
– Utility-based agents
• All of which can be generalized into learning agents that can improve
their performance and generate better actions.
Simple reflex agents
• Simple reflex agents select an action based on the current state only
ignoring the percept history.
• Simple but limited.
• Can only work if the environment is fully observable, that is the correct
action is based on the current percept only.
Agent Sensors
What the world

is like now
Environment
Condition-action rules What action I
should do now
Actuators
Vacuum (reflex) agent
A B
• Let’s write the algorithm for the Vacuum cleaner...

• Percepts: location and content (location sensor, dirtsensor).
Percept Action
[A, clean] Right
[A, dirty] Suck
[B, clean] Left
[B, dirty] Suck
Vacuum (reflex) agent
A B
• Let’s write the algorithm for the Vacuum cleaner...

• Percepts: location and content (location sensor, dirtsensor).
Percept Action
[A, clean] Right
[A, dirty] Suck
[B, clean] Left
[B, dirty] Suck
What if the vacuum agent is deprived from its location sensor?
Let the vacuum agent take a probability of where it is and decide based on it.
Flip a coin.
Model-based reflex agents
• Handle partial observability by keeping track of the part of the world it
can’t see now.
• Internal state depending on the percept history (best guess).
• Model of the world based on (1) how the world evolves in- dependently
from the agent, and (2) how the agent actions affects the world.
Sensors
State
How the world evolves What the world

is like now
Environment
What my actions do
What action I
Condition-action rules should do now
Agent Actuators
Model-based reflex agents
• If patrolling and no enemy in sight then Patrol predefined path

• If patrolling and enemy in sight, switch mode from patrolling to chasing
• If chasing and enemy in sight , move towards enemy
• If chasing and enemy not in sight move towards enemy's last location
• If at enemy's last location and enemy not in sight and cooldown timer
not started then start cooldown timer
• If at enemy's last location and enemy not in sight and cooldown timer
expired then change mode to returning to post
• if returning to post and enemy in sight then set mode to chasing
• if returning to post and reached patrolling path and set mode to
patrolling
• So what if we want to make our AI guards somewhat smarter?

• What if we want them to figure out what to do, themselves by understanding
their situation and possible paths of action?
Goal-based agents
This might be needed if our guard needs to choose between multiple possible
responses like:
• Continue Patrolling
• Investigate Sounds
• Chase Intruder
• Stay in sight of other guards
• Go sound the Alarm
Now instead of having predefined Condition-Action rules for choosing the

actions, our guard should choose the action that best aligns with his Goal.
If the Goal is Capture Intruder, then the guard will always give precedence to
Chase Intruder and Investigate Sounds.
Goal-based agents
• Knowing the current state of the environment is not enough. The agent
needs some goal information.
• Agent program combines the goal information with the environment
model to choose the actions that achieve that goal.
• Consider the future with “What will happen if I do A?”
• Flexible as knowledge supporting the decisions is explicitly represented
and can be modified.
Sensors
State
What the world
How the world evolves is like now
Environment
What it will be like
What my actions do if I do action A
What action I
Goals should do now
Agent Actuators
Utility-based agents
• Sometimes achieving the desired goal is not enough. We may look for
quicker, safer, cheaper trip to reach a destination.
• Ideally we want an Agent that considers all possibilities and choose
the most useful one.
• Agent happiness should be taken into consideration. We call it utility.
• A utility function is the agent’s performance measure
• Because of the uncertainty in the world, a utility agent choses the action
that maximizes the expected utility. E.g., tree-traversing agents
Learning agents
• Programming agents by hand can be very tedious. “Some
more expeditious method seem desirable” Alan Turing, 1950.
• A learning agent is a tool in AI that is capable of learning from its experiences. It
starts with some basic knowledge and is then able to act and adapt autonomously,
through learning, to improve its own performance.
• Four conceptual components:
– Learning element: responsible for making improvements
– Performance element: responsible for selecting external actions. It is what we
considered as agent so far.
– Critic: How well is the agent is doing w.r.t. a fixed performance standard.
– Problem generator: allows the agent to explore.
Agent’s organization
a) Atomic Representation: Each state of the world is a black- box that has
no internal structure. E.g., finding a driving route, each state is a city.
AI algorithms: search, games, Markov decision processes, hidden Markov

models, etc.
b) Factored Representation: Each state has some attribute- value
properties. E.g., GPS location, amount of gas in thetank.
AI algorithms: constraint satisfaction, and Bayesian networks.

c) Structured Representation: Relationships between the objects of a state
can be explicitly expressed. E.g., pedestrian crossing the street.
AI algorithms: first order logic, knowledge-based learning, natural language

understanding.
Intelligent agents
• The concept of intelligent agent is central in AI.
• AI aims to design intelligent agents that are useful, reactive, autonomous

and even social and pro-active.
• An agent perceives its environment through percept and acts through

actuators.
• A performance measure evaluates the behavior of the agent.
• An agent that acts to maximize its expected performance measure is

called a rational agent.
• PEAS: A task environment specification that includes Perfor- mance

measure, Environment, Actuators and Sensors.
Agent=Architecture+Program
Intelligent agents
• Four types of agents: Reflex agents, model-based agents, goal-
based agents, and utility-based agents.
• Agents can improve their performance through learning.
• This is a high-level present of agent programs.
• States representations: atomic, factored, structured. Increasing

expressiveness power.
Intelligent agents
• Four types of agents: Reflex agents, model-based agents, goal-
based agents, and utility-based agents.
• Agents can improve their performance through learning.
• This is a high-level present of agent programs.
• States representations: atomic, factored, structured. Increasing

expressiveness power.
Credit: Courtesy Percy Liang

Credit
• Artificial Intelligence, A Modern Approach. Stuart Russell and Peter
Norvig. Third Edition. Pearson Education.
• edX course ColumbiaX: CSMM.101x Articial Intelligence (AI)
http://aima.cs.berkeley.edu/
Reading
Artificial Intelligence: A Modern Approach, 3rd Edition Russell & Norvig,

Chapters 2, 3.1-3.4.

AI Agents Guide

Uploaded by

Document Information

Original Description:

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

AI Agents Guide

Uploaded by

Copyright:

Available Formats

Artificial Intelligence

• An agent program runs in cycles of:(1)perceive,(2)think, and(3)act.

• An agent program runs in cycles of:(1)perceive,(2)think,

• Percepts: location and contents e.g., [A, Dirty]

• Percepts: location and contents e.g., [A, Dirty]

“For each possible percept sequence, a rational agent should select

• Judge rationality based on:

• PEAS stands for:

• Performance: Safety, time, legal drive, comfort.

• Performance: Safety, time, legal drive, comfort.

• Performance: Safety, time, legal drive, comfort.

• Performance: Safety, time, legal drive, comfort.

iRobot Roomba series

iRobot Roomba series

iRobot Roomba series

iRobot Roomba series

• Actuators: wheels, different brushes, vacuum extractor.

iRobot Roomba series

• Actuators: wheels, different brushes, vacuum extractor.

• Deterministic (vs. stochastic): The next state of the environment is

• Episodic (vs. sequential): The agent’s experience is divided into

• Discrete (vs. continuous): A limited number of distinct, clearly defined

• Single agent(vs. multi-agent): An agent operating by itself

• Known (vs. Unknown): The designer of the agent may or

Deterministic: The next state of the environment is completely determined by the

– Simple reflex agents

– Model-based reflex agents

What the world

• Let’s write the algorithm for the Vacuum cleaner...

• Let’s write the algorithm for the Vacuum cleaner...

What if the vacuum agent is deprived from its location sensor?

How the world evolves What the world

• If patrolling and no enemy in sight then Patrol predefined path

• So what if we want to make our AI guards somewhat smarter?

Now instead of having predefined Condition-Action rules for choosing the

AI algorithms: search, games, Markov decision processes, hidden Markov

AI algorithms: constraint satisfaction, and Bayesian networks.

AI algorithms: first order logic, knowledge-based learning, natural language

• AI aims to design intelligent agents that are useful, reactive, autonomous

• An agent perceives its environment through percept and acts through

• A performance measure evaluates the behavior of the agent.

• An agent that acts to maximize its expected performance mea- sure is

• PEAS: A task environment specification that includes Perfor- mance

• States representations: atomic, factored, structured. Increasing

• States representations: atomic, factored, structured. Increasing

Credit: Courtesy Percy Liang

Artificial Intelligence: A Modern Approach, 3rd Edition Russell & Norvig,

You might also like