What is a rational agent?
For each possible percept history, select an action that is expected to maximize its performance measure, given the evidence by the percept history and whatever built-in knowledge the agent has.
What does PEAS stand for?
Performance Measure, Environment, Actuators, Sensors
What is a fully observable environment type?
sensors detect all relevant properties of the world for the current action
What is a single-agent environment type?
only one agent, no cooperation and no competition between agents
What is a deterministic environment type?
next state determined by the current state and the performed action
What is an episodic environment type?
agent’s experience is divided into “atomic” parts (independent from each other, which means current actions don’t affect future action)
In episodic environments, the choice of action only depends on the current episode
What is a static environment type?
the world does not change during the reasoning time of the agent
What is a semi-dynamic environment type?
static, but the performance score decreases with deliberation time
What’s a discrete environment type?
world properties have discrete values, e.g. time, number of possible states,...
(Doesn’t go on infinitely)
What’s a known environment type?
state of knowledge about the “laws of physics” of the environment
What agent types exist?
simple reflex agents
reflex agents with state (model-based reflex agent)
goal-based agents
utility-based agents
What does a simple reflex agent look like?
no memory
No sequences of percepts
Looping possible
What does the Model-based reflex agent look like?
memory -> maintain/ update world state
reason about unobservable parts
deal with uncertainty, implicit goals
What does a goal based agent look like?
model the world, goals, and actions & their effects explicitly
more flexible, better maintainable
Search & Planning for single/sequence of action/s to achieve a goal
What does a utility based agent look like?
assess goals with a utility function
resolve conflicting goals
use expected utility for decision
Last changed2 years ago