Simulation-based search – Buffl

Buffl

JH

von J H.

So far

complete search for single player (BFS, DFS)
complete search for multi-player games (alpha-beta pruning)

Problem:

what if the game is too large to search completly?

Idea:

heurisitic as evaluation function for non-terminal states

Monte-Carlo Search

random plays until terminal state is reached
heuristics for a node: average reward of random plays

Monte-Carlo Search

Too optimistic

Monte-Carlo Search

Too pessimistic

Monte-Carlo Search

Pros and Cons

Pros

easy to implement
low memory
no game specific knowledge

Cons

does not terminate
wrong assumption: every one plays randomly
not correct result (minimax always computes the best move)

Monte Carlo Tree Search

Author

J H.

Informationen

Zuletzt geändert
vor 3 Jahren

© 2023 Buffl GmbH