MateriMinggu03 1920 AI ProblemSolving

Problem Solving & Search
Beyond Classical Search

Informatics Engineering Study Program
School of Electrical Engineering and Informatics
Institute of Technology Bandung

Contents
 Review
 Beyond Classical Search
2 IF3170/NUMand[1]and[2]and[3]/02-Sep-2019
Review
 What is AI → 4 approaches
 Intelligent Agent
 Task Environment: Remember PEAS
 6 Environment Types
 Problem Solving Agent
 4 components to be defined: initial state,
operator/successor function, goal test, path cost
 Solving by Searching:
 Informed Search
 Uninformed Search
 Beyond Classical Search → TODAY
Local Search
Local search algorithms
 In many optimization problems, the path to the goal is
irrelevant; the goal state itself is the solution
 State space = set of "complete" configurations
 Find configuration satisfying constraints, e.g., n-queens
 In such cases, we can use local search algorithms
 Keep a single "current" state, try to improve it
Hill-climbing search
 "Like climbing Everest in thick fog with amnesia"
Hill-climbing search
 Problem: depending on initial state, can get stuck in local
maxima
Hill-climbing search: 8-queens
problem
Each number indicates h if we move

a queen in its corresponding column
 h = number of pairs of queens that are attacking each other, either directly or
indirectly (h = 17 for the above state)
Hill-climbing search: 8-queens
problem
• A local minimum with h = 1
Gradient Descent
• Assume we have some cost-function: C ( x 1 , ..., x n )

and we want minimize over continuous variables X1,X2,..,Xn

1. Compute the gradient : C ( x 1 , ..., x n ) i
x i
2. Take a small step downhill in the direction of the gradient:


x i → x 'i = x i −  C ( x 1 , ..., x n ) i
x i
3. Check if C ( x 1 , .., x 'i , .., x n )  C ( x 1 , .., x i , .., x n )
4. If true then accept move, if not reject.
5. Repeat.
Simulated Annealing Search
Simulated Annealing
So.... Simulated annealing search
 Idea: escape local maxima by allowing some "bad"
moves but gradually decrease their frequency
Properties of simulated annealing
search
 One can prove: If T decreases slowly enough, then
simulated annealing search will find a global optimum
with probability approaching 1
 Widely used in VLSI layout, airline scheduling, etc
Genetic algorithms
 A successor state is generated by combining two parent states
 Start with k randomly generated states (population)
 A state is represented as a string over a finite alphabet (often a

string of 0s and 1s)
 Evaluation function (fitness function). Higher values for better

states.
 Produce the next generation of states by selection, crossover,

and mutation
Genetic algorithms
 Fitness function: number of non-attacking pairs of queens (min

= 0, max = (8 × 7)/2 = 28)
 24/(24+23+20+11) = 31%
 23/(24+23+20+11) = 29% etc

Genetic algorithms: Cross-Over
32752411 24748552 32748552
Adversarial Search (Games)
Outline
 Optimal decisions
 α-β pruning
 Imperfect, real-time decisions
Games vs. search problems
 "Unpredictable" opponent → specifying a move for every
possible opponent reply
 Time limits → unlikely to find goal, must approximate
 Task: Find the most optimal move → using certain
searching algorithm, with time limit can choose a good
move
 In a normal search problem, the optimal solution would
be a sequence of actions leading to a goal state—a
terminal state that is a win
 In adversarial search, opponent has something to say
about it.
Game tree (2-player, deterministic,
turns)
• MAX’s job to search for a good move
• the term search tree for a tree that is superimposed on the full game tree:
• examines enough nodes to allow a player to determine what move to make
Minimax
 Perfect play for deterministic games
 Idea: choose move to position with highest minimax value
= best achievable payoff against best play
 E.g., 2-ply game:
Minimax algorithm
Properties of minimax
 Complete? Yes (if tree is finite)
 Optimal? Yes (against an optimal opponent)
 Time complexity? O(bm)
 Space complexity? O(bm) (depth-first exploration)
 For chess, b ≈ 35, m ≈100 for "reasonable" games

→ exact solution completely infeasible
α-β pruning example
Properties of α-β
 Pruning does not affect final result
 Good move ordering improves effectiveness of pruning
 With "perfect ordering," time complexity = O(bm/2)

→ doubles depth of search
→ See previous slide (from MIN point of view, the order is ‘worst’ value
first so it had to explore all the branches: {14,5,2})
Why is it called α-β?
 α is the value of the best
(i.e., highest-value)
choice found so far at
any choice point along
the path for max
 If v is worse than α, max
will avoid it
→ prune that branch
 Define β similarly for

min
The α-β algorithm
The α-β algorithm
Algorithm illustration with α-β value
Resource limits
Suppose we have 100 secs, explore 104 nodes/sec
→ 106 nodes per move
Standard approach:
 cutoff test:
e.g., depth limit (perhaps add quiescence search: the main
motive of quiescence search is usually to get a good value
out of an imperfect evaluation function)
 evaluation function
= estimated desirability of position
Evaluation functions
 For chess, typically linear weighted sum of features
Eval(s) = w1 f1(s) + w2 f2(s) + … + wn fn(s)
 e.g., w1 = 9 with
f1(s) = (number of white queens) – (number of black queens),
etc.
Cutting off search
MinimaxCutoff is identical to MinimaxValue except
1. Terminal? is replaced by Cutoff?
2. Utility is replaced by Eval
Does it work in practice?

bm = 106, b=35 → m=4
4-ply lookahead is a hopeless chess player!

 4-ply ≈ human novice
 8-ply ≈ typical PC, human master
 12-ply ≈ Deep Blue, Kasparov
Deterministic games in practice
 Checkers: Chinook ended 40-year-reign of human world champion Marion
Tinsley in 1994. Used a precomputed endgame database defining perfect
play for all positions involving 8 or fewer pieces on the board, a total of 444
billion positions.
 Chess: Deep Blue defeated human world champion Garry Kasparov in a

six-game match in 1997. Deep Blue searches 200 million positions per
second, uses very sophisticated evaluation, and undisclosed methods for
extending some lines of search up to 40 ply.
 Othello: human champions refuse to compete against computers, who are

too good.
 Go: human champions refuse to compete against computers, who are too
bad. In go, b > 300, so most programs use pattern knowledge bases to
suggest plausible moves.
Summary
 Games are fun to work on!
 They illustrate several important points about AI
 perfection is unattainable → must approximate
 good idea to think about what to think about
References
[1] Lathrop, R., Local Search Algorithm: Lecture Notes of
Introduction to Artificial Intelligence course in Computer Science
Department, University of California USA, can be accessed at:
http://www.ics.uci.edu/~rickl/courses/cs-271/2013-wq-cs271/2013-
wq-cs271-lecture-slides/2013wq271-05-LocalSearch.ppt
[2] Moore, A. W., Iterative Improvement Search: Statistical Data
Mining Tutorials, School of Computer Science, Carnegie Mellon
University USA, can be accessed at:
www.autonlab.org/tutorials/hillclimb02.pdf
[3] Hwee Tou Ng's (Singapore) course slides for AIMA, can be
accessed at: http://aima.eecs.berkeley.edu/slides-ppt/
[4] AISpace2 Tools for learning and understanding AI, can be accessed
at https://aispace2.github.io/AISpace2/index.html
THANK YOU

MateriMinggu03 1920 AI ProblemSolving

Uploaded by

Document Information

Original Description:

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

MateriMinggu03 1920 AI ProblemSolving

Uploaded by

Copyright:

Available Formats

Problem Solving & Search

Beyond Classical Search

Institute of Technology Bandung

 State space = set of "complete" configurations

 Find configuration satisfying constraints, e.g., n-queens

 In such cases, we can use local search algorithms

 Keep a single "current" state, try to improve it

Each number indicates h if we move

• A local minimum with h = 1

• Assume we have some cost-function: C ( x 1 , ..., x n )

2. Take a small step downhill in the direction of the gradient:

4. If true then accept move, if not reject.

 Widely used in VLSI layout, airline scheduling, etc

 Start with k randomly generated states (population)

 A state is represented as a string over a finite alphabet (often a

 Evaluation function (fitness function). Higher values for better

 Produce the next generation of states by selection, crossover,

 Fitness function: number of non-attacking pairs of queens (min

 23/(24+23+20+11) = 29% etc

32752411 24748552 32748552

 E.g., 2-ply game:

 For chess, b ≈ 35, m ≈100 for "reasonable" games

 Good move ordering improves effectiveness of pruning

 With "perfect ordering," time complexity = O(bm/2)

 Define β similarly for

Does it work in practice?

4-ply lookahead is a hopeless chess player!

 Chess: Deep Blue defeated human world champion Garry Kasparov in a

 Othello: human champions refuse to compete against computers, who are

You might also like