You are on page 1of 55

Heuristic Search

Chapter 3
Outline

• Generate-and-test
• Hill climbing
• Best-first search
• Problem reduction
• Constraint satisfaction
• Means-ends analysis

2
Generate-and-Test

Algorithm
1. Generate a possible solution.
2. Test to see if this is actually a solution.
3. Quit if a solution has been found.
Otherwise, return to step 1.

3
Generate-and-Test

• Acceptable for simple problems.


• Inefficient for problems with large space.

• Also known as British Museum algorithm.

• Depth first search procedures since complete soln. must be


generated before they can be tested

4
Generate-and-Test

• Exhaustive generate-and-test.
• Heuristic generate-and-test: not consider paths that seem
unlikely to lead to a solution.
• Plan generate-test:
 Create a list of candidates.
 Apply generate-and-test to that list.

5
Generate-and-Test

Example: coloured blocks


“Arrange four 6-sided cubes in a row, with each side of
each cube painted one of four colours,

Arrange cubes in a row such that on all four


sides of the row one block face of each colour is showing.”

6
Generate-and-Test

Example: coloured blocks

Heuristic: if there are more red faces than other colours


then, when placing a block with several red faces, use few
of them as possible as outside faces.

7
8
Hill Climbing

• Searching for a goal state = Climbing to the top of a hill

9
10
Hill Climbing

• Generate-and-test + direction to move.


• Heuristic function to estimate how close a given state is to
a goal state.

11
Simple Hill Climbing

Algorithm
1. Evaluate the initial state.
2. Loop until a solution is found or there are no new operators
left to be applied:
 Select and apply a new operator
 Evaluate the new state:
goal  quit
better than current state  new current state
not better than current state  continue in the loop

12
E.g., for the 8-puzzle:
Heuristics
• h1(n) = number of misplaced tiles
• h2(n) = total Manhattan distance
(i.e., no. of squares from desired location of each tile)

• h1(S) = ? 4
• h2(S) = ? 1+2+1+1=5



Steepest-Ascent Hill Climbing (Gradient
Search)
• Considers all the moves from the current state.
• Selects the best one as the next state.

• In contrast to basic method : First state that is better then


current state is selected.

15
Steepest-Ascent Hill Climbing (Gradient
Search)
Algorithm
1. Evaluate the initial state.
2. Loop until a solution is found or a complete iteration produces
no change to current state:
 SUCC = a state such that any possible successor of the
current state will be better than SUCC (the worst state).
 For each operator that applies to the current state, evaluate
the new state:
goal  quit
better than SUCC  set SUCC to this state
 SUCC is better than the current state  set the current
state to SUCC.
16
Hill Climbing: Disadvantages

Local maximum
A state that is better than all of its neighbours, but not
better than some other states far away.

17
Hill Climbing: Disadvantages

Plateau
A flat area of the search space in which all neighbouring
states have the same value.

18
Hill Climbing: Disadvantages

Ridge
The orientation of the high region, compared to the set
of available moves, makes it impossible to climb up.
However, two moves executed serially may increase
the height.

19
Hill Climbing: Soln

Ways Out
• Backtrack to some earlier node and try going in a different
direction.
• Make a big jump to try to get in a new section.
• Moving in several directions at once.

20
Hill Climbing: Soln

• Hill climbing is a local method:


Decides what to do next by looking only at the
“immediate” consequences of its choices.
• Global information might be encoded in heuristic
functions.

21
Hill Climbing: Disadvantages

Start Goal
A D
D C
C B
B A
Blocks World

22
Hill Climbing: Disadvantages

Start Goal
A D
0 4
D C
C B
B A
Blocks World
Local heuristic:
+1 for each block that is resting on the thing it is supposed to be
resting on.
1 for each block that is resting on a wrong thing.
23
Hill Climbing: Disadvantages

0 A 2

D D
C C
B B A

24
Hill Climbing: Disadvantages

D 2
C
B A

A 0
0
D
C C D C 0

B B A B A D

25
Hill Climbing: Disadvantages

Start Goal
A D
6 6
D C
C B
B A
Blocks World
Global heuristic:
For each block that has the correct support structure: +1 to
every block in the support structure.
For each block that has a wrong support structure: 1 to
every block in the support structure. 26
Hill Climbing: Disadvantages

D 3
C
B A

A 6
2
D
C C D C 1

B B A B A D

27
Hill Climbing: Conclusion

• Can be very inefficient in a large, rough problem space.

• Global heuristic may have to pay for computational


complexity.

• Often useful when combined with other methods, getting it


started right in the right general neighbourhood.

28
Simulated Annealing

• A variation of hill climbing in which, at the beginning of


the process, some downhill moves may be made.

• To do enough exploration of the whole space early on, so


that the final solution is relatively insensitive to the
starting state.
• Lowering the chances of getting caught at a local
maximum, or plateau, or a ridge.

29
Simulated Annealing

• Minimize objective function rather then maximize

30
Simulated Annealing

Physical Annealing
• Physical substances are melted and then gradually cooled
until some solid state is reached.
• The goal is to produce a minimal-energy state.
• Annealing schedule: if the temperature is lowered
sufficiently slowly, then the goal will be attained.
• Nevertheless, there is some probability for a transition to a
higher energy state: eE/kT.

31
Simulated Annealing

Algorithm
1. Evaluate the initial state.
2. Loop until a solution is found or there are no new operators left to
be applied:
 Set T according to an annealing schedule
 Selects and applies a new operator
 Evaluate the new state:
goal  quit
E = Val(current state)  Val(new state)
E < 0  new current state
else  new current state with probability eE/kT.
32
Simulated Annealing

• Major components of annealing schedule


– Initial value to be used for temperature
– Criteria to decide when temperature of the system should be
reduced
– Amount by which temperature to be reduced
– And when to quit.

33
Best-First Search

• Depth-first search: not all competing branches having to be


expanded.

• Breadth-first search: not getting trapped on dead-end


paths.
 Combining the two is to follow a single path at a
time, but switch paths whenever some competing
path look more promising than the current one.

34
Best-First Search

• Select most promising of the node we have generated so


far
– By applying appropriate heuristic function to each of them
• Expand the chosen node by using rule to generate its
successor
• If one of them is soln. then quit
Else most promising node selected and process
continues

35
Best-First Search Vs Steepest ascent Hill
climbing
• One move selected and all other are rejected
While in best-first search one move selected but
others are kept around and can be revisited.
• In best first search best available state selected
even if it’s value lower then current state in
contrast to hill climbing algo.

36
Best-First Search

A A A

B C D B C D
3 5 1 3 5
E F
4 6
A A

B C D B C D
5 5
G H E F G H E F
6 5 4 6 6 5 6
I J
2 1
37
Best-First Search

• OPEN: nodes that have been generated, but have not


examined.
This is organized as a priority queue.

• CLOSED: nodes that have already been examined.


Whenever a new node is generated, check whether it has
been generated before.

38
Best-First Search

Algorithm
1. OPEN = {initial state}.
2. Loop until a goal is found or there are no nodes left in OPEN:
 Pick the best node in OPEN
 Generate its successors
 For each successor:
if new  evaluate it, add it to OPEN, record its parent
if generated before  change parent, update successors

39
Best-First Search

• Uniform-cost search:
g(n) = cost of the cheapest path from the initial state
to node n.

40
Best-First Search

• Greedy search:
h(n) = estimated cost of the cheapest path from node
n to a goal state.
Neither optimal nor complete

41
Best-First Search

• Greedy search:
h(n) = estimated cost of the cheapest path from node
n to a goal state.
Neither optimal nor complete

• Uniform-cost search:
g(n) = cost of the cheapest path from the initial state
to node n.
Optimal and complete, but very inefficient

42
Best-First Search

• Algorithm A* (Hart et al., 1968):

f(n) = g(n) + h(n)

h(n) = cost of the cheapest path from node n to a


goal state.
g(n) = cost of the cheapest path from the initial state
to node n.

43
Best-First Search

• Algorithm A*:

f*(n) = g*(n) + h*(n)

h*(n) (heuristic factor) = estimate of h(n).

g*(n) (depth factor) = approximation of g(n) found by


A* so far.

44
Algorithm A* (with any h on search
Graph)
• Input: an implicit search graph problem with cost on the arcs
• Output: the minimal cost path from start node to a goal node.
– 1. Put the start node s on OPEN.
– 2. If OPEN is empty, exit with failure
– 3. Remove from OPEN and place on CLOSED a node n having minimum f.
– 4. If n is a goal node exit successfully with a solution path obtained by tracing back the
pointers from n to s.
– 5. Otherwise, expand n generating its children and directing pointers from each child
node to n.
• For every child node n’ do
– evaluate h(n’) and compute f(n’) = g(n’) +h(n’)= g(n)+c(n,n’)+h(n)
– If n’ is already on OPEN or CLOSED compare its new f with the old f and
attach the lowest f to n’.
– put n’ with its f value in the right order in OPEN
– 6. Go to step 2.

45
Problem Reduction

Goal: Acquire TV set

Goal: Steal TV set Goal: Earn some money Goal: Buy TV set

AND-OR Graphs

Algorithm AO* (Martelli & Montanari 1973, Nilsson 1980)

46
AND-OR Graph

47
Problem Reduction

48
Problem Reduction: AO*

A A 6
5 9
B C D
3 4 5

A 9 A 11
9 12
B C D 10 B 6 C D 10
3 4 4
E F G H E F
4 4 5 7 4 4

49
Constraint Satisfaction

• Many AI problems can be viewed as problems of


constraint satisfaction.

Cryptarithmetic puzzle:

SEND

MORE
MONEY

50
Constraint Satisfaction

• As compared with a straightforward search procedure,


viewing a problem as one of constraint satisfaction can
reduce substantially the amount of search.

51
Constraint Satisfaction

• Operates in a space of constraint sets.

• Initial state contains the original constraints given in the


problem.

• A goal state is any state that has been constrained


“enough”.

52
Constraint Satisfaction

Two-step process:
1. Constraints are discovered and propagated as far as
possible.
2. If there is still not a solution, then search begins, adding
new constraints.

53
Initial state:
• No two letters have
the same value. M=1
• The sum of the digits S = 8 or 9 SEND
must be as shown.
O=0
N=E+1

C2 = 1
MORE
N+R>8
E9 MONEY
E=2
N=3
R = 8 or 9
2 + D = Y or 2 + D = 10 + Y

C1 = 0 C1 = 1

2+D=Y 2 + D = 10 + Y
N + R = 10 + E D=8+Y
R=9 D = 8 or 9
S =8
D=8 D=9
Y=0 Y=1
54
Constraint Satisfaction

Two kinds of rules:


1. Rules that define valid constraint propagation.
2. Rules that suggest guesses when necessary.

55

You might also like