Consistent heuristic: Difference between revisions
Move confusing equality from proof into its own line |
made proof simpler and more intuitive and added simple example of why the converse isn't true |
||
Line 1: | Line 1: | ||
In the study of [[shortest path problem|path-finding problems]] in [[artificial intelligence]], a [[heuristic function]] is said to be '''consistent''', or '''monotone''', if its estimate is always less than or equal to the estimated distance from any neighbouring vertex to the goal, plus the cost of reaching that neighbour. |
In the study of [[shortest path problem|path-finding problems]] in [[artificial intelligence]], a [[heuristic function]] is said to be '''consistent''', or '''monotone''', if its estimate is always less than or equal to the estimated distance from any neighbouring vertex to the goal, plus the cost of reaching that neighbour. |
||
Formally, for every node ''N'' and each [[Successor_(graph_theory)#Direction|successor]] ''P'' of ''N'', the estimated cost of reaching the goal from ''N'' is no greater than the step cost of getting to ''P'' plus the estimated cost of reaching the goal from ''P''. That is: |
Formally, for every node ''N'' and each [[Successor_(graph_theory)#Direction|successor]] ''P'' of ''N'', the estimated cost of reaching the goal from ''N'' is no greater than the step cost of getting to ''P'' plus the estimated cost of reaching the goal from ''P''. That is: |
||
Line 11: | Line 11: | ||
:* c(N,P) is the cost of reaching node P from N |
:* c(N,P) is the cost of reaching node P from N |
||
Informally, every node ''i'' will give an estimate that, accounting for the cost to reach the next node, is always lesser than the estimate at node ''i+1''. |
|||
⚫ | A consistent heuristic is also [[admissible heuristic|admissible]], i.e. it never overestimates the cost of reaching the goal (the [[Converse (logic)|converse]], however, is not always true). This is proved by induction |
||
:<math>h(N_{m+1}) \leq c(N_{m+1}, N_m) + h(N_m) \leq c(N_{m+1}, N_m) + h^*(N_m)</math> |
|||
⚫ | |||
By definition of <math>h^*(n)</math>, |
|||
Let <math>h(N_{0}) = 0</math> be the estimated cost for the goal node. This implies that the base condition is trivially true as 0 ≤ 0. Since the heuristic is consistent, <math>h(N_{i+1}) \leq c(N_{i+1}, N_{i}) + h(N_{i}) = c(N_{i+1}, N_{i}) + c(N_{i}, N_{i-1}) + h(N_{i-1}) = c(N_{i+1}, N_{i}) + c(N_{i}, N_{i-1}) + ... + c(N_{1}, N_{0}) + h(N_{0})</math>. Since the terms in our equation are equal to the true cost <math>\sum_{i=1}^n c(N_{i}, N_{i-1})</math>, any consistent heuristic is also admissible. |
|||
:<math>c(N_{m+1}, N_m) + h^*(N_m) = h^*(N_{m+1})</math>, |
|||
making <math>h(n)</math> admissible. (<math>N_{m+1}</math> is any node whose best path to the goal, of length m+1, goes through some immediate child <math>N_{m}</math> whose best path to the goal is of length m.) |
|||
The converse is clearly not true as we can always construct a heuristic that is always below the true cost but is nevertheless inconsistent by, for instance, increasing the heuristic estimate from the farthest node as we get closer and, when the estimate <math>h(N_{i})</math> becomes at most the true cost <math>h^*(N_{i})</math>, we make <math>h(N_{i-1}) = h(N_{i}) - c(N_{i}, N_{i-1})</math>. |
|||
==Consequences of monotonicity== |
==Consequences of monotonicity== |
Revision as of 21:44, 13 May 2021
In the study of path-finding problems in artificial intelligence, a heuristic function is said to be consistent, or monotone, if its estimate is always less than or equal to the estimated distance from any neighbouring vertex to the goal, plus the cost of reaching that neighbour.
Formally, for every node N and each successor P of N, the estimated cost of reaching the goal from N is no greater than the step cost of getting to P plus the estimated cost of reaching the goal from P. That is:
- and
where
- h is the consistent heuristic function
- N is any node in the graph
- P is any descendant of N
- G is any goal node
- c(N,P) is the cost of reaching node P from N
Informally, every node i will give an estimate that, accounting for the cost to reach the next node, is always lesser than the estimate at node i+1.
A consistent heuristic is also admissible, i.e. it never overestimates the cost of reaching the goal (the converse, however, is not always true). This is proved by induction.
Let be the estimated cost for the goal node. This implies that the base condition is trivially true as 0 ≤ 0. Since the heuristic is consistent, . Since the terms in our equation are equal to the true cost , any consistent heuristic is also admissible.
The converse is clearly not true as we can always construct a heuristic that is always below the true cost but is nevertheless inconsistent by, for instance, increasing the heuristic estimate from the farthest node as we get closer and, when the estimate becomes at most the true cost , we make .
Consequences of monotonicity
Consistent heuristics are called monotone because the estimated final cost of a partial solution, is monotonically non-decreasing along the best path to the goal, where is the cost of the best path from start node to . It's necessary and sufficient for a heuristic to obey the triangle inequality in order to be consistent.[1]
In the A* search algorithm, using a consistent heuristic means that once a node is expanded, the cost by which it was reached is the lowest possible, under the same conditions that Dijkstra's algorithm requires in solving the shortest path problem (no negative cost cycles). In fact, if the search graph is given cost for a consistent , then A* is equivalent to best-first search on that graph using Dijkstra's algorithm.[2] In the unusual event that an admissible heuristic is not consistent, a node will need repeated expansion every time a new best (so-far) cost is achieved for it.
If the given heuristic is admissible but not consistent, one can artificially force the heuristic values along a path to be monotonically non-decreasing by using
as the heuristic value for instead of , where is the node immediately preceding on the path and . This idea is due to László Mérō[3] and is now known as pathmax. Contrary to common belief, pathmax does not turn an admissible heuristic into a consistent heuristic. For example, if A* uses pathmax and a heuristic that is admissible but not consistent, it is not guaranteed to have an optimal path to a node when it is first expanded.[4]
See also
References
- ^ Pearl, Judea (1984). Heuristics: Intelligent Search Strategies for Computer Problem Solving. Addison-Wesley. ISBN 0-201-05594-5.
- ^ Edelkamp, Stefan; Schrödl, Stefan (2012). Heuristic Search: Theory and Applications. Morgan Kaufmann. ISBN 978-0-12-372512-7.
- ^ Mérō, László (1984). "A Heuristic Search Algorithm with Modifiable Estimate". Artificial Intelligence. 23: 13–27. doi:10.1016/0004-3702(84)90003-1.
- ^ Holte, Robert (2005). "Common Misconceptions Concerning Heuristic Search". Proceedings of the Third Annual Symposium on Combinatorial Search (SoCS).