Real-Time Planning as Decision-Making under Uncertainty

Andrew Mitchell; Wheeler Ruml; Fabian Spaniol; Jorg Hoffmann; Marek Petrik

doi:10.1609/aaai.v33i01.33012338

Authors

Andrew Mitchell University of New Hampshire
Wheeler Ruml University of New Hampshire
Fabian Spaniol Saarland University
Jorg Hoffmann Saarland University
Marek Petrik University of New Hampshire

DOI:

https://doi.org/10.1609/aaai.v33i01.33012338

Abstract

In real-time planning, an agent must select the next action to take within a fixed time bound. Many popular real-time heuristic search methods approach this by expanding nodes using time-limited A* and selecting the action leading toward the frontier node with the lowest f value. In this paper, we reconsider real-time planning as a problem of decision-making under uncertainty. We propose treating heuristic values as uncertain evidence and we explore several backup methods for aggregating this evidence. We then propose a novel lookahead strategy that expands nodes to minimize risk, the expected regret in case a non-optimal action is chosen. We evaluate these methods in a simple synthetic benchmark and the sliding tile puzzle and find that they outperform previous methods. This work illustrates how uncertainty can arise even when solving deterministic planning problems, due to the inherent ignorance of time-limited search algorithms about those portions of the state space that they have not computed, and how an agent can benefit from explicitly metareasoning about this uncertainty.

Real-Time Planning as Decision-Making under Uncertainty

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information

Developed By

Subscription