Seiji Yamada, Tokyo Institute of Technology
This paper describes a novel method to interleave planning with execution in a dynamic environment. Although, in such planning, it is very important to control deliberation: to determine the timing for interleaving them, few research has been done. To cope with this problem, we propose a method to determine the interleave timing with the success probability, SP, that a plan will be successfully executed in an environment. We also developed a method to compute it efficiently with Bayesian networks and implemented SIP system. The system stops planning when the locally optimal plan’s SP falls below an execution threshold, and executes the plan. Since SP depends on dynamics of an environment, a system does reactive behavior in a very dynamic environment, and becomes deliberative in a static one. We made experiments in Tileworld by changing dynamics and observation costs. As a result, we found the optimal threshold between reactivity and deliberation in some problem classes. Furthermore we found out the optimal threshold is robust against the change of dynamics and observation cost, and one of classes in which SIP works well is that the dynamics itself changes.