Index

Title: An analysis of sensor-based task planning

Author(s): Olawsky, D. and Krebsbach, K. and Gini, M.

Source: Tech. Report 95-51, Dept of Comp. Sci., Univ. of MN (on-line version)

Year: 1995

Critique:

This paper addresses issues in planning under incomplete information. Rather that attempting to model the uncertainty as a partially observable Markov decision process the planner accepts that the information is incomplete, and attempts to plan around the missing information. If that is not possible, the planner can either use a default value for the missing information or it can add specific sensing operations to acquire the missing information.

The planning system is an interleaved plan/execute system called BUMP (Basic University of Minnesota Planner). The interleaving is used to acquire additional information during plan execution. The planning domain is Tool Box World, which provides the planner with both incomplete information, and a wold which defeats locally greedy planning algorithms. The world consists of a set of 'tool boxes' which can be opened, closed, and bolted shut by the simulated robot. These tool boxes contain wrenches and bolts of specific, non-interchangeable, sizes. The task is to lock specific tool boxes with specific sized bolts. Incomplete information is provided by limiting the knowledge of where specific wrenches and bolts are located.

A critical component for any comparison is the metric. This paper establishes two plan quality criteria which will be used to compare planning strategies:

These criteria are considered under multiple planning strategies, including changes to goal ordering, when to sense, and what to sense. So each planning strategy consists of a policy for each of these areas. These planning strategies are evaluated according to the rates of failures, and the nature of the failures, either a premature action, which must be undone to complete the plan, or failures due to bad sensor data (either incorrectly sensed or invalid default values).

The paper presents a detailed mathematical analysis of the costs associated with actions and for recovering from bad data, the expected number of premature actions over the problem space, and from these the expected cost of using a specific policy. In addition, the cost of recovering from bad data is calculated, and a O(b log b) (where b is the number of boxes) algorithm for finding a near-optimal policy is presented. The algorithm is near-optimal because an approximation to the actual cost of recovering from bad data is used.


Index
jpg3u@Virginia.edu
Last modified: Wed Mar 10 16:23:33 1999