Robust Combination of Local Controllers (2001)

Carlos Guestrin, and Dirk Ormoneit

Abstract -- Finding solutions to high dimensional Markov Decision Processes (MDPs) is a difficult problem, especially in the presence of uncertainty or if the actions and time measurements are continuous. Frequently this difficulty can be alleviated by the availability of problem-specific knowledge. For example, it may be relatively easy to design controllers that are good locally, though having no global guarantees. We propose a nonparametric method to combine these local controllers to obtain globally good solutions. We apply this formulation to two types of problems: motion planning (stochastic shortest path problems) and discounted-cost MDPs. For motion planning, we argue that only considering the expected cost of a path may be overly simplistic in the presence of uncertainty. We propose an alternative: finding the minimum cost path, subject to the constraint that the robot must reach the goal with high probability. For this problem, we prove that a polynomial number of samples is sufficient to obtain a high probability path. For discounted MDPs, we consider various problem formulations that explicitly deal with model uncertainty. We provide empirical evidence of the usefulness of these approaches using the control of a robot arm.

download information

Carlos Guestrin and Dirk Ormoneit (2001). "Robust Combination of Local Controllers." The seventeenth Conference on Uncertainty in Artificial Intelligence (UAI-01) (pp. 178 - 185).   ps  

bibtex citation

@inproceedings{Guestrin+al:2001e,
   author = {Carlos Guestrin and Dirk Ormoneit},
   title = {Robust Combination of Local Controllers},
   year = {2001},
   booktitle = "The seventeenth Conference on Uncertainty in Artificial Intelligence (UAI-01)",
   address = "Seattle, Washington",
   month = {August},
   pages = {178 -- 185},
}