Context Specific Multiagent Coordination and Planning with Factored MDPs (2002)

Carlos Guestrin, Shobha Venkataraman, and Daphne Koller

Abstract -- We present an algorithm for coordinated decision making in cooperative multiagent settings, where the agents' value function can be represented as a sum of context-specific value rules. The task of finding an optimal joint action in this setting leads to an algorithm where the coordination structure between agents depends on the current state of the system and even on the actual numerical values assigned to the value rules. We apply this framework to the task of multiagent planning in dynamic systems, showing how a joint value function of the associated Markov Decision Process can be approximated as a set of value rules using an efficient linear programming algorithm. The agents then apply the coordination graph algorithm at each iteration of the process to decide on the highest-value joint action, potentially leading to a different coordination pattern at each step of the plan.

download information

Carlos Guestrin, Shobha Venkataraman, and Daphne Koller (2002). "Context Specific Multiagent Coordination and Planning with Factored MDPs." The Eighteenth National Conference on Artificial Intelligence (AAAI-2002) (pp. 253-259).     ps.gz

bibtex citation

@inproceedings{Guestrin+al:2002d,
   author = {Carlos Guestrin and Shobha Venkataraman and Daphne Koller},
   title = {Context Specific Multiagent Coordination and Planning with Factored {MDP}s},
   year = {2002},
   month = {July},
   booktitle = {The Eighteenth National Conference on Artificial Intelligence (AAAI-2002)},
   address = {Edmonton, Canada},
   pages = {253--259},
}