Interactive dynamic influence diagrams (I-DIDs) are recognized graphical models for sequential multiagent decision making under uncertainty. They represent the problem of how a subject agent acts in a common setting shared with other agents who may act in sophisticated ways. The difficulty in solving I-DIDs is mainly due to an exponentially growing space of candidate models ascribed to other agents over time. in order to minimize the model space, the previous I-DID techniques prune behaviorally equivalent models. In this paper, we challenge the minimal set of models and propose a value equivalence approach to further compress the model space. The new method reduces the space by additionally pruning behaviorally distinct models that result in the same expected value of the subject agent’s optimal policy. To achieve this, we propose to learn the value from available data particularly in practical applications of real-time strategy games. We demonstrate the performance of the new technique in two problem domains.
|Title of host publication||Proceedings of the 2016 International Conference on Autonomous Agents & Multiagent Systems|
|Publication status||Published - 9 May 2016|
|Event||15th International Conference on Autonomous Agents and Multiagent Systems - Grand Copthorne Waterfront Hotel, Singapore|
Duration: 9 May 2016 → 13 May 2016
Conference number: 15
|Conference||15th International Conference on Autonomous Agents and Multiagent Systems|
|Period||9/05/16 → 13/05/16|