Approximating Value Equivalence in Interactive Dynamic Influence Diagrams Using Behavioral Coverage

Ross Conroy, Yifeng Zeng, Jing Tang

Research output: Contribution to conferencePaper

12 Downloads (Pure)

Abstract

Interactive dynamic influence diagrams (I-DIDs) provide an explicit way of modeling how a subject agent solves decision making problems in the presence of other agents in a common setting. To optimize its decisions, the subject agent needs to predict the other agents’ behavior, that is generally obtained by solving their candidate models. This becomes extremely difficult since the model space may be rather large, and grows when the other agents act and observe over the time. A recent proposal for solving I-DIDs lies in a concept of value equivalence (VE) that shows potential advances on significantly reducing the model space. In this paper, we establish a principled framework to implement the VE techniques and propose an approximate method to compute VE of candidate models. The development offers ample opportunity of exploiting VE to further improve the scalability of IDID solutions. We theoretically analyze properties of the approximate techniques and show empirical results in multiple problem domains.
Original languageEnglish
Publication statusPublished - 9 Jul 2016

Fingerprint Dive into the research topics of 'Approximating Value Equivalence in Interactive Dynamic Influence Diagrams Using Behavioral Coverage'. Together they form a unique fingerprint.

  • Cite this