TY - JOUR
T1 - Can bounded and self-interested agents be teammates? Application to planning in ad hoc teams
AU - Chandrasekaran, Muthukumaran
AU - Doshi, Prashant
AU - Zeng, Yifeng
AU - Chen, Yingke
PY - 2016/11/23
Y1 - 2016/11/23
N2 - Planning for ad hoc teamwork is challenging because it involves agents collaborating
without any prior coordination or communication. The focus is on principled methods
for a single agent to cooperate with others. This motivates investigating the ad hoc teamwork
problem in the context of self-interested decision-making frameworks. Agents engaged in
individual decision making in multiagent settings face the task of having to reason about
other agents’ actions, which may in turn involve reasoning about others. An established
approximation that operationalizes this approach is to bound the infinite nesting from below
by introducing level 0 models. For the purposes of this study, individual, self-interested decision
making in multiagent settings is modeled using interactive dynamic influence diagrams
(I-DID). These are graphical models with the benefit that they naturally offer a factored representation
of the problem, allowing agents to ascribe dynamic models to others and reason
about them. We demonstrate that an implication of bounded, finitely-nested reasoning by a
self-interested agent is that we may not obtain optimal team solutions in cooperative settings,
if it is part of a team. We address this limitation by including models at level 0 whose solutions
involve reinforcement learning. We show how the learning is integrated into planning
in the context of I-DIDs. This facilitates optimal teammate behavior, and we demonstrate its
applicability to ad hoc teamwork on several problem domains and configurations
AB - Planning for ad hoc teamwork is challenging because it involves agents collaborating
without any prior coordination or communication. The focus is on principled methods
for a single agent to cooperate with others. This motivates investigating the ad hoc teamwork
problem in the context of self-interested decision-making frameworks. Agents engaged in
individual decision making in multiagent settings face the task of having to reason about
other agents’ actions, which may in turn involve reasoning about others. An established
approximation that operationalizes this approach is to bound the infinite nesting from below
by introducing level 0 models. For the purposes of this study, individual, self-interested decision
making in multiagent settings is modeled using interactive dynamic influence diagrams
(I-DID). These are graphical models with the benefit that they naturally offer a factored representation
of the problem, allowing agents to ascribe dynamic models to others and reason
about them. We demonstrate that an implication of bounded, finitely-nested reasoning by a
self-interested agent is that we may not obtain optimal team solutions in cooperative settings,
if it is part of a team. We address this limitation by including models at level 0 whose solutions
involve reinforcement learning. We show how the learning is integrated into planning
in the context of I-DIDs. This facilitates optimal teammate behavior, and we demonstrate its
applicability to ad hoc teamwork on several problem domains and configurations
U2 - 10.1007/s10458-016-9354-4
DO - 10.1007/s10458-016-9354-4
M3 - Article
SN - 1573-7454
SP - -
JO - Autonomous Agents and Multi-Agent Systems
JF - Autonomous Agents and Multi-Agent Systems
ER -