Conroy, Ross, Zeng, Yifeng and Tang, Jing (2016) Approximating value equivalence in interactive dynamic influence diagrams using behavioral coverage. In: IJCAI'16: Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence. AAAI Press/International Joint Conferences on Artificial Intelligence, pp. 201-207. ISBN 9781577357704
|
Text
2016_Approximating_Value_Equivalence_in_Interactive_Dynamic_Influence_Diagrams_Using_Behavioral_Coverage.pdf - Accepted Version Download (464kB) | Preview |
Abstract
Interactive dynamic influence diagrams (I-DIDs) provide an explicit way of modeling how a subject agent solves decision making problems in the presence of other agents in a common setting. To optimize its decisions, the subject agent needs to predict the other agents' behavior, that is generally obtained by solving their candidate models. This becomes extremely difficult since the model space may be rather large, and grows when the other agents act and observe over the time. A recent proposal for solving I-DIDs lies in a concept of value equivalence (VE) that shows potential advances on significantly reducing the model space. In this paper, we establish a principled framework to implement the VE techniques and propose an approximate method to compute VE of candidate models. The development offers ample opportunity of exploiting VE to further improve the scalability of I-DID solutions. We theoretically analyze properties of the approximate techniques and show empirical results in multiple problem domains.
Item Type: | Book Section |
---|---|
Subjects: | G400 Computer Science G900 Others in Mathematical and Computing Sciences |
Department: | Faculties > Engineering and Environment > Computer and Information Sciences |
Depositing User: | Rachel Branson |
Date Deposited: | 26 Oct 2020 14:13 |
Last Modified: | 31 Jul 2021 13:15 |
URI: | http://nrl.northumbria.ac.uk/id/eprint/44592 |
Downloads
Downloads per month over past year