Active Player Modeling in the Iterated Prisoner’s DilemmaReportar como inadecuado

Active Player Modeling in the Iterated Prisoner’s Dilemma - Descarga este documento en PDF. Documentación en PDF para descargar gratis. Disponible también para leer online.

Computational Intelligence and Neuroscience - Volume 2016 2016, Article ID 7420984, 13 pages -

Research ArticleDepartment of Computer Science and Engineering, Sejong University, 209 Neungdong-ro, Gwangjin-gu, Seoul 05006, Republic of Korea

Received 12 November 2015; Revised 14 January 2016; Accepted 20 January 2016

Academic Editor: Reinoud Maex

Copyright © 2016 Hyunsoo Park and Kyung-Joong Kim. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.


The iterated prisoner’s dilemma IPD is well known within the domain of game theory. Although it is relatively simple, it can also elucidate important problems related to cooperation and trust. Generally, players can predict their opponents’ actions when they are able to build a precise model of their behavior based on their game playing experience. However, it is difficult to make such predictions based on a limited number of games. The creation of a precise model requires the use of not only an appropriate learning algorithm and framework but also a good dataset. Active learning approaches have recently been introduced to machine learning communities. The approach can usually produce informative datasets with relatively little effort. Therefore, we have proposed an active modeling technique to predict the behavior of IPD players. The proposed method can model the opponent player’s behavior while taking advantage of interactive game environments. This experiment used twelve representative types of players as opponents, and an observer used an active modeling algorithm to model these opponents. This observer actively collected data and modeled the opponent’s behavior online. Most of our data showed that the observer was able to build, through direct actions, a more accurate model of an opponent’s behavior than when the data were collected through random actions.

Autor: Hyunsoo Park and Kyung-Joong Kim



Documentos relacionados