Reinforcement Learning with Foregone Payoff Information in Normal Form GamesWorking Paper (2016)
This paper studies the reinforcement learning of Erev and Roth with foregone payoff information in normal form games: players observe not only the realised payoffs but also the ones which they could have obtained if they had chosen the other actions. We provide conditions under which the reinforcement learning process converges to a mixed action profile at which each action is chosen with a probability proportional to its expected payoff. In pure coordination games, the mixed action profile corresponds to the mixed Nash equilibrium.
- Reinforcement learning,
- foregone payoff information,
- normal form games
Citation InformationNaoki Funai. "Reinforcement Learning with Foregone Payoff Information in Normal Form Games" Working Paper (2016)
Available at: http://works.bepress.com/funai_naoki/18/