Tackling Sparse Rewards in Real-Time Games with Statistical Forward Planning Methods

  • Raluca D. Gaina Queen Mary University of London
  • Simon M. Lucas Queen Mary University of London
  • Diego Pérez-Liébana Queen Mary University of London

Abstract

One of the issues general AI game players are required to deal with is the different reward systems in the variety of games they are expected to be able to play at a high level. Some games may present plentiful rewards which the agents can use to guide their search for the best solution, whereas others feature sparse reward landscapes that provide little information to the agents. The work presented in this paper focuses on the latter case, which most agents struggle with. Thus, modifications are proposed for two algorithms, Monte Carlo Tree Search and Rolling Horizon Evolutionary Algorithms, aiming at improving performance in this type of games while maintaining overall win rate across those where rewards are plentiful. Results show that longer rollouts and individual lengths, either fixed or responsive to changes in fitness landscape features, lead to a boost of performance in the games during testing without being detrimental to non-sparse reward scenarios.

Published
2019-07-17
Section
AAAI Technical Track: Game Playing and Interactive Entertainment