Bootstrap Estimated Uncertainty of the Environment Model for Model-Based Reinforcement Learning

Wenzhen Huang; Junge Zhang; Kaiqi Huang

doi:10.1609/aaai.v33i01.33013870

Bootstrap Estimated Uncertainty of the Environment Model for Model-Based Reinforcement Learning

Authors

Wenzhen Huang Chinese Academy of Sciences
Junge Zhang Chinese Academy of Sciences
Kaiqi Huang Chinese Academy of Sciences

DOI:

https://doi.org/10.1609/aaai.v33i01.33013870

Abstract

Model-based reinforcement learning (RL) methods attempt to learn a dynamics model to simulate the real environment and utilize the model to make better decisions. However, the learned environment simulator often has more or less model error which would disturb making decision and reduce performance. We propose a bootstrapped model-based RL method which bootstraps the modules in each depth of the planning tree. This method can quantify the uncertainty of environment model on different state-action pairs and lead the agent to explore the pairs with higher uncertainty to reduce the potential model errors. Moreover, we sample target values from their bootstrap distribution to connect the uncertainties at current and subsequent time-steps and introduce the prior mechanism to improve the exploration efficiency. Experiment results demonstrate that our method efficiently decreases model error and outperforms TreeQN and other stateof-the-art methods on multiple Atari games.

Downloads

Published

2019-07-17

How to Cite

Huang, W., Zhang, J., & Huang, K. (2019). Bootstrap Estimated Uncertainty of the Environment Model for Model-Based Reinforcement Learning. Proceedings of the AAAI Conference on Artificial Intelligence, 33(01), 3870-3877. https://doi.org/10.1609/aaai.v33i01.33013870

Download Citation

Issue

Vol. 33 No. 01: AAAI-19, IAAI-19, EAAI-20

Section

AAAI Technical Track: Machine Learning

Bootstrap Estimated Uncertainty of the Environment Model for Model-Based Reinforcement Learning

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information

Developed By

Subscription