Performance Guarantees for Homomorphisms beyond Markov Decision Processes

Sultan Javed Majeed; Marcus Hutter

doi:10.1609/aaai.v33i01.33017659

Performance Guarantees for Homomorphisms beyond Markov Decision Processes

Authors

Sultan Javed Majeed Australian National University
Marcus Hutter Australian National University

DOI:

https://doi.org/10.1609/aaai.v33i01.33017659

Abstract

Most real-world problems have huge state and/or action spaces. Therefore, a naive application of existing tabular solution methods is not tractable on such problems. Nonetheless, these solution methods are quite useful if an agent has access to a relatively small state-action space homomorphism of the true environment and near-optimal performance is guaranteed by the map. A plethora of research is focused on the case when the homomorphism is a Markovian representation of the underlying process. However, we show that nearoptimal performance is sometimes guaranteed even if the homomorphism is non-Markovian.

Downloads

Published

2019-07-17

How to Cite

Majeed, S. J., & Hutter, M. (2019). Performance Guarantees for Homomorphisms beyond Markov Decision Processes. Proceedings of the AAAI Conference on Artificial Intelligence, 33(01), 7659-7666. https://doi.org/10.1609/aaai.v33i01.33017659

Download Citation

Issue

Vol. 33 No. 01: AAAI-19, IAAI-19, EAAI-20

Section

AAAI Technical Track: Planning, Routing, and Scheduling

Performance Guarantees for Homomorphisms beyond Markov Decision Processes

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information

Developed By

Subscription