Towards a Simple Approach to Multi-step Model-based Reinforcement Learning

Kavosh Asadi,Evan Cater,Dipendra Misra,Michael L. Littman

arXiv: Learning（2018）

引用 23|浏览78

暂无评分

摘要

When environmental interaction is expensive, model-based reinforcement learning offers a solution by planning ahead and avoiding costly mistakes. Model-based agents typically learn a single-step transition model. In this paper, we propose a multi-step model that predicts the outcome of an action sequence with variable length. We show that this model is easy to learn, and that the model can make policy-conditional predictions. We report preliminary results that show a clear advantage for the multi-step model compared to its one-step counterpart.

查看译文

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要