Learning Simultaneous Navigation and Construction in Grid Worlds

Wenyu Han,Haoran Wu, Eisuke Hirota,Alexander Gao,Lerrel Pinto,Ludovic Righetti,Chen Feng

ICLR 2023（2023）

引用 0|浏览49

暂无评分

摘要

We propose to study a new learning task, mobile construction, to enable an agent to build designed structures in 1/2/3D grid worlds while navigating in the same evolving environments. Unlike existing robot learning tasks such as visual navigation and object manipulation, this task is challenging because of the interdependence between accurate localization and strategic construction planning. In pursuit of generic and adaptive solutions to this partially observable Markov decision process (POMDP) based on deep reinforcement learning (RL), we design a Deep Recurrent Q-Network (DRQN) with explicit recurrent position estimation in this dynamic grid world. Our extensive experiments show that pre-training this position estimation module before Q-learning can significantly improve the construction performance measured by the intersection-over-union score, achieving the best results in our benchmark of various baselines including model-free and model-based RL, a handcrafted SLAM-based policy, and human players.

查看译文

关键词

Navigation,Localization,Construction,Deep reinforcement learning,Representation learning

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要