Reinforcement Learning with Generalizable Gaussian Splatting
arxiv(2024)
摘要
An excellent representation is crucial for reinforcement learning (RL)
performance, especially in vision-based reinforcement learning tasks. The
quality of the environment representation directly influences the achievement
of the learning task. Previous vision-based RL typically uses explicit or
implicit ways to represent environments, such as images, points, voxels, and
neural radiance fields. However, these representations contain several
drawbacks. They cannot either describe complex local geometries or generalize
well to unseen scenes, or require precise foreground masks. Moreover, these
implicit neural representations are akin to a “black box", significantly
hindering interpretability. 3D Gaussian Splatting (3DGS), with its explicit
scene representation and differentiable rendering nature, is considered a
revolutionary change for reconstruction and representation methods. In this
paper, we propose a novel Generalizable Gaussian Splatting framework to be the
representation of RL tasks, called GSRL. Through validation in the RoboMimic
environment, our method achieves better results than other baselines in
multiple tasks, improving the performance by 10
baselines on the hardest task. This work is the first attempt to leverage
generalizable 3DGS as a representation for RL.
更多查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要