StruMonoNet: Structure-Aware Monocular 3D Prediction

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021(2021)

引用 6|浏览26
暂无评分
摘要
Monocular 3D prediction is one of the fundamental problems in 3D vision. Recent deep learning-based approaches have brought us exciting progress on this problem. However, existing approaches have predominantly focused on end-to-end depth and normal predictions, which do not filly utilize the underlying 3D environment's geometric structures. This paper introduces StruMonoNet, which detects and enforces a planar structure to enhance pixel-wise predictions. StruMonoNet innovates in leveraging a hybrid representation that combines visual feature and a surfel representation for plane prediction. This formulation allows us to combine the power of visual feature learning and the flexibility of geometric representations in incorporating geometric relations. As a result, StruMonoNet can detect relations between planes such as adjacent planes, perpendicular planes, and parallel planes, all of which are beneficial for dense 3D prediction. Experimental results show that StruMonoNet considerably outperforms state-of-the-art approaches on NYUv2 and ScanNet.
更多
查看译文
关键词
geometric relations,adjacent planes,perpendicular planes,parallel planes,dense 3D prediction,structure-aware monocular 3D prediction,deep learning-based approaches,exciting progress,end-to-end depth,normal predictions,3D environment,planar structure,pixel-wise predictions,hybrid representation,surfel representation,plane prediction,visual feature learning,geometric representations,StruMonoNet
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要