Image-Guided Point Cloud Completion with Multi-modal Fusion Transformers.

Zhaowen Li,Shujin Lin,Fan Zhou

2023 IEEE International Conference on Systems, Man, and Cybernetics (SMC)(2023)

引用 0|浏览0
暂无评分
摘要
The task of image-guided point cloud completion aims to leverage information from images to address uncertainty issues in the completion inference of point clouds. The key challenge in this setting lies in how to effectively combine features extracted from both modalities. Due to the large domain discrepancy between the image and point cloud, existing methods that use cross-modal attention to directly fuse features have increased attention on redundant information and noise from different modalities, resulting in poor feature fusion performance. Hence, by introducing multi-modal fusion transformers that use bottleneck tokens, we enabled point cloud feature to learn image feature through information bridges, leading to improved point cloud completion performance. Our method can not only benefit from RGB images, but also from sketches with less feature information but more emphasis on edge information. Extensive experiments demonstrate that our proposed method enhances the quality of point cloud completion and outperforms other state-of-the-art methods.
更多
查看译文
关键词
Point Cloud,Point Cloud Completion,Multimodal Transformer,Image Features,Redundant Information,RGB Images,Feature Fusion,Edge Information,Cloud Features,Point Cloud Features,Domain Discrepancy,Decoding,Local Information,Qualitative Results,Attention Mechanism,Image Information,Global Information,Fusion Process,Ablation Experiments,Point Cloud Data,Input Point Cloud,Feature Compression,Edge Extraction
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要