谷歌浏览器插件
订阅小程序
在清言上使用

Enhancing Monocular 3-D Object Detection Through Data Augmentation Strategies

IEEE transactions on instrumentation and measurement(2024)

引用 0|浏览6
暂无评分
摘要
Data augmentation is a crucial component of machine learning. In 2-D object detection tasks, it can significantly enhance the performance of detectors without increasing the inference cost. Data augmentation methods, such as random translation and random resizing, have become standard practices for 2-D object detectors. However, in monocular 3-D object detection tasks, the data augmentation methods used in 2-D object detection cannot be directly applied due to different representations of object positions. In this study, a method is proposed to migrate a 2-D object detection data enhancement method to monocular 3-D object detection while preserving coordinate and size cues. In addition, we address the sampling bias problem associated with data augmentation in this process. We introduce an unbiased sampling (UB) strategy and several new augmentation methods specifically designed for monocular 3-D object detection. Our proposed method achieves a performance of 20.47% AP3D(IOU = 0.7, car, moderate) on the KITTI dataset and a speed of 45 FPS on RTX 2080Ti GPUs, outperforming all previous monocular methods. The source codes are at: https://github.com/jiayisong/DA3D.
更多
查看译文
关键词
Three-dimensional displays,Object detection,Data augmentation,Task analysis,Pipelines,Cameras,Detectors,Autonomous driving,data augmentation,deep learning,monocular 3-D object detection
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要