Triangular Chain Closed-Loop Detection Network for Dense Pedestrian Detection

IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT(2024)

引用 0|浏览8
暂无评分
摘要
Pedestrian detection has become an important topic in applications such as automatic driver assistance systems for automobiles and pedestrian tracking in surveillance systems, and many powerful object detectors have been widely used in smart sensing instruments. In realistic scenarios, pedestrians in image data are prone to overlap, and detection of fully bracketed boxes may still tend to be false positives in crowded scenes. In addition, low-level parameters shared among features during detection can cause mutual cancellation, resulting in a pair or set of head-enveloping boxes or body-enveloping boxes returning incorrect results. To address the above problems, we propose a triangular chain closed-loop detection network to improve detection in the case of body overlap. We propose a shared parameter elimination module to eliminate the interaction of shared low-level parameters, which has the advantage of improving the feature representation of occluded pedestrians and increasing feature utilization. Because the head bounding box detection encounters fewer occlusions in the occlusion case, the detection capability is better. Therefore, we propose a bidirectional matching module and a chain linking module to enhance the detection capability of the full bounding box using the head bounding box. These modules can better distinguish pedestrians in our network by focusing on individual region features on the pedestrian body, and then learn more representative pedestrian features by minimizing the vector similarity of the whole body, visible region, and head features in space. Our model has been extensively experimented on two challenging dense pedestrian datasets, CrowdHuman and Citypersons. Compared with the experimental results, our method achieves the best performance, especially on heavily occluded subsets, compared with o other popular existing technical methods. This method achieved good results on the CrowdHuman dataset, with an averaged precision (AP) improvement of 1.19% compared with our baseline CrowdDetection method. Especially, using Faster R-CNN as the framework and incorporating our proposed modules, MR-2 of the reasonable set in the CityPersons dataset was reduced by 0.91.
更多
查看译文
关键词
Feature pyramid network,pedestrian detection,triangular chain closed-loop detection
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要