Urban street scene analysis using lightweight multi-level multi-path feature aggregation network

MULTIAGENT AND GRID SYSTEMS(2021)

引用 2|浏览0
暂无评分
摘要
Urban street scene analysis is an important problem in computer vision with many off-line models achieving outstanding semantic segmentation results. However, it is an ongoing challenge for the research community to develop and optimize the deep neural architecture with real-time low computing requirements whilst maintaining good performance. Balancing between model complexity and performance has been a major hurdle with many models dropping too much accuracy for a slight reduction in model size and unable to handle high-resolution input images. The study aims to address this issue with a novel model, named M2FANet, that provides a much better balance between model's efficiency and accuracy for scene segmentation than other alternatives. The proposed optimised backbone helps to increase model's efficiency whereas, suggested Multi-level Multi-path (M2) feature aggregation approach enhances model's performance in the real-time environment. By exploiting multi-feature scaling technique, M2FANet produces state-of-the-art results in resource-constrained situations by handling full input resolution. On the Cityscapes benchmark data set, the proposed model produces 68.5% and 68.3% class accuracy on validation and test sets respectively, whilst having only 1.3 million parameters. Compared with all real-time models of less than 5 million parameters, the proposed model is the most competitive in both performance and real-time capability.
更多
查看译文
关键词
DCNN, semantic segmentation, encoder-decoder, feature map, dilated convolution
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要