Multi-Orientation Scene Text Detection with Adaptive Clustering

IEEE Transactions on Pattern Analysis and Machine Intelligence(2015)

引用 300|浏览133
暂无评分
摘要
Text detection in natural scene images is an important prerequisite for many content-based image analysis tasks, while most current research efforts only focus on horizontal or near horizontal scene text. In this paper, first we present a unified distance metric learning framework for adaptive hierarchical clustering, which can simultaneously learn similarity weights (to adaptively combine different feature similarities) and the clustering threshold (to automatically determine the number of clusters). Then, we propose an effective multi-orientation scene text detection system, which constructs text candidates by grouping characters based on this adaptive clustering. Our text candidates construction method consists of several sequential coarseto- fine grouping steps: morphology-based grouping via single-link clustering, orientation-based grouping via divisive hierarchical clustering, and projection-based grouping also via divisive clustering. The effectiveness of our proposed system is evaluated on several public scene text databases, e.g., ICDAR Robust Reading Competition datasets (2011 and 2013), MSRA-TD500 and NEOCR. Specifically, on the multi-orientation text dataset MSRA-TD500, the f measure of our system is 71%, much better than the state-of-the-art performance. We also construct and release a practical challenging multi-orientation scene text dataset (USTB-SV1K), which is available at http://prir.ustb.edu.cn/TexStar/MOMV-text-detection/.
更多
查看译文
关键词
Scene text detection, multi-orientation, adaptive hierarchical clustering, coarse-to-fine grouping
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要