A Novel Image Text Extraction Method Based on K-Means Clustering

Portland, OR(2008)

引用 65|浏览0
暂无评分
摘要
Texts in web pages, images and videos contain important clues for information indexing and retrieval. Most existing text extraction methods depend on the language type and text appearance. In this paper, a novel and universal method of image text extraction is proposed. A coarse-to-fine text location method is implemented. Firstly, a multi-scale approach is adopted to locate texts with different font sizes. Secondly, projection profiles are used in location refinement step. Color-based k-means clustering is adopted in text segmentation. Compared to grayscale image which is used in most existing methods, color image is more suitable for segmentation based on clustering. It treats corner-points, edge-points and other points equally so that it solves the problem of handling multilingual text. It is demonstrated in experimental results that best performance is obtained when k is 3. Comparative experimental results on a large number of images show that our method is accurate and robust in various conditions.
更多
查看译文
关键词
color-based k-means clustering,text appearance,novel image,universal method,text segmentation,image text extraction,color image,existing method,coarse-to-fine text location method,k-means clustering,existing text extraction method,text extraction method,multilingual text,grayscale image,color,indexing,data mining,gray scale,web pages,image segmentation,robustness,k means clustering,image retrieval,information retrieval,text analysis
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要