首页 | 官方网站   微博 | 高级检索  
     

一种基于连通分量的文本区域定位方法
引用本文:姚金良,翁璐斌,王小华.一种基于连通分量的文本区域定位方法[J].模式识别与人工智能,2012,25(2):325-331.
作者姓名:姚金良  翁璐斌  王小华
作者单位:1. 杭州电子科技大学计算机学院 杭州310018
2. 中国科学院自动化研究所综合信息系统研究中心 北京100190
基金项目:国家自然科学基金项目(No.61005067);浙江省科技厅重大专项项目(No.2010C11049)资助
摘    要:文本区域定位对复杂背景图像中的字符识别和检索具有重要意义.已有方法取得高的定位准确率和召回率,但效率较低,难以应用于实际的系统中.文中提出一种基于连通分量过滤和K-means聚类的文本区域定位方法.该方法首先对图像进行自适应分割,对字符颜色层提取连通分量.然后提取连通分量的特征,并用Adaboost分类器过滤非字符连通分量.最后,对候选的字符连通分量根据其位置和颜色层进行K-means聚类来定位文本区域.实验结果显示该方法具有与当前方法相当的准确率和召回率,同时具有较低的计算复杂度.

关 键 词:文本定位  Adaboost  K-means聚类  文档图像识别

A Text Region Location Method Based on Connected Component
YAO Jin-Liang , WENG Lu-Bin , WANG Xiao-Hua.A Text Region Location Method Based on Connected Component[J].Pattern Recognition and Artificial Intelligence,2012,25(2):325-331.
Authors:YAO Jin-Liang  WENG Lu-Bin  WANG Xiao-Hua
Affiliation:1(School of Computer Science and Technology,Hangzhou Dianzi University,Hangzhou 310018) 2(Integrated Information System Research Center,Institute of Automation,Chinese Academy of Sciences,Beijing 100190)
Abstract:Text region location is important to text recognition and retrieval in images of complex background.The existing methods with precision and recall rate have high computational complexity.These methods are unpractical real environment.A text region location method is proposed based on component filtering and K-means clustering.Firstly,the input image is segmented into three layers by an adaptive image segmentation method,and the components are extracted from the character layers.Then,the features of the component are obtained,and Adaboost classifier is used to filter non-character components.The candidates of character components are grouped into text regions by K-means clustering based on the position and layer of the component.The experimental results demonstrate that the precision and the recall rate of the proposed approach is almost the same that of as the other methods,and the proposed method has lower computational complexity.
Keywords:Text Location  Adaboost  K-means Clustering  Document Image Recognition
本文献已被 CNKI 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号