首页 | 官方网站   微博 | 高级检索  
     

中文日期词的分割与识别
引用本文:张重阳,徐勇,娄震,杨静宇.中文日期词的分割与识别[J].计算机研究与发展,2007,44(12):2085-2091.
作者姓名:张重阳  徐勇  娄震  杨静宇
作者单位:1. 南京理工大学计算机科学与技术系,南京,210094;中创软件工程股份有限公司,济南,250014
2. 哈尔滨工业大学深圳研究生院生物计算研究中心,深圳,518055
3. 南京理工大学计算机科学与技术系,南京,210094
基金项目:国家自然科学基金 , 电子信息产业发展基金
摘    要:非限定性手写汉字串的分割与识别是当前字符识别领域中的一个难点问题.针对手写日期的特点,提出了整词识别和定长汉字串分割识别相结合的组合识别方法.整词识别将字符串作为一个整体进行识别,无需复杂的字符串分割过程.在定长汉字串分割过程中,首先通过识别来预测汉字串的长度,然后通过投影和轮廓分析确定候选分割线,最后通过识别选取最优分割路径.这两种分割识别方法通过规则进行组合,大大提高了系统的性能.在真实票据图像上的实验表明了该方法的有效性,分割识别正确率达到了93.3%.

关 键 词:文档处理  手写体汉字串识别  字符串分割  轮廓  字符识别  中文  分割与识别  String  Chinese  Recognition  识别正确率  有效性  方法  实验  图像  真实票据  性能  系统  规则  路径  最优分割  选取  分割线  轮廓分析  投影
收稿时间:2006-10-17
修稿时间:2007-08-21

Segmentation and Recognition of Handwritten Chinese Day String
Zhang Chongyang,Xu Yong,Lou Zhen,Yang Jingyu.Segmentation and Recognition of Handwritten Chinese Day String[J].Journal of Computer Research and Development,2007,44(12):2085-2091.
Authors:Zhang Chongyang  Xu Yong  Lou Zhen  Yang Jingyu
Abstract:Segmentation and recognition of off-line handwritten Chinese character string is a difficult task in the research field of character recognition. A standard way for character string recognition is to segment a string into isolate character, then compos their recognition results into words or strings. The purpose of segmentation is to reduce the pattern classes which are to be sent to the recognition engines. However, recognition failure caused by segmentation line missing, non character patterns and unreliable recognition scores. To recognize the Chinese day strings on check images, a rule based method is proposed. It recognizes date strings by combining a holistic method and a segmentation-recognition based method. The holistic method recognizes the whole string as a single character without segmentation. The segmentation- recognition based method first finds as much candidate segmentation lines as possible by projection and structure analysis. Then, it reduces segmentation lines by a predicted string length. Finally, the best recognition result is selected by recognition scores. Experiments have been done on 5569 real life check images collected from Chinese bank. The experiment results demonstrate the efficiency of the proposed method. The string recognition rate has achieved 93.3 % on the 1932 test strings.
Keywords:document processing  handwritten Chinese string recognition  string segmentation  contour  character recognition
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号