首页 | 官方网站   微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 125 毫秒
1.
手写文本识别方法主要应用于文本输入技术,对人机交互领域的发展起关键作用。针对多数在线输入法无法识别中英文混合手写识别的问题,提出一种在线中英文混合手写文本识别方法。通过对文本笔画进行基于水平相对位置、垂直重叠率、面积重叠率规则的整合以及连笔切分,得到一系列字符片段,同时利用笔画个数、宽高比、中心偏离、平滑度等几何特征和识别置信度,对字符片段进行中英文分类。在此基础上,根据分类结果并结合自然语言模型的路径评价及动态规划搜索算法,分别对候选的中、英文字符片段进行合并处理,得到待识别的中、英文字符序列,并将其分别送入卷积神经网络的中、英文识别模型中,得到手写文本识别结果。实验结果表明,在线手写中英文混合文本识别正确率达93.67%,不仅能切分在线手写中文文本行,而且对包含字符连笔的在线手写中英文文本行也有较好的切分效果。  相似文献   

2.
字符串识别通过最优路径搜索得到字符切分和字符识别结果.本文将字符同步和时间同步两种搜索模式应用于手写字符串识别系统,比较两种模式下使用不同准则函数和搜索算法的系统性能.同时,提出一种改进的路径评价准则,在此准则下可用动态规划算法进行最优路径的搜索.在联机手写日文字符串识别中的实验结果表明.对于无词典驱动的字符串识别系统,时间同步搜索的效率高于字符同步搜索.利用本文所提出的路径评价准则,可得到与归一化准则相当的切分和识别准确率,但搜索时间大为减少.  相似文献   

3.
本文论述了基于大词汇量词典的日文邮件地址手写体字符串的识别系统,所用词典包含了lll,349个地址短语。在识别过程中,文本行图像与词典入口进行匹配,以获得可靠的分割和合理的地址短语。在预分割中,文本行图像通过连接组件分析和以边缘轮廓线分析为基础的粘连模式分裂被分割为原始的段。词典匹配中,连续的段动态地合并成候选字符模式。一个精确的字符分类器嵌入在词典匹配中,以此从动态分类集中选择候选模式匹配的字符。在词典匹配中,采用了一种Beam搜索策略来荻取实时识别的效果。在测试3589封实际邮件的实验中,本文提出的方法正确率达到了83.86%,而错误率小于l%。  相似文献   

4.
一种视频中字符的集成型切分与识别算法   总被引:3,自引:0,他引:3  
杨武夷  张树武 《自动化学报》2010,36(10):1468-1476
视频文本行图像识别的技术难点主要来源于两个方面: 1)粘连字符的切分与识别问题; 2)复杂背景中字符的切分与识别问题. 为了能够同时切分和识别这两种情况中的字符, 提出了一种集成型的字符切分与识别算法. 该集成型算法首先对文本行图像二值化, 基于二值化的文本行图像的水平投影估计文本行高度. 其次根据字符笔划粘连的程度, 基于图像分析或字符识别对二值图像中的宽连通域进行切分. 然后基于字符识别组合连通域得到候选识别结果, 最后根据候选识别结果构造词图, 基于语言模型从词图中选出字符识别结果. 实验表明该集成型算法大大降低了粘连字符及复杂背景中字符的识别错误率.  相似文献   

5.
离线手写汉字的识别仍然是模式识别中的一个最困难的问题,而特征提取是解决这个问题的关键.本文提出一种基于多尺度小波分解的离线手写汉字的特征提取方法.通过表示为灰度图像的手写汉字的多尺度小波分解,能在不同尺度下抽取字符的特征.在较大的尺度下,抽取字符少量的结构特征,可用于在巨大的汉字候选类集合中进行字符的粗归类;在较小的尺度下,抽取字符的细节特征,可用于在较小的汉字候选类集合中进行字符的细归类(识别).这样一种从粗到细的策略,既减少了匹配的时间,又保持了识别的精度.  相似文献   

6.
针对手写英文识别中易混字符的识别问题,提出一种结合多维特征和候选项以区分易混字符的识别方法.利用卷积神经网络(convolutional neural networks,CNN)对手写英文字符进行识别,根据初始字符识别信息确定易混字符的类别;利用多维特征,设计针对不同类别易混字符的识别规则;由易混字符和其相连字符组成候选项单词,结合语料库以及字符间构成关系,最终对易混字符进行识别判断.实验结果表明,该方法在解决了易混字符的识别问题后,识别手写英文字符的平均准确率达到98.67%,具有一定应用价值.  相似文献   

7.
粘连断裂字符行的切分识别,是很多OCR 实际应用中存在的主要困难之一. 本文针对粘连断裂的印刷体数字行,提出了一种基于Viterbi 算法的切分识别方案,该方案采用两次切分识别的层次型结构. 在第二次切分识别过程中,首先,在候选切分点区域,结合灰度图像与二值轮廓信息,采用基于Viterbi 算法搜索的非直线路径进行切分,得到有效的切分路径;然后,结合分类器输出的可信度,采用Viterbi 算法来合并前面得到的候选切分图像块,进行动态切分与识别. 实际的金融票据识别系统实验表明,本文提出的印刷体数字行切分识别方法能够较好的克服字符行的粘连与断裂情况,提高了识别系统的识别率和鲁棒性.  相似文献   

8.
基于多通道融合的连续手写识别纠错方法   总被引:1,自引:0,他引:1  
敖翔  王绪刚  戴国忠  王宏安 《软件学报》2007,18(9):2162-2173
在基于识别的界面中,用户的满意度不但由识别准确度决定,而且还受识别错误的纠正过程的影响.提出一种基于多通道融合的连续手写笔迹识别错误的纠正方法.该方法允许用户通过口述书写内容纠正手写识别中的字符提取和识别的错误.该纠错方法的核心是一种多通道融合算法.该算法通过利用语音输入约束最优手写识别结果的搜索,可纠正手写字符的切分错和识别错.实验评估结果表明,该融合算法能够有效纠正错误,计算效率高.与另外两种手写识别错误纠正方法相比,该方法具有更高的纠错效率.  相似文献   

9.
在字符识别系统中,字符的有效分割是识别的关键。针对手写汉字字间距及字内距无规则可循,字符间极易发生粘连、交错等现象,提出一种多步分割方法。该方法首先利用Viterbi算法将原字符串切分成互不连通的分割块,使非粘连汉字、交错汉字得到正确分割;对于其中宽度较大存在粘连字符的分割块,从候选分割点入手,用非线性分割路径将粘连部分分开;最后再应用A*算法找到全局最佳分割位置,使过分割的字符得到完整合并。实验结果表明,该方法对于手写汉字的分割是可行、有效的。  相似文献   

10.
概括了目前数字字符识别中常用的切分方法,并对于影响手写数字识别精确性的切分这一关键步骤,提出了一种新颖的解决思路,使得可以适用于各种不同的书写方式和习惯,解决了目前绝大多数识别系统不能解决的问题,极大地拓宽了手写数字字符识别的应用范围;且该方法同样适用于其他字符的切分识别中。  相似文献   

11.
This paper describes a handwritten character string recognition system for Japanese mail address reading on a very large vocabulary. The address phrases are recognized as a whole because there is no extra space between words. The lexicon contains 111,349 address phrases, which are stored in a trie structure. In recognition, the text line image is matched with the lexicon entries (phrases) to obtain reliable segmentation and retrieve valid address phrases. The paper first introduces some effective techniques for text line image preprocessing and presegmentation. In presegmentation, the text line image is separated into primitive segments by connected component analysis and touching pattern splitting based on contour shape analysis. In lexicon matching, consecutive segments are dynamically combined into candidate character patterns. An accurate character classifier is embedded in lexicon matching to select characters matched with a candidate pattern from a dynamic category set. A beam search strategy is used to control the lexicon matching so as to achieve real-time recognition. In experiments on 3,589 live mail images, the proposed method achieved correct rate of 83.68 percent while the error rate is less than 1 percent.  相似文献   

12.
13.
In this paper, a structural method of recognising Arabic handwritten characters is proposed. The major problem in cursive text recognition is the segmentation into characters or into representative strokes. When we segment the cursive portions of words, we take into account the contextual properties of the Arabic grammar and the junction segments connecting the characters to each other along the writing line. The problem of overlapping characters is resolved with a contour-following algorithm associated with the labelling of the detected contours. In the recognition phase, the characters are gathered into ten families of candidate characters with similar shapes. Then a heterarchical analysis follows that checks the pattern via goal-directed feedback control.  相似文献   

14.
非限定性手写汉字串的分割与识别是当前字符识别领域中的一个难点问题.针对手写日期的特点,提出了整词识别和定长汉字串分割识别相结合的组合识别方法.整词识别将字符串作为一个整体进行识别,无需复杂的字符串分割过程.在定长汉字串分割过程中,首先通过识别来预测汉字串的长度,然后通过投影和轮廓分析确定候选分割线,最后通过识别选取最优分割路径.这两种分割识别方法通过规则进行组合,大大提高了系统的性能.在真实票据图像上的实验表明了该方法的有效性,分割识别正确率达到了93.3%.  相似文献   

15.
一种基于字符HMM模型级联的手写体西文单词识别方法   总被引:3,自引:0,他引:3  
提出了一种识别西文单词的级联HMM方法,在字符HMM模型基础上按照统计语法将各模型依概率连接,它扩展了HMM的模式描述方式,允许在级联模型上表征状态的跳跃、转移和驻留等,通过共享字符模型来描述级联状态转移概率,可以更加可靠地刻画手写体单的行为特点,采用面向在的Viterbi算法,在完整的单词采样序列输入后直接识别,无需做字符的分割和标注,从而避免了在字典中为每个单词建立模型而导致的识别不同步问题,用WE-1单词样本库进行试验,级联模型法的第1侯选识别经为89.26%,带有连字模型的HMM法的第1候选识别率为82.34%,降低错误识别率达39.18%。  相似文献   

16.
Correct segmentation of handwritten Chinese characters is crucial to their successful recognition. However, due to many difficulties involved, little work has been reported in this area. In this paper, a two-stage approach is presented to segment unconstrained handwritten Chinese characters. A handwritten Chinese character string is first coarsely segmented according to the background skeleton and vertical projection after a proper image preprocessing. With several geometric features, all possible segmentation paths are evaluated by using the fuzzy decision rules learned from examples. As a result, unsuitable segmentation paths are discarded. In the fine segmentation stage that follows, the strokes that may contain segmentation points are first identified. The feature points are then extracted from candidate strokes and taken as segmentation point candidates through each of which a segmentation path may be formed. The geometric features similar to the coarse segmentation stage are used and corresponding fuzzy decision rules are generated to evaluate fine segmentation paths. Experimental results on 1000 Chinese character strings from postal mail show that our approach can achieve a reasonable good overall accuracy in segmenting unconstrained handwritten Chinese characters.  相似文献   

17.
手写数字串的分割与字符识别密切相关.采用基于识别的分割方法,在分割过程中引入识别机制识别分割碎片,将识别结果经过差值运算后置为每个识别对象的识别可信度,利用动态规划找到最佳分割路径.在训练分类器时,使用反例样本估计分类器参数,得到了性能良好的分类器.实验数据表明,利用正例和反例样本结合训练的分类器比只经过正例样本训练的分类器的识别率要高很多.  相似文献   

18.
傅立叶变换在粘连文字图像切分中的应用   总被引:3,自引:0,他引:3  
朱小燕  王松 《计算机学报》1999,22(12):1246-1252
对于已具有相当识别率的手写体文字识别系统来说切分算法已成为一个关键技术之一,它的正确率对系统性能有着极大影响。该文主要对文字图像的傅立叶变换的性质进行了讨论,提出了消除交换中笔画宽度影响的算法。在此基础上建立了基于傅立叶变换的单/多字图像的判定的基本准则以及基于此准则的粘连文字判别算法。实验表明该算法的粘连文字判断正确率达到96%。为粘连文字的正确切分开辟了新的途径。  相似文献   

19.
An off-line handwritten word recognition system is described. Images of handwritten words are matched to lexicons of candidate strings. A word image is segmented into primitives. The best match between sequences of unions of primitives and a lexicon string is found using dynamic programming. Neural networks assign match scores between characters and segments. Two particularly unique features are that neural networks assign confidence that pairs of segments are compatible with character confidence assignments and that this confidence is integrated into the dynamic programming. Experimental results are provided on data from the U.S. Postal Service.  相似文献   

20.
A stroke-based approach to extract skeletons and structural features for handwritten Chinese character recognition is proposed. We first determine stroke directions based on the directional run-length information of binary character patterns. According to the stroke directions and their adjacent relationships, we split strokes into stroke and fork segments, and then extract the skeletons of the stroke segments called skeleton segments. After all skeleton segments are extracted, fork segments are processed to find the fork points and fork degrees. Skeleton segments that touch a fork segment are connected at the fork point, and all connected skeleton segments form the character skeleton. According to the extracted skeletons and fork points, we can extract primitive strokes and stroke direction maps for recognition. A simple classifier based on the stroke direction map is presented to recognize regular and rotated characters to verify the ability of the proposed feature extraction for handwritten Chinese character recognition. Several experiments are carried out, and the experimental results show that the proposed approach can easily and effectively extract skeletons and structural features, and works well for handwritten Chinese character recognition.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号