首页 | 官方网站   微博 | 高级检索  
     

基于聚类和MRF模型的场景文字提取方法
引用本文:章天则,赵宇明.基于聚类和MRF模型的场景文字提取方法[J].计算机工程,2011,37(21):176-178,181.
作者姓名:章天则  赵宇明
作者单位:上海交通大学电子信息与电气工程学院,上海,200240
摘    要:提出一种从自然场景中提取文本区域的方法。该方法包括候选文本区域的提取,以及候选区域是否为文字区域的判定。候选文字区域的提取,主要利用图像的纹理特征和HSL颜色空间信息,通过改进的模糊C均值聚类函数,结合拉普拉斯掩膜与计算最大梯度差来实现。由连通域边缘密度信息、形状信息的马尔科夫随机场模型,判定候选文字区域是否为文字区域。经ICDAR2003数据库测试结果表明,该方法具有较高的精确度。

关 键 词:模糊C均值聚类  HSL颜色空间  拉普拉斯掩膜  最大梯度差  马尔科夫随机场模型
收稿时间:2011-03-11

Scene Text Extraction Method Based on Clustering and MRF Model
ZHANG Tian-ze,ZHAO Yu-ming.Scene Text Extraction Method Based on Clustering and MRF Model[J].Computer Engineering,2011,37(21):176-178,181.
Authors:ZHANG Tian-ze  ZHAO Yu-ming
Affiliation:(School of Electronic Information and Electrical Engineering,Shanghai Jiaotong University,Shanghai 200240,China)
Abstract:This paper proposes a method for extracting text regions from natural scene images.This method includes two parts,text region candidates extraction and candidate regions further classification of text region or non-text region.The text region candidates are extracted through a modified fuzzy C-means clustering algorithm combined with Laplacian mask and maximum gradient difference value,which involves texture features and HSL color space information.The candidate regions are checked by edge density information and shape information of the connected components based on Markov Random Field(MRF) model.The proposed method achieves reasonable accuracy for text extraction from examples of the ICDAR 2003 database.
Keywords:fuzzy C-means clustering  HSL color space  Laplacian mask  maximum gradient difference  Markov Random Field(MRF) model
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《计算机工程》浏览原始摘要信息
点击此处可从《计算机工程》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号