首页 | 官方网站   微博 | 高级检索  
     

文档图像基准生成系统
引用本文:李明威,夏勇. 文档图像基准生成系统[J]. 计算机应用与软件, 2012, 29(6): 76-80
作者姓名:李明威  夏勇
作者单位:哈尔滨工业大学计算机科学与技术学院 黑龙江哈尔滨150001
基金项目:国家博士后基金项目(20090450994);黑龙江省博士后基金项目(LBH-Z09150);中央高校基本科研业务费专项(HIT.NSRIF.2009152);山东省优秀中青年科学家科研奖励基金项目(BS2011DX002)
摘    要:为生成含噪声的扫描文档图像的基准标引信息,系统首先基于无噪声的PDF文档抽取理想化标引信息,采用透视变换模型,将其与含噪声文档图像进行配准,最终生成含噪声图像的基准标引信息,将其用于测试文字识别、检索的精度.系统还基于几种经典的图像退化模型,批量产生了含不同噪声类型的文档图像.经实验表明,该系统标引信息精度高,图像退化结果与实际噪声效果接近.

关 键 词:文档图像  基准生成  退化模型  透视变换模型

BASE GENERATION SYSTEM OF DOCUMENT IMAGE
Li Mingwei , Xia Yong. BASE GENERATION SYSTEM OF DOCUMENT IMAGE[J]. Computer Applications and Software, 2012, 29(6): 76-80
Authors:Li Mingwei    Xia Yong
Affiliation:Li Mingwei Xia Yong(School of Computer Science and Technology,Harbin Institute of Technology,Harbin 150001,Heilongjiang,China)
Abstract:For the generation of base indexing information of scanned document image with noise,this system first extracts idealised indexing information based on noise-free PDF document,then registers them with the document image with noise using perspective transformation model and finally generates the base indexing information of the document image with noise.These information data are applied to test the accuracies of text recognition and retrieval.Furthermore,based on some typical different image degradation models,the system has generated the document images with different noise types in batch.Experiments show that the indexing information in this system has high accuracy,the results of image degradation are close to practical noise effect.
Keywords:Document image Generation of base Degradation model Perspective transformation model
本文献已被 CNKI 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号