首页 | 官方网站   微博 | 高级检索  
     

一种基于多模态模型的随机子空间分类集成算法
引用本文:叶云龙,杨明.一种基于多模态模型的随机子空间分类集成算法[J].南京师范大学学报,2009,9(4):57-62,72.
作者姓名:叶云龙  杨明
作者单位:南京师范大学计算机科学与技术学院,江苏南京210097
基金项目:国家自然科学基金,江苏省自然科学基金 
摘    要:分类是当前机器学习的重要研究内容之一,已取得了一定的进展.现有的文本分类方法大多基于VSM模型,而VSM未能有效地利用隐含在文本中的结构信息.同时,VSM下的样本空间常常是高维的,单一的降维策略可能会丢失有用信息.为改进现有算法的不足,提出了一种基于多模态模型的随机子空间分类集成算法MMRFSEn,有效地利用文本中的结构信息(单词分布位置的均值和标准差),且各基分类器是由随机选择的子空间构建而成.实验结果表明,该方法是有效可行的.

关 键 词:多模态  随机子空间  分类器集成

A Multi-modality-based Random Subspace Classifier Ensemble Algorithm
Ye Yunlong,Yang Ming.A Multi-modality-based Random Subspace Classifier Ensemble Algorithm[J].Journal of Nanjing Nor Univ: Eng and Technol,2009,9(4):57-62,72.
Authors:Ye Yunlong  Yang Ming
Affiliation:(School of Computer Science and Technology, Nanjing Normal University, Nanjing 210097, China)
Abstract:Text Classification is an important machine learning research, in which some progress has been made. Most of the existing classification methods are based on Vector Space Model(VSM) , but VSM does not effectively utilize the structure information hidden in the text samples. At the same time, VSM vectors are often high-dimensional, merelv using dimensionality reduction strategy may lead to the loss of the useful information. To uvercome the shortcomings of the existing algorithms, we propose an algorithm called Multi-modality-based Random Feature subspaee classifier Ensemble (MMRFSEn) . which can eft)etively use the structure information hidden in the text such as the words' s average localion and standard deviation, and meanwhile each single classifier is constructed by a randomly selected subspace. The experimental results show that the newly developed method is effective and feasible.
Keywords:Multi-modality  random subspace  classifier ensemble
本文献已被 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号