首页 | 官方网站   微博 | 高级检索  
     

面向专利技术主题分析的WI-LDA模型研究
引用本文:吴红,伊惠芳,马永新,李昌.面向专利技术主题分析的WI-LDA模型研究[J].图书情报工作,2018,62(17):68-74.
作者姓名:吴红  伊惠芳  马永新  李昌
作者单位:山东理工大学科技信息研究所 淄博 255049
基金项目:本文系国家社会科学基金项目"高校图书馆深度嵌入专利运营研究"(项目编号:16BTQ029)研究成果之一。
摘    要:目的/意义] 改善现有LDA专利技术主题分析存在的辨识度低、可解释性弱和界限划分模糊问题,对于把握技术热点、追踪技术前沿具有重要意义。方法/过程] 将国际分类号IPC引入LDA专利主题分析中,将其作为技术词的语境,以<词/词组,分类号>二元组的WI (Word IPC)结构进行训练,构建WI-LDA模型,实现对专利文献主题的识别和分析。结果/结论] 通过中国石墨烯领域的实证研究及与传统LDA模型的对比研究证明,WI-LDA模型泛化能力较强,在专利技术主题分析上能有效降低主题的辨识难度,增加主题的可解释性,使文本主题划分更加清晰。

关 键 词:WI-LDA  主题模型  专利技术主题  石墨烯  
收稿时间:2018-02-08

WI-LDA: Technical Topic Analysis in Patents
Wu Hong,Yi Huifang,Ma Yongxin,Li Chang.WI-LDA: Technical Topic Analysis in Patents[J].Library and Information Service,2018,62(17):68-74.
Authors:Wu Hong  Yi Huifang  Ma Yongxin  Li Chang
Affiliation:Science and Technology Information Research Institute, Shandong University of Technology, Zibo 255049
Abstract:Purpose/significance] It is of great significance to improve the existing problems of technical topic analysis in patents based on the LDA, which are low recognition, weak interpretability and fuzzy boundary division,to hold the technical hot spots and track the technological frontier. Method/process] The international patent classification is introduced into the topic analysis in patents based on the LDA, and used as the language content of technical terms. The structure of WI (Word IPC) is trained to construct the WI-LDA model to achieve the identification and analysis of the subject of patent documents. Result/conclusion] The case study of graphene field in Chinese patents and comparative study with traditional LDA models prove that the generalization ability of the WI-LDA model is strong, and the WI-LDA model can effectively reduce the difficulty of identification technical topic analysis in patents, increase the interpretability of topics and make the topic classification clearer.
Keywords:WI-LDA  topic model  technical topic in patents  graphene  
点击此处可从《图书情报工作》浏览原始摘要信息
点击此处可从《图书情报工作》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号