首页 | 官方网站   微博 | 高级检索  
     

基于主题模型和文本相似度计算的专利推荐研究
引用本文:艾楚涵,姜迪,吴建德.基于主题模型和文本相似度计算的专利推荐研究[J].信息技术,2020(4):65-70.
作者姓名:艾楚涵  姜迪  吴建德
作者单位:昆明理工大学信息工程与自动化学院;昆明理工大学知识产权发展研究院;昆明理工大学计算中心
摘    要:如何利用数量庞大的专利并从中找到用户感兴趣的专利进行推荐是很多专利数据库迫切需要解决的问题。文中从专利文本的标题和摘要入手,提出一种基于文本挖掘的专利推荐方法。首先,利用词袋模型将专利文本转化成计算机能够识别的数据;其次,利用文本聚类算法完成专利数据集进行领域划分;再次,结合词频-逆文档频率特征权重计算和余弦相似度来选择合适的发明人进行专利的推荐;最后,以我国物流产业下的专利数据作为数据集完成文中所提方法的验证与分析。实验结果表明,基于文本挖掘的专利推荐研究能够实现对发明人的个性化推荐。

关 键 词:专利推荐  聚类算法  文本挖掘  文本相似度

Patent recommendation research based on topic model and text similarity calculation
AI Chu-han,JIANG Di,WU Jian-de.Patent recommendation research based on topic model and text similarity calculation[J].Information Technology,2020(4):65-70.
Authors:AI Chu-han  JIANG Di  WU Jian-de
Affiliation:(Kunming University of Science and Technology,School of Information Engineering and Automation,Kunming 650500,China;Kunming University of Science and Technology,Institute of Intellectual Property Development,Kunming 650500,China;Kunming University of Science and Technology,Computing Center,Kunming 650500,China)
Abstract:How to use a large number of patents and find patents of interest from relevant inventors is a problem that many patent databases urgently need to solve.Starting with the title and abstract of the patent text,a patent recommendation method based on text mining is proposed.Firstly,the word bag model is used to convert patent text into computer-recognizable data.Then,the text clustering algorithm is used to complete the patent data set for domain division.The word frequency-inverse document frequency feature weight calculation and cosine similarity are used to select the appropriate inventor for patent recommendation.Finally,the verification and analysis of the proposed method is completed by using the patent data under the logistics industry in China as the data set.The experimental results show that the patent recommendation research based on text mining can realize the personalized recommendation to the inventor.
Keywords:patent recommendation  clustering algorithm  text mining  text similarity
本文献已被 维普 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号