A Phrase Topic Model Based on Distributed Representation |
| |
Authors: | Jialin Ma Jieyi Cheng Lin Zhang Lei Zhou Bolun Chen |
| |
Affiliation: | 1.Jiangsu Internet of Things and Moblie Internet Technology Engineering Laboratory, Huaiyin Institute of Technology, Huai’an, 223003, China.2 University of Fribourg, Fribourg, 1700, Switzerland. |
| |
Abstract: | Traditional topic models have been widely used for analyzing semantic topics from electronic documents. However, the obvious defects of topic words acquired by them are poor in readability and consistency. Only the domain experts are possible to guess their meaning. In fact, phrases are the main unit for people to express semantics. This paper presents a Distributed Representation-Phrase Latent Dirichlet Allocation (DRPhrase LDA) which is a phrase topic model. Specifically, we reasonably enhance the semantic information of phrases via distributed representation in this model. The experimental results show the topics quality acquired by our model is more readable and consistent than other similar topic models. |
| |
Keywords: | Phrase topic model LDA distributed representation Gibbs sampling. |
|
| 点击此处可从《计算机、材料和连续体(英文)》浏览原始摘要信息 |
|
点击此处可从《计算机、材料和连续体(英文)》下载全文 |