基于多模态的音乐推荐系统 A music recommendation system based on multi-modal fusion期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于多模态的音乐推荐系统

引用本文：	龚志,邵曦.基于多模态的音乐推荐系统[J].南京气象学院学报,2019,11(1):68-76.

作者姓名：	龚志邵曦

作者单位：	南京邮电大学通信与信息工程学院, 南京, 210003,南京邮电大学通信与信息工程学院, 南京, 210003

基金项目：	国家自然科学基金（70573025）

摘要：	使用传统协同过滤的方式进行推荐往往会忽视音乐底层特征.通过将音乐的音频特征与歌词信息进行多模态融合，并将融合后的特征信息作为协同过滤推荐的补充，提出了一种基于多模态的音乐推荐系统.主要探讨了音频特征与歌词信息的提取，并在提取歌词信息时利用LDA主题模型进行特征降维.针对多模态融合问题，使用一种特征级联早融合法（EFFC）融合方式，并将多模态融合后的结果与单模态结果进行了比较.对于结果的推荐，以多模态特征信息为依据建立用户兴趣模型，并将该模型通过LSTM神经网络，以过滤与优化协同推荐的用户组.结果表明，基于多模态的音乐推荐系统将推荐结果的误差项平方和（SSE）由传统的2.009降至0.388 6，验证了该方法的有效性.
关键词：	音乐推荐协同过滤 LDA主题模型多模态融合 LSTM神经网络
收稿时间：	2018/4/27 0:00:00
A music recommendation system based on multi-modal fusion

GONG Zhi and SHAO Xi.A music recommendation system based on multi-modal fusion[J].Journal of Nanjing Institute of Meteorology,2019,11(1):68-76.

Authors:	GONG Zhi and SHAO Xi

Affiliation:	College of Telecommunications and Information Engineering, Nanjing University of Posts and Telecommunications, Nanjing 210003 and College of Telecommunications and Information Engineering, Nanjing University of Posts and Telecommunications, Nanjing 210003

Abstract:	Despite the continuous enrichment of music, the underlying music features are often overlooked when using traditional collaborative filtering. By multi-modal fusion of audio features and lyric information and supplementing the fusion information feature as a collaborative filtering recommendation, a multi-modal music recommendation system is proposed. This studyprimarily discusses the extraction of audio features and lyrics information and uses the LDA topic model to reduce the character dimension of the lyrics information. For the multi-model fusion problem, this study proposes an EFFC fusion method, and compares the results of multi-modal fusion with the results using single-mode. For result recommendations, the user interest model is established based on the multi-modal information feature with the input of LSTM networks to filter and optimize the user group. The results show that the multi-modal music recommendation system reduces the SSE of the result from 2.009 to 0.388 6, verifying the effectiveness of the method.

Keywords:	music recommendation collaborative filtering LDA topic model multi-modal fusion LSTM networks

	点击此处可从《南京气象学院学报》浏览原始摘要信息
	点击此处可从《南京气象学院学报》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏