基于强化学习的合作频谱分配算法 Cooperative spectrum allocation algorithm based on reinforcement learning期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于强化学习的合作频谱分配算法

引用本文：	李冠雄,李桂林. 基于强化学习的合作频谱分配算法[J]. 电波科学学报, 2022, 37(1): 8-14. DOI: 10.12265/j.cjors.2021016

作者姓名：	李冠雄李桂林

作者单位：	1.天津大学微电子学院，天津 300072

摘要：	为了解决认知无线电网络中的频谱分配问题,提出了一种基于用户体验质量的合作强化学习频谱分配算法,将认知网络中的次用户模拟为强化学习中的智能体,并在次用户间引入合作机制,新加入用户可以吸收借鉴其他用户的强化学习经验,能够以更快的速度获得最佳的频谱分配方案;并且在频谱分配过程中引入了主用户和次用户之间的价格博弈因素,允许主用...
关键词：	认知无线电频谱分配强化学习平均意见得分价格博弈
收稿时间：	2021-01-04
Cooperative spectrum allocation algorithm based on reinforcement learning

LI Guanxiong,LI Guilin. Cooperative spectrum allocation algorithm based on reinforcement learning[J]. Chinese Journal of Radio Science, 2022, 37(1): 8-14. DOI: 10.12265/j.cjors.2021016

Authors:	LI Guanxiong LI Guilin

Affiliation:	1.School of Microelectronics, Tianjin University, Tianjin 300072, China2.School of Electrical Information Engineering, Dalian Jiaotong University, Dalian 116021, China

Abstract:	In order to solve the problem of spectrum allocation in cognitive radio networks, we propose a cooperative reinforcement learning spectrum allocation algorithm based on user experience quality. which simulates the secondary users in the cognitive network as agents in the reinforcement learning, and introduces a cooperation mechanism between the secondary users. New users can absorb and learn from the reinforcement learning experience of other users, and obtain the best spectrum allocation plan at a faster speed. In addition, the price game factor between the primary user and the secondary user is introduced in the spectrum allocation process, allowing the primary user to price the authorized spectrum according to their own situation, and the impact of different spectrum prices on the income of the secondary user is studied, making the algorithm closer to the real scene. In terms of system evaluation, the average opinion score model is used to visually display the service quality of system users. Simulation results show that the algorithm can effectively improve user service quality and system communication performance, and provides an effective solution for understanding the spectrum allocation among users.

Keywords:	cognitive radio spectrum allocation reinforcement learning mean opinion score price game theory
本文献已被维普万方数据等数据库收录！
	点击此处可从《电波科学学报》浏览原始摘要信息
	点击此处可从《电波科学学报》下载免费的PDF全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏