首页 | 官方网站   微博 | 高级检索  
     

基于强化学习的合作频谱分配算法
引用本文:李冠雄,李桂林. 基于强化学习的合作频谱分配算法[J]. 电波科学学报, 2022, 37(1): 8-14. DOI: 10.12265/j.cjors.2021016
作者姓名:李冠雄  李桂林
作者单位:1.天津大学微电子学院,天津 300072
摘    要:为了解决认知无线电网络中的频谱分配问题,提出了一种基于用户体验质量的合作强化学习频谱分配算法,将认知网络中的次用户模拟为强化学习中的智能体,并在次用户间引入合作机制,新加入用户可以吸收借鉴其他用户的强化学习经验,能够以更快的速度获得最佳的频谱分配方案;并且在频谱分配过程中引入了主用户和次用户之间的价格博弈因素,允许主用...

关 键 词:认知无线电  频谱分配  强化学习  平均意见得分  价格博弈
收稿时间:2021-01-04

Cooperative spectrum allocation algorithm based on reinforcement learning
LI Guanxiong,LI Guilin. Cooperative spectrum allocation algorithm based on reinforcement learning[J]. Chinese Journal of Radio Science, 2022, 37(1): 8-14. DOI: 10.12265/j.cjors.2021016
Authors:LI Guanxiong  LI Guilin
Affiliation:1.School of Microelectronics, Tianjin University, Tianjin 300072, China2.School of Electrical Information Engineering, Dalian Jiaotong University, Dalian 116021, China
Abstract:In order to solve the problem of spectrum allocation in cognitive radio networks, we propose a cooperative reinforcement learning spectrum allocation algorithm based on user experience quality. which simulates the secondary users in the cognitive network as agents in the reinforcement learning, and introduces a cooperation mechanism between the secondary users. New users can absorb and learn from the reinforcement learning experience of other users, and obtain the best spectrum allocation plan at a faster speed. In addition, the price game factor between the primary user and the secondary user is introduced in the spectrum allocation process, allowing the primary user to price the authorized spectrum according to their own situation, and the impact of different spectrum prices on the income of the secondary user is studied, making the algorithm closer to the real scene. In terms of system evaluation, the average opinion score model is used to visually display the service quality of system users. Simulation results show that the algorithm can effectively improve user service quality and system communication performance, and provides an effective solution for understanding the spectrum allocation among users.
Keywords:cognitive radio  spectrum allocation  reinforcement learning  mean opinion score  price game theory
本文献已被 维普 万方数据 等数据库收录!
点击此处可从《电波科学学报》浏览原始摘要信息
点击此处可从《电波科学学报》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号