智能体对手建模研究进展

doi:10.11996/JG.j.2095-302X.2021050703

图学学报 ›› 2021, Vol. 42 ›› Issue (5): 703-711.DOI: 10.11996/JG.j.2095-302X.2021050703

智能体对手建模研究进展

大连理工大学计算机科学与技术学院，辽宁大连 116024

出版日期:2021-10-31 发布日期:2021-11-03
基金资助:
中国科协青年人才托举工程(2018QNRC001)；国家自然科学基金项目(61702075，31370778，61425002，61772100，61751203)

Research progress of opponent modeling for agent

School of Computer Science and Technology, Dalian University of Technology, Dalian Liaoning 116024, China

Online:2021-10-31 Published:2021-11-03
Supported by:
Young Elite Scientists Sponsorship Program by CAST (2018QNRC001); National Natural Science Foundation of China (61702075, 31370778, 61425002, 61772100, 61751203)

摘要/Abstract

摘要： 智能体是人工智能领域的一个核心术语。近年来，智能体技术在自动无人驾驶、机器人系统、电子商务、传感网络、智能游戏等方面得到了广泛研究与应用。随着系统复杂性的增加，关于智能体的研究重心由对单个智能体的研究转变为智能体间交互的研究。多个智能体交互场景中，智能体对其他智能体决策行为的推理能力是非常重要的一个方面，通常可以通过构建参与交互的其他智能体的模型，即对手建模来实现。对手建模有助于对其他智能体的动作、目标、信念等进行推理、分析和预测，进而实现决策优化。为此，重点关注智能体对手建模研究，展开介绍关于智能体动作预测、偏好预测、信念预测、类型预测等方面的对手建模技术，对其中的优缺点进行讨论和分析，并对手建模技术当前面临的一些开放问题进行总结，探讨未来可能的研究和发展方向。

Abstract: Agent is a core term in the field of artificial intelligence. In recent years, agent technology has been widely studied and applied in such fields as autonomous driving, robot system, e-commerce, sensor network, and intelligent games. With the increase of system complexity, the research focus on agent technology has been shifted from single agent to interactions between agents. In scenarios with multiple interactive agents, an important direction is to reason out other agents’ decisions and behaviors, which can be realized through the modeling of other agents involved in the interaction, that is, opponent modeling. Opponent modeling is conducive to reasoning, analyzing, and predicting other agents’ actions, targets, and beliefs, thus optimizing one’s decision-making. This paper mainly focused on the research on opponent modeling of agents, and introduced the opponent modeling technology in agent action prediction, preference prediction, belief prediction, and type prediction. In addition, their advantages and disadvantages were discussed, some current open problems were summarized, and the possible future research directions were presented.

Key words: decision intelligence, opponent modeling, game theory, agent systems, AlphaGo

中图分类号:

TP 391

刘婵娟, 赵天昊, 刘睿康, 张强. 智能体对手建模研究进展[J]. 图学学报, 2021, 42(5): 703-711.

LIU Chan-juan, ZHAO Tian-hao, LIU Rui-kang, ZHANG Qiang . Research progress of opponent modeling for agent[J]. Journal of Graphics, 2021, 42(5): 703-711.

[1]	东辉, 陈鑫凯, 孙浩, 姚立纲. 基于改进 YOLOv4 和图像处理的蔬菜田杂草检测[J]. 图学学报, 2022, 43(4): 559-569.
[2]	张盾, 黄志开, 王欢, 吴义鹏, 王颖, 邹家豪. 基于多尺度特征实现超参进化的野生菌分类研究与应用[J]. 图学学报, 2022, 43(4): 580-589.
[3]	陈昭俊, 储珺, 曾伦杰. 基于动态加权类别平衡损失的多类别口罩佩戴检测[J]. 图学学报, 2022, 43(4): 590-598.
[4]	李海鹏, 徐丹, 付宇婷, 柳雁安, 张婷婷. 基于 FPFH 特征提取的散乱点云精简算法[J]. 图学学报, 2022, 43(4): 599-607.
[5]	刘玉珍, 李楠, 陶志勇. 基于环查询和通道注意力的点云分类与分割[J]. 图学学报, 2022, 43(4): 616-623.
[6]	刘世龙, 马智亮. 基于结构光相机的钢筋骨架整体点云获取算法[J]. 图学学报, 2022, 43(4): 633-640.
[7]	彭国琴, 张浩, 徐丹. 基于域自适应的云南重彩画无监督情感识别[J]. 图学学报, 2022, 43(4): 641-650.
[8]	刘南杉, 裴云强, 蒋皓, 韩永国, 吴亚东, 王赋攀, 易思恒. 基于VD-MobileNet 网络的 WebAR生活垃圾分类信息可视化方法[J]. 图学学报, 2022, 43(4): 667-676.
[9]	蔡兴泉, 霍宇晴, 李发建, 孙海燕. 面向太极拳学习的人体姿态估计及相似度计算[J]. 图学学报, 2022, 43(4): 695-706.
[10]	陈主昕, 杨沁七, 陈瑞, 张严辞, 刘艳丽, 吴志红. 基于虚拟光源的实时半透明材质渲染[J]. 图学学报, 2022, 43(4): 707-714.
[11]	刘佳, 张晶晶, 杨胜强, 乔志杰. 百叶轮抛磨叶片微结构区域识别及路径拼接方法研究 [J]. 图学学报, 2022, 43(4): 715-720.
[12]	李晓英, 余亚平. 基于多模态感官体验的儿童音画交互设计研究[J]. 图学学报, 2022, 43(4): 736-743.
[13]	邓壮林, 张绍兵, 成苗, 何莲. 多模态硬币图像单应性矩阵预测[J]. 图学学报, 2022, 43(3): 361-369.
[14]	王素琴, 任琪, 石敏, 朱登明. 基于异常检测的产品表面缺陷检测与分割[J]. 图学学报, 2022, 43(3): 377-386.
[15]	方洪波, 万广, 陈忠辉, 黄以卫, 张文勇, 谢本亮. 基于改进 YOLOv5s 的离线手写数学符号识别[J]. 图学学报, 2022, 43(3): 387-395.

智能体对手建模研究进展

Research progress of opponent modeling for agent

PDF (PC)

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价