欢迎访问《图学学报》 分享到:

图学学报 ›› 2021, Vol. 42 ›› Issue (5): 703-711.DOI: 10.11996/JG.j.2095-302X.2021050703

• 综述 • 上一篇    下一篇

智能体对手建模研究进展

  

  1. 大连理工大学计算机科学与技术学院,辽宁 大连 116024
  • 出版日期:2021-10-31 发布日期:2021-11-03
  • 基金资助:
    中国科协青年人才托举工程(2018QNRC001);国家自然科学基金项目(61702075,31370778,61425002,61772100,61751203) 

Research progress of opponent modeling for agent

  1. School of Computer Science and Technology, Dalian University of Technology, Dalian Liaoning 116024, China
  • Online:2021-10-31 Published:2021-11-03
  • Supported by:
    Young Elite Scientists Sponsorship Program by CAST (2018QNRC001); National Natural Science Foundation of China (61702075, 31370778, 61425002, 61772100, 61751203)

摘要: 智能体是人工智能领域的一个核心术语。近年来,智能体技术在自动无人驾驶、机器人系统、 电子商务、传感网络、智能游戏等方面得到了广泛研究与应用。随着系统复杂性的增加,关于智能体的研究重 心由对单个智能体的研究转变为智能体间交互的研究。多个智能体交互场景中,智能体对其他智能体决策行为 的推理能力是非常重要的一个方面,通常可以通过构建参与交互的其他智能体的模型,即对手建模来实现。对 手建模有助于对其他智能体的动作、目标、信念等进行推理、分析和预测,进而实现决策优化。为此,重点关 注智能体对手建模研究,展开介绍关于智能体动作预测、偏好预测、信念预测、类型预测等方面的对手建模 技术,对其中的优缺点进行讨论和分析,并对手建模技术当前面临的一些开放问题进行总结,探讨未来可能 的研究和发展方向。

Abstract: Agent is a core term in the field of artificial intelligence. In recent years, agent technology has been widely studied and applied in such fields as autonomous driving, robot system, e-commerce, sensor network, and intelligent games. With the increase of system complexity, the research focus on agent technology has been shifted from single agent to interactions between agents. In scenarios with multiple interactive agents, an important direction is to reason out other agents’ decisions and behaviors, which can be realized through the modeling of other agents involved in the interaction, that is, opponent modeling. Opponent modeling is conducive to reasoning, analyzing, and predicting other agents’ actions, targets, and beliefs, thus optimizing one’s decision-making. This paper mainly focused on the research on opponent modeling of agents, and introduced the opponent modeling technology in agent action prediction, preference prediction, belief prediction, and type prediction. In addition, their advantages and disadvantages were discussed, some current open problems were summarized, and the possible future research directions were presented. 

Key words: decision intelligence, opponent modeling, game theory, agent systems, AlphaGo 

中图分类号: