基于知识迁移和双向异步序列的对话生成模型 Dialogue generation model based on knowledge transfer and two-direction asynchronous sequence期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于知识迁移和双向异步序列的对话生成模型

引用本文：	王勇超,曹钰,杨玉辉,许端清.基于知识迁移和双向异步序列的对话生成模型[J].浙江大学学报(自然科学版 ),2022,56(3):520-530.

作者姓名：	王勇超曹钰杨玉辉许端清

作者单位：	1. 浙江大学信息技术中心，浙江杭州 3100272. 浙江大学计算机科学与技术学院，浙江杭州 310027

基金项目：	国家重点研发计划资助项目（2020YFC1523101, 2019YFC1521304）；浙江省重点研发计划资助项目（2021C03140）；宁波市2021科技创新重大专项（20211ZDYF020028）

摘要：	针对端到端的对话生成模型普遍存在无意义安全回复和大量重复词汇的问题，和将外部知识引入对话系统的挑战，提出基于知识迁移和双向异步序列的对话生成模型.将知识库中的外部知识融合到对话生成模型并显式地生成在回复语句中；使用预训练的知识库问答模型获取输入语句的知识表达、候选知识表达以及关键字；搭建2个编码器?解码器结构，通过双向异步解码将关键字显式地生成在对话回复中；编、解码阶段均引入预训练模型的知识理解和知识表达能力，提升对话生成对知识信息的捕捉能力.提出重复检测惩罚机制，通过赋予惩罚权重的方式减少对话生成中的重复词汇.实验结果表明，所提模型在自动评估和人工评价指标上均优于已有的对话生成方法.
关键词：	对话生成知识实体知识库问答双向异步生成序列到序列模型
Dialogue generation model based on knowledge transfer and two-direction asynchronous sequence

Yong-chao WANG,Yu CAO,Yu-hui YANG,Duan-qing XU.Dialogue generation model based on knowledge transfer and two-direction asynchronous sequence[J].Journal of Zhejiang University(Engineering Science),2022,56(3):520-530.

Authors:	Yong-chao WANG Yu CAO Yu-hui YANG Duan-qing XU

Abstract:	A dialogue generation model based on knowledge transfer and two-direction asynchronous sequence generation was proposed, aiming to the generally meaningless safe replies and the problem of a large number of repetitive words in view of the end-to-end dialogue generation models, and the challenge of introducing external knowledge into the dialogue system. The external knowledge in the knowledge base was fused into the dialogue generation model and explicitly generated in the reply sentences. A pre-trained model based on the question and answering of the knowledge base was used to obtain the knowledge expressions of the input sentences, the knowledge expressions of the candidate answers, and keywords. The keywords were then used in the reply. Two encoder-decoder structure models were proposed, and the keywords were generated explicitly in the dialogue reply by two-direction asynchronous generation. The knowledge expressions and understanding capabilities of the pre-trained model were introduced to capture knowledge information to dialog generation at the encoding and decoding stages. A repetitive detection-penalty mechanism was proposed to reduce the repeated words problem by giving weight to punish the repetitive words. Experimental results show that the model outperforms better than existing methods in both automatic evaluation and manual evaluation indicators.

Keywords:	dialogue generation knowledge entity knowledge base question and answer two-direction asynchronous generation sequence-to-sequence model

	点击此处可从《浙江大学学报(自然科学版 )》浏览原始摘要信息
	点击此处可从《浙江大学学报(自然科学版 )》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏