Title Generating Rational Commonsense Knowledge-Aware Dialogue Responses With Channel-Aware Knowledge Fusing Network
Authors Wu, Sixing
Li, Ying
Zhang, Dawei
Wu, Zhonghai
Affiliation Peking Univ, Sch Comp, Beijing 100871, Peoples R China
Peking Univ, Natl Engn Res Ctr Software Engn, Beijing 100871, Peoples R China
Peking Univ, Sch Software & Microelect, Beijing 100871, Peoples R China
Issue Date 2022
Publisher IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING
Abstract Dialogues systems endow machines with the ability to converse with humans using natural language. Nonetheless, previous Seq2Seq-based generative dialogue systems often generate safe but meaningless responses, such as 'I don't know' or 'I think so'. To this end, researchers proposed to infuse external knowledge into dialogue generation, and such knowledge-enhanced methods have achieved remarkable improvements in the open-domain dialogue systems. External knowledge is an exogenous input, where the estrangement inevitably exists between knowledge and dialogue context. Although previous knowledge-enhanced works can already use commonsense knowledge to generate informative responses, they always use knowledge in a single-channel paradigm, which is hard to accurately handle different data-flows and then tends to generate irrational dialogue responses. Thus, they tend to be confused and generate strange responses when infusing the knowledge into dialogue generation, such as 'I just ate a basketball,' dramatically degrading the user experience. To address this problem, this paper proposes a novel Channel-Aware Knowledge Fusing Network (CAKF). Rather than following the traditional single-channel paradigm, CAKF employs three unique channels to handle different data-flows more clearly and rationally: a base channel serves like a vanilla Seq2Seq decoder; a context channel to utilize the contextual information, and a knowledge channel to infuse commonsense knowledge into the dialogue generation. Above such three channels, a Sequential Manager is built to maintain the global sequential decision state, aggregate the local data-flows, and make the final prediction. Experiments on two open-released datasets (a Chinese Weibo and an English Reddit) demonstrated the superior performance of this work against various state-of-the-art approaches.
URI http://hdl.handle.net/20.500.11897/657644
ISSN 2329-9290
DOI 10.1109/TASLP.2022.3199649
Indexed EI
SCI(E)
Appears in Collections: 软件工程国家工程研究中心
软件与微电子学院

Files in This Work
There are no files associated with this item.

Web of Science®


0

Checked on Last Week

Scopus®



Checked on Current Time

百度学术™


0

Checked on Current Time

Google Scholar™





License: See PKU IR operational policies.