论文标题
非自动回归神经对话世代
Non-Autoregressive Neural Dialogue Generation
论文作者
论文摘要
Maximum Mutual information (MMI), which models the bidirectional dependency between responses ($y$) and contexts ($x$), i.e., the forward probability $\log p(y|x)$ and the backward probability $\log p(x|y)$, has been widely used as the objective in the \sts model to address the dull-response issue in open-domain dialog generation.不幸的是,在\ sts模型的框架下,从$ \ log P(y | x) + \ log p(x | y)$直接解码是不可行的,因为第二部分(即$ p(x | y)$)需要对目标生成完成,然后才能计算出$ y $的搜索空间。从经验上讲,首先给定$ p(y | x)$的n-最佳列表,然后使用$ p(x | y)$来重新列表n-bess列表,这不可避免地会导致非全球优化的解决方案。在本文中,我们建议使用非自动回旋(非AR)生成模型来解决这个非全球最优问题。由于目标令牌是在非AR生成中独立生成的,因此可以在生成时立即计算每个目标单词的$ P(x | y)$,并且不必等待整个序列的完成。这自然可以解码非全球最佳问题。实验结果表明,所提出的非AR策略会产生更多样化,相干和适当的反应,从而在BLEU评分和人类评估中产生实质性提高。
Maximum Mutual information (MMI), which models the bidirectional dependency between responses ($y$) and contexts ($x$), i.e., the forward probability $\log p(y|x)$ and the backward probability $\log p(x|y)$, has been widely used as the objective in the \sts model to address the dull-response issue in open-domain dialog generation. Unfortunately, under the framework of the \sts model, direct decoding from $\log p(y|x) + \log p(x|y)$ is infeasible since the second part (i.e., $p(x|y)$) requires the completion of target generation before it can be computed, and the search space for $y$ is enormous. Empirically, an N-best list is first generated given $p(y|x)$, and $p(x|y)$ is then used to rerank the N-best list, which inevitably results in non-globally-optimal solutions. In this paper, we propose to use non-autoregressive (non-AR) generation model to address this non-global optimality issue. Since target tokens are generated independently in non-AR generation, $p(x|y)$ for each target word can be computed as soon as it's generated, and does not have to wait for the completion of the whole sequence. This naturally resolves the non-global optimal issue in decoding. Experimental results demonstrate that the proposed non-AR strategy produces more diverse, coherent, and appropriate responses, yielding substantive gains in BLEU scores and in human evaluations.