非自动回归神经对话世代

论文标题

非自动回归神经对话世代

Non-Autoregressive Neural Dialogue Generation

论文作者

Han, Qinghong, Meng, Yuxian, Wu, Fei, Li, Jiwei

论文摘要

Maximum Mutual information (MMI), which models the bidirectional dependency between responses ($y$) and contexts ($x$), i.e., the forward probability $\log p(y|x)$ and the backward probability $\log p(x|y)$, has been widely used as the objective in the \sts model to address the dull-response issue in open-domain dialog generation.不幸的是，在\ sts模型的框架下，从$ \ log P（y | x） + \ log p（x | y）$直接解码是不可行的，因为第二部分（即$ p（x | y）$）需要对目标生成完成，然后才能计算出$ y $的搜索空间。从经验上讲，首先给定$ p（y | x）$的n-最佳列表，然后使用$ p（x | y）$来重新列表n-bess列表，这不可避免地会导致非全球优化的解决方案。在本文中，我们建议使用非自动回旋（非AR）生成模型来解决这个非全球最优问题。由于目标令牌是在非AR生成中独立生成的，因此可以在生成时立即计算每个目标单词的$ P（x | y）$，并且不必等待整个序列的完成。这自然可以解码非全球最佳问题。实验结果表明，所提出的非AR策略会产生更多样化，相干和适当的反应，从而在BLEU评分和人类评估中产生实质性提高。

Maximum Mutual information (MMI), which models the bidirectional dependency between responses ($y$) and contexts ($x$), i.e., the forward probability $\log p(y|x)$ and the backward probability $\log p(x|y)$, has been widely used as the objective in the \sts model to address the dull-response issue in open-domain dialog generation. Unfortunately, under the framework of the \sts model, direct decoding from $\log p(y|x) + \log p(x|y)$ is infeasible since the second part (i.e., $p(x|y)$) requires the completion of target generation before it can be computed, and the search space for $y$ is enormous. Empirically, an N-best list is first generated given $p(y|x)$, and $p(x|y)$ is then used to rerank the N-best list, which inevitably results in non-globally-optimal solutions. In this paper, we propose to use non-autoregressive (non-AR) generation model to address this non-global optimality issue. Since target tokens are generated independently in non-AR generation, $p(x|y)$ for each target word can be computed as soon as it's generated, and does not have to wait for the completion of the whole sequence. This naturally resolves the non-global optimal issue in decoding. Experimental results demonstrate that the proposed non-AR strategy produces more diverse, coherent, and appropriate responses, yielding substantive gains in BLEU scores and in human evaluations.

下载PDF全文

下载文献需遵守相关版权规定

论文标题