对比度对准何时会改善多到许多神经机器的翻译？

论文标题

对比度对准何时会改善多到许多神经机器的翻译？

When do Contrastive Word Alignments Improve Many-to-many Neural Machine Translation?

论文作者

Mao, Zhuoyuan, Chu, Chenhui, Dabre, Raj, Song, Haiyue, Wan, Zhen, Kurohashi, Sadao

论文摘要

事实证明，单词一致性可以使多到多的神经机器翻译（NMT）受益。但是，在以前的方法中，使用高质量的基础双语词典进行预编辑，这对于大多数语言对而言是不可用的。同时，对比目标可以隐式地使用自动学习的单词对齐，这在许多一对多的NMT中尚未探讨。这项工作提出了一个单词级的对比目标，以利用多到许多NMT的单词对齐方式。经验结果表明，这将导致几对语言对的0.8 BLEU增长。分析表明，在多对多的NMT中，编码器的句子检索与翻译质量高度相关，这解释了何时拟议的方法会影响翻译。这激发了许多对许多NMT的探索，以改善编码器的句子检索表现。

Word alignment has proven to benefit many-to-many neural machine translation (NMT). However, high-quality ground-truth bilingual dictionaries were used for pre-editing in previous methods, which are unavailable for most language pairs. Meanwhile, the contrastive objective can implicitly utilize automatically learned word alignment, which has not been explored in many-to-many NMT. This work proposes a word-level contrastive objective to leverage word alignments for many-to-many NMT. Empirical results show that this leads to 0.8 BLEU gains for several language pairs. Analyses reveal that in many-to-many NMT, the encoder's sentence retrieval performance highly correlates with the translation quality, which explains when the proposed method impacts translation. This motivates future exploration for many-to-many NMT to improve the encoder's sentence retrieval performance.

下载PDF全文

下载文献需遵守相关版权规定

论文标题