Semeval-2022任务2：使用多语言预审前的语言模型检测多字表达式的惯用性

论文标题

Semeval-2022任务2：使用多语言预审前的语言模型检测多字表达式的惯用性

HiJoNLP at SemEval-2022 Task 2: Detecting Idiomaticity of Multiword Expressions using Multilingual Pretrained Language Models

论文作者

Tan, Minghuan

论文摘要

本文介绍了一种仅从MWE在多语言审计的语言模型上的情境化表示中检测惯用性的方法。我们的实验发现，较大的模型通常在惯用性检测中更有效。但是，使用较高的模型可能不能保证更好的性能。在多语言场景中，不同语言的融合不一致，丰富的资源语言比其他语言具有很大的优势。

This paper describes an approach to detect idiomaticity only from the contextualized representation of a MWE over multilingual pretrained language models. Our experiments find that larger models are usually more effective in idiomaticity detection. However, using a higher layer of the model may not guarantee a better performance. In multilingual scenarios, the convergence of different languages are not consistent and rich-resource languages have big advantages over other languages.

下载PDF全文

下载文献需遵守相关版权规定

论文标题