论文标题
Semeval-2022任务2:使用多语言预审前的语言模型检测多字表达式的惯用性
HiJoNLP at SemEval-2022 Task 2: Detecting Idiomaticity of Multiword Expressions using Multilingual Pretrained Language Models
论文作者
论文摘要
本文介绍了一种仅从MWE在多语言审计的语言模型上的情境化表示中检测惯用性的方法。我们的实验发现,较大的模型通常在惯用性检测中更有效。但是,使用较高的模型可能不能保证更好的性能。在多语言场景中,不同语言的融合不一致,丰富的资源语言比其他语言具有很大的优势。
This paper describes an approach to detect idiomaticity only from the contextualized representation of a MWE over multilingual pretrained language models. Our experiments find that larger models are usually more effective in idiomaticity detection. However, using a higher layer of the model may not guarantee a better performance. In multilingual scenarios, the convergence of different languages are not consistent and rich-resource languages have big advantages over other languages.