论文标题

紧急沟通:刘易斯游戏中的概括和过度拟合

Emergent Communication: Generalization and Overfitting in Lewis Games

论文作者

Rita, Mathieu, Tallec, Corentin, Michel, Paul, Grill, Jean-Bastien, Pietquin, Olivier, Dupoux, Emmanuel, Strub, Florian

论文摘要

刘易斯信号游戏是一类简单的通信游戏,用于模拟语言的出现。在这些游戏中,两个代理商必须同意通信协议,以解决合作任务。先前的工作表明,受过强化学习训练的代理商倾向于从语言角度(缺乏概括,缺乏构图等)来开发表现出不良属性的语言。在本文中,我们旨在通过分析研究路易斯游戏中的学习问题来更好地理解这种现象。作为核心贡献,我们证明了路易斯游戏的标准目标可以分为两个组成部分:共同适应损失和信息损失。这种分解使我们能够表现出两个潜在的过度拟合来源,我们表明这可能破坏结构化通信协议的出现。特别是,当我们控制过度适应共同适应损失时,我们会在新兴语言中恢复所需的属性:它们更具组成性并更好地概括。

Lewis signaling games are a class of simple communication games for simulating the emergence of language. In these games, two agents must agree on a communication protocol in order to solve a cooperative task. Previous work has shown that agents trained to play this game with reinforcement learning tend to develop languages that display undesirable properties from a linguistic point of view (lack of generalization, lack of compositionality, etc). In this paper, we aim to provide better understanding of this phenomenon by analytically studying the learning problem in Lewis games. As a core contribution, we demonstrate that the standard objective in Lewis games can be decomposed in two components: a co-adaptation loss and an information loss. This decomposition enables us to surface two potential sources of overfitting, which we show may undermine the emergence of a structured communication protocol. In particular, when we control for overfitting on the co-adaptation loss, we recover desired properties in the emergent languages: they are more compositional and generalize better.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源