论文标题
概括新兴的沟通
Generalizing Emergent Communication
论文作者
论文摘要
我们将最近开发的Babyai网格世界平台转换为发件人/接收器设置,以检验以下假设:建立的深入强化学习技术足以激励广义代理之间的基础离散通信协议的出现。这与以前采用直通估计或专门电感偏见的实验相反。我们的结果表明,通过提供适当的环境激励措施,确实可以避免这些。此外,他们表明,沟通激励更抽象的语义之间的较长间隔。在某些情况下,与整体剂相比,传达的代理更快地适应了新环境,展示了紧急交流在转移学习和总体上的潜力。
We converted the recently developed BabyAI grid world platform to a sender/receiver setup in order to test the hypothesis that established deep reinforcement learning techniques are sufficient to incentivize the emergence of a grounded discrete communication protocol between generalized agents. This is in contrast to previous experiments that employed straight-through estimation or specialized inductive biases. Our results show that these can indeed be avoided, by instead providing proper environmental incentives. Moreover, they show that a longer interval between communications incentivized more abstract semantics. In some cases, the communicating agents adapted to new environments more quickly than a monolithic agent, showcasing the potential of emergent communication for transfer learning and generalization in general.