论文标题
与世界模型的紧急沟通
Emergent Communication with World Models
论文作者
论文摘要
我们介绍了语言世界模型,这是一种语言条件生成模型,通过预测未来观察的潜在代码来解释自然语言信息。这提供了该消息的视觉基础,类似于对世界的增强观察,该观察可能包括听力代理商视野视野之外的对象。我们将此“观察”纳入持续的内存状态,并允许听力代理的策略在其上调节,类似于世界模型中的内存与控制器之间的关系。我们表明,这改善了2D GridWorld扬声器访问者导航任务中有效的沟通和任务成功。此外,我们为基于模型的公式而制定了两种损失,以促进积极的信号传导和积极的聆听。最后,由于消息是在生成模型中解释的,因此我们可以可视化模型信念,以深入了解如何利用通信通道。
We introduce Language World Models, a class of language-conditional generative model which interpret natural language messages by predicting latent codes of future observations. This provides a visual grounding of the message, similar to an enhanced observation of the world, which may include objects outside of the listening agent's field-of-view. We incorporate this "observation" into a persistent memory state, and allow the listening agent's policy to condition on it, akin to the relationship between memory and controller in a World Model. We show this improves effective communication and task success in 2D gridworld speaker-listener navigation tasks. In addition, we develop two losses framed specifically for our model-based formulation to promote positive signalling and positive listening. Finally, because messages are interpreted in a generative model, we can visualize the model beliefs to gain insight into how the communication channel is utilized.