论文标题
求职者:利用大型语言模型提示以下数据有效的体现指令
Prompter: Utilizing Large Language Model Prompting for a Data Efficient Embodied Instruction Following
论文作者
论文摘要
体现教学以下(EIF)研究如何控制自动移动操纵机器人,以完成由自然语言说明描述的长马操作任务。尽管对模拟器进行了许多对EIF的研究,但该领域的最终目标是在现实生活中部署代理商。这就是为什么最近的方法端到端训练模型并采用模块化方法的原因之一,而模块化方法不需要昂贵的专家操作数据。但是,由于仍在将模块化思想导入EIF的早期,因此在EIF任务中寻找有效的模块远非结论。在本文中,我们建议使用从两个外部来源获得的知识扩展模块化设计。首先,我们表明将部署机器人的物理约束嵌入到模块设计中是非常有效的。我们的设计还允许相同的模块化系统跨不同配置的机器人使用,并具有最小的修改。其次,我们表明,基于里程碑的对象搜索先前由需要专用数据集的训练有素的模型实施,可以用实施取代,该实施促使鉴定的大型语言模型用于地标 - 对象关系,从而消除了收集专用培训数据的需求。我们提出的提示者在阿尔弗雷德基准测试中获得41.53 \%和45.32 \%,分别具有高级指令和逐步指示,大大优于5.46 \%\%和9.91 \%。
Embodied Instruction Following (EIF) studies how autonomous mobile manipulation robots should be controlled to accomplish long-horizon tasks described by natural language instructions. While much research on EIF is conducted in simulators, the ultimate goal of the field is to deploy the agents in real life. This is one of the reasons why recent methods have moved away from training models end-to-end and take modular approaches, which do not need the costly expert operation data. However, as it is still in the early days of importing modular ideas to EIF, a search for modules effective in the EIF task is still far from a conclusion. In this paper, we propose to extend the modular design using knowledge obtained from two external sources. First, we show that embedding the physical constraints of the deployed robots into the module design is highly effective. Our design also allows the same modular system to work across robots of different configurations with minimal modifications. Second, we show that the landmark-based object search, previously implemented by a trained model requiring a dedicated set of data, can be replaced by an implementation that prompts pretrained large language models for landmark-object relationships, eliminating the need for collecting dedicated training data. Our proposed Prompter achieves 41.53\% and 45.32\% on the ALFRED benchmark with high-level instructions only and step-by-step instructions, respectively, significantly outperforming the previous state of the art by 5.46\% and 9.91\%.