论文标题
语言是一种认知工具,可以想象在好奇心驱动的探索中的目标
Language as a Cognitive Tool to Imagine Goals in Curiosity-Driven Exploration
论文作者
论文摘要
发展机器学习研究人造代理如何模拟儿童学习技能的开放式曲目的方式。这样的代理需要创建和表示目标,选择要追求的目标并学会实现目标。最近的方法考虑了使用状态的生成模型固定和手工定义或学习的目标空间。该有限的代理是在已知效应分布中采样目标。我们认为,可以想象分发目标的能力是实现创意发现和开放式学习的关键。孩子们通过利用语言的组成性作为一种工具来想象他们从未有过的结果的描述,以将它们作为目标作为目标的描述。我们介绍Imagine,一种内在动机的深层增强学习体系结构,该体系结构对这种能力进行了建模。像孩子一样,这种想象力的代理人受益于提供语言描述的社交同伴的指导。为了利用目标想象力,代理商必须能够利用这些描述来解释其想象中的分布目标。通过模块化使这种概括成为可能:学到的目标奖励功能与依靠深度集,通心的注意力和以对象为中心表示的政策之间的分解。我们介绍了操场环境,并研究这种目标想象力如何改善对缺乏这种能力的代理商的概括和探索。此外,我们确定了目标想象的特性,从而使这些结果能够实现这些结果并研究模块化和社会互动的影响。
Developmental machine learning studies how artificial agents can model the way children learn open-ended repertoires of skills. Such agents need to create and represent goals, select which ones to pursue and learn to achieve them. Recent approaches have considered goal spaces that were either fixed and hand-defined or learned using generative models of states. This limited agents to sample goals within the distribution of known effects. We argue that the ability to imagine out-of-distribution goals is key to enable creative discoveries and open-ended learning. Children do so by leveraging the compositionality of language as a tool to imagine descriptions of outcomes they never experienced before, targeting them as goals during play. We introduce IMAGINE, an intrinsically motivated deep reinforcement learning architecture that models this ability. Such imaginative agents, like children, benefit from the guidance of a social peer who provides language descriptions. To take advantage of goal imagination, agents must be able to leverage these descriptions to interpret their imagined out-of-distribution goals. This generalization is made possible by modularity: a decomposition between learned goal-achievement reward function and policy relying on deep sets, gated attention and object-centered representations. We introduce the Playground environment and study how this form of goal imagination improves generalization and exploration over agents lacking this capacity. In addition, we identify the properties of goal imagination that enable these results and study the impacts of modularity and social interactions.