论文标题
从部分观察的图像中学习视觉计划模型
Learning Visual Planning Models from Partially Observed Images
论文作者
论文摘要
在古典计划中,人们对计划模型学习的关注越来越多。但是,大多数现有方法都集中在符号表示中的结构化数据中学习计划模型。在实际情况下,通常很难获得此类结构化数据。尽管已经从完全观察到的非结构化数据(例如图像)中开发了许多方法来学习计划模型,但在许多情况下,原始观察通常是不完整的。在本文中,我们提供了一个新颖的框架,即\ atype {recplan},用于从部分观察到的原始图像痕迹中学习过渡模型。更具体地说,通过在跟踪中考虑前期和后续图像,我们了解原始观察的潜在状态表示,然后基于此类表示形式构建过渡模型。此外,我们提出了一种基于神经网络的方法来学习一种启发式模型,该模型估计了给定目标观察的距离。基于学习的过渡模型和启发式模型,我们为图像实施了经典的计划者。我们从经验上表明,我们的方法比以不完整的观察结果在环境中学习视觉计划模型的最新方法更有效。
There has been increasing attention on planning model learning in classical planning. Most existing approaches, however, focus on learning planning models from structured data in symbolic representations. It is often difficult to obtain such structured data in real-world scenarios. Although a number of approaches have been developed for learning planning models from fully observed unstructured data (e.g., images), in many scenarios raw observations are often incomplete. In this paper, we provide a novel framework, \aType{Recplan}, for learning a transition model from partially observed raw image traces. More specifically, by considering the preceding and subsequent images in a trace, we learn the latent state representations of raw observations and then build a transition model based on such representations. Additionally, we propose a neural-network-based approach to learn a heuristic model that estimates the distance toward a given goal observation. Based on the learned transition model and heuristic model, we implement a classical planner for images. We exhibit empirically that our approach is more effective than a state-of-the-art approach of learning visual planning models in the environment with incomplete observations.