部分可观测时空混沌系统的无模型预测

论文标题

部分可观测时空混沌系统的无模型预测

Bimanual rope manipulation skill synthesis through context dependent correction policy learning from human demonstration

论文作者

Akbulut, T. Baturhan, Girgin, G. Tuba C., Mehrabi, Arash, Asada, Minoru, Ugur, Emre, Oztop, Erhan

论文摘要

从示范中学习（LFD）提供了一种方便的手段，可以在机器人固有的坐标中获得演示时为机器人提供灵巧的技能。但是，长期且复杂的技能中复杂错误的问题减少了其广泛的部署。由于大多数这样的复杂技能由组合的较小运动组成，因此考虑到目标技能是一系列紧凑的运动原语似乎是合理的。在这里，需要解决的问题是确保电动机以允许成功执行后续原始性执行的状态结束。在这项研究中，我们通过提议学习明确的校正政策来关注这个问题，当时未达到原始人之间的预期过渡状态。校正策略本身是通过使用最先进的运动原始学习架构，条件神经运动原语（CNMP）来学习的。然后，学识渊博的校正政策能够以背景方式产生多种运动轨迹。在模拟中使用桌面设置显示了所提出的系统而不是学习完整任务的优点，其中必须以两个步骤将对象推到走廊。然后，通过为上身的类人机器人配备在3D空间中的条上打结的技巧，显示了所提出的方法在现实世界中进行双重打结的适用性。该实验表明，即使面对校正案例不属于人类示范集的一部分，机器人也可以执行成功的打结。

Learning from demonstration (LfD) provides a convenient means to equip robots with dexterous skills when demonstration can be obtained in robot intrinsic coordinates. However, the problem of compounding errors in long and complex skills reduces its wide deployment. Since most such complex skills are composed of smaller movements that are combined, considering the target skill as a sequence of compact motor primitives seems reasonable. Here the problem that needs to be tackled is to ensure that a motor primitive ends in a state that allows the successful execution of the subsequent primitive. In this study, we focus on this problem by proposing to learn an explicit correction policy when the expected transition state between primitives is not achieved. The correction policy is itself learned via behavior cloning by the use of a state-of-the-art movement primitive learning architecture, Conditional Neural Motor Primitives (CNMPs). The learned correction policy is then able to produce diverse movement trajectories in a context dependent way. The advantage of the proposed system over learning the complete task as a single action is shown with a table-top setup in simulation, where an object has to be pushed through a corridor in two steps. Then, the applicability of the proposed method to bi-manual knotting in the real world is shown by equipping an upper-body humanoid robot with the skill of making knots over a bar in 3D space. The experiments show that the robot can perform successful knotting even when the faced correction cases are not part of the human demonstration set.

下载PDF全文

下载文献需遵守相关版权规定

论文标题