论文标题

示范中的残留学习:调整DMP进行接触丰富的操作

Residual Learning from Demonstration: Adapting DMPs for Contact-rich Manipulation

论文作者

Davchev, Todor, Luck, Kevin Sebastian, Burke, Michael, Meier, Franziska, Schaal, Stefan, Ramamoorthy, Subramanian

论文摘要

涉及接触和摩擦的操作技巧是许多机器人任务所固有的。使用插入钉钉的电动机原始词,我们研究机器人如何学习此类技能。动态运动原语(DMP)是通过行为克隆(BC)提取此类政策的流行方式,但在插入的背景下可能会挣扎。诸如残留学习之类的政策适应策略可以帮助在接触丰富的操纵下改善政策的整体绩效。但是,尚不清楚如何使用DMP最好地做到这一点。结果,我们考虑了调整DMP配方的几种可能方法,并提出了``从演示中的残留学习''(RLFD),该框架将DMP与增强学习(RL)结合在一起以学习残留校正政策。我们的评估表明,直接在任务空间中应用残差学习并在机器人的全部姿势上运行可以显着改善DMP的整体性能。我们表明,RLFD为关节解决方案提供了一种温和的,可以改善DMPS \ rb {的任务成功和概括,并可以通过几次射击任务适应来转移到不同的几何和摩擦。提出的框架对一组任务进行了评估。模拟的机器人和物理机器人必须成功地将钉子,齿轮和插头插入各自的插座。本文随附的其他材料和视频可在https://sites.google.com/view/rlfd/上提供。

Manipulation skills involving contact and friction are inherent to many robotics tasks. Using the class of motor primitives for peg-in-hole like insertions, we study how robots can learn such skills. Dynamic Movement Primitives (DMP) are a popular way of extracting such policies through behaviour cloning (BC) but can struggle in the context of insertion. Policy adaptation strategies such as residual learning can help improve the overall performance of policies in the context of contact-rich manipulation. However, it is not clear how to best do this with DMPs. As a result, we consider several possible ways for adapting a DMP formulation and propose ``residual Learning from Demonstration`` (rLfD), a framework that combines DMPs with Reinforcement Learning (RL) to learn a residual correction policy. Our evaluations suggest that applying residual learning directly in task space and operating on the full pose of the robot can significantly improve the overall performance of DMPs. We show that rLfD offers a gentle to the joints solution that improves the task success and generalisation of DMPs \rb{and enables transfer to different geometries and frictions through few-shot task adaptation}. The proposed framework is evaluated on a set of tasks. A simulated robot and a physical robot have to successfully insert pegs, gears and plugs into their respective sockets. Other material and videos accompanying this paper are provided at https://sites.google.com/view/rlfd/.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源