替代辅助控制器，用于昂贵的进化增强学习

论文标题

替代辅助控制器，用于昂贵的进化增强学习

A Surrogate-Assisted Controller for Expensive Evolutionary Reinforcement Learning

论文作者

Wang, Yuxing, Zhang, Tiantian, Chang, Yongzhe, Liang, Bin, Wang, Xueqian, Yuan, Bo

论文摘要

增强学习（RL）和进化算法（EAS）的整合旨在同时利用样本效率以及两个范式的多样性和鲁棒性。最近，基于此原则的混合学习框架在各种挑战的机器人控制任务中取得了巨大成功。但是，在这些方法中，通过与实际环境的相互作用来评估遗传人群的政策，从而限制了它们在计算昂贵的问题中的适用性。在这项工作中，我们提出了替代辅助控制器（SC），这是一个新颖而有效的模块，可以将其集成到现有框架中，以通过部分替换昂贵的政策评估来减轻EAS的计算负担。应用此模块的关键挑战是防止优化过程被替代物引入的可能的错误最小值误导。为了解决这个问题，我们提出了SC控制混合框架工作流程的两种策略。 OpenAI健身平台上六个连续控制任务的实验表明，SC不仅可以显着降低健身评估的成本，而且可以通过协作学习和进化过程提高原始混合框架的性能。

The integration of Reinforcement Learning (RL) and Evolutionary Algorithms (EAs) aims at simultaneously exploiting the sample efficiency as well as the diversity and robustness of the two paradigms. Recently, hybrid learning frameworks based on this principle have achieved great success in various challenging robot control tasks. However, in these methods, policies from the genetic population are evaluated via interactions with the real environments, limiting their applicability in computationally expensive problems. In this work, we propose Surrogate-assisted Controller (SC), a novel and efficient module that can be integrated into existing frameworks to alleviate the computational burden of EAs by partially replacing the expensive policy evaluation. The key challenge in applying this module is to prevent the optimization process from being misled by the possible false minima introduced by the surrogate. To address this issue, we present two strategies for SC to control the workflow of hybrid frameworks. Experiments on six continuous control tasks from the OpenAI Gym platform show that SC can not only significantly reduce the cost of fitness evaluations, but also boost the performance of the original hybrid frameworks with collaborative learning and evolutionary processes.

下载PDF全文

下载文献需遵守相关版权规定

论文标题