深度加固学习，教科书

论文标题

深度加固学习，教科书

Deep Reinforcement Learning, a textbook

论文作者

Plaat, Aske

论文摘要

深度强化学习最近引起了很多关注。在自动驾驶，游戏玩法，分子重组和机器人技术等各种活动中取得了令人印象深刻的结果。在所有这些领域，计算机程序都教会自己解决困难问题。他们学会了飞行模型直升机，并进行特技飞行器，例如循环和滚动。在某些应用中，它们甚至比在Atari，Go，Poker和Starcraft中的人类变得更好。深度强化学习探索复杂环境的方式使我们想起了孩子们如何通过调皮尝试，获得反馈并重新尝试的方式学习。计算机似乎真正拥有人类学习的各个方面。这是人工智能梦的核心。教育工作者的研究成功并没有引起人们的注意，大学开始提供有关该学科的课程。本书的目的是提供深入强化学习领域的全面概述。这本书是为人工智能的研究生以及希望更好地了解深入强化学习方法及其挑战的研究人员和从业人员而撰写的。我们假设对计算机科学和人工智能的了解的本科级别；本书的编程语言是Python。我们描述了深度强化学习的基础，算法和应用。我们涵盖了构成该领域基础的已建立的无模型和基于模型的方法。发展迅速，我们还涵盖了先进的主题：深度多代理的增强学习，深层的层次结构增强学习和深度元学习。

Deep reinforcement learning has gathered much attention recently. Impressive results were achieved in activities as diverse as autonomous driving, game playing, molecular recombination, and robotics. In all these fields, computer programs have taught themselves to solve difficult problems. They have learned to fly model helicopters and perform aerobatic manoeuvers such as loops and rolls. In some applications they have even become better than the best humans, such as in Atari, Go, poker and StarCraft. The way in which deep reinforcement learning explores complex environments reminds us of how children learn, by playfully trying out things, getting feedback, and trying again. The computer seems to truly possess aspects of human learning; this goes to the heart of the dream of artificial intelligence. The successes in research have not gone unnoticed by educators, and universities have started to offer courses on the subject. The aim of this book is to provide a comprehensive overview of the field of deep reinforcement learning. The book is written for graduate students of artificial intelligence, and for researchers and practitioners who wish to better understand deep reinforcement learning methods and their challenges. We assume an undergraduate-level of understanding of computer science and artificial intelligence; the programming language of this book is Python. We describe the foundations, the algorithms and the applications of deep reinforcement learning. We cover the established model-free and model-based methods that form the basis of the field. Developments go quickly, and we also cover advanced topics: deep multi-agent reinforcement learning, deep hierarchical reinforcement learning, and deep meta learning.

下载PDF全文

下载文献需遵守相关版权规定

论文标题