多基因系统中的价值比对平衡

论文标题

多基因系统中的价值比对平衡

Value Alignment Equilibrium in Multiagent Systems

论文作者

Montes, Nieves, Sierra, Carles

论文摘要

近年来，价值一致性已成为产生有益和正念人工智能系统的基本原则。它主要指出，自治实体应以与我们的人类价值观保持一致的方式行为。在这项工作中，我们总结了一个先前开发的模型，该模型将价值视为对世界状态的偏好，并定义了管理规范与价值观之间的一致性。我们使用迭代囚犯的困境模型为此框架提供了用例，我们用来示例我们审查的定义。我们利用此用例来引入与既定框架集成的新概念：对齐平衡和帕累托最佳对齐。这些是在经典的NASH平衡和帕累托最优性上灵感的，但旨在考虑我们希望在系统中建模的任何值。

Value alignment has emerged in recent years as a basic principle to produce beneficial and mindful Artificial Intelligence systems. It mainly states that autonomous entities should behave in a way that is aligned with our human values. In this work, we summarize a previously developed model that considers values as preferences over states of the world and defines alignment between the governing norms and the values. We provide a use-case for this framework with the Iterated Prisoner's Dilemma model, which we use to exemplify the definitions we review. We take advantage of this use-case to introduce new concepts to be integrated with the established framework: alignment equilibrium and Pareto optimal alignment. These are inspired on the classical Nash equilibrium and Pareto optimality, but are designed to account for any value we wish to model in the system.

下载PDF全文

下载文献需遵守相关版权规定

论文标题