论文标题

结构化访问:用于安全AI部署的新兴范式

Structured access: an emerging paradigm for safe AI deployment

论文作者

Shevlane, Toby

论文摘要

结构化访问是用于安全部署人工智能(AI)的新兴范式。开发人员没有公开传播AI系统,而是促进了受控的,而是与其AI系统的长度相互作用。目的是防止危险的AI功能可广泛访问,同时保存可以安全使用的AI功能。开发人员必须既必须限制如何使用AI系统,又可以防止用户通过修改或反向工程对AI系统的修改或反向工程来规避这些限制。通过基于云的AI服务实施结构化访问最有效,而不是传播在用户硬件上本地运行的AI软件。基于云的接口为AI开发人员提供了更大的范围,用于控制AI系统的使用方式,并防止对系统设计的未经授权修改。本章扩展了对AI社区中“出版规范”的讨论,迄今为止,该社区的重点是如何传播AI研究项目的信息内容(例如,代码和模型)。尽管这是一个重要的问题,但通过控制信息流可以实现的目标是有限的。结构化的访问视图AI软件不仅是可以共享的信息,还可以作为用户可以具有ARM长度交互的工具。 AI开发人员正在实践结构化访问的早期例子,但是在基于云的接口的功能和更广泛的机构框架中,还有很多进一步开发的空间。

Structured access is an emerging paradigm for the safe deployment of artificial intelligence (AI). Instead of openly disseminating AI systems, developers facilitate controlled, arm's length interactions with their AI systems. The aim is to prevent dangerous AI capabilities from being widely accessible, whilst preserving access to AI capabilities that can be used safely. The developer must both restrict how the AI system can be used, and prevent the user from circumventing these restrictions through modification or reverse engineering of the AI system. Structured access is most effective when implemented through cloud-based AI services, rather than disseminating AI software that runs locally on users' hardware. Cloud-based interfaces provide the AI developer greater scope for controlling how the AI system is used, and for protecting against unauthorized modifications to the system's design. This chapter expands the discussion of "publication norms" in the AI community, which to date has focused on the question of how the informational content of AI research projects should be disseminated (e.g., code and models). Although this is an important question, there are limits to what can be achieved through the control of information flows. Structured access views AI software not only as information that can be shared but also as a tool with which users can have arm's length interactions. There are early examples of structured access being practiced by AI developers, but there is much room for further development, both in the functionality of cloud-based interfaces and in the wider institutional framework.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源