首页|Bare Minimum Mitigations for Autonomous AI Development

Bare Minimum Mitigations for Autonomous AI Development

来源：

英文摘要

Artificial intelligence (AI) is advancing rapidly, with the potential for significantly automating AI research and development itself in the near future. In 2024, international scientists, including Turing Award recipients, warned of risks from autonomous AI research and development (R&D), suggesting a red line such that no AI system should be able to improve itself or other AI systems without explicit human approval and assistance. However, the criteria for meaningful human approval remain unclear, and there is limited analysis on the specific risks of autonomous AI R&D, how they arise, and how to mitigate them. In this brief paper, we outline how these risks may emerge and propose four minimum safeguard recommendations applicable when AI agents significantly automate or accelerate AI development.

作者：Joshua Clymer、Isabella Duan、Chris Cundy、Yawen Duan、Fynn Heide、Chaochao Lu、S?ren Mindermann、Conor McGurk、Xudong Pan、Saad Siddiqui、Jingren Wang、Min Yang、Xianyuan Zhan

作者单位：

学科分类：自动化基础理论计算技术、计算机技术

推荐引用：Joshua Clymer,Isabella Duan,Chris Cundy,Yawen Duan,Fynn Heide,Chaochao Lu,S?ren Mindermann,Conor McGurk,Xudong Pan,Saad Siddiqui,Jingren Wang,Min Yang,Xianyuan Zhan.Bare Minimum Mitigations for Autonomous AI Development[EB/OL].(2025-04-21)[2025-05-28].https://arxiv.org/abs/2504.15416.点此复制

Bare Minimum Mitigations for Autonomous AI Development

Bare Minimum Mitigations for Autonomous AI Development

评论