Formalizing Embeddedness Failures in Universal Artificial Intelligence
Formalizing Embeddedness Failures in Universal Artificial Intelligence
We rigorously discuss the commonly asserted failures of the AIXI reinforcement learning agent as a model of embedded agency. We attempt to formalize these failure modes and prove that they occur within the framework of universal artificial intelligence, focusing on a variant of AIXI that models the joint action/percept history as drawn from the universal distribution. We also evaluate the progress that has been made towards a successful theory of embedded agency based on variants of the AIXI agent.
Cole Wyeth、Marcus Hutter
自动化基础理论
Cole Wyeth,Marcus Hutter.Formalizing Embeddedness Failures in Universal Artificial Intelligence[EB/OL].(2025-05-23)[2025-07-16].https://arxiv.org/abs/2505.17882.点此复制
评论