|国家预印本平台
首页|UI-Evol: Automatic Knowledge Evolving for Computer Use Agents

UI-Evol: Automatic Knowledge Evolving for Computer Use Agents

UI-Evol: Automatic Knowledge Evolving for Computer Use Agents

来源:Arxiv_logoArxiv
英文摘要

External knowledge has played a crucial role in the recent development of computer use agents. We identify a critical knowledge-execution gap: retrieved knowledge often fails to translate into effective real-world task execution. Our analysis shows even 90\% correct knowledge yields only 41\% execution success rate. To bridge this gap, we propose UI-Evol, a plug-and-play module for autonomous GUI knowledge evolution. UI-Evol consists of two stages: a Retrace Stage that extracts faithful objective action sequences from actual agent-environment interactions, and a Critique Stage that refines existing knowledge by comparing these sequences against external references. We conduct comprehensive experiments on the OSWorld benchmark with the state-of-the-art Agent S2. Our results demonstrate that UI-Evol not only significantly boosts task performance but also addresses a previously overlooked issue of high behavioral standard deviation in computer use agents, leading to superior performance on computer use tasks and substantially improved agent reliability.

Ziyun Zhang、Xinyi Liu、Xiaoyi Zhang、Jun Wang、Gang Chen、Yan Lu

计算技术、计算机技术

Ziyun Zhang,Xinyi Liu,Xiaoyi Zhang,Jun Wang,Gang Chen,Yan Lu.UI-Evol: Automatic Knowledge Evolving for Computer Use Agents[EB/OL].(2025-05-28)[2025-06-19].https://arxiv.org/abs/2505.21964.点此复制

评论