|国家预印本平台
首页|EvoCurr: Self-evolving Curriculum with Behavior Code Generation for Complex Decision-making

EvoCurr: Self-evolving Curriculum with Behavior Code Generation for Complex Decision-making

EvoCurr: Self-evolving Curriculum with Behavior Code Generation for Complex Decision-making

来源:Arxiv_logoArxiv
英文摘要

Large Language Models (LLMs) have demonstrated remarkable capabilities across diverse domains, including programming, planning, and decision-making. However, their performance often degrades when faced with highly complex problem instances that require deep reasoning over long horizons. In such cases, direct problem-solving approaches can lead to inefficiency or failure due to the lack of structured intermediate guidance. To address this, we propose a novel self-evolve framework, EvoCurr, in which a dedicated curriculum-generation LLM constructs a sequence of problem instances with gradually increasing difficulty, tailored to the solver LLM's learning progress. The curriculum dynamically adapts easing challenges when the solver struggles and escalating them when success is consistent, thus maintaining an optimal learning trajectory. This approach enables the solver LLM, implemented as a code-generation model producing Python decision-tree scripts, to progressively acquire the skills needed for complex decision-making tasks. Experimental results on challenging decision-making benchmarks show that our method significantly improves task success rates and solution efficiency compared to direct-solving baselines. These findings suggest that LLM-driven curriculum learning holds strong potential for enhancing automated reasoning in real-world, high-complexity domains.

Yang Cheng、Zilai Wang、Weiyu Ma、Wenhui Zhu、Yue Deng、Jian Zhao

计算技术、计算机技术

Yang Cheng,Zilai Wang,Weiyu Ma,Wenhui Zhu,Yue Deng,Jian Zhao.EvoCurr: Self-evolving Curriculum with Behavior Code Generation for Complex Decision-making[EB/OL].(2025-08-20)[2025-08-24].https://arxiv.org/abs/2508.09586.点此复制

评论