AGORA: Incentivizing Group Emergence Capability in LLMs via Group Distillation
AGORA: Incentivizing Group Emergence Capability in LLMs via Group Distillation
Progress in complex reasoning is constrained by the static nature of the current training datasets. We propose structured interaction as a new scaling axis, moving beyond the prevailing paradigm of increasing model parameters. Our self-evolving framework, AGORA, enables a collaborative ensemble to achieve reasoning performance exceeding state-of-the-art monolithic systems by up to 4.45 percentage points on challenging mathematical benchmarks. This gain stems from group emergent ability-the synthesis of collective capabilities unattainable by isolated models, validating interaction as a scalable driver of intelligence. Our results position the engineering of collaborative ecosystems as a vital frontier for capability emergence.
Ren Zhuang、Ben Wang、Shuifa Sun
计算技术、计算机技术
Ren Zhuang,Ben Wang,Shuifa Sun.AGORA: Incentivizing Group Emergence Capability in LLMs via Group Distillation[EB/OL].(2025-07-25)[2025-08-11].https://arxiv.org/abs/2507.21166.点此复制
评论