首页|Efficient Function Orchestration for Large Language Models

Efficient Function Orchestration for Large Language Models

来源：

英文摘要

Function calling is a fundamental capability of today's large language models, but sequential function calling posed efficiency problems. Recent studies have proposed to request function calls with parallelism support in order to alleviate this issue. However, they either delegate the concurrent function calls to users for execution which are conversely executed sequentially, or overlook the relations among various function calls, rending limited efficiency. This paper introduces LLMOrch, an advanced framework for automated, parallel function calling in large language models. The key principle behind LLMOrch is to identify an available processor to execute a function call while preventing any single processor from becoming overburdened. To this end, LLMOrch models the data relations (i.e., def-use) among different function calls and coordinates their executions by their control relations (i.e., mutual-exclusion) as well as the working status of the underlying processors. When comparing with state-of-the-art techniques, LLMOrch demonstrated comparable efficiency improvements in orchestrating I/O-intensive functions, while significantly outperforming (2$\times$) them with compute-intensive functions. LLMOrch's performance even showed a linear correlation to the number of allocated processors. We believe that these results highlight the potential of LLMOrch as an efficient solution for parallel function orchestration in the context of large language models.

作者：Xiaoxia Liu、Peng Di、Cong Li、Jun Sun、Jingyi Wang

作者单位：

学科分类：计算技术、计算机技术自动化技术、自动化技术设备

推荐引用：Xiaoxia Liu,Peng Di,Cong Li,Jun Sun,Jingyi Wang.Efficient Function Orchestration for Large Language Models[EB/OL].(2025-04-21)[2025-04-30].https://arxiv.org/abs/2504.14872.点此复制

Efficient Function Orchestration for Large Language Models

Efficient Function Orchestration for Large Language Models

评论