|国家预印本平台
首页|SpectR: Dynamically Composing LM Experts with Spectral Routing

SpectR: Dynamically Composing LM Experts with Spectral Routing

SpectR: Dynamically Composing LM Experts with Spectral Routing

来源:Arxiv_logoArxiv
英文摘要

Training large, general-purpose language models poses significant challenges. The growing availability of specialized expert models, fine-tuned from pretrained models for specific tasks or domains, offers a promising alternative. Leveraging the potential of these existing expert models in real-world applications requires effective methods to select or merge the models best suited for a given task. This paper introduces SPECTR, an approach for dynamically composing expert models at each time step during inference. Notably, our method requires no additional training and enables flexible, token- and layer-wise model combinations. Our experimental results demonstrate that SPECTR improves routing accuracy over alternative training-free methods, increasing task performance across expert domains.

William Fleshman、Benjamin Van Durme

计算技术、计算机技术

William Fleshman,Benjamin Van Durme.SpectR: Dynamically Composing LM Experts with Spectral Routing[EB/OL].(2025-04-04)[2025-05-02].https://arxiv.org/abs/2504.03454.点此复制

评论