首页|ConstitutionalExperts: Training a Mixture of Principle-based Prompts

ConstitutionalExperts: Training a Mixture of Principle-based Prompts

来源：

英文摘要

Large language models (LLMs) are highly capable at a variety of tasks given the right prompt, but writing one is still a difficult and tedious process. In this work, we introduce ConstitutionalExperts, a method for learning a prompt consisting of constitutional principles (i.e. rules), given a training dataset. Unlike prior methods that optimize the prompt as a single entity, our method incrementally improves the prompt by surgically editing individual principles. We also show that we can improve overall performance by learning unique prompts for different semantic regions of the training data and using a mixture-of-experts (MoE) architecture to route inputs at inference time. We compare our method to other state of the art prompt-optimization techniques across six benchmark datasets. We also investigate whether MoE improves these other techniques. Our results suggest that ConstitutionalExperts outperforms other prompt optimization techniques by 10.9% (F1) and that mixture-of-experts improves all techniques, suggesting its broad applicability.

作者：James Wexler、Savvas Petridis、Ben Wedin、Ann Yuan、Nithum Thain

作者单位：

学科分类：计算技术、计算机技术政治理论

推荐引用：James Wexler,Savvas Petridis,Ben Wedin,Ann Yuan,Nithum Thain.ConstitutionalExperts: Training a Mixture of Principle-based Prompts[EB/OL].(2024-03-07)[2025-08-16].https://arxiv.org/abs/2403.04894.点此复制

ConstitutionalExperts: Training a Mixture of Principle-based Prompts

ConstitutionalExperts: Training a Mixture of Principle-based Prompts

评论