首页|Fine-Grained control over Music Generation with Activation Steering

Fine-Grained control over Music Generation with Activation Steering

来源：

英文摘要

We present a method for fine-grained control over music generation through inference-time interventions on an autoregressive generative music transformer called MusicGen. Our approach enables timbre transfer, style transfer, and genre fusion by steering the residual stream using weights of linear probes trained on it, or by steering the attention layer activations in a similar manner. We observe that modelling this as a regression task provides improved performance, hypothesizing that the mean-squared-error better preserve meaningful directional information in the activation space. Combined with the global conditioning offered by text prompts in MusicGen, our method provides both global and local control over music generation. Audio samples illustrating our method are available at our demo page.

作者：Jayden Koshy Joe、Harshith M R、Swathi Narashiman、Pranay Mathur、Anish Veerakumar、Aniruddh Krishna、Keerthiharan A、Dipanshu Panda

作者单位：

学科分类：计算技术、计算机技术

推荐引用：Jayden Koshy Joe,Harshith M R,Swathi Narashiman,Pranay Mathur,Anish Veerakumar,Aniruddh Krishna,Keerthiharan A,Dipanshu Panda.Fine-Grained control over Music Generation with Activation Steering[EB/OL].(2025-06-11)[2025-06-24].https://arxiv.org/abs/2506.10225.点此复制

Fine-Grained control over Music Generation with Activation Steering

Fine-Grained control over Music Generation with Activation Steering

评论