|国家预印本平台
首页|TuneComp: Joint Fine-tuning and Compression for Large Foundation Models

TuneComp: Joint Fine-tuning and Compression for Large Foundation Models

TuneComp: Joint Fine-tuning and Compression for Large Foundation Models

来源:Arxiv_logoArxiv
英文摘要

To reduce model size during post-training, compression methods, including knowledge distillation, low-rank approximation, and pruning, are often applied after fine-tuning the model. However, sequential fine-tuning and compression sacrifices performance, while creating a larger than necessary model as an intermediate step. In this work, we aim to reduce this gap, by directly constructing a smaller model while guided by the downstream task. We propose to jointly fine-tune and compress the model by gradually distilling it to a pruned low-rank structure. Experiments demonstrate that joint fine-tuning and compression significantly outperforms other sequential compression methods.

Xiangyu Chen、Jing Liu、Ye Wang、Matthew Brand、Pu、Wang、Toshiaki Koike-Akino

PerryPerryPerryPerryPerry

计算技术、计算机技术

Xiangyu Chen,Jing Liu,Ye Wang,Matthew Brand,Pu,Wang,Toshiaki Koike-Akino.TuneComp: Joint Fine-tuning and Compression for Large Foundation Models[EB/OL].(2025-05-27)[2025-06-09].https://arxiv.org/abs/2505.21835.点此复制

评论