Improving BERT for Symbolic Music Understanding Using Token Denoising and Pianoroll Prediction
Improving BERT for Symbolic Music Understanding Using Token Denoising and Pianoroll Prediction
We propose a pre-trained BERT-like model for symbolic music understanding that achieves competitive performance across a wide range of downstream tasks. To achieve this target, we design two novel pre-training objectives, namely token correction and pianoroll prediction. First, we sample a portion of note tokens and corrupt them with a limited amount of noise, and then train the model to denoise the corrupted tokens; second, we also train the model to predict bar-level and local pianoroll-derived representations from the corrupted note tokens. We argue that these objectives guide the model to better learn specific musical knowledge such as pitch intervals. For evaluation, we propose a benchmark that incorporates 12 downstream tasks ranging from chord estimation to symbolic genre classification. Results confirm the effectiveness of the proposed pre-training objectives on downstream tasks.
Jun-You Wang、Li Su
计算技术、计算机技术
Jun-You Wang,Li Su.Improving BERT for Symbolic Music Understanding Using Token Denoising and Pianoroll Prediction[EB/OL].(2025-07-07)[2025-07-16].https://arxiv.org/abs/2507.04776.点此复制
评论