|国家预印本平台
首页|Improving BERT for Symbolic Music Understanding Using Token Denoising and Pianoroll Prediction

Improving BERT for Symbolic Music Understanding Using Token Denoising and Pianoroll Prediction

Improving BERT for Symbolic Music Understanding Using Token Denoising and Pianoroll Prediction

来源:Arxiv_logoArxiv
英文摘要

We propose a pre-trained BERT-like model for symbolic music understanding that achieves competitive performance across a wide range of downstream tasks. To achieve this target, we design two novel pre-training objectives, namely token correction and pianoroll prediction. First, we sample a portion of note tokens and corrupt them with a limited amount of noise, and then train the model to denoise the corrupted tokens; second, we also train the model to predict bar-level and local pianoroll-derived representations from the corrupted note tokens. We argue that these objectives guide the model to better learn specific musical knowledge such as pitch intervals. For evaluation, we propose a benchmark that incorporates 12 downstream tasks ranging from chord estimation to symbolic genre classification. Results confirm the effectiveness of the proposed pre-training objectives on downstream tasks.

Jun-You Wang、Li Su

计算技术、计算机技术

Jun-You Wang,Li Su.Improving BERT for Symbolic Music Understanding Using Token Denoising and Pianoroll Prediction[EB/OL].(2025-07-07)[2025-07-16].https://arxiv.org/abs/2507.04776.点此复制

评论