|国家预印本平台
首页|Score-Informed BiLSTM Correction for Refining MIDI Velocity in Automatic Piano Transcription

Score-Informed BiLSTM Correction for Refining MIDI Velocity in Automatic Piano Transcription

Score-Informed BiLSTM Correction for Refining MIDI Velocity in Automatic Piano Transcription

来源:Arxiv_logoArxiv
英文摘要

MIDI is a modern standard for storing music, recording how musical notes are played. Many piano performances have corresponding MIDI scores available online. Some of these are created by the original performer, recording on an electric piano alongside the audio, while others are through manual transcription. In recent years, automatic music transcription (AMT) has rapidly advanced, enabling machines to transcribe MIDI from audio. However, these transcriptions often require further correction. Assuming a perfect timing correction, we focus on the loudness correction in terms of MIDI velocity (a parameter in MIDI for loudness control). This task can be approached through score-informed MIDI velocity estimation, which has undergone several developments. While previous approaches introduced specifically built models to re-estimate MIDI velocity, thereby replacing AMT estimates, we propose a BiLSTM correction module to refine AMT-estimated velocity. Although we did not reach state-of-the-art performance, we validated our method on the well-known AMT system, the high-resolution piano transcription (HPT), and achieved significant improvements.

Zhanhong He、Huang、Roberto Togneri、Defeng

计算技术、计算机技术

Zhanhong He,Huang,Roberto Togneri,Defeng.Score-Informed BiLSTM Correction for Refining MIDI Velocity in Automatic Piano Transcription[EB/OL].(2025-08-11)[2025-08-24].https://arxiv.org/abs/2508.07757.点此复制

评论