|国家预印本平台
首页|Neural Spectral Band Generation for Audio Coding

Neural Spectral Band Generation for Audio Coding

Neural Spectral Band Generation for Audio Coding

来源:Arxiv_logoArxiv
英文摘要

Audio bandwidth extension is the task of reconstructing missing high frequency components of bandwidth-limited audio signals, where bandwidth limitation is a common issue for audio signals due to several reasons, including channel capacity and data constraints. While conventional spectral band replication is a well-established parametric approach to audio bandwidth extension, the SBR usually entails coarse feature extraction and reconstruction techniques, which leads to limitations when processing various types of audio signals. In parallel, numerous deep neural network-based audio bandwidth extension methods have been proposed. These DNN-based methods are usually referred to as blind BWE, as these methods do not rely on prior information extracted from original signals, and only utilize given low frequency band signals to estimate missing high frequency components. In order to replace conventional SBR with DNNs, simply adopting existing DNN-based methodologies results in suboptimal performance due to the blindness of these methods. My proposed research suggests a new approach to parametric non-blind bandwidth extension, as DNN-based side information extraction and DNN-based bandwidth extension are performed only at the front and end of the audio coding pipeline.

Woongjib Choi、Byeong Hyeon Kim、Hyungseob Lim、Inseon Jang、Hong-Goo Kang

无线电设备、电信设备通信

Woongjib Choi,Byeong Hyeon Kim,Hyungseob Lim,Inseon Jang,Hong-Goo Kang.Neural Spectral Band Generation for Audio Coding[EB/OL].(2025-06-07)[2025-07-19].https://arxiv.org/abs/2506.06732.点此复制

评论