|国家预印本平台
首页|Parametric Object Coding in IVAS: Efficient Coding of Multiple Audio Objects at Low Bit Rates

Parametric Object Coding in IVAS: Efficient Coding of Multiple Audio Objects at Low Bit Rates

Parametric Object Coding in IVAS: Efficient Coding of Multiple Audio Objects at Low Bit Rates

来源:Arxiv_logoArxiv
英文摘要

The recently standardized 3GPP codec for Immersive Voice and Audio Services (IVAS) includes a parametric mode for efficiently coding multiple audio objects at low bit rates. In this mode, parametric side information is obtained from both the object metadata and the input audio objects. The side information comprises directional information, indices of two dominant objects, and the power ratio between these two dominant objects. It is transmitted to the decoder along with a stereo downmix. In IVAS, parametric object coding allows for transmitting three or four arbitrarily placed objects at bit rates of 24.4 or 32 kbit/s and faithfully reconstructing the spatial image of the original audio scene. Subjective listening tests confirm that IVAS provides a comparable immersive experience at lower bit rate and complexity compared to coding the audio objects independently using Enhanced Voice Services (EVS).

Andrea Eichenseer、Srikanth Korse、Guillaume Fuchs、Markus Multrus

通信

Andrea Eichenseer,Srikanth Korse,Guillaume Fuchs,Markus Multrus.Parametric Object Coding in IVAS: Efficient Coding of Multiple Audio Objects at Low Bit Rates[EB/OL].(2025-07-07)[2025-07-19].https://arxiv.org/abs/2507.05409.点此复制

评论