Parametric Object Coding in IVAS: Efficient Coding of Multiple Audio Objects at Low Bit Rates
Parametric Object Coding in IVAS: Efficient Coding of Multiple Audio Objects at Low Bit Rates
The recently standardized 3GPP codec for Immersive Voice and Audio Services (IVAS) includes a parametric mode for efficiently coding multiple audio objects at low bit rates. In this mode, parametric side information is obtained from both the object metadata and the input audio objects. The side information comprises directional information, indices of two dominant objects, and the power ratio between these two dominant objects. It is transmitted to the decoder along with a stereo downmix. In IVAS, parametric object coding allows for transmitting three or four arbitrarily placed objects at bit rates of 24.4 or 32 kbit/s and faithfully reconstructing the spatial image of the original audio scene. Subjective listening tests confirm that IVAS provides a comparable immersive experience at lower bit rate and complexity compared to coding the audio objects independently using Enhanced Voice Services (EVS).
Andrea Eichenseer、Srikanth Korse、Guillaume Fuchs、Markus Multrus
通信
Andrea Eichenseer,Srikanth Korse,Guillaume Fuchs,Markus Multrus.Parametric Object Coding in IVAS: Efficient Coding of Multiple Audio Objects at Low Bit Rates[EB/OL].(2025-07-07)[2025-07-19].https://arxiv.org/abs/2507.05409.点此复制
评论