BINAQUAL: A Full-Reference Objective Localization Similarity Metric for Binaural Audio
BINAQUAL: A Full-Reference Objective Localization Similarity Metric for Binaural Audio
Spatial audio enhances immersion in applications such as virtual reality, augmented reality, gaming, and cinema by creating a three-dimensional auditory experience. Ensuring the spatial fidelity of binaural audio is crucial, given that processes such as compression, encoding, or transmission can alter localization cues. While subjective listening tests like MUSHRA remain the gold standard for evaluating spatial localization quality, they are costly and time-consuming. This paper introduces BINAQUAL, a full-reference objective metric designed to assess localization similarity in binaural audio recordings. BINAQUAL adapts the AMBIQUAL metric, originally developed for localization quality assessment in ambisonics audio format to the binaural domain. We evaluate BINAQUAL across five key research questions, examining its sensitivity to variations in sound source locations, angle interpolations, surround speaker layouts, audio degradations, and content diversity. Results demonstrate that BINAQUAL effectively differentiates between subtle spatial variations and correlates strongly with subjective listening tests, making it a reliable metric for binaural localization quality assessment. The proposed metric provides a robust benchmark for ensuring spatial accuracy in binaural audio processing, paving the way for improved objective evaluations in immersive audio applications.
Davoud Shariat Panah、Dan Barry、Alessandro Ragano、Jan Skoglund、Andrew Hines
电子技术应用
Davoud Shariat Panah,Dan Barry,Alessandro Ragano,Jan Skoglund,Andrew Hines.BINAQUAL: A Full-Reference Objective Localization Similarity Metric for Binaural Audio[EB/OL].(2025-05-17)[2025-06-13].https://arxiv.org/abs/2505.11915.点此复制
评论