|国家预印本平台
首页|Binamix -- A Python Library for Generating Binaural Audio Datasets

Binamix -- A Python Library for Generating Binaural Audio Datasets

Binamix -- A Python Library for Generating Binaural Audio Datasets

来源:Arxiv_logoArxiv
英文摘要

The increasing demand for spatial audio in applications such as virtual reality, immersive media, and spatial audio research necessitates robust solutions to generate binaural audio data sets for use in testing and validation. Binamix is an open-source Python library designed to facilitate programmatic binaural mixing using the extensive SADIE II Database, which provides Head Related Impulse Response (HRIR) and Binaural Room Impulse Response (BRIR) data for 20 subjects. The Binamix library provides a flexible and repeatable framework for creating large-scale spatial audio datasets, making it an invaluable resource for codec evaluation, audio quality metric development, and machine learning model training. A range of pre-built example scripts, utility functions, and visualization plots further streamline the process of custom pipeline creation. This paper presents an overview of the library's capabilities, including binaural rendering, impulse response interpolation, and multi-track mixing for various speaker layouts. The tools utilize a modified Delaunay triangulation technique to achieve accurate HRIR/BRIR interpolation where desired angles are not present in the data. By supporting a wide range of parameters such as azimuth, elevation, subject Impulse Responses (IRs), speaker layouts, mixing controls, and more, the library enables researchers to create large binaural datasets for any downstream purpose. Binamix empowers researchers and developers to advance spatial audio applications with reproducible methodologies by offering an open-source solution for binaural rendering and dataset generation. We release the library under the Apache 2.0 License at https://github.com/QxLabIreland/Binamix/

Dan Barry、Alessandro Ragano、Davoud Shariat Panah、Jan Skoglund、Andrew Hines

计算技术、计算机技术

Dan Barry,Alessandro Ragano,Davoud Shariat Panah,Jan Skoglund,Andrew Hines.Binamix -- A Python Library for Generating Binaural Audio Datasets[EB/OL].(2025-05-02)[2025-06-15].https://arxiv.org/abs/2505.01369.点此复制

评论