|国家预印本平台
首页|基于全景视频内容的空间音频生成方法

基于全景视频内容的空间音频生成方法

Spatial Audio Generation Method Based On Panoramic Video Content

中文摘要英文摘要

随着虚拟现实技术的发展,360度全景视频作为一种新的视频形式,其可全角度环绕式观赏的特点为用户带来了沉浸式的体验。为了在全景视频基础上获得更强的沉浸感,业界一般会为视频添加与之匹配的空间音频,而如何生成逼真的空间音频融入视频的场景中仍然是一个难点。本文首先将生成空间音频所需要素分解成声音对象、房间混响、环境音效三部分,基于视频内容,分别通过多目标检测与跟踪、房间参数回归、场景分类模块实现这些要素,并建立环境音效数据库来辅助这一过程。利用本文提出的算法实现了空间音频生成管线,在此基础上对算法设计并进行了用户研究,通过分析统计数据了解了本算法具有的优势和算法中各组件的相对重要性。

With the development of virtual reality technology, 360-degree panoramic video, as a new form of video, provides a more immersive experience to users due to its feature of full-angle surround view. Generally, in order to obtain a deeper sense of immersion on the basis of panoramic video, typically a matching spatial audio is added to the video. However, it is still a technical difficulty to generate realistic spatial audio to integrate into the video scenes. In this paper, firstly we decompose the elements required to generate spatial audio into three main parts: sound objects, room reverberation, and ambient sound. Based on the video content, these elements are produced through multi-target detection and tracking, room parameter regression, and scene classification modules, respectively. And an ambient sounds database is established to facilitate this process. The spatial audio generation system is established using the method proposed in this paper. On this basis we designed and conducted a user study to verify the effectiveness of this algorithm. Through analyzing statistical data, we verified the effectiveness of this algorithm, and we studied the relative importance of each element in the algorithm.

黄琪东、廖建新

声学工程电子技术应用电子对抗

全景视频空间音频多目标跟踪场景检测与分类。

Panoramic VideoSpatial AudioMulti-object TrackingScene Detection And Classfication

黄琪东,廖建新.基于全景视频内容的空间音频生成方法[EB/OL].(2021-03-10)[2025-08-04].http://www.paper.edu.cn/releasepaper/content/202103-101.点此复制

评论