论文标题

具有感知验证的虚拟应用程序的声场翻译和混合源模型

Sound Field Translation and Mixed Source Model for Virtual Applications with Perceptual Validation

论文作者

Birnie, Lachlan, Abhayapala, Thushara, Tourbabin, Vladimir, Samarasinghe, Prasanga

论文摘要

诸如电影院电影之类的非交互和线性体验提供了高质量的环绕声音频以增强沉浸感,但是听众的体验通常固定在单个声学视角上。随着虚拟现实的兴起,需要以一种允许用户在复制范围内进行互动和移动的方式记录和重现现实世界的体验。传统的声场翻译技术采用录制并将其扩展到相同的虚拟来源环境中。但是,商业高阶麦克风的有限抽样在虚拟繁殖中产生了声学甜点。结果,该技术仍然限制了听众的通航区域。在本文中,我们提出了一种在声学繁殖中进行听众翻译的方法,该方法在稀疏扩展的虚拟环境中融合了近场和远场来源的混合物。我们通过使用隐藏的参考和锚(Mushra)实验的多个刺激来感知该方法。与PlaneWave基准相比,所提出的方法既可以提高源的本地化性和鲁棒性,使得在翻译位置处的光谱扭曲。与数值模拟的盘问表明,稀疏膨胀会放松固有的甜点约束,从而改善了稀疏环境的可靠性。此外,所提出的方法可以更好地再现近场环境的强度和双耳脉冲响应光谱,从而进一步支持了强烈的感知结果。

Non-interactive and linear experiences like cinema film offer high quality surround sound audio to enhance immersion, however the listener's experience is usually fixed to a single acoustic perspective. With the rise of virtual reality, there is a demand for recording and recreating real-world experiences in a way that allows for the user to interact and move within the reproduction. Conventional sound field translation techniques take a recording and expand it into an equivalent environment of virtual sources. However, the finite sampling of a commercial higher order microphone produces an acoustic sweet-spot in the virtual reproduction. As a result, the technique remains to restrict the listener's navigable region. In this paper, we propose a method for listener translation in an acoustic reproduction that incorporates a mixture of near-field and far-field sources in a sparsely expanded virtual environment. We perceptually validate the method through a Multiple Stimulus with Hidden Reference and Anchor (MUSHRA) experiment. Compared to the planewave benchmark, the proposed method offers both improved source localizability and robustness to spectral distortions at translated positions. A cross-examination with numerical simulations demonstrated that the sparse expansion relaxes the inherent sweet-spot constraint, leading to the improved localizability for sparse environments. Additionally, the proposed method is seen to better reproduce the intensity and binaural room impulse response spectra of near-field environments, further supporting the strong perceptual results.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源