论文标题
自然主义现场声学环境对法医独立的扬声器验证系统的影响
Impact of Naturalistic Field Acoustic Environments on Forensic Text-independent Speaker Verification System
论文作者
论文摘要
法医扬声器验证的音频分析在系统性能中提供了独特的挑战,部分原因是在自然主义现场的声学环境中收集的数据,在法医数据收集过程中,位置/场景不确定性很常见。法医语音数据作为潜在的证据可以在随机自然主义环境中获得,从而导致数据质量可变。语音样本可能包括由于声音努力而引起的可变性,例如大喊911个紧急电话,而其他人可能会在野外地点或面试室中窃窃私语或情境强调的声音。这种语音的可变性由内在和外在特征组成,并使法医验证成为复杂而艰巨的任务。外部属性包括记录设备,例如麦克风类型和放置,环境噪声,包括混响在内的房间配置以及其他基于环境的问题。某些因素,例如噪声和非目标语音,将通过仅出现就会影响验证系统的性能。为了调查现场声学环境的影响,我们根据CRSS-Ferensic语料库进行了一项演讲者验证研究,并从8个现场位置收集的音频(包括警察采访)。这项调查包括对使用X-Vector系统的七个看不见的声学环境对说话者验证系统性能的影响的分析。
Audio analysis for forensic speaker verification offers unique challenges in system performance due in part to data collected in naturalistic field acoustic environments where location/scenario uncertainty is common in the forensic data collection process. Forensic speech data as potential evidence can be obtained in random naturalistic environments resulting in variable data quality. Speech samples may include variability due to vocal efforts such as yelling over 911 emergency calls, whereas others might be whisper or situational stressed voice in a field location or interview room. Such speech variability consists of intrinsic and extrinsic characteristics and makes forensic speaker verification a complicated and daunting task. Extrinsic properties include recording equipment such as microphone type and placement, ambient noise, room configuration including reverberation, and other environmental scenario-based issues. Some factors, such as noise and non-target speech, will impact the verification system performance by their mere presence. To investigate the impact of field acoustic environments, we performed a speaker verification study based on the CRSS-Forensic corpus with audio collected from 8 field locations including police interviews. This investigation includes an analysis of the impact of seven unseen acoustic environments on speaker verification system performance using an x-Vector system.