论文标题
使用光谱熵,K-均值聚类和连续小波变换的软输出信号检测用于鲸类发声
Soft-Output Signal Detection for Cetacean Vocalizations Using Spectral Entropy, K-Means Clustering and the Continuous Wavelet Transform
论文作者
论文摘要
水下声学监测系统记录了许多小时的海洋研究音频数据,从而使快速,可靠的非因果信号检测至高无上。此类探测器有助于减少信号注释所需的劳动量,这些人通常包含没有信号的大部分。 研究基于光谱熵的鲸类发声检测是一种发声发现的手段。使用光谱熵(SE)的先前技术主要考虑熵度量的时频增强,并利用STFT作为其时间频率(TF)分解。 SE方法还要求用户手动设置检测阈值,该阈值需要了解生产的熵措施。 本文将中位过滤视为一种简单,有效的方法,可以为熵测量提供时间稳定,并将CWT视为替代性TF分解。 K-均值聚类用于确定准确分离信号/无信号熵测量所需的阈值,从而导致一维的两类分类问题。该类均值用于执行伪造的软级分配,这是算法开发中的有用指标。研究了中值滤波,信噪比和所选TF分解的影响。 所提出的方法显示出检测准确性和特异性的显着提高,同时还通过软类分配提供了更容易解释的检测阈值设置。
Underwater acoustic monitoring systems record many hours of audio data for marine research, making fast and reliable non-causal signal detection paramount. Such detectors assist in reducing the amount of labor required for signal annotations, which often contain large portions devoid of signals. Cetacean vocalization detection based on spectral entropy is investigated as a means of vocalization discovery. Previous techniques using spectral entropy (SE) mostly consider time-frequency enhancement of the entropy measure, and utilize the STFT as its time-frequency (TF) decomposition. SE methods also requires the user to set a detection threshold manually, which call for knowledge of the produced entropy measures. This paper considers median filtering as a simple, effective way to provide temporal stabilization to the entropy measure, and considers the CWT as an alternative TF decomposition. K-means clustering is used to determine the threshold required to accurately separate the signal/no-signal entropy measures, resulting in a one-dimensional, two-class classification problem. The class means are used to perform pseudo-probabilistic soft class assignment, which is a useful metric in algorithmic development. The effect of median filtering, signal-to-noise ratio and the chosen TF decomposition are investigated. The proposed method shows a significant improvement in detection accuracy and specificity, while also providing a more interpretable detection threshold setting via soft class assignment.