论文标题
Emo-CNN可感知音频信号的压力:脑化学方法
Emo-CNN for Perceiving Stress from Audio Signals: A Brain Chemistry Approach
论文作者
论文摘要
情感在医疗保健等许多应用中起着关键作用,以收集患者的情感行为。由于它们在理解人类的感觉方面的有效性,因此有些情绪更为重要。在本文中,我们提出了一种模拟音频信号人体压力的方法。语音情感检测中的研究挑战是定义压力的含义,并能够精确地对其进行分类。监督的机器学习模型,包括最先进的深度学习分类方法,依赖于清洁和标记的数据的可用性。情感计算和情绪检测的问题之一是压力的注释数据有限。现有标记的压力情绪数据集对注释者的感知高度主观。 我们通过利用在卷积神经网络中使用传统的MFCC功能来解决第一期特征选择。我们的实验表明,Emo-CNN始终如一,并且显着优于多个数据集的流行现有方法。它在EMO-DB数据集上实现了90.2%的分类精度。为了解决应力标签中的第二个和更重要的主观性问题,我们使用Lovheim的立方体,这是情绪的三维投影。该立方体旨在解释这些神经递质与3D空间中情绪的位置之间的关系。来自EMO-CNN的学习情绪表示形式使用三个组件PCA(主要成分分析)映射到立方体,然后将其用于模拟人类压力。这种提出的方法不仅规避了对压力数据的需求,而且还符合Lovheim Cube给出的情感心理理论。我们认为,这项工作是建立人工智能与人类情感化学之间建立联系的第一步。
Emotion plays a key role in many applications like healthcare, to gather patients emotional behavior. There are certain emotions which are given more importance due to their effectiveness in understanding human feelings. In this paper, we propose an approach that models human stress from audio signals. The research challenge in speech emotion detection is defining the very meaning of stress and being able to categorize it in a precise manner. Supervised Machine Learning models, including state of the art Deep Learning classification methods, rely on the availability of clean and labelled data. One of the problems in affective computation and emotion detection is the limited amount of annotated data of stress. The existing labelled stress emotion datasets are highly subjective to the perception of the annotator. We address the first issue of feature selection by exploiting the use of traditional MFCC features in Convolutional Neural Network. Our experiments show that Emo-CNN consistently and significantly outperforms the popular existing methods over multiple datasets. It achieves 90.2% categorical accuracy on the Emo-DB dataset. To tackle the second and the more significant problem of subjectivity in stress labels, we use Lovheim's cube, which is a 3-dimensional projection of emotions. The cube aims at explaining the relationship between these neurotransmitters and the positions of emotions in 3D space. The learnt emotion representations from the Emo-CNN are mapped to the cube using three component PCA (Principal Component Analysis) which is then used to model human stress. This proposed approach not only circumvents the need for labelled stress data but also complies with the psychological theory of emotions given by Lovheim's cube. We believe that this work is the first step towards creating a connection between Artificial Intelligence and the chemistry of human emotions.