论文标题
可变形的卷积LSTM用于人体情感识别
Deformable Convolutional LSTM for Human Body Emotion Recognition
论文作者
论文摘要
人们以多种方式代表自己的情绪。最重要的是全身表达式,它们在不同领域(例如人类计算机相互作用(HCI))中具有许多应用。人类情感识别中最重要的挑战之一是,人们使用自己的脸和身体以各种方式表达相同的感觉。最近,许多方法试图使用深层神经网络(DNN)克服这些挑战。但是,这些方法中的大多数仅基于图像或面部表达式,并且不考虑在图像中可能发生的变形,例如缩放和旋转,可能会对识别精度产生不利影响。在这项工作中,是在最新的可变形卷积研究的推动下,我们将可变形的行为纳入了卷积长的短期记忆(Convlstm)的核心,以提高图像中这些变形的鲁棒性,从而提高其对视频的精确度。我们在GEMEP数据集上进行了实验,并在验证集对整个人体情感识别的任务上实现了98.8%的最新精度。
People represent their emotions in a myriad of ways. Among the most important ones is whole body expressions which have many applications in different fields such as human-computer interaction (HCI). One of the most important challenges in human emotion recognition is that people express the same feeling in various ways using their face and their body. Recently many methods have tried to overcome these challenges using Deep Neural Networks (DNNs). However, most of these methods were based on images or on facial expressions only and did not consider deformation that may happen in the images such as scaling and rotation which can adversely affect the recognition accuracy. In this work, motivated by recent researches on deformable convolutions, we incorporate the deformable behavior into the core of convolutional long short-term memory (ConvLSTM) to improve robustness to these deformations in the image and, consequently, improve its accuracy on the emotion recognition task from videos of arbitrary length. We did experiments on the GEMEP dataset and achieved state-of-the-art accuracy of 98.8% on the task of whole human body emotion recognition on the validation set.