EAT-RADAR：使用FMCW雷达和3D临时卷积网络连续的细粒进气手势检测

论文标题

EAT-RADAR：使用FMCW雷达和3D临时卷积网络连续的细粒进气手势检测

Eat-Radar: Continuous Fine-Grained Intake Gesture Detection Using FMCW Radar and 3D Temporal Convolutional Network with Attention

论文作者

Wang, Chunzhuo, Kumar, T. Sunil, De Raedt, Walter, Camps, Guido, Hallez, Hans, Vanrumste, Bart

论文摘要

不健康的饮食习惯被认为是各种慢性疾病的主要原因，包括肥胖和糖尿病。自动食品摄入监测系统有可能通过饮食评估来改善与饮食相关疾病的人的生活质量（QOL）。在这项工作中，我们提出了一种新型的基于非接触式雷达的方法进行食物摄入监测。具体而言，使用频率调制连续波（FMCW）雷达传感器来识别细粒的饮食和饮用手势。细粒度的饮食手势包含一系列的动作，从抬起手到嘴，直到将手从嘴里放开。通过处理Range-Doppler Cube（RD Cube），开发了具有自我注意力（3D-TCN-ATT）的3D时间卷积网络（3D-TCN-ATT）。与以前的基于雷达的研究不同，这项工作在连续的饮食课程中收集数据（更现实的场景）。我们创建了一个公共数据集，其中包括70个餐点（4,132次进餐手势和893个饮酒手势），总持续时间为1155分钟。该数据集包括四种饮食样式（叉子和刀，筷子，勺子，手）。为了验证所提出的方法的性能，应用了七倍的交叉验证方法。 3D-TCN-ATT模型的饮食手势分别达到了0.896和0.868的分段F1分数。拟议方法的结果表明，在饮食课程中使用雷达将雷达用于细粒度的饮食手势检测和分割。

Unhealthy dietary habits are considered as the primary cause of various chronic diseases, including obesity and diabetes. The automatic food intake monitoring system has the potential to improve the quality of life (QoL) of people with diet-related diseases through dietary assessment. In this work, we propose a novel contactless radar-based approach for food intake monitoring. Specifically, a Frequency Modulated Continuous Wave (FMCW) radar sensor is employed to recognize fine-grained eating and drinking gestures. The fine-grained eating/drinking gesture contains a series of movements from raising the hand to the mouth until putting away the hand from the mouth. A 3D temporal convolutional network with self-attention (3D-TCN-Att) is developed to detect and segment eating and drinking gestures in meal sessions by processing the Range-Doppler Cube (RD Cube). Unlike previous radar-based research, this work collects data in continuous meal sessions (more realistic scenarios). We create a public dataset comprising 70 meal sessions (4,132 eating gestures and 893 drinking gestures) from 70 participants with a total duration of 1,155 minutes. Four eating styles (fork & knife, chopsticks, spoon, hand) are included in this dataset. To validate the performance of the proposed approach, seven-fold cross-validation method is applied. The 3D-TCN-Att model achieves a segmental F1-score of 0.896 and 0.868 for eating and drinking gestures, respectively. The results of the proposed approach indicate the feasibility of using radar for fine-grained eating and drinking gesture detection and segmentation in meal sessions.

下载PDF全文

下载文献需遵守相关版权规定

论文标题