摄像机模型识别和图像操纵检测的L2约束remnet

论文标题

摄像机模型识别和图像操纵检测的L2约束remnet

L2-Constrained RemNet for Camera Model Identification and Image Manipulation Detection

论文作者

Rafi, Abdul Muntakim, Wu, Jonathan, Hasan, Md. Kamrul

论文摘要

源摄像头模型识别（CMI）和图像操纵检测在图像取证中至关重要。在本文中，我们提出了一个由L2约束的残留卷积神经网络（L2限制的Remnet）来执行这两个关键任务。所提出的网络体系结构由动态预处理器块和一个分类块组成。 L2损失应用于预处理器块的输出，并根据分类块的输出来计算分类的Crossentropy损失。通过最小化总损失，以端到端的方式训练了整个网络，这是L2损失和分类Crossentropy损失的组合。在L2损失的帮助下，数据自适应的预处理器学会了抑制不必要的图像内容，并有助于分类块提取强大的图像取证功能。我们在德累斯顿数据库上训练和测试网络，并达到98.15％的总体准确性，其中所有测试图像均来自培训期间未使用的设备和场景来复制实际应用。即使操纵图像，该网络也优于其他最先进的CNN。此外，我们在图像操纵检测中达到了99.68％的总体精度，这意味着它可以用作图像取证任务的通用网络。

Source camera model identification (CMI) and image manipulation detection are of paramount importance in image forensics. In this paper, we propose an L2-constrained Remnant Convolutional Neural Network (L2-constrained RemNet) for performing these two crucial tasks. The proposed network architecture consists of a dynamic preprocessor block and a classification block. An L2 loss is applied to the output of the preprocessor block, and categorical crossentropy loss is calculated based on the output of the classification block. The whole network is trained in an end-to-end manner by minimizing the total loss, which is a combination of the L2 loss and the categorical crossentropy loss. Aided by the L2 loss, the data-adaptive preprocessor learns to suppress the unnecessary image contents and assists the classification block in extracting robust image forensics features. We train and test the network on the Dresden database and achieve an overall accuracy of 98.15%, where all the test images are from devices and scenes not used during training to replicate practical applications. The network also outperforms other state-of-the-art CNNs even when the images are manipulated. Furthermore, we attain an overall accuracy of 99.68% in image manipulation detection, which implies that it can be used as a general-purpose network for image forensic tasks.

下载PDF全文

下载文献需遵守相关版权规定

论文标题