基于暹罗的神经网络，用于离线作者在单词级别数据上的识别

论文标题

基于暹罗的神经网络，用于离线作者在单词级别数据上的识别

Siamese based Neural Network for Offline Writer Identification on word level data

论文作者

Kumar, Vineet, Sundaram, Suresh

论文摘要

手写识别是文档理解和分析的理想属性之一。它与文档的写作风格和特征有关，以区分作者。文本图像的多样性，特别是在带有不同笔迹的图像中，在很少的数据可用的情况下，学习良好特征的过程很难。在本文中，我们提出了一个新的方案，以根据输入单词图像来识别文档的作者。我们的方法是独立文本的，并且对正在检查的输入图像的大小没有任何限制。首先，我们使用比例不变特征变换（SIFT）检测笔迹中的关键组件，并提取周围的区域。这些补丁旨在捕获单个写作功能（包括字符的分类，字符或字符组合），这些特征对于单个作家而言可能是唯一的。然后，这些特征通过深层卷积神经网络（CNN），通过使用暹罗网络应用相似性学习的概念来学习权重。暹罗网络通过绘制不同输入图像对之间的相似性来增强CNN的歧视能力。使用稀疏PCA对提取的SIFT密钥点的不同尺度学到的特征进行编码，分配了稀疏PCA的每个组件，分配了显着性评分，这在有效区分不同作家时表示其显着性水平。最后，将对应于每个筛分密钥点的加权稀疏PCA组合在一起，以得出每个作者的最终分类分数。与其他基于深度学习的算法相比，在两个公开数据库（即IAM和CVL）上评估了所提出的算法。

Handwriting recognition is one of the desirable attributes of document comprehension and analysis. It is concerned with the documents writing style and characteristics that distinguish the authors. The diversity of text images, notably in images with varying handwriting, makes the process of learning good features difficult in cases where little data is available. In this paper, we propose a novel scheme to identify the author of a document based on the input word image. Our method is text independent and does not impose any constraint on the size of the input image under examination. To begin with, we detect crucial components in handwriting and extract regions surrounding them using Scale Invariant Feature Transform (SIFT). These patches are designed to capture individual writing features (including allographs, characters, or combinations of characters) that are likely to be unique for an individual writer. These features are then passed through a deep Convolutional Neural Network (CNN) in which the weights are learned by applying the concept of Similarity learning using Siamese network. Siamese network enhances the discrimination power of CNN by mapping similarity between different pairs of input image. Features learned at different scales of the extracted SIFT key-points are encoded using Sparse PCA, each components of the Sparse PCA is assigned a saliency score signifying its level of significance in discriminating different writers effectively. Finally, the weighted Sparse PCA corresponding to each SIFT key-points is combined to arrive at a final classification score for each writer. The proposed algorithm was evaluated on two publicly available databases (namely IAM and CVL) and is able to achieve promising result, when compared with other deep learning based algorithm.

下载PDF全文

下载文献需遵守相关版权规定

论文标题