论文标题

孟加拉二十年的手写数字识别:一项调查

Two Decades of Bengali Handwritten Digit Recognition: A Survey

论文作者

Rahman, A. B. M. Ashikur, Hasan, Md. Bakhtiar, Ahmed, Sabbir, Ahmed, Tasnim, Ashmafee, Md. Hamjajul, Kabir, Mohammad Ridwan, Kabir, Md. Hasanul

论文摘要

手写数字识别(HDR)是光学特征识别(OCR)领域中最具挑战性的任务之一。不管语言如何,HDR都存在一些固有的挑战,这主要是由于个人编写样式的差异,编写媒介和环境,无法在反复编写任何数字等时保持相同的笔触,除此之外,特定语言数字的结构性复杂性可能会导致HDR歧义情况。多年来,研究人员开发了许多离线和在线HDR管道,其中不同的图像处理技术与传统的机器学习(ML)基于基于的机器学习和/或基于深度学习(DL)的体系结构相结合。尽管文献中存在有关HDR的广泛审查研究的证据,例如英语,阿拉伯语,印度,波尔西,中文等,但几乎没有对孟加拉人HDR(BHDR)的调查,这缺乏对挑战,基本识别过程以及可能的未来方向的全面分析。在本文中,已经分析了孟加拉语手写数字的特征和固有的歧义,以及二十年来最先进的数据集的全面见解和脱机BHDR的方法。此外,还详细讨论了一些涉及BHDR的现实应用特定研究。本文还将成为对离线BHDR背后科学感兴趣的研究人员的汇编,促进了对相关研究的新途径的探索,这可能会进一步导致在不同应用领域对孟加拉语手写数字的更好地识别。

Handwritten Digit Recognition (HDR) is one of the most challenging tasks in the domain of Optical Character Recognition (OCR). Irrespective of language, there are some inherent challenges of HDR, which mostly arise due to the variations in writing styles across individuals, writing medium and environment, inability to maintain the same strokes while writing any digit repeatedly, etc. In addition to that, the structural complexities of the digits of a particular language may lead to ambiguous scenarios of HDR. Over the years, researchers have developed numerous offline and online HDR pipelines, where different image processing techniques are combined with traditional Machine Learning (ML)-based and/or Deep Learning (DL)-based architectures. Although evidence of extensive review studies on HDR exists in the literature for languages, such as English, Arabic, Indian, Farsi, Chinese, etc., few surveys on Bengali HDR (BHDR) can be found, which lack a comprehensive analysis of the challenges, the underlying recognition process, and possible future directions. In this paper, the characteristics and inherent ambiguities of Bengali handwritten digits along with a comprehensive insight of two decades of state-of-the-art datasets and approaches towards offline BHDR have been analyzed. Furthermore, several real-life application-specific studies, which involve BHDR, have also been discussed in detail. This paper will also serve as a compendium for researchers interested in the science behind offline BHDR, instigating the exploration of newer avenues of relevant research that may further lead to better offline recognition of Bengali handwritten digits in different application areas.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源