论文标题

FC2RN:一个完全卷积的角改进网络,用于精确的多个方向场景检测

FC2RN: A Fully Convolutional Corner Refinement Network for Accurate Multi-Oriented Scene Text Detection

论文作者

Qin, Xugong, Zhou, Yu, Wu, Dayan, Yue, Yinliang, Wang, Weiping

论文摘要

最近的场景文本检测作用主要集中在曲线文本检测上。但是,在实际应用中,曲线文本比多方向的曲线更稀缺。准确检测具有较大量表,方向和纵横比的多种方面文本具有重要意义。在多个导向的检测方法中,场景文本的几何形状的直接回归共享一条简单而强大的管道,并在学术和工业社区中很受欢迎,但它可能会产生不完善的检测,尤其是由于接收领域的限制而言。在这项工作中,我们旨在改善这一点,同时保持管道简单。提出了一个完全卷积的角改进网络(FC2RN),以进行准确的多方向文本检测,其中初始角预测和精制的角落预测是在一个通过时获得的。借助针对多面场景文本的新型四边形ROI卷积操作,初始四边形预测被编码到特征图中,可进一步用于预测初始预测和地面真相之间的偏移,并输出精致的置信分数。在包括MSRA-TD500,ICDAR2017-RCTW,ICDAR2015和可可文本在内的四个公共数据集上的实验结果表明,FC2RN可以优于最先进的方法。消融研究表明了角落细化的有效性和对准确文本定位的评分。

Recent scene text detection works mainly focus on curve text detection. However, in real applications, the curve texts are more scarce than the multi-oriented ones. Accurate detection of multi-oriented text with large variations of scales, orientations, and aspect ratios is of great significance. Among the multi-oriented detection methods, direct regression for the geometry of scene text shares a simple yet powerful pipeline and gets popular in academic and industrial communities, but it may produce imperfect detections, especially for long texts due to the limitation of the receptive field. In this work, we aim to improve this while keeping the pipeline simple. A fully convolutional corner refinement network (FC2RN) is proposed for accurate multi-oriented text detection, in which an initial corner prediction and a refined corner prediction are obtained at one pass. With a novel quadrilateral RoI convolution operation tailed for multi-oriented scene text, the initial quadrilateral prediction is encoded into the feature maps which can be further used to predict offset between the initial prediction and the ground-truth as well as output a refined confidence score. Experimental results on four public datasets including MSRA-TD500, ICDAR2017-RCTW, ICDAR2015, and COCO-Text demonstrate that FC2RN can outperform the state-of-the-art methods. The ablation study shows the effectiveness of corner refinement and scoring for accurate text localization.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源