TextDragon: An End-to-End Framework for Arbitrary Shaped Text Spotting 论文

2019引用 234

Handwritten Text Recognition TechniquesVehicle License Plate RecognitionAdvanced Image and Video Retrieval Techniques

Advanced Image and Video Retrieval Techniques Handwritten Text Recognition Techniques Vehicle License Plate Recognition

作者

摘要

Most existing text spotting methods either focus on horizontal/oriented texts or perform arbitrary shaped text spotting with character-level annotations. In this paper, we propose a novel text spotting framework to detect and recognize text of arbitrary shapes in an end-to-end manner, using only word/line-level annotations for training. Motivated from the name of TextSnake, which is only a detection model, we call the proposed text spotting framework TextDragon. In TextDragon, a text detector is designed to describe the shape of text with a series of quadrangles, which can handle text of arbitrary shapes. To extract arbitrary text regions from feature maps, we propose a new differentiable operator named RoISlide, which is the key to connect arbitrary shaped text detection and recognition. Based on the extracted features through RoISlide, a CNN and CTC based text recognizer is introduced to make the framework free from labeling the location of characters. The proposed method achieves state-of-the-art performance on two curved text benchmarks CTW1500 and Total-Text, and competitive results on the ICDAR 2015 Dataset.

作者查看全部 (5)

Cheng‐Lin Liu

Xu-Yao Zhang

Fei Yin

Wenhao He

TextDragon: An End-to-End Framework for Arbitrary Shaped Text Spotting 论文

详细信息

摘要

作者查看全部 (5)

相关技术查看全部 (3)

相关事件

相关文章