Title | A unified deep neural network for scene text detection |
Authors | Li, Yixin Ma, Jinwen |
Affiliation | LMAM, Department of Information Science, School of Mathematical Sciences, Peking University, Beijing, 100871, China |
Issue Date | 2017 |
Publisher | 13th International Conference on Intelligent Computing, ICIC 2017 |
Citation | 13th International Conference on Intelligent Computing, ICIC 2017. 2017, 10361 LNCS, 101-112. |
Abstract | Scene text detection is important and valuable for text recognition in natural scenes, but it is still a very challenging problem. In this paper, we propose a unified deep neural network for scene text detection, which is composed of a Fully Convolutional Network (FCN) for text saliency map generation and a Bounding box Regression Network (BRN) for text bounding boxes prediction. The FCN is trained with a hybrid loss function based on two types of pixel-wise ground truth masks while the unified neural network is fine-tuned with a multitask loss function. Additionally, the post-processing procedures including scoring the predicted bounding boxes by the saliency map and eliminating the redundant boxes via the Non-Maximum Suppression (NMS) method are applied to improve the final text detection results. It is demonstrated by the experimental results on ICDAR2013 benchmark that our proposed unified deep neural network can achieve good performance of text detection and process images at 5 fps, being faster than most of the existing text detection methods. ? Springer International Publishing AG 2017. |
URI | http://hdl.handle.net/20.500.11897/505064 |
ISSN | 9783319633084 |
DOI | 10.1007/978-3-319-63309-1_10 |
Indexed | EI |
Appears in Collections: | 信息科学技术学院 数学及其应用教育部重点实验室 |