Title A unified deep neural network for scene text detection
Authors Li, Yixin
Ma, Jinwen
Affiliation LMAM, Department of Information Science, School of Mathematical Sciences, Peking University, Beijing, 100871, China
Issue Date 2017
Publisher 13th International Conference on Intelligent Computing, ICIC 2017
Citation 13th International Conference on Intelligent Computing, ICIC 2017. 2017, 10361 LNCS, 101-112.
Abstract Scene text detection is important and valuable for text recognition in natural scenes, but it is still a very challenging problem. In this paper, we propose a unified deep neural network for scene text detection, which is composed of a Fully Convolutional Network (FCN) for text saliency map generation and a Bounding box Regression Network (BRN) for text bounding boxes prediction. The FCN is trained with a hybrid loss function based on two types of pixel-wise ground truth masks while the unified neural network is fine-tuned with a multitask loss function. Additionally, the post-processing procedures including scoring the predicted bounding boxes by the saliency map and eliminating the redundant boxes via the Non-Maximum Suppression (NMS) method are applied to improve the final text detection results. It is demonstrated by the experimental results on ICDAR2013 benchmark that our proposed unified deep neural network can achieve good performance of text detection and process images at 5 fps, being faster than most of the existing text detection methods. ? Springer International Publishing AG 2017.
URI http://hdl.handle.net/20.500.11897/505064
ISSN 9783319633084
DOI 10.1007/978-3-319-63309-1_10
Indexed EI
Appears in Collections: 信息科学技术学院
数学及其应用教育部重点实验室

Files in This Work
There are no files associated with this item.

Web of Science®


0

Checked on Last Week

Scopus®



Checked on Current Time

百度学术™


0

Checked on Current Time

Google Scholar™





License: See PKU IR operational policies.