Locating Texts in Images by Fully Convolutional Network with Bounding Text Labeling

Yu Hung Hou, Guan Xin Zeng, Po Chyi Su

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Texts in images are often viewed as the regions of interest. Locating such areas for further analysis may help to extract image-related information and facilitate many interesting applications. Considering that pixel-based image segmentation is effective in identifying the areas containing texts, we propose a text detection scheme based on the Fully Convolutional Network (FCN), which employs Feature Pyramid Network (FPN) and Atrous Spatial Pyramid Pooling (ASPP) to effectively mark the pixels related to texts in an image. A comparatively efficient labeling approach is adopted to group related letters/characters and separate different words by oval-shaped text labeling masks with boundaries. The experimental results demonstrate feasibility of the proposed text-detection scheme when compared with the state-of-the-art text detection approaches.

Original languageEnglish
Title of host publication2020 IEEE International Conference on Consumer Electronics - Taiwan, ICCE-Taiwan 2020
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9781728173993
DOIs
StatePublished - 28 Sep 2020
Event7th IEEE International Conference on Consumer Electronics - Taiwan, ICCE-Taiwan 2020 - Taoyuan, Taiwan
Duration: 28 Sep 202030 Sep 2020

Publication series

Name2020 IEEE International Conference on Consumer Electronics - Taiwan, ICCE-Taiwan 2020

Conference

Conference7th IEEE International Conference on Consumer Electronics - Taiwan, ICCE-Taiwan 2020
Country/TerritoryTaiwan
CityTaoyuan
Period28/09/2030/09/20

Fingerprint

Dive into the research topics of 'Locating Texts in Images by Fully Convolutional Network with Bounding Text Labeling'. Together they form a unique fingerprint.

Cite this