TitleA NEW PERSPECTIVE FOR FLEXIBLE FEATURE GATHERING IN SCENE TEXT RECOGNITION VIA CHARACTER ANCHOR POOLING
AuthorsLong, Shangbang
Guan, Yushuo
Bian, Kaigui
Yao, Cong
AffiliationPeking Univ, Beijing, Peoples R China
Megvii Face Technol Inc, Beijing, Peoples R China
Issue Date2020
Publisher2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING
AbstractIrregular scene text recognition has attracted much attention from the research community, mainly due to the complexity of shapes of text in natural scene. However, recent methods either rely on shape-sensitive modules such as bounding box regression, or discard sequence learning. To tackle these issues, we propose a pair of coupling modules, termed as Character Anchoring Module (CAM) and Anchor Pooling Module (APM), to extract high-level semantics from two-dimensional space to form feature sequences. The proposed CAM localizes the text in a shape-insensitive way by design by anchoring characters individually. APM then interpolates and gathers features flexibly along the character anchors which enables sequence learning. The complementary modules realize a harmonic unification of spatial information and sequence learning. With the proposed modules, our recognition system surpasses previous state-of-the-art scores on irregular and perspective text datasets, including, ICDAR 2015, CUTE, and Total-Text, while paralleling state-of-the-art performance on regular text datasets.
URIhttp://hdl.handle.net/20.500.11897/604772
ISBN978-1-5090-6631-5
ISSN1520-6149
IndexedCPCI-S(ISTP)
Appears in Collections:待认领

Files in This Work
There are no files associated with this item.

Web of Science®



Checked on Last Week

百度学术™



Checked on Current Time




License: See PKU IR operational policies.