GCCNet: Grouped channel composition network for scene text detection

Autor: Xiaobin Zhu, Lei Xiao, Long-Huang Wu, Jie-Bo Hou, Chun Yang, Xu-Cheng Yin, Chang Liu
Rok vydání: 2021
Předmět:
Zdroj: Neurocomputing. 454:135-151
ISSN: 0925-2312
DOI: 10.1016/j.neucom.2021.04.095
Popis: Anchor mechanism is widely applied in scene text detection methods and demonstrates promising performance. However, existing anchor mechanisms have two major limitations, namely handcrafted anchor design and hard-wired anchor assignment. We propose a novel Grouped Channels Composition(GCC) block to achieve the data-driven anchor design and adaptive anchor assignment. To be more specific, our GCC block uses optimizable anchor functions rather than handcrafted ones to achieve data-drive anchor design. In our GCC block, an adaptive anchor assignment is achieved with the attention mechanism instead of empirically assigning anchor according to the Intersection Over Union (IoU) between ground truth and targets. We then build a corresponding network named GCCNet with our GCC blocks. We also propose a Unified Loss Weighting module to alleviate the inconsistency between classification score and localization accuracy. Experiments conducted on publicly available datasets demonstrate the state-of-the-art performance of our methods.
Databáze: OpenAIRE