Hybrid Pyramid Convolutional Network for Multiscale Face Detection

Autor: Dongdong Fang, Guangqiang Yin, Shaoqi Hou, Yixi Pan, Ye Li
Rok vydání: 2021
Předmět:
Rotation
Article Subject
General Computer Science
Computer science
General Mathematics
Computer applications to medicine. Medical informatics
R858-859.7
ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION
Neurosciences. Biological psychiatry. Neuropsychiatry
Context (language use)
02 engineering and technology
Convolutional neural network
Convolution
Robustness (computer science)
0202 electrical engineering
electronic engineering
information engineering

Pyramid (image processing)
Face detection
business.industry
General Neuroscience
020206 networking & telecommunications
Pattern recognition
General Medicine
Semantics
Feature (computer vision)
Face (geometry)
020201 artificial intelligence & image processing
Neural Networks
Computer

Artificial intelligence
business
Algorithms
RC321-571
Research Article
Zdroj: Computational Intelligence and Neuroscience
Computational Intelligence and Neuroscience, Vol 2021 (2021)
ISSN: 1687-5273
1687-5265
DOI: 10.1155/2021/9963322
Popis: Face detection remains a challenging problem due to the high variability of scale and occlusion despite the strong representational power of deep convolutional neural networks and their implicit robustness. To handle hard face detection under extreme circumstances especially tiny faces detection, in this paper, we proposed a multiscale Hybrid Pyramid Convolutional Network (HPCNet), which is a one-stage fully convolutional network. Our HPCNet consists of three newly presented modules: firstly, we designed the Hybrid Dilated Convolution (HDC) module to replace the fully connected layers in VGG16, which enlarges receptive field and reduces its loss of local information; secondly, we constructed the Hybrid Feature Pyramid (HFP) to combine semantic information from higher layers together with details from lower layers; and thirdly, to deal with the problem of occlusion and blurring effectively, we introduced Context Information Extractor (CIE) in HPCNet. In addition, we presented an improved Online Hard Example Mining (OHEM) strategy, which can enhance the average precision of face detection by balancing the number of positive and negative samples. Our method has achieved an accuracy of 0.933, 0.924, and 0.848 on the Easy, Medium, and Hard subset of WIDER FACE, respectively, which surpasses most of the advanced algorithms.
Databáze: OpenAIRE