Nonlinear CNN: improving CNNs with quadratic convolutions
Autor: | Yiyang Jiang, Dian Zhou, Hengliang Zhu, Fan Yang, Xuan Zeng |
---|---|
Rok vydání: | 2019 |
Předmět: |
0209 industrial biotechnology
Contextual image classification Spacetime Computer science 02 engineering and technology Pascal (programming language) Convolutional neural network Object detection Convolution Nonlinear system 020901 industrial engineering & automation Quadratic equation Artificial Intelligence 0202 electrical engineering electronic engineering information engineering 020201 artificial intelligence & image processing computer Algorithm Software computer.programming_language |
Zdroj: | Neural Computing and Applications. 32:8507-8516 |
ISSN: | 1433-3058 0941-0643 |
DOI: | 10.1007/s00521-019-04316-4 |
Popis: | In this work, instead of designing deeper convolutional neural networks, we investigate the relationship between the nonlinearity of convolution layer and the performance of the network. We modify the normal convolution layer by inserting quadratic convolution units which can map linear features to a higher-dimensional space in a single layer so as to enhance the approximability of the network. A genetic algorithm-based training scheme is adopted to reduce the time and space complexity caused by the quadratic convolution. Our method is experimented on classical image classification architectures including VGG-16 Net and GoogLeNet and outperforms the original models on the ImageNet classification dataset. The experimental results also show that better performance of our method can be achieved with a shallower architecture. We notice that VGG-16 model is widely used in popular object detection frameworks such as faster R-CNN and SSD. We adopt our modified VGG-16 model in these frameworks and also achieve improvements on PASCAL VOC2007 and VOC2012 dataset. |
Databáze: | OpenAIRE |
Externí odkaz: |