Hierarchical bilinear convolutional neural network for image classification

Autor: Xiang Zhang, Lei Tang, Hangzai Luo, Sheng Zhong, Ziyu Guan, Long Chen, Chao Zhao, Jinye Peng, Jianping Fan
Jazyk: angličtina
Rok vydání: 2021
Předmět:
Zdroj: IET Computer Vision, Vol 15, Iss 3, Pp 197-207 (2021)
Druh dokumentu: article
ISSN: 1751-9640
1751-9632
DOI: 10.1049/cvi2.12023
Popis: Abstract Image classification is one of the mainstream tasks of computer vision. However, the most existing methods use labels of the same granularity level for training. This leads to ignoring the hierarchy that may help to differentiate different visual objects better. Embedding hierarchical information into the convolutional neural networks (CNNs) can effectively regulate the semantic space and thus reduce the ambiguity of prediction. To this end, a multi‐task learning framework, named as Hierarchical Bilinear Convolutional Neural Network (HB‐CNN), is developed by seamlessly integrating CNNs with multi‐task learning over the hierarchical visual concept structures. Specifically, the labels with a tree structure are used as the supervision to hierarchically train multiple branch networks. In this way, the model can not only learn additional information (e.g. context information) as the coarse‐level category features, but also focus the learned fine‐level category features on the object properties. To smoothly pass hierarchical conceptual information and encourage feature reuse, a connectivity pattern is proposed to connect features at different levels. Furthermore, a bilinear module is embedded to generalise various orderless texture feature descriptors so that our model can capture more discriminative features. The proposed method is extensively evaluated on the CIFAR‐10, CIFAR‐100, and ‘Orchid’ Plant image sets. The experimental results show the effectiveness and superiority of our method.
Databáze: Directory of Open Access Journals