IncDet: In Defense of Elastic Weight Consolidation for Incremental Object Detection
Autor: | Yimin Chen, Wenming Yang, Zhanghui Kuang, Liyang Liu, Jing-Hao Xue, Wayne Zhang |
---|---|
Rok vydání: | 2020 |
Předmět: |
Flexibility (engineering)
Class (computer programming) Forgetting Computer Networks and Communications Computer science 02 engineering and technology Pascal (programming language) Regularization (mathematics) Object detection Computer Science Applications Quadratic equation Artificial Intelligence Incremental learning 0202 electrical engineering electronic engineering information engineering 020201 artificial intelligence & image processing computer Algorithm Software computer.programming_language |
Zdroj: | IEEE transactions on neural networks and learning systems. 32(6) |
ISSN: | 2162-2388 |
Popis: | Elastic weight consolidation (EWC) has been successfully applied for general incremental learning to overcome the catastrophic forgetting issue. It adaptively constrains each parameter of the new model not to deviate much from its counterpart in the old model during fine-tuning on new class data sets, according to its importance weight for old tasks. However, the previous study demonstrates that it still suffers from catastrophic forgetting when directly used in object detection. In this article, we show EWC is effective for incremental object detection if with critical adaptations. First, we conduct controlled experiments to identify two core issues why EWC fails if trivially applied to incremental detection: 1) the absence of old class annotations in new class images makes EWC misclassify objects of old classes in these images as background and 2) the quadratic regularization loss in EWC easily leads to gradient explosion when balancing old and new classes. Then, based on the abovementioned findings, we propose the corresponding solutions to tackle these issues: 1) utilize pseudobounding box annotations of old classes on new data sets to compensate for the absence of old class annotations and 2) adopt a novel Huber regularization instead of the original quadratic loss to prevent from unstable training. Finally, we propose a general EWC-based incremental object detection framework and implement it under both Fast R-CNN and Faster R-CNN, showing its flexibility and versatility. In terms of either the final performance or the performance drop with respect to the upper bound of joint training on all seen classes, evaluations on the PASCAL VOC and COCO data sets show that our method achieves a new state of the art. |
Databáze: | OpenAIRE |
Externí odkaz: |