Any-Shot Object Detection
Autor: | Shafin Rahman, Nick Barnes, Fahad Shahbaz Khan, Salman H. Khan |
---|---|
Rok vydání: | 2021 |
Předmět: |
Class (computer programming)
Forgetting business.industry Computer science ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION Inference 02 engineering and technology Pascal (programming language) 010501 environmental sciences Overfitting Object (computer science) Semantics Machine learning computer.software_genre 01 natural sciences Object detection 0202 electrical engineering electronic engineering information engineering 020201 artificial intelligence & image processing Artificial intelligence business computer 0105 earth and related environmental sciences computer.programming_language |
Zdroj: | Computer Vision – ACCV 2020 ISBN: 9783030695347 ACCV (3) |
DOI: | 10.1007/978-3-030-69535-4_6 |
Popis: | Previous work on novel object detection considers zero or few-shot settings where none or few examples of each category are available for training. In real world scenarios, it is less practical to expect that ‘all’ the novel classes are either unseen or have few-examples. Here, we propose a more realistic setting termed ‘Any-shot detection’, where totally unseen and few-shot categories can simultaneously co-occur during inference. Any-shot detection offers unique challenges compared to conventional novel object detection such as, a high imbalance between unseen, few-shot and seen object classes, susceptibility to forget base-training while learning novel classes and distinguishing novel classes from the background. To address these challenges, we propose a unified any-shot detection model, that can concurrently learn to detect both zero-shot and few-shot object classes. Our core idea is to use class semantics as prototypes for object detection, a formulation that naturally minimizes knowledge forgetting and mitigates the class-imbalance in the label space. Besides, we propose a rebalanced loss function that emphasizes difficult few-shot cases but avoids overfitting on the novel classes to allow detection of totally unseen classes. Without bells and whistles, our framework can also be used solely for Zero-shot object detection and Few-shot object detection tasks. We report extensive experiments on Pascal VOC and MS-COCO datasets where our approach is shown to provide significant improvements. |
Databáze: | OpenAIRE |
Externí odkaz: |