HM-NAS: Efficient Neural Architecture Search via Hierarchical Masking
Autor: | Xiao Zeng, Yu Zheng, Biyi Fang, Zhang Fa'en, Hui Xu, Shen Yan, Mi Zhang |
---|---|
Rok vydání: | 2019 |
Předmět: |
FOS: Computer and information sciences
Computer Science - Machine Learning Network architecture Artificial neural network business.industry Computer science Computer Vision and Pattern Recognition (cs.CV) Distributed computing Computer Science - Computer Vision and Pattern Recognition Machine Learning (stat.ML) Masking (Electronic Health Record) Machine Learning (cs.LG) Statistics - Machine Learning Artificial intelligence Architecture business Heuristics |
Zdroj: | ICCV Workshops |
DOI: | 10.1109/iccvw.2019.00243 |
Popis: | The use of automatic methods, often referred to as Neural Architecture Search (NAS), in designing neural network architectures has recently drawn considerable attention. In this work, we present an efficient NAS approach, named HM- NAS, that generalizes existing weight sharing based NAS approaches. Existing weight sharing based NAS approaches still adopt hand-designed heuristics to generate architecture candidates. As a consequence, the space of architecture candidates is constrained in a subset of all possible architectures, making the architecture search results sub-optimal. HM-NAS addresses this limitation via two innovations. First, HM-NAS incorporates a multi-level architecture encoding scheme to enable searching for more flexible network architectures. Second, it discards the hand-designed heuristics and incorporates a hierarchical masking scheme that automatically learns and determines the optimal architecture. Compared to state-of-the-art weight sharing based approaches, HM-NAS is able to achieve better architecture search performance and competitive model evaluation accuracy. Without the constraint imposed by the hand-designed heuristics, our searched networks contain more flexible and meaningful architectures that existing weight sharing based NAS approaches are not able to discover. Comment: 9 pages, 6 figures, 6 tables. Nominated for ICCV 2019 Neural Architects Workshop Best Paper Award |
Databáze: | OpenAIRE |
Externí odkaz: |