HM-NAS: Efficient Neural Architecture Search via Hierarchical Masking

Autor:	Xiao Zeng, Yu Zheng, Biyi Fang, Zhang Fa'en, Hui Xu, Shen Yan, Mi Zhang
Rok vydání:	2019
Předmět:	FOS: Computer and information sciences Computer Science - Machine Learning Network architecture Artificial neural network business.industry Computer science Computer Vision and Pattern Recognition (cs.CV) Distributed computing Computer Science - Computer Vision and Pattern Recognition Machine Learning (stat.ML) Masking (Electronic Health Record) Machine Learning (cs.LG) Statistics - Machine Learning Artificial intelligence Architecture business Heuristics
Zdroj:	ICCV Workshops
DOI:	10.1109/iccvw.2019.00243
Popis:	The use of automatic methods, often referred to as Neural Architecture Search (NAS), in designing neural network architectures has recently drawn considerable attention. In this work, we present an efficient NAS approach, named HM- NAS, that generalizes existing weight sharing based NAS approaches. Existing weight sharing based NAS approaches still adopt hand-designed heuristics to generate architecture candidates. As a consequence, the space of architecture candidates is constrained in a subset of all possible architectures, making the architecture search results sub-optimal. HM-NAS addresses this limitation via two innovations. First, HM-NAS incorporates a multi-level architecture encoding scheme to enable searching for more flexible network architectures. Second, it discards the hand-designed heuristics and incorporates a hierarchical masking scheme that automatically learns and determines the optimal architecture. Compared to state-of-the-art weight sharing based approaches, HM-NAS is able to achieve better architecture search performance and competitive model evaluation accuracy. Without the constraint imposed by the hand-designed heuristics, our searched networks contain more flexible and meaningful architectures that existing weight sharing based NAS approaches are not able to discover. Comment: 9 pages, 6 figures, 6 tables. Nominated for ICCV 2019 Neural Architects Workshop Best Paper Award
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_dedup___::ec005dd3944aff372a8548d77abfce73 https://doi.org/10.1109/iccvw.2019.00243 Zobrazit plný text záznamu