A method for optimizing neural networks based on structural distillation using a genetic algorithm
Autor: | Vladimir N. Kuzmin, Artem B. Menisov, Timur R. Sabirov |
---|---|
Jazyk: | English<br />Russian |
Rok vydání: | 2024 |
Předmět: | |
Zdroj: | Naučno-tehničeskij Vestnik Informacionnyh Tehnologij, Mehaniki i Optiki, Vol 24, Iss 5, Pp 770-778 (2024) |
Druh dokumentu: | article |
ISSN: | 2226-1494 2500-0373 |
DOI: | 10.17586/2226-1494-2024-24-5-770-778 |
Popis: | As neural networks become more complex, the number of parameters and required computations increases, which complicates the installation and operation of artificial intelligence systems on edge devices. Structural distillation can significantly reduce the resource intensity of using any neural networks. The paper presents a method for optimizing neural networks that combines the advantages of structural distillation and a genetic algorithm. Unlike evolutionary approaches used to search for the optimal architecture or distillation of neural networks, when forming distillation options, it is proposed to encode not only the parameters of the neural network, but also the connections between neurons. The experimental study was conducted on the VGG16 and ResNet18 models using the CIFAR-10 dataset. It is shown that structural distillation allows optimizing the size of neural networks while maintaining their generalizing ability, and the genetic algorithm is used to effectively search for optimal distillation options for neural networks, taking into account their structural complexity and performance. The obtained results demonstrated the effectiveness of the proposed method in reducing the size and improving the performance of networks with an acceptable loss of quality. |
Databáze: | Directory of Open Access Journals |
Externí odkaz: |