Gradient Methods for Optimizing Metaparameters in the Knowledge Distillation Problem.

Autor: Gorpinich, M.1 (AUTHOR) gorpinich.m@phystech.edu, Bakhteev, O. Yu.2 (AUTHOR) bakhteev@phystech.edu, Strijov, V. V.2 (AUTHOR) strijov@gmail.com
Zdroj: Automation & Remote Control. Oct2022, Vol. 83 Issue 10, p1544-1554. 11p.
Databáze: Academic Search Ultimate