On decomposing a deep neural network into modules
Autor: | Hridesh Rajan, Rangeet Pan |
---|---|
Rok vydání: | 2020 |
Předmět: |
Structure (mathematical logic)
Modularity (networks) Artificial neural network Computer science business.industry Deep learning 020207 software engineering 02 engineering and technology Software 020204 information systems Component (UML) 0202 electrical engineering electronic engineering information engineering Software system Artificial intelligence business MNIST database |
Zdroj: | ESEC/SIGSOFT FSE |
DOI: | 10.1145/3368089.3409668 |
Popis: | Deep learning is being incorporated in many modern software systems. Deep learning approaches train a deep neural network (DNN) model using training examples, and then use the DNN model for prediction. While the structure of a DNN model as layers is observable, the model is treated in its entirety as a monolithic component. To change the logic implemented by the model, e.g. to add/remove logic that recognizes inputs belonging to a certain class, or to replace the logic with an alternative, the training examples need to be changed and the DNN needs to be retrained using the new set of examples. We argue that decomposing a DNN into DNN modules— akin to decomposing a monolithic software code into modules—can bring the benefits of modularity to deep learning. In this work, we develop a methodology for decomposing DNNs for multi-class problems into DNN modules. For four canonical problems, namely MNIST, EMNIST, FMNIST, and KMNIST, we demonstrate that such decomposition enables reuse of DNN modules to create different DNNs, enables replacement of one DNN module in a DNN with another without needing to retrain. The DNN models formed by composing DNN modules are at least as good as traditional monolithic DNNs in terms of test accuracy for our problems. |
Databáze: | OpenAIRE |
Externí odkaz: |