Computation Offloading for Machine Learning Web Apps in the Edge Server Environment
Autor: | InChang Jeong, Soo-Mook Moon, Hyuk-Jin Jeong, Hyeon-Jae Lee |
---|---|
Rok vydání: | 2018 |
Předmět: |
020203 distributed computing
ComputerSystemsOrganization_COMPUTERSYSTEMIMPLEMENTATION Computer science business.industry Mobile computing 020206 networking & telecommunications Cloud computing 02 engineering and technology computer.software_genre Edge server Virtual machine Server 0202 electrical engineering electronic engineering information engineering Operating system Computation offloading Web application The Internet business computer Edge computing |
Zdroj: | ICDCS |
DOI: | 10.1109/icdcs.2018.00154 |
Popis: | Machine leaning apps require heavy computations, especially with the use of the deep neural network (DNN), so an embedded device with limited hardware cannot run the apps by itself. One solution for this problem is to offload DNN computations from the client to a nearby edge server. Existing approaches to DNN offloading with edge servers either specialize the edge server for fixed, specific apps, or customize the edge server for diverse apps, yet after migrating a large VM image that contains the client's back-end software system. In this paper, we propose a new and simple approach to offload DNN computations in the context of web apps. We migrate the current execution state of a web app from the client to the edge server just before executing a DNN computation, so that the edge server can execute the DNN computation with its powerful hardware. Then, we migrate the new execution state from the edge server to the client so that the client can continue to execute the app. We can save the execution state of the web app in the form of another web app called the snapshot, which immensely simplifies saving and restoring the execution state with a small overhead. We can offload any DNN app to any generic edge server, equipped with a browser and our offloading system. We address some issues related to offloading DNN apps such as how to send the DNN model and how to improve the privacy of user data. We also discuss how to install our offloading system on the edge server on demand. Our experiment with real DNN-based web apps shows that snapshot-based offloading achieves a promising performance result, comparable to running the app entirely on the server. |
Databáze: | OpenAIRE |
Externí odkaz: |