MARVEL D3.6 - Efficient deployment of AI-optimised ML/DL models – final version

Autor: Manos Papoutsakis, Anthi Barmpaki, Emmanouil Michalodimitrakis
Jazyk: angličtina
Rok vydání: 2023
Předmět:
DOI: 10.5281/zenodo.8147020
Popis: The objective of this deliverable is to provide an overview of the MARVEL Edge- to-Fog-Cloud framework, which serves as the deployment layer for the AI/DL MARVEL components. This framework encompasses the deployment logic that operates behind MARVdash, a proposed Kubernetes dashboard used to instantiate services as orchestrated containers and deploy them to desired execution sites, following an optimisation strategy. The primary aim of the optimisation strategy is to ensure that MARVEL components are deployed onto Kubernetes nodes based on their specific resource requirements and the available resources on the respective nodes. Furthermore, this document will outline methods for compressing machine learning algorithms/models by leveraging the resources present at the edge, such as reducing the size and operation time of deep learning models with millions of parameters. This compression approach can help minimise the computational overhead on edge servers.
Databáze: OpenAIRE