Toward efficient data science: A comprehensive MLOps template for collaborative code development and automation

Autor: Ryan C. Godwin, Ryan L. Melvin
Jazyk: angličtina
Rok vydání: 2024
Předmět:
Zdroj: SoftwareX, Vol 26, Iss , Pp 101723- (2024)
Druh dokumentu: article
ISSN: 2352-7110
DOI: 10.1016/j.softx.2024.101723
Popis: In the era of big data analytics and AI applications, data provenance is as important as ever, particularly as applications emerge in vital industries like healthcare. Additionally, as the suites of tools and packages grow exponentially, code transparency and experiment record keeping are essential to ensuring full traceability of AI and ML models. This manuscript presents an open-source Machine Learning Operations (MLOps) Template that provides a consistent framework to support collaborative development and improve efficiency. The template provides a robust and reliable software structure incorporating essential development aspects. These tools include automated code documentation, built-in package management, experiment tracking, configuration and logging infrastructure, and more. The template is built on an agglomeration of best practices gleaned from industry and academia alike, providing a great starting point for any ML/AI project.
Databáze: Directory of Open Access Journals