Less is not more

Autor: Laurens Versluis, Mehmet Cetin, Caspar Greeven, Kristian Laursen, Damian Podareanu, Valeriu Codreanu, Alexandru Uta, Alexandru Iosup
Přispěvatelé: Computer Science, Computer Systems, Network Institute, Massivizing Computer Systems
Jazyk: angličtina
Rok vydání: 2023
Předmět:
Zdroj: Versluis, L, Cetin, M, Greeven, C, Laursen, K, Podareanu, D, Codreanu, V, Uta, A & Iosup, A 2023, ' Less is not more : We need rich datasets to explore ', Future Generation Computer Systems, vol. 142, pp. 117-130 . https://doi.org/10.1016/j.future.2022.12.022
Future Generation Computer Systems, 142, 117-130. Elsevier
ISSN: 1872-7115
0167-739X
Popis: Traditional datacenter analysis is based on high-level, coarse-grained metrics. This obscures our vision of datacenter behavior, as we do not observe the full picture nor subtleties that might make up these high-level, coarse metrics. There is room for operational improvement based on fine-grained temporal and spatial, low-level metric data. We leverage in this work one of the (rare) public datasets providing fine-grained information on datacenter operations, with over 60 billion measurements captured in 15-second intervals. We show evidence that fine-grained information reveals new operational aspects, that the different metrics cannot be derived from one another (and thus need to be captured), and that many low-level metrics, gathered frequently are key to understanding datacenter operations. We propose a holistic analysis for datacenter operations, providing statistical characterization of node and workload aspects. Our analysis reveals both generic and machine learning-specific aspects, summarized in over 30 observations, providing deep insight into this dataset and the originating cluster. We give actionable insights, surprising findings, and exemplify how our observations support performance-engineering tasks such as workload prediction and long-term datacenter design.
Databáze: OpenAIRE