DIAMetrics: Benchmarking Query Engines at Scale.

Autor: Deep, Shaleen, Gruenheid, Anja, Nagaraj, Kruthi, Naito, Hiro, Naughton, Jeff, Viglas, Stratis
Předmět:
Zdroj: Communications of the ACM; Dec2022, Vol. 65 Issue 12, p105-112, 8p, 4 Diagrams, 1 Graph
Abstrakt: This paper introduces DIAMetrics: a novel framework for end-to-end benchmarking and performance monitoring of query engines. DIAMetrics consists of a number of components supporting tasks such as automated workload summarization, data anonymization, benchmark execution, monitoring, regression identification, and alerting. The architecture of DIAMetrics is highly modular and supports multiple systems by abstracting their implementation details and relying on common canonical formats and pluggable software drivers. The end result is a powerful unified framework that is capable of supporting every aspect of benchmarking production systems and workloads. DIAMetrics has been developed in Google and is being used to benchmark various internal query engines. In this paper, we give an overview of DIAMetrics and discuss its design and implementation. Furthermore, we provide details about its deployment and example use cases. Given the variety of supported systems and use cases within Google, we argue that its core concepts can be used more widely to enable comparative end-to-end benchmarking in other industrial environments. [ABSTRACT FROM AUTHOR]
Databáze: Complementary Index