QCFE: An efficient Feature engineering for query cost estimation

Autor: Yan, Yu, Wang, Hongzhi, Huang, Junfang, Zhong, Dake, Yang, Man, Zhang, Kaixin, Yu, Tao, Wan, Tianqing
Rok vydání: 2023
Předmět:
Druh dokumentu: Working Paper
Popis: Query cost estimation is a classical task for database management. Recently, researchers apply the AI-driven model to implement query cost estimation for achieving high accuracy. However, two defects of feature design lead to poor cost estimation accuracy-time efficiency. On the one hand, existing works only encode the query plan and data statistics while ignoring some other important variables, like storage structure, hardware, database knobs, etc. These variables also have significant impact on the query cost. On the other hand, due to the straightforward encoding design, existing works suffer heavy representation learning burden on ineffective dimensions of input. To meet the above two problems, we first propose an efficient feature engineering for query cost estimation, called QCFE. Specifically, we design a novel feature called feature snapshot to efficiently integrate the influences of the ignored variables. Further, we propose a difference-propagation feature reduction method for query cost estimation to filter the useless features. The experimental results demonstrate our QCFE could largely improve the time-accuracy efficiency on extensive benchmarks.
Databáze: arXiv