Cohort Query Processing

Autor: Jiang, Dawei, Cai, Qingchao, Chen, Gang, Jagadish, H. V., Ooi, Beng Chin, Tan, Kian-Lee, Tung, Anthony K. H.
Rok vydání: 2016
Předmět:
Druh dokumentu: Working Paper
Popis: Modern Internet applications often produce a large volume of user activity records. Data analysts are interested in cohort analysis, or finding unusual user behavioral trends, in these large tables of activity records. In a traditional database system, cohort analysis queries are both painful to specify and expensive to evaluate. We propose to extend database systems to support cohort analysis. We do so by extending SQL with three new operators. We devise three different evaluation schemes for cohort query processing. Two of them adopt a non-intrusive approach. The third approach employs a columnar based evaluation scheme with optimizations specifically designed for cohort query processing. Our experimental results confirm the performance benefits of our proposed columnar database system, compared against the two non-intrusive approaches that implement cohort queries on top of regular relational databases.
Databáze: arXiv