Zobrazeno 1 - 10
of 53
pro vyhledávání: '"Aaron J. Elmore"'
Autor:
Siyuan Xia, Zhiru Zhu, Chris Zhu, Jinjin Zhao, Kyle Chard, Aaron J. Elmore, Ian Foster, Michael Franklin, Sanjay Krishnan, Raul Castro Fernandez
Pooling and sharing data increases and distributes its value. But since data cannot be revoked once shared, scenarios that require controlled release of data for regulatory, privacy, and legal reasons default to not sharing. Because selectively contr
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::8479d6adf52c87633f1b4225ecd68fb5
Publikováno v:
Proceedings of the VLDB Endowment. 14:2586-2598
Modern data-intensive applications often generate large amounts of low precision float data with a limited range of values. Despite the prevalence of such data, there is a lack of an effective solution to ingest, store, and analyze bounded, low-preci
Publikováno v:
Proceedings of the VLDB Endowment. 14:2795-2798
The ad-hoc, heterogeneous process of modern data science typically involves loading, cleaning, and mutating dataset(s) into multiple versions recorded as artifacts by various tools within a single data science workflow. Lineage information, including
Publikováno v:
Proceedings of the VLDB Endowment. 13:2937-2940
Existing stream processing and continuous query processing systems eagerly maintain standing queries by consuming all available resources to finish the jobs at hand, which can be a major source of wasting CPU cycles and memory resources. However, use
Publikováno v:
Proceedings of the VLDB Endowment. 13:925-938
We propose PIDS, Pattern Inference Decomposed Storage, an innovative storage method for decomposing string attributes in columnar stores. Using an unsupervised approach, PIDS identifies common patterns in string attributes from relational databases,
Publikováno v:
The VLDB Journal. 29:509-538
Data science teams often collaboratively analyze datasets, generating dataset versions at each stage of iterative exploration and analysis. There is a pressing need for a system that can support dataset versioning, enabling such teams to efficiently
Publikováno v:
UIST
Users face many challenges in keeping their personal file collections organized. While current file-management interfaces help users retrieve files in disorganized repositories, they do not aid in organization. Pertinent files can be difficult to fin
Publikováno v:
Aaron Elmore
SoCC
SoCC
We propose MindPalace, a prototype of a versioned database for efficient collaborative data management. MindPalace supports offline collaboration, where users work independently without real-time correspondence. The core of MindPalace is a critical s
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::572e102517d68a765ca1fa24a281d992
http://arxiv.org/abs/2110.01778
http://arxiv.org/abs/2110.01778
Publikováno v:
SIGIR
Prior work suggests that users conceptualize the organization of personal collections of digital files through the lens of similarity. However, it is unclear to what degree similar files are actually located near one another (e.g., in the same direct
Publikováno v:
SIGMOD Conference
Shared query execution can reduce resource consumption by sharing common sub-expressions across concurrent queries. We show that this is not always the case when regularly querying a dataset under change. Depending on latency goals, how eagerly to in