Výsledky vyhledávání - "Carey, Michael J."

Report

JEDI: These aren't the JSON documents you're looking for... (Extended Version*)

Autor: Hütter, Thomas, Augsten, Nikolaus, Kirsch, Christoph M., Carey, Michael J., Li, Chen

The JavaScript Object Notation (JSON) is a popular data format used in document stores to natively support semi-structured data. In this paper, we address the problem of JSON similarity lookup queries: given a query document and a distance threshold

Externí odkaz: http://arxiv.org/abs/2201.08099

Zobrazit plný text záznamu

Report

Design Trade-offs for a Robust Dynamic Hybrid Hash Join (Extended Version)

Autor: Jahangiri, Shiva, Carey, Michael J., Freytag, Johann-Christoph

The Join operator, as one of the most expensive and commonly used operators in database systems, plays a substantial role in Database Management System (DBMS) performance. Among the many different Join algorithms studied over the last decades, Hybrid

Externí odkaz: http://arxiv.org/abs/2112.02480

Zobrazit plný text záznamu

Report

Columnar Formats for Schemaless LSM-based Document Stores

Autor: Alkowaileet, Wail Y., Carey, Michael J.

In the last decade, document store database systems have gained more traction for storing and querying large volumes of semi-structured data. However, the flexibility of the document stores' data models has limited their ability to store data in a co

Externí odkaz: http://arxiv.org/abs/2111.11517

Zobrazit plný text záznamu

Report

DynaHash: Efficient Data Rebalancing in Apache AsterixDB (Extended Version)

Autor: Luo, Chen, Carey, Michael J.

Parallel shared-nothing data management systems have been widely used to exploit a cluster of machines for efficient and scalable data processing. When a cluster needs to be dynamically scaled in or out, data must be efficiently rebalanced. Ideally,

Externí odkaz: http://arxiv.org/abs/2105.11075

Zobrazit plný text záznamu

Report

Bridging BAD Islands: Declarative Data Sharing at Scale

Autor: Wang, Xikui, Carey, Michael J., Tsotras, Vassilis J.

In many Big Data applications today, information needs to be actively shared between systems managed by different organizations. To enable sharing Big Data at scale, developers would have to create dedicated server programs and glue together multiple

Externí odkaz: http://arxiv.org/abs/2101.01852

Zobrazit plný text záznamu

Report

PolyFrame: A Retargetable Query-based Approach to Scaling DataFrames (Extended Version)

Autor: Sinthong, Phanwadee, Carey, Michael J.

In the last few years, the field of data science has been growing rapidly as various businesses have adopted statistical and machine learning techniques to empower their decision making and applications. Scaling data analysis, possibly including the

Externí odkaz: http://arxiv.org/abs/2010.05529

Zobrazit plný text záznamu

Report

Revisiting Runtime Dynamic Optimization for Join Queries in Big Data Management Systems

Autor: Pavlopoulou, Christina, Carey, Michael J., Tsotras, Vassilis J.

Query Optimization remains an open problem for Big Data Management Systems. Traditional optimizers are cost-based and use statistical estimates of intermediate result cardinalities to assign costs and pick the best plan. However, such estimates tend

Externí odkaz: http://arxiv.org/abs/2010.00728

Zobrazit plný text záznamu

Report

Subscribing to Big Data at Scale

Autor: Wang, Xikui, Carey, Michael J., Tsotras, Vassilis J.

Today, data is being actively generated by a variety of devices, services, and applications. Such data is important not only for the information that it contains, but also for its relationships to other data and to interested users. Most existing Big

Externí odkaz: http://arxiv.org/abs/2009.04611

Zobrazit plný text záznamu

Report

Breaking Down Memory Walls: Adaptive Memory Management in LSM-based Storage Systems (Extended Version)

Autor: Luo, Chen, Carey, Michael J.

Log-Structured Merge-trees (LSM-trees) have been widely used in modern NoSQL systems. Due to their out-of-place update design, LSM-trees have introduced memory walls among the memory components of multiple LSM-trees and between the write memory and t

Externí odkaz: http://arxiv.org/abs/2004.10360

Zobrazit plný text záznamu

Report

BAD to the Bone: Big Active Data at its Core

Autor: Jacobs, Steven, Wang, Xikui, Carey, Michael J., Tsotras, Vassilis J., Uddin, Md Yusuf Sarwar

Virtually all of today's Big Data systems are passive in nature, responding to queries posted by their users. Instead, we are working to shift Big Data platforms from passive to active. In our view, a Big Active Data (BAD) system should continuously

Externí odkaz: http://arxiv.org/abs/2002.09755

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání