Zobrazeno 1 - 10
of 2 661
pro vyhledávání: '"Carey Michael"'
Within the dynamic world of Big Data, traditional systems typically operate in a passive mode, processing and responding to user queries by returning the requested data. However, this methodology falls short of meeting the evolving demands of users w
Externí odkaz:
http://arxiv.org/abs/2412.14519
The JavaScript Object Notation (JSON) is a popular data format used in document stores to natively support semi-structured data. In this paper, we address the problem of JSON similarity lookup queries: given a query document and a distance threshold
Externí odkaz:
http://arxiv.org/abs/2201.08099
The Join operator, as one of the most expensive and commonly used operators in database systems, plays a substantial role in Database Management System (DBMS) performance. Among the many different Join algorithms studied over the last decades, Hybrid
Externí odkaz:
http://arxiv.org/abs/2112.02480
In the last decade, document store database systems have gained more traction for storing and querying large volumes of semi-structured data. However, the flexibility of the document stores' data models has limited their ability to store data in a co
Externí odkaz:
http://arxiv.org/abs/2111.11517
Autor:
Luo, Chen, Carey, Michael J.
Parallel shared-nothing data management systems have been widely used to exploit a cluster of machines for efficient and scalable data processing. When a cluster needs to be dynamically scaled in or out, data must be efficiently rebalanced. Ideally,
Externí odkaz:
http://arxiv.org/abs/2105.11075
In many Big Data applications today, information needs to be actively shared between systems managed by different organizations. To enable sharing Big Data at scale, developers would have to create dedicated server programs and glue together multiple
Externí odkaz:
http://arxiv.org/abs/2101.01852
Publikováno v:
In Social Sciences & Humanities Open 2024 10
Autor:
Sinthong, Phanwadee, Carey, Michael J.
In the last few years, the field of data science has been growing rapidly as various businesses have adopted statistical and machine learning techniques to empower their decision making and applications. Scaling data analysis, possibly including the
Externí odkaz:
http://arxiv.org/abs/2010.05529