Výsledky vyhledávání

Transactional Panorama: A Conceptual Framework for User Perception in Analytical Visual Interfaces

Autor: Dixin Tang, Alan Fekete, Indranil Gupta, Aditya G. Parameswaran

Many tools empower analysts and data scientists to consume analysis results in a visual interface, such as a dashboard. When the underlying data changes, these results need to be updated, but this update can take a long time -- all while the user con

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::089265ae4d76bd96fb2de85adff9bbf3

Zobrazit plný text záznamu

Lux

Autor: Doris Jung-Lin Lee, Dixin Tang, Kunal Agarwal, Thyne Boonmark, Caitlyn Chen, Jake Kang, Ujjaini Mukhopadhyay, Jerry Song, Micah Yong, Marti A. Hearst, Aditya G. Parameswaran

Publikováno v: Proceedings of the VLDB Endowment. 15:727-738

Exploratory data science largely happens in computational notebooks with dataframe APIs, such as pandas, that support flexible means to transform, clean, and analyze data. Yet, visually exploring data in dataframes remains tedious, requiring substant

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::dda89587808b3af7b0c7cc91cb23e6ff
https://doi.org/10.14778/3494124.3494151

Zobrazit plný text záznamu

Flexible rule-based decomposition and metadata independence in modin

Autor: Devin Petersohn, Dixin Tang, Rehan Durrani, Areg Melik-Adamyan, Joseph E. Gonzalez, Anthony D. Joseph, Aditya G. Parameswaran

Publikováno v: Proceedings of the VLDB Endowment. 15:739-751

Dataframes have become universally popular as a means to represent data in various stages of structure, and manipulate it using a rich set of operators---thereby becoming an essential tool in the data scientists' toolbox. However, dataframe systems,

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::df1e9934f875790faedc6e662c0aaa0f
https://doi.org/10.14778/3494124.3494152

Zobrazit plný text záznamu

CrocodileDB in action

Autor: Zechao Shang, Sanjay Krishnan, Dixin Tang, Michael J. Franklin, Aaron J. Elmore

Publikováno v: Proceedings of the VLDB Endowment. 13:2937-2940

Existing stream processing and continuous query processing systems eagerly maintain standing queries by consuming all available resources to finish the jobs at hand, which can be a major source of wasting CPU cycles and memory resources. However, use

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::a38e608b47f0f3254cd346d59bcf6ebe
https://doi.org/10.14778/3415478.3415513

Zobrazit plný text záznamu

Resource-efficient Shared Query Execution via Exploiting Time Slackness

Autor: Zechao Shang, William W. Ma, Sanjay Krishnan, Dixin Tang, Aaron J. Elmore

Publikováno v: SIGMOD Conference

Shared query execution can reduce resource consumption by sharing common sub-expressions across concurrent queries. We show that this is not always the case when regularly querying a dataset under change. Depending on latency goals, how eagerly to in

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::da0302c52f77668dd720f1ef7fb618ba
https://doi.org/10.1145/3448016.3457282

Zobrazit plný text záznamu

Intermittent query processing

Autor: Michael J. Franklin, Dixin Tang, Aaron J. Elmore, Zechao Shang, Sanjay Krishnan

Publikováno v: Proceedings of the VLDB Endowment. 12:1427-1441

Many applications ingest data in an intermittent, yet largely predictable, pattern. Existing systems tend to ignore how data arrives when making decisions about how to update (or refresh) an ongoing query. To address this shortcoming we propose a new

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::a89e08d0b2e963d94890ac88ea0b76c5
https://doi.org/10.14778/3342263.3342278

Zobrazit plný text záznamu

CIAO: An Optimization Framework for Client-Assisted Data Loading

Autor: Sanjay Krishnan, Cong Ding, Dixin Tang, Xi Liang, Aaron J. Elmore

Publikováno v: ICDE
Aaron Elmore

Data loading has been one of the most common performance bottlenecks for many big data applications, especially when they are running on inefficient human-readable formats, such as JSON or CSV. Parsing, validating, integrity checking and data structu

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::fc459ebb4574060e35e3bcd95d35c15f
https://doi.org/10.1109/icde51399.2021.00187

Zobrazit plný text záznamu

Thrifty Query Execution via Incrementability

Autor: Dixin Tang, Sanjay Krishnan, Michael J. Franklin, Zechao Shang, Aaron J. Elmore

Publikováno v: SIGMOD Conference

Many applications schedule queries before all data is ready. To return fast query results, database systems can eagerly process existing data and incrementally incorporate new data into prior intermediate results, which often relies on incremental vi

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::712519a4bd3fce532b116e2ddb4003b5
https://doi.org/10.1145/3318464.3389756

Zobrazit plný text záznamu

Socrates

Autor: Krystyna Reisteter, Cristian Diaconu, Sandeep Lingam, Dixin Tang, Umar Farooq Minhas, Jack Hu, Vijendra Purohit, Alejandro Hernandez Saenz, Naveen Prakash, Hugh Qu, Sheetal Shrotri, Chaitanya Sreenivas Ravella, Alex Budovski, Hanuma Kodavalla, Vikram Wakade, Donald Kossmann, Panagiotis Antonopoulos

Publikováno v: SIGMOD Conference

The database-as-a-service paradigm in the cloud (DBaaS) is becoming increasingly popular. Organizations adopt this paradigm because they expect higher security, higher availability, and lower and more flexible cost with high performance. It has becom

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::6e7e0e76d0c2eed0ef68240e8ee226bc
https://doi.org/10.1145/3299869.3314047

Zobrazit plný text záznamu

SparkArray: An Array-Based Scientific Data Management System Built on Apache Spark

Autor: Wenjuan Wang, Rubao Lee, Hong Liu, Dixin Tang, Wei Li, Taoying Liu

Publikováno v: NAS

With the highly demanded requirements for manipulating large scientific datasets, scientists are in need of flexible cluster-level software to execute fast scientific data analysis. In this paper, we discuss whether the Apache Spark framework is suit

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::144702099d969347f7add885047ebdd6
https://doi.org/10.1109/nas.2016.7549422

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání