Zobrazeno 1 - 10
of 51
pro vyhledávání: '"Dmitri V. Kalashnikov"'
Publikováno v:
Proceedings of the 31st ACM International Conference on Information & Knowledge Management.
Publikováno v:
ICDE
A considerable amount of useful information on the web is (semi-)structured, such as tables and lists. An extensive corpus of prior work addresses the problem of making these human-readable representations interpretable by algorithms. Most of these w
Publikováno v:
WWW
Wikipedia is the largest encyclopedia to date. Scattered among its articles, there is an enormous number of tables that contain structured, relational information. In contrast to database tables, these webtables lack metadata, making it difficult to
Autor:
Felix Naumann, Leon Bornemann, Divesh Srivastava, Dmitri V. Kalashnikov, Tobias Bleifuß, Theodore Johnson
Publikováno v:
Proceedings of the VLDB Endowment. 12:85-98
Data and metadata in datasets experience many different kinds of change. Values are inserted, deleted or updated; rows appear and disappear; columns are added or repurposed, etc. In such a dynamic situation, users might have many questions related to
Publikováno v:
Datenbank-Spektrum. 18:79-87
Analysis of static data is one of the best studied research areas. However, data changes over time. These changes may reveal patterns or groups of similar values, properties, and entities. We study changes in large, publicly available data repositori
Publikováno v:
ACM Transactions on Knowledge Discovery from Data. 12:1-45
Entity resolution (ER) is the process of identifying which entities in a dataset refer to the same real-world object. In relational ER, the dataset consists of multiple entity-sets and relationships among them. Such relationships cause the resolution
Publikováno v:
IEEE Transactions on Knowledge and Data Engineering. 29:402-417
This paper addresses the problem of query-aware data cleaning in the context of a user query. In particular, we develop a novel Query-Driven Approach ( ${\tt QDA}$ ) that systematically exploits the semantics of the predicates in ${\tt SQL}$ -like se
Publikováno v:
IEEE Transactions on Image Processing. 25:4504-4513
In the era of big data, a traditional offline setting to processing image data is simply not tenable. We simply do not have the computational power to process every image with every possible tag; moreover, we will not have the manpower to clean up th
Publikováno v:
SIGMOD Conference
We study the problem of Query Reverse Engineering (QRE), where given a database and an output table, the task is to find a simple project-join SQL query that generates that table when applied on the database. This problem is known for its efficiency
Publikováno v:
Proceedings of the VLDB Endowment. 9:120-131
This paper explores an analysis-aware data cleaning architecture for a large class of SPJ SQL queries. In particular, we propose QuERy, a novel framework for integrating entity resolution (ER) with query processing. The aim of QuERy is to correctly a