Zobrazeno 1 - 10
of 28
pro vyhledávání: '"Jayant Madhavan"'
Autor:
Hongrae Lee, Alon Halevy, Daisy Zhe Wang, Michael Cafarella, Jayant Madhavan, Eugene Wu, Cong Yu
Publikováno v:
Proceedings of the VLDB Endowment. 11:2140-2149
In 2008, we wrote about WebTables, an effort to exploit the large and diverse set of structured databases casually published online in the form of HTML tables. The past decade has seen a flurry of research and commercial activities around the WebTabl
Publikováno v:
ACM Transactions on Database Systems. 38:1-35
Large-scale map visualization systems play an increasingly important role in presenting geographic datasets to end-users. Since these datasets can be extremely large, a map rendering system often needs to select a small fraction of the data to visual
Autor:
Warren Shen, Petros Venetis, Gengxin Miao, Chung Wu, Jayant Madhavan, Marius Pasca, Alon Halevy, Fei Wu
Publikováno v:
Proceedings of the VLDB Endowment. 4:528-538
The Web offers a corpus of over 100 million tables [6], but the meaning of each table is rarely explicit from the table itself. Header rows exist in few cases and even when they do, the attribute names are typically useless. We describe a system that
Publikováno v:
The VLDB Journal. 20:209-226
A large number of web pages contain data structured in the form of "lists". Many such lists can be further split into multi-column tables, which can then be used in more semantically meaningful tasks. However, harvesting relational tables from such l
Publikováno v:
Communications of the ACM. 54:72-79
Google's Web Tables and Deep Web Crawler identify and deliver this otherwise inaccessible resource directly to end users.
Publikováno v:
ACM SIGMOD Record. 37:55-61
A long-standing goal of Web research has been to construct a unified Web knowledge base. Information extraction techniques have shown good results on Web inputs, but even most domain-independent ones are not appropriate for Web-scale operation. In th
Publikováno v:
Proceedings of the VLDB Endowment. 1:1241-1252
The Deep Web, i.e., content hidden behind HTML forms, has long been acknowledged as a significant gap in search engine coverage. Since it represents a large portion of the structured data on the Web, accessing Deep-Web content has been a long-standin
Publikováno v:
IEEE Transactions on Knowledge and Data Engineering. 16:787-798
Intuitively, data management and data integration tools are well-suited for exchanging information in a semantically meaningful way. Unfortunately, they suffer from two significant problems: They typically require a comprehensive schema design before
Publikováno v:
The VLDB Journal The International Journal on Very Large Data Bases. 12:303-319
On the Semantic Web, data will inevitably come from many different ontologies, and information processing across ontologies is not possible without knowing the semantic mappings between them. Manually finding such mappings is tedious, error-prone, an
Autor:
Jayant Madhavan, Yana Kadiyska, Xin Dong, Alon Halevy, Dan Suciu, Nilesh Dalvi, Igor Tatarinov, Gerome Miklau, Peter Mork, Zachary G. Ives
Publikováno v:
ACM SIGMOD Record. 32:47-52
A major problem in today's information-driven world is that sharing heterogeneous, semantically rich data is incredibly difficult. Piazza is a peer data management system that enables sharing heterogeneous data in a distributed and scalable way. Piaz