Výsledky vyhledávání - "Eugene J. Shekita"

Robust Large-Scale Machine Learning in the Cloud

Autor: Eugene J. Shekita, Bor-Yiing Su, Steffen Rendle, Dennis Fetterly

Publikováno v: KDD

The convergence behavior of many distributed machine learning (ML) algorithms can be sensitive to the number of machines being used or to changes in the computing environment. As a result, scaling to a large number of machines can be challenging. In

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::6c745765956fcd1d637754fea6216381
https://doi.org/10.1145/2939672.2939790

Zobrazit plný text záznamu

Jaql

Autor: Mohamed Y. Eltabakh, Kevin Scott Beyer, Rainer Gemulla, Vuk Ercegovac, Carl-Christian Kanne, Andrey Balmin, Fatma Ozcan, Eugene J. Shekita

Publikováno v: Scopus-Elsevier

This paper describes Jaql, a declarative scripting language for analyzing large semistructured datasets in parallel using Hadoop's MapReduce framework. Jaql is currently used in IBM's InfoSphere BigInsights [5] and Cognos Consumer Insight [9] product

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::7d37aeae1fe4282397362f2fc934051a
https://doi.org/10.14778/3402755.3402761

Zobrazit plný text záznamu

Column-oriented storage techniques for MapReduce

Autor: Avrilia Floratou, Eugene J. Shekita, Jignesh M. Patel, Sandeep Tata

Publikováno v: Proceedings of the VLDB Endowment. 4:419-429

Users of MapReduce often run into performance problems when they scale up their workloads. Many of the problems they encounter can be overcome by applying techniques learned from over three decades of research on parallel DBMSs. However, translating

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::4f3f181dd60f68f618da695e1f4fbef4
https://doi.org/10.14778/1988776.1988778

Zobrazit plný text záznamu

XTABLES: Bridging relational technology and XML

Autor: J. E. Funderburk, C. Wei, Eugene J. Shekita, Gerald G. Kiernan, Jayavel Shanmugasundaram

Publikováno v: IBM Systems Journal. 41:616-641

XML (Extensible Markup Language) has emerged as the standard data-exchange format for Internet-based business applications. These applications introduce a new set of data management requirements involving XML. However, for the foreseeable future, a s

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::9c5b59f60afa0a31f2842ef11f325867
https://doi.org/10.1147/sj.414.0616

Zobrazit plný text záznamu

Efficiently publishing relational data as XML documents

Autor: Rimon Barr, Michael J. Carey, Bruce G. Lindsay, Berthold Reinwald, Jayavel Shanmugasundaram, Eugene J. Shekita, Hamid Pirahesh

Publikováno v: VLDB

XML is rapidly emerging as a standard for exchanging business data on the World Wide Web. For the foreseeable future, however, most business data will continue to be stored in relational database systems. Consequently, if XML is to fulfill its potent

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::5579014ea0116755500000fe2d90a306
https://doi.org/10.1007/s007780100052

Zobrazit plný text záznamu

A general technique for querying XML documents using a relational database system

Autor: Jayavel Shanmugasundaram, Rajasekar Krishnamurthy, Eugene J. Shekita, Igor Tatarinov, Jeffrey F. Naughton, Efstratios Viglas, Jerry Kiernan

Publikováno v: ACM SIGMOD Record. 30:20-26

There has been recent interest in using relational database systems to store and query XML documents. Each of the techniques proposed in this context works by (a) creating tables for the purpose of storing XML documents (also called relational schema

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::b6cabe4e18a4264d9fe003918dd04df5
https://doi.org/10.1145/603867.603871

Zobrazit plný text záznamu

Improved histograms for selectivity estimation of range predicates

Autor: Eugene J. Shekita, Viswanath Poosala, Peter J. Haas, Yannis Ioannidis

Publikováno v: SIGMOD Conference

Many commercial database systems maintain histograms to summarize the contents of relations and permit efficient estimation of query result sizes and access plan costs. Although several types of histograms have been proposed in the past, there has ne

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::c388edc7e560386c162c22a40959a1ce
https://doi.org/10.1145/235968.233342

Zobrazit plný text záznamu

Clydesdale

Autor: Sandeep Tata, Tim Kaldewey, Eugene J. Shekita

Publikováno v: EDBT

MapReduce has emerged as a promising architecture for large scale data analytics on commodity clusters. The rapid adoption of Hive, a SQL-like data processing language on Hadoop (an open source implementation of MapReduce), shows the increasing impor

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::df2d05d07c3b5942016cd44d8f204b8c
https://doi.org/10.1145/2247596.2247600

Zobrazit plný text záznamu

Using Paxos to Build a Scalable, Consistent, and Highly Available Datastore

Autor: Eugene J. Shekita, Sandeep Tata, Jun Rao

Spinnaker is an experimental datastore that is designed to run on a large cluster of commodity servers in a single datacenter. It features key-based range partitioning, 3-way replication, and a transactional get-put API with the option to choose eith

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::0db3ca23737edbc3b7f8763c9b485dcc
http://arxiv.org/abs/1103.2408

Zobrazit plný text záznamu

A comparison of join algorithms for log processing in MaPreduce

Autor: Jignesh M. Patel, Spyros Blanas, Vuk Ercegovac, Eugene J. Shekita, Jun Rao, Yuanyuan Tian

Publikováno v: SIGMOD Conference

The MapReduce framework is increasingly being used to analyze large volumes of data. One important type of data analysis done with MapReduce is log processing, in which a click-stream or an event log is filtered, aggregated, or mined for patterns. As

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::529f839ce395e8fa10c96a7a5194e770
https://doi.org/10.1145/1807167.1807273

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání