New Trends in Data Analysis: The Inside Story.

Autor: Mathur, Nishant, Bhandari, Mahesh Kumar
Předmět:
Zdroj: IUP Journal of Computer Sciences; Jan-Apr2016, Vol. 10 Issue 1/2, p52-61, 10p
Abstrakt: Hadoop, an open-source distributed file system, works on Java framework developed by Apache. It provides storage for large amounts of data efficiently at a low cost. Another Apache Lucene project, SOLR--an open-source enterprise search platform--is the most popular enterprise search engine with highly scalable and fault-tolerant feature providing distributed search and index replication. The major features of SOLR include full-text search, faceted search, hit highlighting, real-time indexing, dynamic clustering, database integration and rich document handling. This paper briefly presents the main features of Hadoop and SOLR, and reports the analysis of large stock sample data related to shares, trading, etc. [ABSTRACT FROM AUTHOR]
Databáze: Complementary Index