Data Mining in Life Sciences: A Case Study on SAPs In-Memory Computing Engine
Autor: | Matthias Steinbrecher, Massimiliano Marcon, Miganoush Katrin Magarian, Cafer Tosun, Joos-Hendrik Boese, Gennadi Rabinovitch, Vishal Sikka |
---|---|
Rok vydání: | 2013 |
Předmět: | |
Zdroj: | Lecture Notes in Business Information Processing ISBN: 9783642398711 BIRTE |
DOI: | 10.1007/978-3-642-39872-8_2 |
Popis: | While column-oriented in-memory databases have been primarily designed to support fast OLAP queries and business intelligence applications, their analytical performance makes them a promising platform for data mining tasks found in life sciences. One such system is the HANA database, SAP’s in-memory data management solution. In this contribution, we show how HANA meets some inherent requirements of data mining in life sciences. Furthermore, we conducted a case study in the area of proteomics research. As part of this study, we implemented a proteomics analysis pipeline in HANA. We also implemented a flexible data analysis toolbox that can be used by life sciences researchers to easily design and evaluate their analysis models. |
Databáze: | OpenAIRE |
Externí odkaz: |