Computing Platforms for Big Biological Data Analytics: Perspectives and Challenges

Autor: Zekun Yin, Haidong Lan, Guangming Tan, Mian Lu, Athanasios V. Vasilakos, Weiguo Liu
Jazyk: angličtina
Rok vydání: 2017
Předmět:
Zdroj: Computational and Structural Biotechnology Journal, Vol 15, Iss , Pp 403-411 (2017)
Druh dokumentu: article
ISSN: 2001-0370
DOI: 10.1016/j.csbj.2017.07.004
Popis: The last decade has witnessed an explosion in the amount of available biological sequence data, due to the rapid progress of high-throughput sequencing projects. However, the biological data amount is becoming so great that traditional data analysis platforms and methods can no longer meet the need to rapidly perform data analysis tasks in life sciences. As a result, both biologists and computer scientists are facing the challenge of gaining a profound insight into the deepest biological functions from big biological data. This in turn requires massive computational resources. Therefore, high performance computing (HPC) platforms are highly needed as well as efficient and scalable algorithms that can take advantage of these platforms. In this paper, we survey the state-of-the-art HPC platforms for big biological data analytics. We first list the characteristics of big biological data and popular computing platforms. Then we provide a taxonomy of different biological data analysis applications and a survey of the way they have been mapped onto various computing platforms. After that, we present a case study to compare the efficiency of different computing platforms for handling the classical biological sequence alignment problem. At last we discuss the open issues in big biological data analytics. Keywords: Computational biology applications, Computing platforms, Big biological data, NGS, GPU, Intel MIC
Databáze: Directory of Open Access Journals