MzJava: An open source library for mass spectrometry data processing
Autor: | Oliver Horlacher, Davide Alocci, Markus Müller, Frédérique Lisacek, Julien Mariethoz, Frederic Nikitin |
---|---|
Jazyk: | angličtina |
Rok vydání: | 2015 |
Předmět: |
Proteomics
Source code Java Computer science Protein digestion media_common.quotation_subject Molecular Sequence Data Biophysics Information Storage and Retrieval computer.software_genre Peptide Mapping Biochemistry Mass Spectrometry Market fragmentation User-Computer Interface Sequence Analysis Protein ddc:570 Amino Acid Sequence ddc:025.063 Databases Protein Cluster analysis Glycomics media_common computer.programming_language Spark Application programming interface Database Mass spectrometry Proteins Data structure Hadoop Database Management Systems Programming Languages Mass spectrometry data format computer |
Zdroj: | Journal of proteomics, Vol. 129 (2015) pp. 63-70 Journal of Proteomics |
ISSN: | 1874-3919 |
Popis: | Mass spectrometry (MS) is a widely used and evolving technique for the high-throughput identification of molecules in biological samples. The need for sharing and reuse of code among bioinformaticians working with MS data prompted the design and implementation of MzJava, an open-source Java Application Programming Interface (API) for MS related data processing. MzJava provides data structures and algorithms for representing and processing mass spectra and their associated biological molecules, such as metabolites, glycans and peptides. MzJava includes functionality to perform mass calculation, peak processing (e.g. centroiding, filtering, transforming), spectrum alignment and clustering, protein digestion, fragmentation of peptides and glycans as well as scoring functions for spectrum–spectrum and peptide/glycan-spectrum matches. For data import and export MzJava implements readers and writers for commonly used data formats. For many classes support for the Hadoop MapReduce (hadoop.apache.org) and Apache Spark (spark.apache.org) frameworks for cluster computing was implemented. The library has been developed applying best practices of software engineering. To ensure that MzJava contains code that is correct and easy to use the library's API was carefully designed and thoroughly tested. MzJava is an open-source project distributed under the AGPL v3.0 licence. MzJava requires Java 1.7 or higher. Binaries, source code and documentation can be downloaded from http://mzjava.expasy.org and https://bitbucket.org/sib-pig/mzjava . This article is part of a Special Issue entitled: Computational Proteomics. |
Databáze: | OpenAIRE |
Externí odkaz: |