Four high-quality draft genome assemblies of the marine heterotrophic nanoflagellate Cafeteria roenbergensis.

Autor: Hackl T; Max Planck Institute for Medical Research, Department of Biomolecular Mechanisms, 69120, Heidelberg, Germany. thackl@mit.edu.; Massachusetts Institute of Technology, Department of Civil and Environmental Engineering, Cambridge, MA, 02139, USA. thackl@mit.edu., Martin R; Philipps-University of Marburg, Department of Mathematics & Computer Science, 35032, Marburg, Germany.; TUM Campus Straubing, Petersgasse 18, 94315, Straubing, Germany., Barenhoff K; Max Planck Institute for Medical Research, Department of Biomolecular Mechanisms, 69120, Heidelberg, Germany., Duponchel S; Max Planck Institute for Medical Research, Department of Biomolecular Mechanisms, 69120, Heidelberg, Germany., Heider D; Philipps-University of Marburg, Department of Mathematics & Computer Science, 35032, Marburg, Germany., Fischer MG; Max Planck Institute for Medical Research, Department of Biomolecular Mechanisms, 69120, Heidelberg, Germany. mfischer@mr.mpg.de.
Jazyk: angličtina
Zdroj: Scientific data [Sci Data] 2020 Jan 21; Vol. 7 (1), pp. 29. Date of Electronic Publication: 2020 Jan 21.
DOI: 10.1038/s41597-020-0363-4
Abstrakt: The heterotrophic stramenopile Cafeteria roenbergensis is a globally distributed marine bacterivorous protist. This unicellular flagellate is host to the giant DNA virus CroV and the virophage mavirus. We sequenced the genomes of four cultured C. roenbergensis strains and generated 23.53 Gb of Illumina MiSeq data (99-282 × coverage per strain) and 5.09 Gb of PacBio RSII data (13-45 × coverage). Using the Canu assembler and customized curation procedures, we obtained high-quality draft genome assemblies with a total length of 34-36 Mbp per strain and contig N50 lengths of 148 kbp to 464 kbp. The C. roenbergensis genome has a GC content of ~70%, a repeat content of ~28%, and is predicted to contain approximately 7857-8483 protein-coding genes based on a combination of de novo, homology-based and transcriptome-supported annotation. These first high-quality genome assemblies of a bicosoecid fill an important gap in sequenced stramenopile representatives and enable a more detailed evolutionary analysis of heterotrophic protists.
Databáze: MEDLINE