Scipion3: A workflow engine for cryo-electron microscopy image processing and structural biology.

Autor: Conesa, Pablo, Fonseca, Yunior C., de la Morena, Jorge Jiménez, Sharov, Grigory, de la Rosa-Trevín, Jose Miguel, Cuervo, Ana, Mena, Alberto García, de Francisco, Borja Rodríguez, Hoyo, Daniel del, Herreros, David, Marchan, Daniel, Strelak, David, Fernández-Giménez, Estrella, Ramírez-Aportela, Erney, de Isidro-Gómez, Federico Pedro, Sánchez, Irene, Krieger, James, Vilas, José Luis, Cano, Laura del, Gragera, Marcos
Předmět:
Zdroj: Biological Imaging; 2023, Vol. 3, p1-18, 18p
Abstrakt: Image-processing pipelines require the design of complex workflows combining many different steps that bring the raw acquired data to a final result with biological meaning. In the image-processing domain of cryo-electron microscopy single-particle analysis (cryo-EM SPA), hundreds of steps must be performed to obtain the three-dimensional structure of a biological macromolecule by integrating data spread over thousands of micrographs containing millions of copies of allegedly the same macromolecule. The execution of such complicated workflows demands a specific tool to keep track of all these steps performed. Additionally, due to the extremely low signal-to-noise ratio (SNR), the estimation of any image parameter is heavily affected by noise resulting in a significant fraction of incorrect estimates. Although low SNR and processing millions of images by hundreds of sequential steps requiring substantial computational resources are specific to cryo-EM, these characteristics may be shared by other biological imaging domains. Here, we present Scipion, a Python generic open-source workflow engine specifically adapted for image processing. Its main characteristics are: (a) interoperability, (b) smart object model, (c) gluing operations, (d) comparison operations, (e) wide set of domain-specific operations, (f) execution in streaming, (g) smooth integration in high-performance computing environments, (h) execution with and without graphical capabilities, (i) flexible visualization, (j) user authentication and private access to private data, (k) scripting capabilities, (l) high performance, (m) traceability, (n) reproducibility, (o) self-reporting, (p) reusability, (q) extensibility, (r) software updates, and (s) non-restrictive software licensing. [ABSTRACT FROM AUTHOR]
Databáze: Complementary Index