Deep data analytics for genetic engineering of diatoms linking genotype to phenotype via machine learning
Autor: | Paulina K. Urbanowicz, Anton V. Ievlev, Alex Belianinov, Teresa J. Mathews, Olga S. Ovchinnikova, Maxim Ziatdinov, Artem A. Trofimov, Nikolay Borodinov, Alison A. Pawlicki, Shovon Mandal, Joshua K. Michener, Katherine A. Hausladen, Mark Hildebrand, Chad A. Steed, Rama K. Vasudevan |
---|---|
Jazyk: | angličtina |
Rok vydání: | 2019 |
Předmět: |
Thalassiosira pseudonana
Machine learning computer.software_genre Genome engineering Cell wall lcsh:TA401-492 General Materials Science Gene lcsh:Computer software Artificial neural network biology business.industry Chemistry fungi biology.organism_classification Phenotype Computer Science Applications Diatom lcsh:QA76.75-76.765 Mechanics of Materials Modeling and Simulation lcsh:Materials of engineering and construction. Mechanics of materials Artificial intelligence business computer Biomineralization |
Zdroj: | npj Computational Materials, Vol 5, Iss 1, Pp 1-8 (2019) |
ISSN: | 2057-3960 |
DOI: | 10.1038/s41524-019-0202-3 |
Popis: | Genome engineering for materials synthesis is a promising avenue for manufacturing materials with unique properties under ambient conditions. Biomineralization in diatoms, unicellular algae that use silica to construct micron-scale cell walls with nanoscale features, is an attractive candidate for functional synthesis of materials for applications including photonics, sensing, filtration, and drug delivery. Therefore, controllably modifying diatom structure through targeted genetic modifications for these applications is a very promising field. In this work, we used gene knockdown in Thalassiosira pseudonana diatoms to create modified strains with changes to structural morphology and linked genotype to phenotype using supervised machine learning. An artificial neural network (NN) was developed to distinguish wild and modified diatoms based on the SEM images of frustules exhibiting phenotypic changes caused by a specific protein (Thaps3_21880), resulting in 94% detection accuracy. Class activation maps visualized physical changes that allowed the NNs to separate diatom strains, subsequently establishing a specific gene that controls pores. A further NN was created to batch process image data, automatically recognize pores, and extract pore-related parameters. Class interrelationship of the extracted paraments was visualized using a multivariate data visualization tool, called CrossVis, and allowed to directly link changes in morphological diatom phenotype of pore size and distribution with changes in the genotype. |
Databáze: | OpenAIRE |
Externí odkaz: |