Additional file 1 of Potentially adaptive SARS-CoV-2 mutations discovered with novel spatiotemporal and explainable AI models

Autor: Garvin, Michael R., Prates, Erica T., Pavicic, Mirko, Jones, Piet, B. Kirtley Amos, Geiger, Armin, Manesh B. Shah, Streich, Jared, Joao Gabriel Felipe Machado Gazolla, Kainer, David, Cliff, Ashley, Romero, Jonathon, Keith, Nathan, Brown, James B., Jacobson, Daniel
Rok vydání: 2020
Předmět:
DOI: 10.6084/m9.figshare.13487342.v1
Popis: Additional file 1. Supplementary Figures (Haplotype Success and Potential Adaptation Fig. S1. Codon usage frequency of the SARS-CoV-2 coronavirus, the pangolin from which it likely evolved, and based on the alternative alleles from all currently known mutations. The TTT codon is significantly greater than expected based on pangolin (p < 0.01), suggesting that there is selection for this mutation.) (Structural analysis of SARS-CoV-2 mutants – nsp2 Fig. S2. Ab initio predicted structure of the N-terminal domain of nsp2. Sites of non-conservative substitution are depicted in yellow.) (Structural analysis of SARS-CoV-2 mutants – nsp3 Fig. S3. Predicted structure of the Arg/Lys-rich nucleic acid-binding domain of nsp3 (a.a. 1089-1201). Non-conservative substitutions are depicted in green. Positively charged residues are also shown, in yellow.) (Structural analysis of SARS-CoV-2 mutants – nsp13 Fig. S4. Structure of SARS-CoV-2 nsp13 based on PDB 6jyt. Non-conservative substitutions are depicted in green. Zinc ions are represented as yellow spheres.) (Structural analysis of SARS-CoV-2 mutants – Nucleocapsid Fig. S5. Sites of proteolytic cleavage of the Ser/Arg-rich motif of SARS-CoV2 nucleocapsid, predicted by the ExPASy PeptideCutter tool. The proteases, thermolysin (Therm), Arg-C proteinase (ArgC), proteinase K (ProtK), clostripain (Clost), and trypsin (Tryps), are assigned in their putative specific site of cleavage. The site of cleavage between Ser202 and Arg203 is marked with *.) (Structural analysis of SARS-CoV-2 mutants – Nucleocapsid Fig. S6. Crystal structure of SARS-CoV-2 nucleocapsid RNA-binding domain (PDB id 6vyo). Non-conservative substitutions are depicted in orange green. Zinc ions are represented as violet spheres.) (Structural analysis of SARS-CoV-2 mutants – Spike glycoprotein Fig. S7. Probability density of residues in the receptor-binding domain of SARS-CoV-2 forming contacts with the ACE2. The cutoff distance of 4 Å between any atom in a pair of residues was used. Bars with standard deviation higher than 50% of the probability density are considered transient interactions in the simulations and not included in the plot, except for the contact of residue 483, which is included for reference.) (Concluding remarks Fig. S8. Haplotype network produced with 9,294 sequences demonstrating the same relationships with the larger sample size presented in Fig. 2 (main text).) and Supplementary Tables (Table S1. The 107 sites with more than 2 alleles (more than one mutational event). The most frequent changes are provided first (column four) followed by second or third mutations, if present. The site in the genome is based on the reference sequence NC_0045512. Table S1 (cont.)) (Table S2. Groups of conservative substitutions considered in this study. Amino acids in brackets are evaluated after structural analysis.) (Table S3. Results from iRF-LOOP. Significant loci based on their ability to predict or be predicted by other loci. Three sites (21575_T, 11083_T, and 11074_T) appear to be hypermutation sites that occur as the result of long stretches of thymines due to consecutive runs of phenylalanine, leucine, and valine residues.) (Table S4. Summary of the main mutations, possible observable characteristics they may affect, and suggested future experiments to assess that.) (Supplementary References 1. Jia Z, Yan L, Ren Z, Wu L, Wang J, Guo J, et al. Delicate structural coordination of the Severe Acute Respiratory Syndrome coronavirus Nsp13 upon ATP hydrolysis. Nucleic Acids Res. 2019;47:6538–50. 2. von Brunn A, Teepe C, Simpson JC, Pepperkok R, Friedel CC, Zimmer R, et al. Analysis of intraviral protein-protein interactions of the SARS coronavirus ORFeome. PLoS One. 2007;2:e459. 3. Neuman BW, Joseph JS, Saikatendu KS, Serrano P, Chatterjee A, Johnson MA, et al. Proteomics analysis unravels the functional repertoire of coronavirus nonstructural protein 3. J Virol. 2008;82:5279–94. 4. Jia Z, Yan L, Ren Z, Wu L, Wang J, Guo J, et al. Delicate structural coordination of the Severe Acute Respiratory Syndrome coronavirus Nsp13 upon ATP hydrolysis. Nucleic Acids Res. 2019;47:6538–50. 5. He R, Dobie F, Ballantine M, Leeson A, Li Y, Bastien N, et al. Analysis of multimerization of the SARS coronavirus nucleocapsid protein. Biochem Biophys Res Commun. 2004;316:476–83.)
Databáze: OpenAIRE