Sigma70Pred: A highly accurate method for predicting sigma70 promoter in Escherichia coli K-12 strains

Autor: Sumeet Patiyal, Nitindeep Singh, Mohd Zartab Ali, Dhawal Singh Pundir, Gajendra P. S. Raghava
Jazyk: angličtina
Rok vydání: 2022
Předmět:
Zdroj: Frontiers in Microbiology, Vol 13 (2022)
Druh dokumentu: article
ISSN: 1664-302X
DOI: 10.3389/fmicb.2022.1042127
Popis: Sigma70 factor plays a crucial role in prokaryotes and regulates the transcription of most of the housekeeping genes. One of the major challenges is to predict the sigma70 promoter or sigma70 factor binding site with high precision. In this study, we trained and evaluate our models on a dataset consists of 741 sigma70 promoters and 1,400 non-promoters. We have generated a wide range of features around 8,000, which includes Dinucleotide Auto-Correlation, Dinucleotide Cross-Correlation, Dinucleotide Auto Cross-Correlation, Moran Auto-Correlation, Normalized Moreau-Broto Auto-Correlation, Parallel Correlation Pseudo Tri-Nucleotide Composition, etc. Our SVM based model achieved maximum accuracy 97.38% with AUROC 0.99 on training dataset, using 200 most relevant features. In order to check the robustness of the model, we have tested our model on the independent dataset made by using RegulonDB10.8, which included 1,134 sigma70 and 638 non-promoters, and able to achieve accuracy of 90.41% with AUROC of 0.95. Our model successfully predicted constitutive promoters with accuracy of 81.46% on an independent dataset. We have developed a method, Sigma70Pred, which is available as webserver and standalone packages at https://webs.iiitd.edu.in/raghava/sigma70pred/. The services are freely accessible.
Databáze: Directory of Open Access Journals