Multi-relational Data Mining for Tetratricopeptide Repeats (TPR)-Like Superfamily Members in Leishmania spp.: Acting-by-Connecting Proteins
Autor: | Michely C. Diniz, Michel T. Kamimura, Diana Oliveira, Karen T. Girão, Maria Cristina da Silva, Samara C. Silva, Laura D.G. Carneiro, Fatima De Cassia E. Oliveira, Italo M. C. Maia, Ana Carolina Landim Pacheco, Kaio M. Farias, Carla R. F. Gadelha |
---|---|
Rok vydání: | 2008 |
Předmět: | |
Zdroj: | Pattern Recognition in Bioinformatics ISBN: 9783540884347 PRIB |
DOI: | 10.1007/978-3-540-88436-1_31 |
Popis: | The multi-relational data mining (MRDM) approach looks for patterns that involve multiple tables from a relational database made of complex/structured objects whose normalized representation does require multiple tables. We have applied MRDM methods (relational association rule discovery and probabilistic relational models) with hidden Markov models (HMMs) and Viterbi algorithm (VA) to mine tetratricopeptide repeat (TPR), pentatricopeptide (PPR) and half-a-TPR (HAT) in genomes of pathogenic protozoa Leishmania. TPR is a protein-protein interaction module and TPR-containing proteins (TPRPs) act as scaffolds for the assembly of different multiprotein complexes. Our aim is to build a great panel of the TPR-like superfamily of Leishmania. Distributed relational state representations for complex stochastic processes were applied to identification, clustering and classification of Leishmaniagenes and we were able to detect putative 104 TPRPs, 36 PPRPs and 08 HATPs, comprising the TPR-like superfamily. We have also compared currently available resources (Pfam, SMART, SUPER-FAMILY and TPRpred) with our approach (MRDM/HMM/VA). |
Databáze: | OpenAIRE |
Externí odkaz: |