Combined analysis of chromosomal instabilities and gene expression for colon cancer progression inference
Autor: | Claudia Cava* 1, Italo Zoppis* 2, Manuela Gariboldi 3, 4, Isabella Castiglioni 1, Giancarlo Mauri 2, Marco Antoniotti 2 |
---|---|
Přispěvatelé: | Cava, C, Zoppis, I, Gariboldi, M, Castiglioni, I, Mauri, G, Antoniotti, M |
Jazyk: | angličtina |
Rok vydání: | 2014 |
Předmět: |
Support vector machine
Colon Cancer Colorectal cancer Research Disease progression Gene Expression INF/01 - INFORMATICA Inference Cancer Health Informatics Biology Copy number alteration Bioinformatics medicine.disease Dissimilarity representation ING-INF/05 - SISTEMI DI ELABORAZIONE DELLE INFORMAZIONI Malignant transformation Cancer Progression Genetic variation Gene expression medicine Gene |
Zdroj: | Journal of Clinical Bioinformatics Journal of clinical bioinformatics 4 (2014): 2. doi:10.1186/2043-9113-4-2 info:cnr-pdr/source/autori:Claudia Cava* 1; Italo Zoppis* 2; Manuela Gariboldi 3;4; Isabella Castiglioni 1; Giancarlo Mauri 2; Marco Antoniotti 2/titolo:Combined analysis of chromosomal instabilities and gene expression for colon cancer progression inference/doi:10.1186%2F2043-9113-4-2/rivista:Journal of clinical bioinformatics/anno:2014/pagina_da:2/pagina_a:/intervallo_pagine:2/volume:4 |
ISSN: | 2043-9113 |
DOI: | 10.1186/2043-9113-4-2 |
Popis: | Background: Copy number alterations (CNAs) represent an important component of genetic variations. Such alterations are related with certain type of cancer including those of the pancreas, colon, and breast, among others. CNAs have been used as biomarkers for cancer prognosis in multiple studies, but few works report on the relation of CNAs with the disease progression. Moreover, most studies do not consider the following two important issues. (I) The identification of CNAs in genes which are responsible for expression regulation is fundamental in order to define genetic events leading to malignant transformation and progression. (II) Most real domains are best described by structured data where instances of multiple types are related to each other in complex ways. Results: Our main interest is to check whether the colorectal cancer (CRC) progression inference benefits when considering both (I) the expression levels of genes with CNAs, and (II) relationships (i.e. dissimilarities) between patients due to expression level differences of the altered genes. We first evaluate the accuracy performance of a state-of-the-art inference method (support vector machine) when subjects are represented only through sets of available attribute values (i.e. gene expression level). Then we check whether the inference accuracy improves, when explicitly exploiting the information mentioned above. Our results suggest that the CRC progression inference improves when the combined data (i.e. CNA and expression level) and the considered dissimilarity measures are applied. Conclusions: Through our approach, classification is intuitively appealing and can be conveniently obtained in the resulting dissimilarity spaces. Different public datasets from Gene Expression Omnibus (GEO) were used to validate the results. |
Databáze: | OpenAIRE |
Externí odkaz: |