Unveiling epigenetic regulatory elements associated with breast cancer development.

Autor: Jardanowska-Kotuniak M; Computational Biology Group, Institute of Computer Science of the Polish Academy of Sciences, Warsaw, Poland.; Institute of Biochemistry and Biophysics of the Polish Academy of Sciences, Warsaw, Poland., Dramiński M; Computational Biology Group, Institute of Computer Science of the Polish Academy of Sciences, Warsaw, Poland., Własnowolski M; Laboratory of Bioinformatics and Computational Genomics, Faculty of Mathematics and Information Science, Warsaw University of Technology, Warsaw, Poland., Łapiński M; Computational Biology Group, Institute of Computer Science of the Polish Academy of Sciences, Warsaw, Poland., Sengupta K; Laboratory of Bioinformatics and Computational Genomics, Faculty of Mathematics and Information Science, Warsaw University of Technology, Warsaw, Poland., Agarwal A; Laboratory of Functional and Structural Genomics, Centre of New Technologies, University of Warsaw, Warsaw, Poland., Filip A; Computational Biology Group, Institute of Computer Science of the Polish Academy of Sciences, Warsaw, Poland., Ghosh N; Department of Computer Science and Information Technology, Institute of Technical Education and Research, Siksha O Anusandhan University, Bhubaneswar, Odisha, 751030, India., Pancaldi V; CRCT, Université de Toulouse, Inserm, CNRS, Université Toulouse III-Paul Sabatier, Centre de Recherches en Cancérologie de Toulouse, Toulouse, France., Grynberg M; Institute of Biochemistry and Biophysics of the Polish Academy of Sciences, Warsaw, Poland., Saha I; Department of Computer Science and Engineering, National Institute of Technical Teachers' Training and Research, Kolkata 700106, India., Plewczynski D; Laboratory of Bioinformatics and Computational Genomics, Faculty of Mathematics and Information Science, Warsaw University of Technology, Warsaw, Poland.; Laboratory of Functional and Structural Genomics, Centre of New Technologies, University of Warsaw, Warsaw, Poland., Dąbrowski MJ; Computational Biology Group, Institute of Computer Science of the Polish Academy of Sciences, Warsaw, Poland.
Jazyk: angličtina
Zdroj: BioRxiv : the preprint server for biology [bioRxiv] 2024 Nov 15. Date of Electronic Publication: 2024 Nov 15.
DOI: 10.1101/2024.11.12.623187
Abstrakt: Breast cancer is the most common cancer in women and the 2nd most common cancer worldwide, yearly impacting over 2 million females and causing 650 thousand deaths. It has been widely studied, but its epigenetic variation is not entirely unveiled. We aimed to identify epigenetic mechanisms impacting the expression of breast cancer related genes to detect new potential biomarkers and therapeutic targets. We considered The Cancer Genome Atlas database with over 800 samples and several omics datasets such as mRNA, miRNA, DNA methylation, which we used to select 2701 features that were statistically significant to differ between cancer and control samples using the Monte Carlo Feature Selection and Interdependency Discovery algorithm, from an initial total of 417,486. Their biological impact on cancerogenesis was confirmed using: statistical analysis, natural language processing, linear and machine learning models as well as: transcription factors identification, drugs and 3D chromatin structure analyses. Classification of cancer vs control samples on the selected features returned high classification weighted Accuracy from 0.91 to 0.98 depending on feature-type: mRNA, miRNA, DNA methylation, and classification algorithm. In general, cancer samples showed lower expression of differentially expressed genes and increased β -values of differentially methylated sites. We identified mRNAs whose expression is well explained by miRNA expression and differentially methylated sites β -values. We recognized differentially methylated sites possibly affecting NRF1 and MXI1 transcription factors binding, causing a disturbance in NKAPL and PITX1 expression, respectively. Our 3D models showed more loosely packed chromatin in cancer. This study successfully points out numerous possible regulatory dependencies.
Databáze: MEDLINE