A comparative study on using unsupervised learning based data analysis techniques for breast cancer detection

Autor: Vlad-Ioan Tomescu, Gabriela Czibula, Stefan Nitica
Rok vydání: 2020
Předmět:
Zdroj: SACI
DOI: 10.1109/saci49304.2020.9118783
Popis: As stated by the World Health Organisation, breast cancer is the most frequent form of cancer among women, being responsible for 15% of all cancer-related deaths in this group. A lot of research has been carried out, so far, in using various machine learning models for breast cancer prediction, ranging from conventional classifiers to deep learning techniques. Three unsupervised learning models (t-Distributed Stochastic Neighbor Embedding, autoencoders and self-organizing maps) are comparatively analysed in this paper with the aim of unsupervisedly detecting the classes of benign and malignant instances. Experiments performed on data sets previously used in the literature for breast cancer detection reveal a good performance of the proposed unsupervised learning models. The best performance was obtained using autoencoders, which provided values higher than 0.935 for the area under the ROC curve evaluation measure.
Databáze: OpenAIRE