A COMPARISON OF CLASSIFIERS APPLIED TO THE PROBLEM OF BIOPSY IMAGES ANALYSIS
Autor: | Vladyslav Yaloveha, Nataliia Lukova-Chuiko, Andrii Podorozhniak, Daria Hlavcheva |
---|---|
Rok vydání: | 2020 |
Předmět: |
Information theory
medicine.diagnostic_test business.industry Computer science Deep learning General Engineering deep learning convolutional neural network Pattern recognition Convolutional neural network grid search QA76.75-76.765 ComputingMethodologies_PATTERNRECOGNITION machine learning Biopsy Hyperparameter optimization medicine biopsy Computer software Artificial intelligence Q350-390 business |
Zdroj: | Сучасні інформаційні системи, Vol 4, Iss 2 (2020) |
ISSN: | 2522-9052 |
DOI: | 10.20998/2522-9052.2020.2.03 |
Popis: | The purpose of the research is to compare classification algorithms for the histopathological images analyzing issue and to optimize the parameters for obtaining better classification accuracy. The following tasks are solved in the article: preprocessing of BreCaHAD dataset images, implementation and training of CNN, applying K-nearest neighbours, SVM, Random Forest, XGBoost, and perceptron algorithms for classifying features that were extracted by CNN, and results comparison. The object of the research is the process of classifying tumor cells in the microscopic biopsy images. The subject of the research is the process of using ML algorithms for classification of the features extracted by CNN from input biopsy image. The scientific novelty of the research is a comparative analysis of classifiers on the task of “tumor” and “healthy” cells images classification from processed BreCaHAD dataset. As a result it was obtained that from chosen classifiers SVM reached the highest accuracy on test data – 0.972. This is the only algorithm that shows better accuracy than perceptron. Perceptron gets 0.966 classification accuracy. K-nearest neighbours, Random Forest, and XGBoost algorithms reached lower results. The algorithms' hyperparameters optimization was carried out. The results have been compared with related works. The following research methods are used: the theory of deep learning, mathematical statistics, parameters optimization. |
Databáze: | OpenAIRE |
Externí odkaz: |