Improving Skin-Disease Classification Based on Customized Loss Function Combined With Balanced Mini-Batch Logic and Real-Time Image Augmentation

Autor: Van-Dung Hoang, Cong-Thanh Tran, Antoine Doucet, Tri-Cong Pham, Chi-Mai Luong
Přispěvatelé: Thuy Loi University - TLU (VIETNAM), ICTLab, University of sciences and technologies of hanoi (USTH), FPT Software, Laboratoire Informatique, Image et Interaction - EA 2118 (L3I), Université de La Rochelle (ULR), Institute of Information Technology (IOIT), Institute of Information Technology, Ton Duc Thang University [Hô-Chi-Minh-City]
Jazyk: angličtina
Rok vydání: 2020
Předmět:
General Computer Science
Computer science
Early detection
02 engineering and technology
Skin disease
01 natural sciences
[INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL]
Image (mathematics)
[INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI]
[INFO.INFO-LG]Computer Science [cs]/Machine Learning [cs.LG]
0202 electrical engineering
electronic engineering
information engineering

medicine
[INFO.INFO-DL]Computer Science [cs]/Digital Libraries [cs.DL]
hybrid method
General Materials Science
[INFO.INFO-HC]Computer Science [cs]/Human-Computer Interaction [cs.HC]
ComputingMilieux_MISCELLANEOUS
business.industry
Deep learning
010401 analytical chemistry
General Engineering
Disease classification
020206 networking & telecommunications
Pattern recognition
Function (mathematics)
medicine.disease
loss function
3. Good health
0104 chemical sciences
[INFO.INFO-TT]Computer Science [cs]/Document and Text Processing
deep neural networks
[INFO.INFO-IR]Computer Science [cs]/Information Retrieval [cs.IR]
imbalanced dataset
lcsh:Electrical engineering. Electronics. Nuclear engineering
Artificial intelligence
Skin cancer
balanced mini-batch logic
business
lcsh:TK1-9971
Zdroj: IEEE Access
IEEE Access, IEEE, 2020, 8, pp.150725-150737. ⟨10.1109/ACCESS.2020.3016653⟩
IEEE Access, Vol 8, Pp 150725-150737 (2020)
ISSN: 2169-3536
DOI: 10.1109/ACCESS.2020.3016653⟩
Popis: Skin cancer is one of the most common cancers in the world. However, the disease is curable if detected in the beginning stage. Early detection of malignant lesions through accurate techniques and innovative technologies has a significant impact on reducing skin cancer mortality rates. Recently, artificial intelligence has come to the forefront to facilitate skin cancer diagnosis based on medical images. Many deep learning models have been studied and developed, but the imbalance of performance among classes in the multi-class classification is still a challenging problem. This study proposes a hybrid method for handling class imbalance of skin-disease classification. This method combines the data level method of balanced mini-batch logic followed by real-time image augmentation with the algorithm level method of designing new loss function. The training dataset includes 24,530 dermoscopic images of seven skin disease categories, which is by far the largest dataset of skin cancer. The performance metrics of six proposed methods are evaluated on a test dataset of 2,453 images. Our proposed EfficientNetB4-CLF model achieves the highest accuracy of 89.97% and also the highest mean recall of 86.13% with the smallest recalls' standard deviations of 7.60%. Compared to the original methods, our proposed solution not only surpasses 4.65% (86.13% vs 81.48%) of mean recalls but also reduces 4.24% of the recalls' standard deviations (from ±11.84% to ±7.60%). This result indicates that our hybrid method is highly effective in training the Deep CNN network on the skin-disease imbalanced dataset. It addresses the problem of slow learning of the minority classes in the networks by combining the data level method of balanced mini-batch logic followed by the real-time image augmentation with the algorithm level method of the newly designed loss function.
Databáze: OpenAIRE