Trade-off between training and testing ratio in machine learning for medical image processing

Autor:	Muthuramalingam Sivakumar, Sudhaman Parthasarathy, Thiyagarajan Padmapriya
Jazyk:	angličtina
Rok vydání:	2024
Předmět:	Medical image processing Train-test split Overfitting Underfitting Electronic computers. Computer science QA75.5-76.95
Zdroj:	PeerJ Computer Science, Vol 10, p e2245 (2024)
Druh dokumentu:	article
ISSN:	2376-5992
DOI:	10.7717/peerj-cs.2245
Popis:	Artificial intelligence (AI) and machine learning (ML) aim to mimic human intelligence and enhance decision making processes across various fields. A key performance determinant in a ML model is the ratio between the training and testing dataset. This research investigates the impact of varying train-test split ratios on machine learning model performance and generalization capabilities using the BraTS 2013 dataset. Logistic regression, random forest, k nearest neighbors, and support vector machines were trained with split ratios ranging from 60:40 to 95:05. Findings reveal significant variations in accuracies across these ratios, emphasizing the critical need to strike a balance to avoid overfitting or underfitting. The study underscores the importance of selecting an optimal train-test split ratio that considers tradeoffs such as model performance metrics, statistical measures, and resource constraints. Ultimately, these insights contribute to a deeper understanding of how ratio selection impacts the effectiveness and reliability of machine learning applications across diverse fields.
Databáze:	Directory of Open Access Journals
Externí odkaz:	https://doaj.org/article/798ede8b88d049cb85dae91af215c88f Zobrazit plný text záznamu View record in DOAJ