Autor: |
Muthuramalingam Sivakumar, Sudhaman Parthasarathy, Thiyagarajan Padmapriya |
Jazyk: |
angličtina |
Rok vydání: |
2024 |
Předmět: |
|
Zdroj: |
PeerJ Computer Science, Vol 10, p e2245 (2024) |
Druh dokumentu: |
article |
ISSN: |
2376-5992 |
DOI: |
10.7717/peerj-cs.2245 |
Popis: |
Artificial intelligence (AI) and machine learning (ML) aim to mimic human intelligence and enhance decision making processes across various fields. A key performance determinant in a ML model is the ratio between the training and testing dataset. This research investigates the impact of varying train-test split ratios on machine learning model performance and generalization capabilities using the BraTS 2013 dataset. Logistic regression, random forest, k nearest neighbors, and support vector machines were trained with split ratios ranging from 60:40 to 95:05. Findings reveal significant variations in accuracies across these ratios, emphasizing the critical need to strike a balance to avoid overfitting or underfitting. The study underscores the importance of selecting an optimal train-test split ratio that considers tradeoffs such as model performance metrics, statistical measures, and resource constraints. Ultimately, these insights contribute to a deeper understanding of how ratio selection impacts the effectiveness and reliability of machine learning applications across diverse fields. |
Databáze: |
Directory of Open Access Journals |
Externí odkaz: |
|