Comparison Analysis: Large Data Classification Using PLS-DA and Decision Trees

Autor: Norashikin Nasaruddin, Kartini Kassim, Nurazlina Abdul Rashid, Amirah Hazwani Abdul Rahim
Rok vydání: 2020
Předmět:
Zdroj: Mathematics and Statistics. 8:100-105
ISSN: 2332-2144
2332-2071
Popis: Classification studies are widely applied in many areas of research. In our study, we are using classification analysis to explore approaches for tackling the classification problem for a large number of measures using partial least square discriminant analysis (PLS-DA) and decision trees (DT). The performance for both methods was compared using a sample data of breast tissues from the University of Wisconsin Hospital. A partial least square discriminant analysis (PLS-DA) and decision trees (DT) predict the diagnosis of breast tissues (M = malignant, B = benign). A total of 699 patients diagnose (458 benign and 241 malignant) are used in this study. The performance of PLS-DA and DT has been evaluated based on the misclassification error and accuracy rate. The results show PLS-DA can be considered as a good and reliable technique to be used when dealing with a large dataset for the classification task and have good prediction accuracy.
Databáze: OpenAIRE