Autor: |
Ke Wang, Ying An, Jiancun Zhou, Yuehong Long, Xianlai Chen |
Jazyk: |
angličtina |
Rok vydání: |
2023 |
Předmět: |
|
Zdroj: |
Alexandria Engineering Journal, Vol 66, Iss , Pp 993-999 (2023) |
Druh dokumentu: |
article |
ISSN: |
1110-0168 |
DOI: |
10.1016/j.aej.2022.10.069 |
Popis: |
Radiomics is characterized by high-dimension and high redundancy. The existing Lasso-based feature selection does not consider features that are weakly correlated with the classification results, which will have a certain impact on the quality of feature subset. A multi-level feature selection algorithm based on Lasso coefficient threshold (Coe-Thr-Lasso) was proposed. Firstly, t-test and variance were used to remove the features that had little correlation with the classification results. Secondly, the proposed algorithm was used to remove features with redundancy and weak correlation of classification results. Three machine learning algorithms, including Logistic regression (LR), random forest (RF) and support vector machine (SVM), were verify the performance of the proposed algorithm on the non-small cell lung cancer subtype classification dataset. When modeling based on the feature subset generated by the proposed method, the proposed method achieved the best classification performance compared with other publication methods. Therefore, Coe-Thr-Lasso algorithm can effectively remove redundant and irrelevant features in radiomics, so as to improve the quality of feature subset and the ability of model generalization. |
Databáze: |
Directory of Open Access Journals |
Externí odkaz: |
|