A two-step feature selection procedure for relevant markers of Squamous Cell Lung Carcinoma using different survival models

Autor: Atanu Bhattacharjee, Samudranil Basak, Pragya Kumari
Jazyk: angličtina
Rok vydání: 2023
Předmět:
Zdroj: Healthcare Analytics, Vol 3, Iss , Pp 100168- (2023)
Druh dokumentu: article
ISSN: 2772-4425
DOI: 10.1016/j.health.2023.100168
Popis: There are potentially infinite gene expression markers for Lung Squamous Cell Carcinoma. This results in a high-dimensional data with a large number of features. The selection of relevant markers for analysis is thus, of utmost importance. In our study, we have aimed to select a subset of prominent and significant features from 31918 features of gene expressions. Analysis is then performed on the selected features using the Cox Proportional Hazards Model to know how each marker affects the survival estimates of a patient. We have employed a two-step selection process to select a subset of markers. The first step is done by L1 regularized Cox PH. Then the selected markers are screened a second time by running a univariate Cox PH model and checking for the p-value of each bio-marker via Wald inference (p
Databáze: Directory of Open Access Journals