Quadratic Programming Feature Selection

Autor: Irene Rodriguez Lujan, Huerta, R., Elkan, C., Cruz, C. S.
Přispěvatelé: UAM. Departamento de Ingeniería Informática, Aprendizaje Automático (ING EPS-001)
Jazyk: angličtina
Rok vydání: 2010
Předmět:
Zdroj: Scopus-Elsevier
Biblos-e Archivo. Repositorio Institucional de la UAM
instname
Popis: Identifying a subset of features that preserves classification accuracy is a problem of growing importance, because of the increasing size and dimensionality of real-world data sets. We propose a new feature selection method, named Quadratic Programming Feature Selection (QPFS), that reduces the task to a quadratic optimization problem. In order to limit the computational complexity of solving the optimization problem, QPFS uses the Nystr¨om method for approximate matrix diagonalization. QPFS is thus capable of dealing with very large data sets, for which the use of other methods is computationally expensive. In experiments with small and medium data sets, the QPFS method leads to classification accuracy similar to that of other successful techniques. For large data sets, QPFS is superior in terms of computational efficiency.
I.R.-L. is supported by an FPU grant from Universidad Autónoma de Madrid, and partially supported by the Universidad Autónoma de Madrid-IIC Chair. R.H. acknowledges partial support by ONR N00014-07-1-0741
Databáze: OpenAIRE