Adaptive variable-weighted support vector machine as optimized by particle swarm optimization algorithm with application of QSAR studies
Autor: | Ru-Qin Yu, Hai-Long Wu, Jian-Hui Wen, Guo-Li Shen, Jian-Hui Jiang, Li-Juan Tang, Ke-Jun Zhong |
---|---|
Rok vydání: | 2010 |
Předmět: |
Quantitative structure–activity relationship
Sulfonamides Chemistry Process (computing) Particle swarm optimization Quantitative Structure-Activity Relationship Feature selection Carbonic Anhydrase II Analytical Chemistry Weighting Support vector machine Maleimides Variable (computer science) Glycogen Synthase Kinase 3 Artificial Intelligence Animals Humans Carbonic Anhydrase Inhibitors Algorithm Protein Kinase Inhibitors Selection (genetic algorithm) Algorithms |
Zdroj: | Talanta. 84(1) |
ISSN: | 1873-3573 |
Popis: | Representing a compound by a numerous structural descriptors becomes common in quantitative structure-activity relationship (QSAR) studies. As every descriptor carries molecular structure information more or less, it seems more advisable to investigate all the possible descriptor vectors rather than traditional variable selection when building a QSAR model. Based on particle swarm optimization (PSO) algorithm, a more flexible descriptor selection and model construction method variable-weighted support vector machine (VW-SVM) is proposed. The new strategy adopted in this paper is to weight all structural descriptors with continuous non-negative values rather than removing or reserving any ones arbitrarily. The manner of invoking PSO to seek non-negative weights of variables can be regarded as a process of searching optimized rescaling for every molecular structural descriptor. Moreover, PSO is employed to search the optimal parameters of VW-SVM model besides variable weights, enables the construction of a rational and adaptive parameter-free QSAR model according to the performance of the total model. Results obtained by investigating glycogen synthase kinase-3α inhibitors and carbonic anhydrase II inhibitors indicate VW-SVM can hold more useful structure information of compounds than other methods as optimally weighting all the descriptors, consequently leading to precisely QSAR models coupled with developed performance both in training and in prediction. |
Databáze: | OpenAIRE |
Externí odkaz: |