Learning Ranking Functions For Information Retrieval Using Layered Multi-Population Genetic Programming
Autor: | Jung-Yi Lin, Jen-Yuan Yeh |
---|---|
Rok vydání: | 2017 |
Předmět: |
Information retrieval
General Computer Science business.industry Computer science InformationSystems_INFORMATIONSTORAGEANDRETRIEVAL 05 social sciences 050301 education Genetic programming 02 engineering and technology Machine learning computer.software_genre Inductive programming Ranking (information retrieval) Okapi BM25 Ranking SVM 0202 electrical engineering electronic engineering information engineering Question answering 020201 artificial intelligence & image processing Learning to rank Artificial intelligence Document retrieval business 0503 education computer |
Zdroj: | Malaysian Journal of Computer Science. 30:27-47 |
ISSN: | 0127-9084 |
DOI: | 10.22452/mjcs.vol30no1.3 |
Popis: | Ranking plays a key role in many applications, such as document retrieval, recommendation, question answering, and machine translation. In practice, a ranking function (or model) is exploited to determine the rank-order relations between objects, with respect to a particular criterion. In this paper, a layered multipopulation genetic programming based method, known as RankMGP, is proposed to learn ranking functions for document retrieval by incorporating various types of retrieval models into a singular one with high effectiveness. RankMGP represents a potential solution (i.e., a ranking function) as an individual in a population of genetic programming and aims to directly optimize information retrieval evaluation measures in the evolution process. Overall, RankMGP consists of a set of layers and a sequential workflow running through the layers. In one layer, multiple populations evolve independently to generate a set of the best individuals. When the evolution process is completed, a new training dataset is created using the best individuals and the input training set of the layer. Then, the populations in the next layer evolve with the new training dataset. In the final layer, the best individual is obtained as the output ranking function. The proposed method is evaluated using the LETOR datasets and is found to be superior to classical information retrieval models, such as Okapi BM25. It is also statistically competitive with the state-of-the-art methods, including Ranking SVM, ListNet, AdaRank and RankBoost. |
Databáze: | OpenAIRE |
Externí odkaz: |