Parameter Learning for Statistical Machine Translation Using CMA-ES
Autor: | Huy-Quang Nguyen, Hoai-Xuan Nguyen, Viet-Hong Tran, Anh-Tuan Pham, Vinh-Van Nguyen |
---|---|
Rok vydání: | 2015 |
Předmět: |
Machine translation
Active learning (machine learning) Computer science business.industry Online machine learning Word error rate Machine learning computer.software_genre Example-based machine translation Computational learning theory Artificial intelligence business computer Metaheuristic Global optimization |
Zdroj: | Advances in Intelligent Systems and Computing ISBN: 9783319116792 KSE |
DOI: | 10.1007/978-3-319-11680-8_34 |
Popis: | Minimum error rate training (MERT) is probably still the most widely used parameter learning algorithm in statistical machine translation [1] (SMT). However, it does not support the use of large number of learning features (e.g. 30 features or more). Moreover, acting on parameter space, MERT is only a local optimization algorithm. In this paper, we investigate for the first time the use of metaheuristics and global optimization techniques for the problem of learning parameters in SMT. In particular, We replace MERT with the well-known meta-heuristics for global optimization called CovarianceMatrixAdaptation Evolution Strategy (CMAES) [2]. We test the effectiveness of CMA-ES by conducting SMT experiments on an English-Vietnamese corpus. The results show that the improved SMT system using CMA-ES achieved superior BLEU scores compared to the baseline SMT system using MERT both on the dev and test data sets. |
Databáze: | OpenAIRE |
Externí odkaz: |