Jane: an advanced freely available hierarchical machine translation toolkit

Autor: Hermann Ney, Daniel Stein, David Vilar, Matthias Huck
Rok vydání: 2012
Předmět:
Zdroj: Machine Translation. 26:197-216
ISSN: 1573-0573
0922-6567
Popis: In this article we will describe the design and implementation of Jane, an efficient hierarchical phrase-based (HPB) toolkit developed at RWTH Aachen University. The system has been used by RWTH at several international evaluation campaigns, including the WMT and NIST evaluations, and is now freely available for non-commercial application. We will go through the main features of Jane, which include, among others, support for different search strategies, different language model formats, support for syntax-based enhancements to the HPB machine translation paradigm, string-to-dependency translation, extended lexicon models, different methods for minimum-error-rate training and distributed operation on a computer cluster. Special attention has been paid to the efficiency of the decoder, clean code and quality assurance through unit and regression testing. Results on current machine translation tasks are reported, which show that the system is able to obtain state-of-the-art performance.
Databáze: OpenAIRE