Jane: an advanced freely available hierarchical machine translation toolkit
Autor: | Hermann Ney, Daniel Stein, David Vilar, Matthias Huck |
---|---|
Rok vydání: | 2012 |
Předmět: |
Linguistics and Language
Machine translation Syntax (programming languages) Computer science Programming language computer.software_genre Language and Linguistics Example-based machine translation Rule-based machine translation Artificial Intelligence Regression testing NIST Computer-assisted translation Language model computer Software |
Zdroj: | Machine Translation. 26:197-216 |
ISSN: | 1573-0573 0922-6567 |
Popis: | In this article we will describe the design and implementation of Jane, an efficient hierarchical phrase-based (HPB) toolkit developed at RWTH Aachen University. The system has been used by RWTH at several international evaluation campaigns, including the WMT and NIST evaluations, and is now freely available for non-commercial application. We will go through the main features of Jane, which include, among others, support for different search strategies, different language model formats, support for syntax-based enhancements to the HPB machine translation paradigm, string-to-dependency translation, extended lexicon models, different methods for minimum-error-rate training and distributed operation on a computer cluster. Special attention has been paid to the efficiency of the decoder, clean code and quality assurance through unit and regression testing. Results on current machine translation tasks are reported, which show that the system is able to obtain state-of-the-art performance. |
Databáze: | OpenAIRE |
Externí odkaz: |