Evaluating the Impact of Re-training a Lexical Disambiguation Model on Domain Adaptation of an HPSG Parser.

Autor: Hara, Tadayoshi, Miyao, Yusuke, Tsu ii, Jun-ichi
Zdroj: Trends in Parsing Technology; 2011, p257-275, 19p
Abstrakt: This chapter describes an effective approach to adapting an HPSG parser trained on the Penn Treebank to a biomedical domain. In this approach, we train probabilities of lexical entry assignments to words in a target domain and then incorporate them into the original parser. Experimental results show that this method can obtain higher parsing accuracy than previous work on domain adaptation for parsing the same data. Moreover, the results show that the combination of the proposed method and the existing method achieves parsing accuracy that is as high as that of an HPSG parser retrained from scratch, but with much lower training cost. We also evaluated our method on the Brown corpus to show the portability of our approach in another domain. [ABSTRACT FROM AUTHOR]
Databáze: Complementary Index