Supertagging for a Statistical HPSG Parser for Spanish
Autor: | Luis Chiruzzo, Dina Wonsever |
---|---|
Rok vydání: | 2015 |
Předmět: | |
Zdroj: | Statistical Language and Speech Processing ISBN: 9783319257884 SLSP |
DOI: | 10.1007/978-3-319-25789-1_3 |
Popis: | We created a supertagger for the Spanish language aimed at disambiguating the HPSG lexical frames for the verbs in a sentence. The supertagger uses a CRF model and achieves an accuracy of 83.58i¾ź% for the verb classes on the test set. The tagset contains 92 verb classes, extracted from a Spanish HPSG-compatible annotated corpus that was created by automatically transforming the Ancora Spanish corpus. The verb tags include information about the arguments structure and syntactic categories of the arguments, so they can be easily translated into HPSG lexical entries. |
Databáze: | OpenAIRE |
Externí odkaz: |