Supertagging for a Statistical HPSG Parser for Spanish

Autor: Luis Chiruzzo, Dina Wonsever
Rok vydání: 2015
Předmět:
Zdroj: Statistical Language and Speech Processing ISBN: 9783319257884
SLSP
DOI: 10.1007/978-3-319-25789-1_3
Popis: We created a supertagger for the Spanish language aimed at disambiguating the HPSG lexical frames for the verbs in a sentence. The supertagger uses a CRF model and achieves an accuracy of 83.58i¾ź% for the verb classes on the test set. The tagset contains 92 verb classes, extracted from a Spanish HPSG-compatible annotated corpus that was created by automatically transforming the Ancora Spanish corpus. The verb tags include information about the arguments structure and syntactic categories of the arguments, so they can be easily translated into HPSG lexical entries.
Databáze: OpenAIRE