Revision of Part-of-Speech Tagging in Stockholm Umeå Corpus 2.0
Autor: | Forsbom, Eva, Wilhelmsson, Kenneth |
---|---|
Jazyk: | angličtina |
Rok vydání: | 2010 |
Předmět: |
General Language Studies and Linguistics
Studier av enskilda språk Jämförande språkvetenskap och allmän lingvistik ordklasstaggning InformationSystems_INFORMATIONSTORAGEANDRETRIEVAL corpus linguistics korpuslingvistik språkteknologi Språkteknologi informationsvetenskap Specific Languages computational linguistics TheoryofComputation_MATHEMATICALLOGICANDFORMALLANGUAGES datalingvistik stockholm umeå corpus 2.0 language technology part-of-speech tagging |
Popis: | Many parsers use a part-of-speech tagger as a first step in parsing. The accuracy of the tagger naturally affects the performance of the parser. In this experiment, we revise 1500+ proposed errors in SUC 2.0 that were mainly found during work with schema parsing, and evaluate tagger instances trained on the revised corpus. The revisions turned out to be beneficial also for the taggers. Samarbete med Eva Forsbom, Uppsala universitet |
Databáze: | OpenAIRE |
Externí odkaz: |