Text Punctuation: An Inter-annotator Agreement Study
Autor: | Michal Rott, Marek Bohac, Vojtěch Kovář |
---|---|
Rok vydání: | 2017 |
Předmět: |
Computer science
business.industry media_common.quotation_subject 05 social sciences 050209 industrial relations computer.software_genre Punctuation Linguistics Transcription (linguistics) Phenomenon 0502 economics and business Written language Artificial intelligence business computer 050203 business & management Natural language processing Utterance Spoken language media_common |
Zdroj: | Text, Speech, and Dialogue ISBN: 9783319642055 TSD |
DOI: | 10.1007/978-3-319-64206-2_14 |
Popis: | Spoken language is a phenomenon which is hard to be annotated accurately. One of the most ambiguous tasks is to fill in the punctuation marks into the spoken language transcription. Used punctuation marks are often dependent on how annotators understand the transcription content. This may differ as the spoken language often lacks clear structure (inherent to written language) due to the utterance spontaneity or due to skipping between ideas. |
Databáze: | OpenAIRE |
Externí odkaz: |