Unifying dimensions in coherence relations: How various annotation frameworks are related
Autor: | Jet Hoek, Ted Sanders, Vera Demberg, Jacqueline Evers-Vermeul, Merel Scholman, Sandrine Zufferey, Fatemeh Torabi Asr |
---|---|
Rok vydání: | 2018 |
Předmět: |
060201 languages & linguistics
Linguistics and Language business.industry 05 social sciences 410 Linguistics Applied linguistics 06 humanities and the arts Discourse connectives computer.software_genre 050105 experimental psychology Language and Linguistics Annotation 0602 languages and literature 440 French & related languages 0501 psychology and cognitive sciences Artificial intelligence business Psychology computer Natural language processing Coherence (linguistics) |
Zdroj: | Sanders, Ted; Demberg, Vera; Hoek, Jet; Scholman, Merel; Asr, Fatemeh; Zufferey, Sandrine; Evers-Vermeul, Jacqueline (2018). Unifying dimensions in discourse relations. How various annotation frameworks are related. Corpus Linguistics and Linguistic Theory, 17(1), pp. 1-71. De Gruyter 10.1515/cllt-2016-0078 |
ISSN: | 1613-7035 1613-7027 |
Popis: | In this paper, we show how three often used and seemingly different discourse annotation frameworks – Penn Discourse Treebank (PDTB), Rhetorical Structure Theory (RST), and Segmented Discourse Representation Theory – can be related by using a set of unifying dimensions. These dimensions are taken from the Cognitive approach to Coherence Relations and combined with more fine-grained additional features from the frameworks themselves to yield a posited set of dimensions that can successfully map three frameworks. The resulting interface will allow researchers to find identical or at least closely related relations within sets of annotated corpora, even if they are annotated within different frameworks. Furthermore, we tested our unified dimension (UniDim) approach by comparing PDTB and RST annotations of identical newspaper texts and converting their original end label annotations of relations into the accompanying values per dimension. Subsequently, rates of overlap in the attributed values per dimension were analyzed. Results indicate that the proposed dimensions indeed create an interface that makes existing annotation systems “talk to each other.” |
Databáze: | OpenAIRE |
Externí odkaz: |