Distant Rhythm: Automatic Enjambment Detection on Four Centuries of Spanish Sonnets
Autor: | Pablo Ruiz Fabo, Clara Martínez Cantón, Thierry Poibeau |
---|---|
Přispěvatelé: | Lattice - Langues, Textes, Traitements informatiques, Cognition - UMR 8094 (Lattice), Département Littératures et langage - ENS Paris (LILA), École normale supérieure - Paris (ENS Paris), Université Paris sciences et lettres (PSL)-Université Paris sciences et lettres (PSL)-École normale supérieure - Paris (ENS Paris), Université Paris sciences et lettres (PSL)-Université Paris sciences et lettres (PSL)-Centre National de la Recherche Scientifique (CNRS)-Université Sorbonne Paris Cité (USPC)-Université Sorbonne Nouvelle - Paris 3, Département Littératures et langage (LILA), Poibeau, Thierry |
Jazyk: | angličtina |
Rok vydání: | 2017 |
Předmět: |
[SHS.LITT] Humanities and Social Sciences/Literature
[INFO.INFO-TT]Computer Science [cs]/Document and Text Processing [SHS.LITT]Humanities and Social Sciences/Literature [INFO.INFO-TT] Computer Science [cs]/Document and Text Processing Literatura [SCCO.LING] Cognitive science/Linguistics [SCCO.LING]Cognitive science/Linguistics [SHS.LANGUE]Humanities and Social Sciences/Linguistics [SHS.LANGUE] Humanities and Social Sciences/Linguistics |
Zdroj: | Digital Humanities 2017 Digital Humanities 2017, Aug 2017, Montreal, Canada Digital Humanities 2017. Conference Abstracts ZENODO HAL |
Popis: | Enjambment takes place when a syntactic unit is broken up across two lines of poetry, giving rise to different stylistic effects. In Spanish literary studies, detailed case-studies of the phenomenon based on single authors exist. However, a larger-scale study spanning hundreds of major and minor authors, across several centuries, is not available so far. Towards that need, we have developed software based on Natural Language Processing (NLP), to automatically identify enjambment (and its type) in Spanish. To evaluate the system, we manually annotated two reference corpora (one diachronic, one from the 20th century). Results are satisfactory for the system's first version, with F1 varying depending on period and enjambment type. As a scholarly corpus to apply the tool, from public HTML sources we created a diachronic corpus covering four centuries of sonnets (3750 poems). We applied the tool to analyze the occurrence of enjambment across stanzaic boundaries in different periods. |
Databáze: | OpenAIRE |
Externí odkaz: |