Distant Rhythm: Automatic Enjambment Detection on Four Centuries of Spanish Sonnets

Autor: Pablo Ruiz Fabo, Clara Martínez Cantón, Thierry Poibeau
Přispěvatelé: Lattice - Langues, Textes, Traitements informatiques, Cognition - UMR 8094 (Lattice), Département Littératures et langage - ENS Paris (LILA), École normale supérieure - Paris (ENS Paris), Université Paris sciences et lettres (PSL)-Université Paris sciences et lettres (PSL)-École normale supérieure - Paris (ENS Paris), Université Paris sciences et lettres (PSL)-Université Paris sciences et lettres (PSL)-Centre National de la Recherche Scientifique (CNRS)-Université Sorbonne Paris Cité (USPC)-Université Sorbonne Nouvelle - Paris 3, Département Littératures et langage (LILA), Poibeau, Thierry
Jazyk: angličtina
Rok vydání: 2017
Předmět:
Zdroj: Digital Humanities 2017
Digital Humanities 2017, Aug 2017, Montreal, Canada
Digital Humanities 2017. Conference Abstracts
ZENODO
HAL
Popis: Enjambment takes place when a syntactic unit is broken up across two lines of poetry, giving rise to different stylistic effects. In Spanish literary studies, detailed case-studies of the phenomenon based on single authors exist. However, a larger-scale study spanning hundreds of major and minor authors, across several centuries, is not available so far. Towards that need, we have developed software based on Natural Language Processing (NLP), to automatically identify enjambment (and its type) in Spanish. To evaluate the system, we manually annotated two reference corpora (one diachronic, one from the 20th century). Results are satisfactory for the system's first version, with F1 varying depending on period and enjambment type. As a scholarly corpus to apply the tool, from public HTML sources we created a diachronic corpus covering four centuries of sonnets (3750 poems). We applied the tool to analyze the occurrence of enjambment across stanzaic boundaries in different periods.
Databáze: OpenAIRE