The Diachronic Spanish Sonnet Corpus (DISCO): TEI and Linked Open Data Encoding, Data Distribution and Metrical Findings

Autor:	Elena González-Blanco, Clara Isabel Martínez Cantón, Pablo Ruiz Fabo, Helena Bermúdez Sabel
Přispěvatelé:	Linguistique, Langues et Parole (LILPA), Université de Strasbourg (UNISTRA), Université de Lausanne (UNIL), Universidad Nacional de Educación a Distancia (UNED), UNED - Universidad Nacional de Educación a Distancia, European Project: 679528,H2020,ERC-2015-STG,POSTDATA(2016)
Jazyk:	angličtina
Rok vydání:	2021
Předmět:	Linguistics and Language History [SHS.LITT]Humanities and Social Sciences/Literature Interoperability Aucun Distribution (economics) Language and Linguistics [INFO.INFO-CL]Computer Science [cs]/Computation and Language [cs.CL] Sonnet Encoding (semiotics) [SHS.LANGUE]Humanities and Social Sciences/Linguistics Sciences de l'Homme et Société/Littératures 060201 languages & linguistics Poetry business.industry 05 social sciences 06 humanities and the arts Linked data Linguistics Computer Science Applications Metadata 0602 languages and literature Sciences de l'Homme et Société/Linguistique 0509 other social sciences 050904 information & library sciences business Period (music) Information Systems
Zdroj:	Digital Scholarship in the Humanities Digital Scholarship in the Humanities, Oxford University Press, 2021, 36 (Supplement_1), pp.i68-i80. ⟨10.1093/llc/fqaa035⟩ HAL Digital Scholarship in the Humanities, 2021, 36 (Supplement_1), pp.i68-i80. ⟨10.1093/llc/fqaa035⟩
ISSN:	2055-7671 2055-768X
DOI:	10.1093/llc/fqaa035⟩
Popis:	How has the sonnet form in Spanish evolved over the centuries? What is the distribution of metrical patterns and combinations thereof, considering diachronic, geographical, and social factors? What rhyme schemes are favoured in different periods and regions? How is enjambment distributed within the sonnet? Providing quantitative answers to such questions requires a corpus spanning several centuries, annotated for the relevant literary features and containing author metadata. The absence of appropriate digital resources to undertake a macroanalytic study of the evolution of the sonnet in Spanish led us to create the Diachronic Spanish Sonnet Corpus. This article presents how the corpus was designed for providing quantitative evidence on the evolution of sonnets in Spanish, and our findings regarding metrics and enjambment. The corpus contains 4,085 sonnets by 1,204 Spanish and Latin American authors (15th to 19th centuries), encoded in TEI, with RDFa attributes. The corpus aims at breadth, including many peripheral authors besides some major ones. Author metadata were encoded (dates, origin, gender). Scansion and enjambment were annotated automatically, with the ADSO and ANJA tools. The range of authors and periods, the use of TEI and RDFa for interoperability, and the combination of metrical and enjambment annotations goes beyond previously available digital resources. The corpus allowed us to examine the evolution of metrical patterns and their combinations after the Golden Age, complementing earlier studies. We also observed an increase in enjambment across the tercets in the 19th century, which may indicate increased variety in the discourse organization of sonnets in the period.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_dedup___::d89a2fb1fe8edb3867b9bc155972b67e https://hal.archives-ouvertes.fr/hal-02661650 Zobrazit plný text záznamu