The Russian Legislative Corpus

Autor: Saveliev, Denis, Kuchakov, Ruslan
Rok vydání: 2024
Předmět:
Druh dokumentu: Working Paper
Popis: We present the comprehensive Russian primary and secondary legislation corpus covering 1991 to 2023. The corpus collects all 281,413 texts (176,523,268 tokens) of non-secret federal regulations and acts, along with their metadata. The corpus has two versions the original text with minimal preprocessing and a version prepared for linguistic analysis with morphosyntactic markup.
Comment: 7 pages, 6 figures, 1 table
Databáze: arXiv