Zobrazeno 1 - 5
of 5
pro vyhledávání: '"Martin Stluka"'
Publikováno v:
Časopis pro Moderní Filologii, Vol 105, Iss 1, Pp 121-140 (2023)
The objective of the paper is to describe the principles for building the onemillionword DIA1900 Corpus consisting of Czech texts published between 1851 and 1900, designed to be both balanced and representative. There are two main goals determining t
Externí odkaz:
https://doaj.org/article/f51d195b91c84c42966d4fd6ddc03d73
Publikováno v:
Časopis pro Moderní Filologii, Vol 101, Iss 1, Pp 92-98 (2019)
The paper describes the principles and structure of the one-million-word DIA1900 Corpus built at the Institute of the Czech National Corpus (CNC) in Prague, focused on the language of Czech texts published in the years 1851 to 1900. The DIA1900, plan
Externí odkaz:
https://doaj.org/article/d5f16906538b45d4a9b1cf68af54b113
Publikováno v:
Časopis pro Moderní Filologii, Vol 101, Iss 1, Pp 92-98 (2019)
Časopis pro moderní filologii (Journal for Modern Philology) 2019(1),92-98 (2019)
Časopis pro moderní filologii (Journal for Modern Philology) 2019(1),92-98 (2019)
The paper describes the principles and structure of the one-million-word DIA1900 Corpus built at the Institute of the Czech National Corpus (CNC) in Prague, focused on the language of Czech texts published in the years 1851 to 1900. The DIA1900, plan
Autor:
Martin Stluka, Karel Kučera
Publikováno v:
Procedia - Social and Behavioral Sciences. 106:2217-2221
Integration of digital libraries into school education has been explored for almost two decades now, with a primary focus on fully searchable contemporary materials helping to shift the emphasis in science classes from instruction and memorization to
Autor:
Martin Stluka, Karel Kučera
Publikováno v:
DATeCH
The paper describes the processing of linguistic data obtained through OCR, namely their use for the construction of dictionary databases and subsequent lemmatization. The process is demonstrated on the Czech prints from the 19th century.