Abstrakt: |
This article gives an insight into the inner workings of "Tēzaurs", a machinereadable lexicon and online dictionary of Latvian, currently containing nearly 280 000 entries from over 280 sources. The article contains information about how "Tēzaurs" was constructed, its purpose and its potential uses in linguistic research. Since "Tēzaurs" is a machine-readable lexicon, we can retrieve different types of data from it (e. g., grammatical information - inflectional paradigm, semantic relations). To illustrate what kind of data can be retrieved, the article contains analysis of 700 first conjugation verbs, which are grouped according to their present, past and infinitive stems, as well as derivation with prefixes and homonyms. [ABSTRACT FROM AUTHOR] |