Zobrazeno 1 - 10
of 727
pro vyhledávání: '"lemmatization"'
Autor:
Hamest Tamrazyan, Gayane Hovhannisyan
Publikováno v:
Heritage, Vol 7, Iss 5, Pp 2296-2312 (2024)
In the face of geopolitical threats in Artsakh, the preservation of Armenia’s epigraphic heritage has become a mission of both historical and cultural urgency. This project delves deep into Armenian inscriptions, employing advanced digital tools an
Externí odkaz:
https://doaj.org/article/925999876a8c42fbb5459b90147b49f8
Publikováno v:
Časopis pro Moderní Filologii, Vol 105, Iss 1, Pp 121-140 (2023)
The objective of the paper is to describe the principles for building the onemillionword DIA1900 Corpus consisting of Czech texts published between 1851 and 1900, designed to be both balanced and representative. There are two main goals determining t
Externí odkaz:
https://doaj.org/article/f51d195b91c84c42966d4fd6ddc03d73
Publikováno v:
Časopis pro moderní filologii / Journal for Modern Philology. 105(1):121-140
Externí odkaz:
https://www.ceeol.com/search/article-detail?id=1147778
Akademický článek
Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.
K zobrazení výsledku je třeba se přihlásit.
Autor:
Roberto Torre Alonso
Publikováno v:
Revista de Lingüística y Lenguas Aplicadas, Vol 17, Pp 143-161 (2022)
The grammatical description of Old English lacks complete and systematic lemmatization, which hinders Natural Language Processing studies in this language, as they strongly rely on the existence of large, annotated corpora. Moreover, the inflectional
Externí odkaz:
https://doaj.org/article/4dada0161c944e93a639a897ccd5662d
Autor:
Křivan, Jan, Šindlerová, Jana
Publikováno v:
Slovo a slovesnost. 83(2):122-145
Externí odkaz:
https://www.ceeol.com/search/article-detail?id=1040741
Publikováno v:
Journal of Open Humanities Data, Vol 9, Pp 28-28 (2023)
The dataset contains a list of 215,102 Latin dictionary forms (known as canonical forms or lemmas). The dataset is a set of 1,699,687 Resource Description Framework (RDF) triples that describe, using a series of Web Ontology Language (OWL) ontologies
Externí odkaz:
https://doaj.org/article/9be3dc81f98444179d5440b5a6aee87e
Publikováno v:
SEEU Review, Vol 16, Iss 2, Pp 3-16 (2021)
An important element of Natural Language Processing is parts of speech tagging. With fine-grained word-class annotations, the word forms in a text can be enhanced and can also be used in downstream processes, such as dependency parsing. The improved
Externí odkaz:
https://doaj.org/article/e5fd67164f8e4bd58bd70ed0a98d6f08
Autor:
Roberto Torre Alonso
Publikováno v:
Journal of English Studies, Vol 20 (2022)
This article presents ALOEV3, a lemmatizer based on Morphological Generation that allows for the type-based automatic lemmatization of Old English Class III strong verbs beginning with the letters L–Y. The lemmatizer operates on the basis of the in
Externí odkaz:
https://doaj.org/article/4debec32199448fe91deb614737e9c6b