Zobrazeno 1 - 10
of 4 484
pro vyhledávání: '"treebanks"'
Autor:
Sarveswaran, Kengatharaiyer
Publikováno v:
Sarveswaran, K. (2024). Building Tamil Treebanks. In Proceedings of the International Conference on Tamil Computing and Information Technology (ICTCIT 2024)/23rd Tamil Internet Conference (pp. 22-32). INFITT. ISSN: 2313-4887
Treebanks are important linguistic resources, which are structured and annotated corpora with rich linguistic annotations. These resources are used in Natural Language Processing (NLP) applications, supporting linguistic analyses, and are essential f
Externí odkaz:
http://arxiv.org/abs/2409.14657
Autor:
A. Abeillé
Linguists and engineers in Natural Language Processing tend to use electronic corpora more and more. Most research has long been limited to raw (unannotated) texts or to tagged texts (annotated with parts of speech only), but these approaches suffer
Existing Latin treebanks draw from Latin's long written tradition, spanning 17 centuries and a variety of cultures. Recent efforts have begun to harmonize these treebanks' annotations to better train and evaluate morphological taggers. However, the h
Externí odkaz:
http://arxiv.org/abs/2408.06675
Descriptive grammars are highly valuable, but writing them is time-consuming and difficult. Furthermore, while linguists typically use corpora to create them, grammar descriptions often lack quantitative data. As for formal grammars, they can be chal
Externí odkaz:
http://arxiv.org/abs/2403.17534
Over the last few decades, the widespread diffusion of digital technology has increased availability of primary textual sources, radically changing the everyday life of scholars in the humanities, who are now able to access, query and process a wealt
We introduce SPUD (Semantically Perturbed Universal Dependencies), a framework for creating nonce treebanks for the multilingual Universal Dependencies (UD) corpora. SPUD data satisfies syntactic argument structure, provides syntactic annotations, an
Externí odkaz:
http://arxiv.org/abs/2311.07497
Autor:
Zeldes, Amir, Schneider, Nathan
Recent efforts to consolidate guidelines and treebanks in the Universal Dependencies project raise the expectation that joint training and dataset comparison is increasingly possible for high-resource languages such as English, which have multiple co
Externí odkaz:
http://arxiv.org/abs/2302.00636
Autor:
Huber, Patrick, Carenini, Giuseppe
Publikováno v:
CODI 2020
Discourse parsing is an essential upstream task in Natural Language Processing with strong implications for many real-world applications. Despite its widely recognized role, most recent discourse parsers (and consequently downstream tasks) still rely
Externí odkaz:
http://arxiv.org/abs/2212.06038
Akademický článek
Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.
K zobrazení výsledku je třeba se přihlásit.