Zobrazeno 1 - 10
of 12
pro vyhledávání: '"Martin Puttkammer"'
Publikováno v:
Routledge Encyclopedia of Translation Technology ISBN: 9781003168348
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_________::ba90787911ae081e74435cd321acf6d8
https://doi.org/10.4324/9781003168348-23
https://doi.org/10.4324/9781003168348-23
Autor:
Febe de Wet, Andiswa Bukula, Willem Karsten, Martin Puttkammer, Erwin Schillack, Roné Wierenga, Roald Eiselen
Publikováno v:
Journal of the Digital Humanities Association of Southern Africa (DHASA). 4
Despite many attempts to address the situation, South Africa's official languages remain under-resourced in terms of the text and speech data required to implement state-of-the-art language technology. To ensure that no language is left behind, resou
Autor:
Martin Puttkammer, Jakobus S. Du Toit
Publikováno v:
Journal of the Digital Humanities Association of Southern Africa (DHASA). 3
Morphological analysis involves investigating the syntactic class of a word but can also extend to the decomposition and syntactic analysis of its underlying morpheme composition. This is especially relevant to languages with an agglutinative writing
Development of linguistically annotated parallel language resources for four South African languages
Autor:
Tanja Gaustad, Martin Puttkammer
Publikováno v:
Journal of the Digital Humanities Association of Southern Africa (DHASA). 3
For this project, we collected and annotated data to develop language resources for the four official South African Nguni languages written with a conjunctive orthography. The data for these four languages is parallel to allow for comparative (comput
Autor:
Martin Puttkammer, Cindy A. McKellar
Publikováno v:
Data in Brief, Vol 29, Iss, Pp-(2020)
Data in Brief
Data in Brief
This data article describes the Autshumato machine translation evaluation set. The evaluation set contains data that can be used to evaluate machine translation systems between any of the 11 official South African languages. The dataset is parallel w
Publikováno v:
Information, Vol 12, Iss 520, p 520 (2021)
Information; Volume 12; Issue 12; Pages: 520
Information; Volume 12; Issue 12; Pages: 520
The creation of linguistic resources is crucial to the continued growth of research and development efforts in the field of natural language processing, especially for resource-scarce languages. In this paper, we describe the curation and annotation
Autor:
Melinda Loubser, Martin Puttkammer
Publikováno v:
Information
Volume 11
Issue 1
Information, Vol 11, Iss 1, p 41 (2020)
Volume 11
Issue 1
Information, Vol 11, Iss 1, p 41 (2020)
In this paper, the viability of neural network implementations of core technologies (the focus of this paper is on text technologies) for 10 resource-scarce South African languages is evaluated. Neural networks are increasingly being used in place of
Publikováno v:
ACL (4)
In this paper, we present a project where existing text-based core technologies were ported to Java-based web services from various architectures. These technologies were developed over a period of eight years through various government funded projec
Autor:
Martin Puttkammer, Justin Hocking
Publikováno v:
2016 Pattern Recognition Association of South Africa and Robotics and Mechatronics International Conference (PRASA-RobMech).
Optical Character Recognition (OCR) is an essential technology in the digitisation of printed media. Many OCR engines are language-specific and available for common languages such as English and other European languages, but less so for smaller langu
Publikováno v:
Literator, Vol 29, Iss 1, Pp 21-42 (2008)
The development of a hyphenator and compound analyser for Afrikaans The development of two core-technologies for Afrikaans, viz. a hyphenator and a compound analyser is described in this article. As no annotated Afrikaans data existed prior to this p