Zobrazeno 1 - 6
of 6
pro vyhledávání: '"Martin J. Puttkammer"'
Publikováno v:
Data in Brief, Vol 57, Iss , Pp 110898- (2024)
This data article describes a machine translation training data set for translation between English and Tshivenḓa. The data set contains parallel, aligned English–Tshivenḓa data as well as monolingual Tshivenḓa data. The data was collected fr
Externí odkaz:
https://doaj.org/article/98bdb0fb8eec491e98cebe20362759d1
Publikováno v:
Data in Brief, Vol 54, Iss , Pp 110325- (2024)
This data article presents a dataset for Siswati, a Bantu language of the Nguni group that is one of the eleven official South African languages and the official language of Eswatini (together with English). The dataset contains parallel textual data
Externí odkaz:
https://doaj.org/article/c96fa087c91740859792dba75c8220f0
Autor:
Tanja Gaustad, Martin J. Puttkammer
Publikováno v:
Data in Brief, Vol 41, Iss , Pp 107994- (2022)
This data article presents a linguistically annotated data set for four official South African languages with a conjunctive orthography, namely isiNdebele, isiXhosa, isiZulu and Siswati. The data set is parallel for all four languages and can be used
Externí odkaz:
https://doaj.org/article/9e281df15bde4d74ac37364f1a173e5c
Publikováno v:
Data in Brief, Vol 29, Iss , Pp - (2020)
This data article describes the Autshumato machine translation evaluation set. The evaluation set contains data that can be used to evaluate machine translation systems between any of the 11 official South African languages. The dataset is parallel w
Externí odkaz:
https://doaj.org/article/6b253a05543c4ea095d0777617096c05
Publikováno v:
Information, Vol 12, Iss 12, p 520 (2021)
The creation of linguistic resources is crucial to the continued growth of research and development efforts in the field of natural language processing, especially for resource-scarce languages. In this paper, we describe the curation and annotation
Externí odkaz:
https://doaj.org/article/602fa7260d6f4aef911a1204565abfb1
Autor:
Melinda Loubser, Martin J. Puttkammer
Publikováno v:
Information, Vol 11, Iss 1, p 41 (2020)
In this paper, the viability of neural network implementations of core technologies (the focus of this paper is on text technologies) for 10 resource-scarce South African languages is evaluated. Neural networks are increasingly being used in place of
Externí odkaz:
https://doaj.org/article/800ce4afe45649ebb52a8bf6d533781d