Translation crowdsourcing: Creating a multilingual corpus of online educational content

Autor: Sosoni, V., Kermanidis, K. L., Stasimioti, M., Naskos, T., Takoulidou, E., Menno van Zaanen, Castilho, S., Georgakopoulou, P., Kordoni, V., Egg, M.
Předmět:
Zdroj: Scopus-Elsevier
Sosoni, Vilelmini ORCID: 0000-0002-9583-4651 , Kermanidis, Katia Lida ORCID: 0000-0002-3270-5078 , Stasimioti, Maria ORCID: 0000-0002-3270-5078 , Naskos, Thanasis, Takoulidou, Eirini, van Zaanen, Menno, Castilho, Sheila ORCID: 0000-0002-8416-6555 , Georgakopoulou, Panayota ORCID: 0000-0001-9780-1813 , Kordoni, Valia and Egg, Markus (2018) Translation crowdsourcing: creating a multilingual corpus of online educational content. In: 11th International Conference on Language Resources and Evaluation, 7-12 May 2018, Miyazaki, Japan. ISBN 979-10-95546-00-9
Popis: The present work describes a multilingual corpus of online content in the educational domain, i.e. Massive Open Online Course material, ranging from course forum text to subtitles of online video lectures, that has been developed via large-scale crowdsourcing. The English source text is manually translated into 11 European and BRIC languages using the CrowdFlower platform. During the process several challenges arose which mainly involved the in-domain text genre, the large text volume, the idiosyncrasies of each target language, the limitations of the crowdsourcing platform, as well as the quality assurance and workflow issues of the crowdsourcing process. The corpus constitutes a product of the EU-funded TraMOOC project and is utilised in the project in order to train, tune and test machine translation engines.
Databáze: OpenAIRE