Document similarity in repeatedly translated corpora

Autor: Vladimir Mateljan, Vedran Juričić, Dario Ogrizović
Jazyk: angličtina
Rok vydání: 2017
Předmět:
Zdroj: Tehnički Vjesnik, Vol 24, Iss 2, Pp 599-602 (2017)
Druh dokumentu: article
ISSN: 1330-3651
1848-6339
DOI: 10.17559/TV-20150831012553
Popis: The paper analyses the changes in relationship between documents in textual corpus that occur due to the translation into another language. Authors analyzed the similarities between documents in original corpus, in Croatian, and compared them with the corresponding documents in translated corpus, in English. The changes were analyzed using two measures, chi-square test’s P-value and new proposed measure, correction coefficient.
Databáze: Directory of Open Access Journals