Monte Carlo modelling of confidence intervals in translation quality evaluation (TQE) and post-editing dstance (PED) measurement

Autor: Alekseeva, Alexandra, Gladkoff, Serge, Sorokina, Irina, Han, Lifeng
Jazyk: angličtina
Rok vydání: 2021
Předmět:
Zdroj: Alekseeva, Alexandra ORCID: 0000-0002-7990-4592 , Gladkoff, Serge, Sorokina, Irina and Han, Lifeng ORCID: 0000-0002-3221-2185 (2021) Monte Carlo modelling of confidence intervals in translation quality evaluation (TQE) and post-editing dstance (PED) measurement. In: Metrics 2021: Workshop on Informetric and Scientometric Research (SIG-MET), 23-24 Oct 2021, Online.
Popis: From both human translators (HT) and machine translation (MT) researchers' point of view, translation quality evaluation (TQE) is an essential task. This is especially the case, when language service providers (LSPs) face huge amount of request frequently from their clients and users to acquire high-quality translations. While automatic translation quality assessment (TQA) metrics and quality estimation (QE) tools are widely available and easy to access, human assessment from professional translators (HAP) are often chosen as the golden standard \cite{han-etal-2021-TQA}. One challenge that comes to this point is this: \textit{to avoid the overall text quality checking from both cost and efficiency perspectives, how to choose the confidence sample size of the translated text, so as to properly estimate the overall text or document translation quality}? This work carries out such an motivated research to correctly estimate the confidence intervals \cite{Brown_etal2001Interval} regarding the sample size of translated text, e.g. the amount of words or sentences, that needs to be taken into account for confident evaluation of overall translation quality. The methodology we applied for this work is from Bernoulli Statistical Distribution Modelling (BSDM) and Monte Carlo Sampling Analysis (MCSA).
Databáze: OpenAIRE