Turkish word error detection using syllable bigram statistics

Autor:	R. Asliyan, Korhan Günel
Jazyk:	turečtina
Rok vydání:	2006
Předmět:	Turkish Computer science business.industry Speech recognition Bigram Speech synthesis Optical character recognition computer.software_genre language.human_language ComputingMethodologies_PATTERNRECOGNITION language Language model Artificial intelligence Syllable business computer Natural language Natural language processing Word (group theory)
Popis:	In this study, we have designed and implemented a system, which uses n-gram statistical language model in order to facilitate Optical Character Recognition, Speech Synthesis and Recognition systems. First, the syllables bigram frequencies are extracted from Turkish corpora. Then, the test database including the words, which are written correctly and wrongly, is created. The probability of the words appears the given text is calculated and the wrongly and, correctly written words are determined. The system finds the wrongly written words about 86.13% with the proposed approach and the correctly written words are found about 88.32%.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_dedup___::bb8c614a5351d0c754d9de7d6dd6edb0 https://avesis.deu.edu.tr/publication/details/51f5e301-bdbe-4169-a735-4ec279e207c0/oai Zobrazit plný text záznamu