Zobrazeno 1 - 6
of 6
pro vyhledávání: '"Johny, Cibu"'
This paper presents an open-source software library that provides a set of finite-state transducer (FST) components and corresponding utilities for manipulating the writing systems of languages that use the Perso-Arabic script. The operations include
Externí odkaz:
http://arxiv.org/abs/2301.11406
Since its original appearance in 1991, the Perso-Arabic script representation in Unicode has grown from 169 to over 440 atomic isolated characters spread over several code pages representing standard letters, various diacritics and punctuation for th
Externí odkaz:
http://arxiv.org/abs/2210.12273
Autor:
Kirov, Christo1 (AUTHOR) ckirov@google.com, Johny, Cibu1 (AUTHOR) cibu@google.com, Katanova, Anna1 (AUTHOR) akatanova@google.com, Gutkin, Alexander1 (AUTHOR) agutkin@google.com, Roark, Brian1 (AUTHOR) roark@google.com
Publikováno v:
Computational Linguistics. Jun2024, Vol. 50 Issue 2, p475-534. 60p.
Autor:
Butryna, Alena, Chu, Shan-Hui Cathy, Demirsahin, Isin, Gutkin, Alexander, Ha, Linne, He, Fei, Jansche, Martin, Johny, Cibu, Katanova, Anna, Kjartansson, Oddur, Li, Chenfang, Merkulova, Tatiana, Oo, Yin May, Pipatsrisawat, Knot, Rivera, Clara, Sarin, Supheakmungkol, de Silva, Pasindu, Sodimana, Keshan, Sproat, Richard, Wattanavekin, Theeraphol, Wibawa, Jaka Aris Eko
This paper presents an overview of a program designed to address the growing need for developing freely available speech resources for under-represented languages. At present we have released 38 datasets for building text-to-speech and automatic spee
Externí odkaz:
http://arxiv.org/abs/2010.06778
Autor:
Roark, Brian, Wolf-Sonkin, Lawrence, Kirov, Christo, Mielke, Sabrina J., Johny, Cibu, Demirsahin, Isin, Hall, Keith
This paper describes the Dakshina dataset, a new resource consisting of text in both the Latin and native scripts for 12 South Asian languages. The dataset includes, for each language: 1) native script Wikipedia text; 2) a romanization lexicon; and 3
Externí odkaz:
http://arxiv.org/abs/2007.01176
Akademický článek
Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.
K zobrazení výsledku je třeba se přihlásit.