Výsledky vyhledávání - "van Esch, Daan"

Report

Multimodal Modeling For Spoken Language Identification

Autor: Bharadwaj, Shikhar, Ma, Min, Vashishth, Shikhar, Bapna, Ankur, Ganapathy, Sriram, Axelrod, Vera, Dalmia, Siddharth, Han, Wei, Zhang, Yu, van Esch, Daan, Ritchie, Sandy, Talukdar, Partha, Riesa, Jason

Spoken language identification refers to the task of automatically predicting the spoken language in a given utterance. Conventionally, it is modeled as a speech-based language identification task. Prior techniques have been constrained to a single m

Externí odkaz: http://arxiv.org/abs/2309.10567

Zobrazit plný text záznamu

Report

Large vocabulary speech recognition for languages of Africa: multilingual modeling and self-supervised learning

Autor: Ritchie, Sandy, Cheng, You-Chi, Chen, Mingqing, Mathews, Rajiv, van Esch, Daan, Li, Bo, Sim, Khe Chai

Almost none of the 2,000+ languages spoken in Africa have widely available automatic speech recognition systems, and the required data is also only available for a few languages. We have experimented with two techniques which may provide pathways to

Externí odkaz: http://arxiv.org/abs/2208.03067

Zobrazit plný text záznamu

Report

Accented Speech Recognition: Benchmarking, Pre-training, and Diverse Data

Autor: Aksënova, Alëna, Chen, Zhehuai, Chiu, Chung-Cheng, van Esch, Daan, Golik, Pavel, Han, Wei, King, Levi, Ramabhadran, Bhuvana, Rosenberg, Andrew, Schwartz, Suzan, Wang, Gary

Building inclusive speech recognition systems is a crucial step towards developing technologies that speakers of all language varieties can use. Therefore, ASR systems must work for everybody independently of the way they speak. To accomplish this go

Externí odkaz: http://arxiv.org/abs/2205.08014

Zobrazit plný text záznamu

Report

Building Machine Translation Systems for the Next Thousand Languages

In this paper we share findings from our effort to build practical machine translation (MT) systems capable of translating across over one thousand languages. We describe results in three research domains: (i) Building clean, web-mined datasets for 1

Externí odkaz: http://arxiv.org/abs/2205.03983

Zobrazit plný text záznamu

Report

XTREME-S: Evaluating Cross-lingual Speech Representations

Autor: Conneau, Alexis, Bapna, Ankur, Zhang, Yu, Ma, Min, von Platen, Patrick, Lozhkov, Anton, Cherry, Colin, Jia, Ye, Rivera, Clara, Kale, Mihir, Van Esch, Daan, Axelrod, Vera, Khanuja, Simran, Clark, Jonathan H., Firat, Orhan, Auli, Michael, Ruder, Sebastian, Riesa, Jason, Johnson, Melvin

We introduce XTREME-S, a new benchmark to evaluate universal cross-lingual speech representations in many languages. XTREME-S covers four task families: speech recognition, classification, speech-to-text translation and retrieval. Covering 102 langua

Externí odkaz: http://arxiv.org/abs/2203.10752

Zobrazit plný text záznamu

Report

Handling Compounding in Mobile Keyboard Input

Autor: Kabel, Andreas, Hall, Keith, Ouyang, Tom, Rybach, David, van Esch, Daan, Beaufays, Françoise

This paper proposes a framework to improve the typing experience of mobile users in morphologically rich languages. Smartphone keyboards typically support features such as input decoding, corrections and predictions that all rely on language models.

Externí odkaz: http://arxiv.org/abs/2201.06469

Zobrazit plný text záznamu

Report

Quality at a Glance: An Audit of Web-Crawled Multilingual Datasets

Autor: Kreutzer, Julia, Caswell, Isaac, Wang, Lisa, Wahab, Ahsan, van Esch, Daan, Ulzii-Orshikh, Nasanbayar, Tapo, Allahsera, Subramani, Nishant, Sokolov, Artem, Sikasote, Claytone, Setyawan, Monang, Sarin, Supheakmungkol, Samb, Sokhar, Sagot, Benoît, Rivera, Clara, Rios, Annette, Papadimitriou, Isabel, Osei, Salomey, Suarez, Pedro Ortiz, Orife, Iroro, Ogueji, Kelechi, Rubungo, Andre Niyongabo, Nguyen, Toan Q., Müller, Mathias, Müller, André, Muhammad, Shamsuddeen Hassan, Muhammad, Nanda, Mnyakeni, Ayanda, Mirzakhalov, Jamshidbek, Matangira, Tapiwanashe, Leong, Colin, Lawson, Nze, Kudugunta, Sneha, Jernite, Yacine, Jenny, Mathias, Firat, Orhan, Dossou, Bonaventure F. P., Dlamini, Sakhile, de Silva, Nisansa, Ballı, Sakine Çabuk, Biderman, Stella, Battisti, Alessia, Baruwa, Ahmed, Bapna, Ankur, Baljekar, Pallavi, Azime, Israel Abebe, Awokoya, Ayodele, Ataman, Duygu, Ahia, Orevaoghene, Ahia, Oghenefego, Agrawal, Sweta, Adeyemi, Mofetoluwa

Publikováno v: Transactions of the Association for Computational Linguistics (2022) 10: 50-72

With the success of large-scale pre-training and multilingual modeling in Natural Language Processing (NLP), recent years have seen a proliferation of large, web-mined text datasets covering hundreds of languages. We manually audit the quality of 205

Externí odkaz: http://arxiv.org/abs/2103.12028

Zobrazit plný text záznamu

Report

Mining Large-Scale Low-Resource Pronunciation Data From Wikipedia

Autor: Chakraborty, Tania, Prasad, Manasa, Breiner, Theresa, Ritchie, Sandy, van Esch, Daan

Pronunciation modeling is a key task for building speech technology in new languages, and while solid grapheme-to-phoneme (G2P) mapping systems exist, language coverage can stand to be improved. The information needed to build G2P models for many mor

Externí odkaz: http://arxiv.org/abs/2101.11575

Zobrazit plný text záznamu

Report

Language ID in the Wild: Unexpected Challenges on the Path to a Thousand-Language Web Text Corpus

Autor: Caswell, Isaac, Breiner, Theresa, van Esch, Daan, Bapna, Ankur

Large text corpora are increasingly important for a wide variety of Natural Language Processing (NLP) tasks, and automatic language identification (LangID) is a core technology needed to collect such datasets in a multilingual context. LangID is larg

Externí odkaz: http://arxiv.org/abs/2010.14571

Zobrazit plný text záznamu

Report

Writing Across the World's Languages: Deep Internationalization for Gboard, the Google Keyboard

Autor: van Esch, Daan, Sarbar, Elnaz, Lucassen, Tamar, O'Brien, Jeremy, Breiner, Theresa, Prasad, Manasa, Crew, Evan, Nguyen, Chieu, Beaufays, Françoise

This technical report describes our deep internationalization program for Gboard, the Google Keyboard. Today, Gboard supports 900+ language varieties across 70+ writing systems, and this report describes how and why we have been adding support for hu

Externí odkaz: http://arxiv.org/abs/1912.01218

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání