Výsledky vyhledávání - "Ciprian Chelba"

Lego-Features: Exporting modular encoder features for streaming and deliberation ASR

Autor: Rami Botros, Rohit Prabhavalkar, Johan Schalkwyk, Ciprian Chelba, Tara N. Sainath, Françoise Beaufays

In end-to-end (E2E) speech recognition models, a representational tight-coupling inevitably emerges between the encoder and the decoder. We build upon recent work that has begun to explore building encoders with modular encoded representations, such

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::8a9eb14f07d3c1400c300713bbe63244

Zobrazit plný text záznamu

Dynamically Composing Domain-Data Selection with Clean-Data Selection by 'Co-Curricular Learning' for Neural Machine Translation

Autor: Wei Wang, Isaac Caswell, Ciprian Chelba

Publikováno v: ACL (1)

Noise and domain are important aspects of data quality for neural machine translation. Existing research focus separately on domain-data selection, clean-data selection, or their static combination, leaving the dynamic interaction across them not exp

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::213ae8dd704c7dfaee19b092566bb6e6

Zobrazit plný text záznamu

Tagged Back-Translation

Autor: Ciprian Chelba, David Grangier, Isaac Caswell

Publikováno v: WMT (1)

Recent work in Neural Machine Translation (NMT) has shown significant quality gains from noised-beam decoding during back-translation, a method to generate synthetic parallel data. We show that the main role of such synthetic noise is not to diversif

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::17cb9e745e5422c553c84deb26579fde

Zobrazit plný text záznamu

Denoising Neural Machine Translation Training with Trusted Data and Online Data Selection

Autor: Tetsuji Nakagawa, Macduff Hughes, Taro Watanabe, Wei Wang, Ciprian Chelba

Publikováno v: WMT

Measuring domain relevance of data and identifying or selecting well-fit domain data for machine translation (MT) is a well-studied topic, but denoising is not yet. Denoising is concerned with a different type of data quality and tries to reduce the

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::ef4a6d844e7978ecf8c7d469f2e4acd9
https://doi.org/10.18653/v1/w18-6314

Zobrazit plný text záznamu

Sparse Non-Negative Matrix Language Modeling: Maximum Entropy Flexibility on the Cheap

Autor: Fadi Biadsy, Diamantino Caseiro, Ciprian Chelba

Publikováno v: INTERSPEECH

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::2f8729a3eacfa2e522891d99ba630891
https://doi.org/10.21437/interspeech.2017-493

Zobrazit plný text záznamu

Sparse non-negative matrix language modeling

Autor: Ciprian Chelba, Noam Shazeer, Joris Pelemans

We present Sparse Non-negative Matrix (SNM) estimation, a novel probability estimation technique for language modeling that can efficiently incorporate arbitrary features. We evaluate SNM language models on two corpora: the One Billion Word Benchmark

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::71185e1026c495bd2386ba3f8a2ce86e
https://lirias.kuleuven.be/handle/123456789/543949

Zobrazit plný text záznamu

Sparse non-negative matrix language modeling for geo-annotated query session data

Autor: Ciprian Chelba, Noam Shazeer

Publikováno v: ASRU

The paper investigates the impact on query language modeling when using skip-grams within query as well as across queries in a given search session, in conjunction with the geo-annotation available for the query stream data. As modeling tool we use t

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::115771f23f02a669c41210871678814c
https://doi.org/10.1109/asru.2015.7404767

Zobrazit plný text záznamu

Retrieval and browsing of spoken content

Autor: Murat Saraclar, Ciprian Chelba, Timothy J. Hazen

Publikováno v: IEEE Signal Processing Magazine. 25:39-49

Ever-increasing computing power and connectivity bandwidth, together with falling storage costs, are resulting in an overwhelming amount of data of various types being produced, exchanged, and stored. Consequently, information search and retrieval ha

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::28eeb1649d8ce9140edf41becb3794e7
https://doi.org/10.1109/msp.2008.917992

Zobrazit plný text záznamu

Soft indexing of speech content for search in spoken documents

Autor: Jorge F. Silva, Ciprian Chelba, Alex Acero

Publikováno v: Computer Speech & Language. 21:458-478

The paper presents the Position Specific Posterior Lattice (PSPL), a novel lossy representation of automatic speech recognition lattices that naturally lends itself to efficient indexing and subsequent relevance ranking of spoken documents. This tech

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::be4eec042e8739061df78d71ed7bc484
https://doi.org/10.1016/j.csl.2006.09.001

Zobrazit plný text záznamu

Geo-location for voice search language modeling

Autor: Ciprian Chelba, Xuedong Zhang, Keith Hall

Publikováno v: INTERSPEECH

We investigate the benefit of augmenting with geo-location information the language model used in speech recognition for voice-search. We observe reductions in perplexity of up to 15% relative on test sets obtained from both typed query data, as well

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::71d59c383b77cd878ec34061db61c915
https://doi.org/10.21437/interspeech.2015-344

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání