Výsledky vyhledávání - "Silovsky, Jan"

Report

Federated Learning with Differential Privacy for End-to-End Speech Recognition

Autor: Pelikan, Martin, Azam, Sheikh Shams, Feldman, Vitaly, Silovsky, Jan "Honza", Talwar, Kunal, Likhomanenko, Tatiana

While federated learning (FL) has recently emerged as a promising approach to train machine learning models, it is limited to only preliminary explorations in the domain of automatic speech recognition (ASR). Moreover, FL does not inherently guarante

Externí odkaz: http://arxiv.org/abs/2310.00098

Zobrazit plný text záznamu

Report

Importance of Smoothness Induced by Optimizers in FL4ASR: Towards Understanding Federated Learning for End-to-End ASR

Autor: Azam, Sheikh Shams, Likhomanenko, Tatiana, Pelikan, Martin, Silovsky, Jan "Honza"

In this paper, we start by training End-to-End Automatic Speech Recognition (ASR) models using Federated Learning (FL) and examining the fundamental considerations that can be pivotal in minimizing the performance gap in terms of word error rate betw

Externí odkaz: http://arxiv.org/abs/2309.13102

Zobrazit plný text záznamu

Report

Cross-lingual Knowledge Transfer and Iterative Pseudo-labeling for Low-Resource Speech Recognition with Transducers

Autor: Silovsky, Jan, Deng, Liuhui, Argueta, Arturo, Arvizo, Tresi, Hsiao, Roger, Kuznietsov, Sasha, Lin, Yiu-Chang, Xiao, Xiaoqiang, Zhang, Yuanyuan

Voice technology has become ubiquitous recently. However, the accuracy, and hence experience, in different languages varies significantly, which makes the technology not equally inclusive. The availability of data for different languages is one of th

Externí odkaz: http://arxiv.org/abs/2305.13652

Zobrazit plný text záznamu

Report

Learning from Noisy Labels with Noise Modeling Network

Autor: Jiang, Zhuolin, Silovsky, Jan, Siu, Man-Hung, Hartmann, William, Gish, Herbert, Adali, Sancar

Multi-label image classification has generated significant interest in recent years and the performance of such systems often suffers from the not so infrequent occurrence of incorrect or missing labels in the training data. In this paper, we extend

Externí odkaz: http://arxiv.org/abs/2005.00596

Zobrazit plný text záznamu

Report

Improving Language Identification for Multilingual Speakers

Autor: Titus, Andrew, Silovsky, Jan, Chen, Nanxin, Hsiao, Roger, Young, Mary, Ghoshal, Arnab

Spoken language identification (LID) technologies have improved in recent years from discriminating largely distinct languages to discriminating highly similar languages or even dialects of the same language. One aspect that has been mostly neglected

Externí odkaz: http://arxiv.org/abs/2001.11019

Zobrazit plný text záznamu

Report

Towards a New Understanding of the Training of Neural Networks with Mislabeled Training Data

Autor: Gish, Herbert, Silovsky, Jan, Sung, Man-Ling, Siu, Man-Hung, Hartmann, William, Jiang, Zhuolin

We investigate the problem of machine learning with mislabeled training data. We try to make the effects of mislabeled training better understood through analysis of the basic model and equations that characterize the problem. This includes results a

Externí odkaz: http://arxiv.org/abs/1909.09136

Zobrazit plný text záznamu

Periodical

First Czech Sentenced For ISIS Aspirations.

Publikováno v: Transitions Online. 3/6/2017, p1-1. 1p.

Zobrazit plný text záznamu

Plný text ve formátu HTML

Akademický článek

Speaker-adaptive speech recognition using speaker diarization for improved transcription of large spoken archives

Autor: Cerva, Petr, Silovsky, Jan, Zdansky, Jindrich, Nouza, Jan, Seps, Ladislav

Publikováno v: In Speech Communication November-December 2013 55(10):1033-1046

Zobrazit plný text záznamu

Akademický článek

Enhancement of emotion detection in spoken dialogue systems by combining several information sources

Autor: López-Cózar, Ramón, Silovsky, Jan, Kroul, Martin

Publikováno v: In Speech Communication 2011 53(9):1210-1228

Zobrazit plný text záznamu

Fast Keyword Spotting in Telephone Speech

Autor: Nouza, Jan, Silovsky, Jan

Publikováno v: Radioengineering, Vol 18, Iss 4, Pp 665-670 (2009)
Radioengineering. 2009, vol. 18, č. 4, s. 665-670. ISSN 1210-2512

In the paper, we present a system designed for detecting keywords in telephone speech. We focus not only on achieving high accuracy but also on very short processing time. The keyword spotting system can run in three modes: a) an off-line mode requir

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=dedup_wf_001::08ab2e3ef363c5306172d210b32f7990
http://www.radioeng.cz/fulltexts/2009/09_04_665_670.pdf

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání