Zobrazeno 1 - 10
of 10
pro vyhledávání: '"Dhananjay Ram"'
Publikováno v:
Speech Communication. 103:27-36
This paper addresses the problem of detecting speech utterances from a large audio archive using a simple spoken query, hence referring to this problem as “Query by Example Spoken Term Detection” (QbE-STD). This still open pattern matching proble
Publikováno v:
ASRU
State of the art solutions to query by example spoken term detection (QbE-STD) rely on bottleneck feature representation of the query and audio document. Here, we present a study on QbE-STD performance using several monolingual as well as multilingua
Publikováno v:
EMNLP
arXiv.org e-Print Archive
arXiv.org e-Print Archive
Neural Machine Translation (NMT) can be improved by including document-level contextual information. For this purpose, we propose a hierarchical attention model to capture the context in a structured and dynamic manner. The model is integrated in the
Publikováno v:
NAACL-HLT
arXiv.org e-Print Archive
Scopus-Elsevier
arXiv.org e-Print Archive
Scopus-Elsevier
Neural sequence-to-sequence networks with attention have achieved remarkable performance for machine translation. One of the reasons for their effectiveness is their ability to capture relevant source-side contextual information at each time-step pre
Publikováno v:
INTERSPEECH
We cast the query by example spoken term detection (QbE-STD) problem as subspace detection where query and background subspaces are modeled as union of low-dimensional subspaces. The speech exemplars used for subspace modeling are class-conditional p
Publikováno v:
INTERSPEECH
Sparse representation has been shown to be a powerful modeling framework for classification and detection tasks. In this paper, we propose a new keyword detection algorithm based on sparse representation of the posterior exemplars. The posterior exem
Publikováno v:
NCC
Large variation in speakers causes significant performance degradation of a speaker independent speech recognition system. In an attempt to compensate for this degradation in performance, this paper proposes a novel Bayesian approach to estimate spea
Publikováno v:
INTERSPEECH
In this work, we address the problem of query by example spoken term detection (QbE-STD) in zero-resource scenario. State of the art solutions usually rely on dynamic time warping (DTW) based template matching. In contrast, we propose here to tackle
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::14b221bc01d3f2758c4cf931391ff068
https://infoscience.epfl.ch/record/256312
https://infoscience.epfl.ch/record/256312
This paper focuses on the problem of query by example spoken term detection (QbE-STD) in zero-resource scenario. State-of-the-art approaches primarily rely on dynamic time warping (DTW) based template matching techniques using phone posterior or bott
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::23726a00f6ee7a165007c6806864eae5
https://infoscience.epfl.ch/record/278156
https://infoscience.epfl.ch/record/278156
Publikováno v:
INTERSPEECH
State of the art query by example spoken term detection (QbE-STD) systems in zero-resource conditions rely on representation of speech in terms of sequences of class-conditional posterior probabilities estimated by deep neural network (DNN). The post
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::005df051b0aef4afbfc8d4ea78fa5e57
https://infoscience.epfl.ch/record/256261
https://infoscience.epfl.ch/record/256261