Výsledky vyhledávání - "Shruti Palaskar"

Speech Technology for Unwritten Languages

Autor: Mark Hasegawa-Johnson, Lucas Ondel, Elin Larsen, Shruti Palaskar, Liming Wang, Sebastian Stüker, Francesco Ciannella, Markus Müller, Odette Scharenborg, Rachid Riad, Florian Metze, Pierre Godard, Laurent Besacier, Mingxing Du, Alan W. Black, Danny Merkx, Emmanuel Dupoux, Philip Arthur, Graham Neubig

Publikováno v: IEEE/ACM Transactions on Audio, Speech and Language Processing
IEEE/ACM Transactions on Audio, Speech and Language Processing, 2020, ⟨10.1109/TASLP.2020.2973896⟩
IEEE/ACM Transactions on Audio Speech and Language Processing, 28, 964-975
IEEE/ACM Transactions on Audio Speech and Language Processing, 28, pp. 964-975
IEEE/ACM Transactions on Audio, Speech and Language Processing, Institute of Electrical and Electronics Engineers, 2020, ⟨10.1109/TASLP.2020.2973896⟩
IEEE-ACM Transactions on Audio, Speech, and Language Processing, 28

International audience; Speech technology plays an important role in our everyday life. Speech is, among others, used for human-computer interaction, including, for instance, information retrieval and on-line shopping. In the case of an unwritten lan

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::91e8981c6d7470b56776e09c0e74b11f
https://doi.org/10.1109/taslp.2020.2973896

Zobrazit plný text záznamu

Multimodal Speech Summarization Through Semantic Concept Learning

Autor: Ruslan Salakhutdinov, Florian Metze, Alan W. Black, Shruti Palaskar

Publikováno v: Interspeech 2021.

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::359d8ca8abce8c7f500f4c39dc1bd623
https://doi.org/10.21437/interspeech.2021-1923

Zobrazit plný text záznamu

ASR Error Correction and Domain Adaptation Using Machine Translation

Autor: Florian Metze, Anirudh Mani, Shruti Palaskar, Sandeep Konam, Nimshi Venkat Meripo

Publikováno v: ICASSP

Off-the-shelf pre-trained Automatic Speech Recognition (ASR) systems are an increasingly viable service for companies of any size building speech-based products. While these ASR systems are trained on large amounts of data, domain mismatch is still a

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::08b58882fac7a441d7ee3900ece8b1c9
https://doi.org/10.1109/icassp40776.2020.9053126

Zobrazit plný text záznamu

Towards Understanding ASR Error Correction for Medical Conversations

Autor: Shruti Palaskar, Anirudh Mani, Sandeep Konam

Publikováno v: Proceedings of the First Workshop on Natural Language Processing for Medical Conversations.

Domain Adaptation for Automatic Speech Recognition (ASR) error correction via machine translation is a useful technique for improving out-of-domain outputs of pre-trained ASR systems to obtain optimal results for specific in-domain tasks. We use this

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::49f7f3bb511455ff3d306aab2015a7c5
https://doi.org/10.18653/v1/2020.nlpmc-1.2

Zobrazit plný text záznamu

How2Sign: A Large-scale Multimodal Dataset for Continuous American Sign Language

Autor: Xavier Giro-i-Nieto, Shruti Palaskar, Deepti Ghadiyaram, Kenneth DeHaan, Amanda Duarte, Lucas Ventura, Jordi Torres, Florian Metze

Publikováno v: UPCommons. Portal del coneixement obert de la UPC
Universitat Politècnica de Catalunya (UPC)
Digital.CSIC. Repositorio Institucional del CSIC
instname
CVPR

Trabajo presentado en la IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), celebrada de forma virtual del 19 al 25 de junio de 2021
One of the factors that have hindered progress in the areas of sign language recognition, tr

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::8b8f11f0f1f12b91b5cc9293c6c8c458

Zobrazit plný text záznamu

Learned in Speech Recognition: Contextual Acoustic Word Embeddings

Autor: Shruti Palaskar, Florian Metze, Vikas Raunak

Publikováno v: ICASSP

End-to-end acoustic-to-word speech recognition models have recently gained popularity because they are easy to train, scale well to large amounts of training data, and do not require a lexicon. In addition, word models may also be easier to integrate

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::1ef6d47a6ca27240878ff996d40eae6b
https://doi.org/10.1109/icassp.2019.8683868

Zobrazit plný text záznamu

Multimodal Abstractive Summarization for How2 Videos

Autor: Florian Metze, Shruti Palaskar, Jindrich Libovický, Spandana Gella

Publikováno v: ACL (1)

In this paper, we study abstractive summarization for open-domain videos. Unlike the traditional text news summarization, the goal is less to "compress" text information but rather to provide a fluent textual summary of information that has been coll

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::6aa27daa5a29233765228267168bf6df

Zobrazit plný text záznamu

Transfer learning for multimodal dialog

Autor: Ramon Sanabria, Florian Metze, Shruti Palaskar

Publikováno v: Computer Speech & Language. 64:101093

Audio-Visual Scene-Aware Dialog (AVSD) is best understood as an extension of Visual Question Answering, the task of generating a textual answer in response to a textual question on multi-media content. In AVSD, the answer-relevant “context” is ex

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::78dd0f2fb3e4db3a71cceba491cc7d8e
https://doi.org/10.1016/j.csl.2020.101093

Zobrazit plný text záznamu

Learning from Multiview Correlations in Open-Domain Videos

Autor: Florian Metze, Pranava Madhyastha, Shruti Palaskar, Raman Arora, Nils Holzenberger

Publikováno v: ICASSP

An increasing number of datasets contain multiple views, such as video, sound and automatic captions. A basic challenge in representation learning is how to leverage multiple views to learn better representations. This is further complicated by the e

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::63791503bec8fae677ca8433a62dce28
http://arxiv.org/abs/1811.08890

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání