Výsledky vyhledávání - "Speech understanding"

Report

Roadmap towards Superhuman Speech Understanding using Large Language Models

Autor: Bu, Fan, Zhang, Yuhao, Wang, Xidong, Wang, Benyou, Liu, Qun, Li, Haizhou

The success of large language models (LLMs) has prompted efforts to integrate speech and audio data, aiming to create general foundation models capable of processing both textual and non-textual inputs. Recent advances, such as GPT-4o, highlight the

Externí odkaz: http://arxiv.org/abs/2410.13268

Zobrazit plný text záznamu

Report

TI-ASU: Toward Robust Automatic Speech Understanding through Text-to-speech Imputation Against Missing Speech Modality

Autor: Feng, Tiantian, Shi, Xuan, Gupta, Rahul, Narayanan, Shrikanth S.

Automatic Speech Understanding (ASU) aims at human-like speech interpretation, providing nuanced intent, emotion, sentiment, and content understanding from speech and language (text) content conveyed in speech. Typically, training a robust ASU model

Externí odkaz: http://arxiv.org/abs/2404.17983

Zobrazit plný text záznamu

Report

Lightweight Protection for Privacy in Offloaded Speech Understanding

Autor: Cai, Dongqi

Speech is a common input method for mobile embedded devices, but cloud-based speech recognition systems pose privacy risks. Disentanglement-based encoders, designed to safeguard user privacy by filtering sensitive information from speech signals, unf

Externí odkaz: http://arxiv.org/abs/2401.11983

Zobrazit plný text záznamu

Report

Turbocharge Speech Understanding with Pilot Inference

Autor: Wang, Rongxiang, Lin, Felix Xiaozhu

Modern speech understanding (SU) runs a sophisticated pipeline: ingesting streaming voice input, the pipeline executes encoder-decoder based deep neural networks repeatedly; by doing so, the pipeline generates tentative outputs (called hypotheses), a

Externí odkaz: http://arxiv.org/abs/2311.17065

Zobrazit plný text záznamu

Akademický článek

Free Markets and Free Speech: Understanding the Limits of the Noerr-Pennington Doctrine.

Autor: Patel, Mitsoo K.¹

Publikováno v: University of Chicago Law Review. 2024Special, p567-603. 37p.

Zobrazit plný text záznamu

Report

Speech Understanding on Tiny Devices with A Learning Cache

Autor: Benazir, Afsara, Xu, Zhiming, Lin, Felix Xiaozhu

This paper addresses spoken language understanding (SLU) on microcontroller-like embedded devices, integrating on-device execution with cloud offloading in a novel fashion. We leverage temporal locality in the speech inputs to a device and reuse rece

Externí odkaz: http://arxiv.org/abs/2311.18188

Zobrazit plný text záznamu

Akademický článek

Investigation of Maximum Monosyllabic Word Recognition as a Predictor of Speech Understanding with Cochlear Implant.

Autor: Czurda, Ronja¹ (AUTHOR) ronja.czurda@uniklinik-freiburg.de, Wesarg, Thomas¹ (AUTHOR) antje.aschendorff@uniklinik-freiburg.de, Aschendorff, Antje¹ (AUTHOR) rainer.beck@uniklinik-freiburg.de, Beck, Rainer Linus¹ (AUTHOR) manuel.christoph.ketterer@uniklinik-freiburg.de, Hocke, Thomas² (AUTHOR) thocke@cochlear.com, Ketterer, Manuel Christoph¹ (AUTHOR) susan.arndt@uniklinik-freiburg.de, Arndt, Susan¹ (AUTHOR)

Publikováno v: Journal of Clinical Medicine. Feb2024, Vol. 13 Issue 3, p646. 12p.

Zobrazit plný text záznamu

Plný text ve formátu HTML

Akademický článek

A Mixed-Rate Strategy on a Bilaterally-Synchronized Cochlear Implant Processor Offering the Opportunity to Provide Both Speech Understanding and Interaural Time Difference Cues.

Autor: Dennison, Stephen R.¹ (AUTHOR) srdennison@wisc.edu, Thakkar, Tanvi² (AUTHOR) tthakkar@uwlax.edu, Kan, Alan³ (AUTHOR) alan.kan@mq.edu.au, Svirsky, Mario A.⁴ (AUTHOR) mario.svirsky@nyulangone.org, Azadpour, Mahan⁴ (AUTHOR) mahan.azadpour@nyulangone.org, Litovsky, Ruth Y.¹ (AUTHOR) ruth.litovsky@wisc.edu

Publikováno v: Journal of Clinical Medicine. Apr2024, Vol. 13 Issue 7, p1917. 17p.

Zobrazit plný text záznamu

Plný text ve formátu HTML

Report

Privacy-preserving Representation Learning for Speech Understanding

Autor: Tran, Minh, Soleymani, Mohammad

Existing privacy-preserving speech representation learning methods target a single application domain. In this paper, we present a novel framework to anonymize utterance-level speech embeddings generated by pre-trained encoders and show its effective

Externí odkaz: http://arxiv.org/abs/2310.17194

Zobrazit plný text záznamu

Report

Joint Audio and Speech Understanding

Autor: Gong, Yuan, Liu, Alexander H., Luo, Hongyin, Karlinsky, Leonid, Glass, James

Humans are surrounded by audio signals that include both speech and non-speech sounds. The recognition and understanding of speech and non-speech audio events, along with a profound comprehension of the relationship between them, constitute fundament

Externí odkaz: http://arxiv.org/abs/2309.14405

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání