Zobrazeno 1 - 10
of 245 090
pro vyhledávání: '"Asr"'
Publikováno v:
ASR Nederland NV MarketLine Company Profile. 11/22/2024, p1-27. 27p.
Automatic speech Recognition (ASR) is a fundamental and important task in the field of speech and natural language processing. It is an inherent building block in many applications such as voice assistant, speech translation, etc. Despite the advance
Externí odkaz:
http://arxiv.org/abs/2412.03075
Autor:
Nguyen, Thai-Binh, Waibel, Alexander
Speaker-attributed automatic speech recognition (SA-ASR) aims to transcribe speech while assigning transcripts to the corresponding speakers accurately. Existing methods often rely on complex modular systems or require extensive fine-tuning of joint
Externí odkaz:
http://arxiv.org/abs/2411.18152
Autor:
Zhang, Yue1 (AUTHOR) zhangyue19970309@163.com, Wang, Mengfan1 (AUTHOR) wangmengfan61@163.com, Kitashov, Andery V.2,3 (AUTHOR) kitashov@smbu.edu.cn, Yang, Ling1,2,4 (AUTHOR) yangl-cf@nefu.edu.cn
Publikováno v:
International Journal of Molecular Sciences. Oct2024, Vol. 25 Issue 19, p10283. 18p.
Approaching Speech-to-Text and Automatic Speech Recognition problems in low-resource languages is notoriously challenging due to the scarcity of validated datasets and the diversity of dialects. Arabic, Russian, and Portuguese exemplify these difficu
Externí odkaz:
http://arxiv.org/abs/2501.00425
This study investigates the impact of integrating a dataset of disordered speech recordings ($\sim$1,000 hours) into the fine-tuning of a near state-of-the-art ASR baseline system. Contrary to what one might expect, despite the data being less than 1
Externí odkaz:
http://arxiv.org/abs/2412.19315
Multilingual Automatic Speech Recognition (ASR) aims to recognize and transcribe speech from multiple languages within a single system. Whisper, one of the most advanced ASR models, excels in this domain by handling 99 languages effectively, leveragi
Externí odkaz:
http://arxiv.org/abs/2412.16474
Building a universal multilingual automatic speech recognition (ASR) model that performs equitably across languages has long been a challenge due to its inherent difficulties. To address this task we introduce a Language-Agnostic Multilingual ASR pip
Externí odkaz:
http://arxiv.org/abs/2412.15299
In recent years, the enhanced capabilities of ASR models and the emergence of multi-dialect datasets have increasingly pushed Arabic ASR model development toward an all-dialect-in-one direction. This trend highlights the need for benchmarking studies
Externí odkaz:
http://arxiv.org/abs/2412.13788
This study explores fine-tuning multilingual ASR (Automatic Speech Recognition) models, specifically OpenAI's Whisper-Tiny, to improve performance in Japanese. While multilingual models like Whisper offer versatility, they often lack precision in spe
Externí odkaz:
http://arxiv.org/abs/2412.10705