Výsledky vyhledávání

MarketLine Company Profile: ASR Nederland NV.

Publikováno v: ASR Nederland NV MarketLine Company Profile. 11/22/2024, p1-27. 27p.

Report

ASR-EC Benchmark: Evaluating Large Language Models on Chinese ASR Error Correction

Autor: Wei, Victor Junqiu, Wang, Weicheng, Jiang, Di, Song, Yuanfeng, Wang, Lu

Automatic speech Recognition (ASR) is a fundamental and important task in the field of speech and natural language processing. It is an inherent building block in many applications such as voice assistant, speech translation, etc. Despite the advance

Externí odkaz: http://arxiv.org/abs/2412.03075

Zobrazit plný text záznamu

Report

MSA-ASR: Efficient Multilingual Speaker Attribution with frozen ASR Models

Autor: Nguyen, Thai-Binh, Waibel, Alexander

Speaker-attributed automatic speech recognition (SA-ASR) aims to transcribe speech while assigning transcripts to the corresponding speakers accurately. Existing methods often rely on complex modular systems or require extensive fine-tuning of joint

Externí odkaz: http://arxiv.org/abs/2411.18152

Zobrazit plný text záznamu

Akademický článek

Development History, Structure, and Function of ASR (Abscisic Acid-Stress-Ripening) Transcription Factor.

Autor: Zhang, Yue¹ (AUTHOR) zhangyue19970309@163.com, Wang, Mengfan¹ (AUTHOR) wangmengfan61@163.com, Kitashov, Andery V.^2,3 (AUTHOR) kitashov@smbu.edu.cn, Yang, Ling^1,2,4 (AUTHOR) yangl-cf@nefu.edu.cn

Publikováno v: International Journal of Molecular Sciences. Oct2024, Vol. 25 Issue 19, p10283. 18p.

Zobrazit plný text záznamu

Plný text ve formátu HTML

Report

Whisper Turns Stronger: Augmenting Wav2Vec 2.0 for Superior ASR in Low-Resource Languages

Autor: Anidjar, Or Haim, Marbel, Revital, Yozevitch, Roi

Approaching Speech-to-Text and Automatic Speech Recognition problems in low-resource languages is notoriously challenging due to the scarcity of validated datasets and the diversity of dialects. Arabic, Russian, and Portuguese exemplify these difficu

Externí odkaz: http://arxiv.org/abs/2501.00425

Zobrazit plný text záznamu

Report

Towards a Single ASR Model That Generalizes to Disordered Speech

Autor: Tobin, Jimmy, Tomanek, Katrin, Venugopalan, Subhashini

This study investigates the impact of integrating a dataset of disordered speech recordings ($\sim$1,000 hours) into the fine-tuning of a near state-of-the-art ASR baseline system. Contrary to what one might expect, despite the data being less than 1

Externí odkaz: http://arxiv.org/abs/2412.19315

Zobrazit plný text záznamu

Report

Enhancing Multilingual ASR for Unseen Languages via Language Embedding Modeling

Autor: Huang, Shao-Syuan, Huang, Kuan-Po, Liu, Andy T., Lee, Hung-yi

Multilingual Automatic Speech Recognition (ASR) aims to recognize and transcribe speech from multiple languages within a single system. Whisper, one of the most advanced ASR models, excels in this domain by handling 99 languages effectively, leveragi

Externí odkaz: http://arxiv.org/abs/2412.16474

Zobrazit plný text záznamu

Report

LAMA-UT: Language Agnostic Multilingual ASR through Orthography Unification and Language-Specific Transliteration

Autor: Lee, Sangmin, Chung, Woo-Jin, Kang, Hong-Goo

Building a universal multilingual automatic speech recognition (ASR) model that performs equitably across languages has long been a challenge due to its inherent difficulties. To address this task we introduce a Language-Agnostic Multilingual ASR pip

Externí odkaz: http://arxiv.org/abs/2412.15299

Zobrazit plný text záznamu

Report

Open Universal Arabic ASR Leaderboard

Autor: Wang, Yingzhi, Alhmoud, Anas, Alqurishi, Muhammad

In recent years, the enhanced capabilities of ASR models and the emergence of multi-dialect datasets have increasingly pushed Arabic ASR model development toward an all-dialect-in-one direction. This trend highlights the need for benchmarking studies

Externí odkaz: http://arxiv.org/abs/2412.13788

Zobrazit plný text záznamu

Report

Efficient Adaptation of Multilingual Models for Japanese ASR

Autor: Bajo, Mark, Fukukawa, Haruka, Morita, Ryuji, Ogasawara, Yuma

This study explores fine-tuning multilingual ASR (Automatic Speech Recognition) models, specifically OpenAI's Whisper-Tiny, to improve performance in Japanese. While multilingual models like Whisper offer versatility, they often lack precision in spe

Externí odkaz: http://arxiv.org/abs/2412.10705

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání