Výsledky vyhledávání

Report

Contextualization of ASR with LLM using phonetic retrieval-based augmentation

Autor: Lei, Zhihong, Na, Xingyu, Xu, Mingbin, Pusateri, Ernest, Van Gysel, Christophe, Zhang, Yuanyuan, Han, Shiyi, Huang, Zhen

Large language models (LLMs) have shown superb capability of modeling multimodal signals including audio and text, allowing the model to generate spoken or textual response given a speech input. However, it remains a challenge for the model to recogn

Externí odkaz: http://arxiv.org/abs/2409.15353

Zobrazit plný text záznamu

Report

Enhancing CTC-based speech recognition with diverse modeling units

Autor: Han, Shiyi, Lei, Zhihong, Xu, Mingbin, Na, Xingyu, Huang, Zhen

In recent years, the evolution of end-to-end (E2E) automatic speech recognition (ASR) models has been remarkable, largely due to advances in deep learning architectures like transformer. On top of E2E systems, researchers have achieved substantial ac

Externí odkaz: http://arxiv.org/abs/2406.03274

Zobrazit plný text záznamu

Report

Conformer-Based Speech Recognition On Extreme Edge-Computing Devices

Autor: Xu, Mingbin, Jin, Alex, Wang, Sicheng, Su, Mu, Ng, Tim, Mason, Henry, Han, Shiyi, Lei, Zhihong, Deng, Yaqiao, Huang, Zhen, Krishnamoorthy, Mahesh

With increasingly more powerful compute capabilities and resources in today's devices, traditionally compute-intensive automatic speech recognition (ASR) has been moving from the cloud to devices to better protect user privacy. However, it is still c

Externí odkaz: http://arxiv.org/abs/2312.10359

Zobrazit plný text záznamu

Report

Personalization of CTC-based End-to-End Speech Recognition Using Pronunciation-Driven Subword Tokenization

Autor: Lei, Zhihong, Pusateri, Ernest, Han, Shiyi, Liu, Leo, Xu, Mingbin, Ng, Tim, Travadi, Ruchir, Zhang, Youyuan, Hannemann, Mirko, Siu, Man-Hung, Huang, Zhen

Recent advances in deep learning and automatic speech recognition have improved the accuracy of end-to-end speech recognition systems, but recognition of personal content such as contact names remains a challenge. In this work, we describe our person

Externí odkaz: http://arxiv.org/abs/2310.09988

Zobrazit plný text záznamu

Report

Acoustic Model Fusion for End-to-end Speech Recognition

Autor: Lei, Zhihong, Xu, Mingbin, Han, Shiyi, Liu, Leo, Huang, Zhen, Ng, Tim, Zhang, Yuanyuan, Pusateri, Ernest, Hannemann, Mirko, Deng, Yaqiao, Siu, Man-Hung

Recent advances in deep learning and automatic speech recognition (ASR) have enabled the end-to-end (E2E) ASR system and boosted the accuracy to a new level. The E2E systems implicitly model all conventional ASR components, such as the acoustic model

Externí odkaz: http://arxiv.org/abs/2310.07062

Zobrazit plný text záznamu

Report

Training Large-Vocabulary Neural Language Models by Private Federated Learning for Resource-Constrained Devices

Autor: Xu, Mingbin, Song, Congzheng, Tian, Ye, Agrawal, Neha, Granqvist, Filip, van Dalen, Rogier, Zhang, Xiao, Argueta, Arturo, Han, Shiyi, Deng, Yaqiao, Liu, Leo, Walia, Anmol, Jin, Alex

Federated Learning (FL) is a technique to train models using data distributed across devices. Differential Privacy (DP) provides a formal privacy guarantee for sensitive data. Our goal is to train a large neural network language model (NNLM) on compu

Externí odkaz: http://arxiv.org/abs/2207.08988

Zobrazit plný text záznamu

Akademický článek

Effect of perioperative rehabilitation exercise on postoperative outcomes in patients aged ≥65 years undergoing gastrointestinal surgery: A multicenter randomized controlled trial

Autor: Lv, Xuecai, Hou, Aisheng, Han, Shiyi, Cao, Jiangbei, Lou, Jingsheng, Li, Hao, Min, Su, Tan, Hongyu, Li, Shuo, Lv, Feng, Zhou, Zhikang, Chi, Menglin, Zhang, Hong, Liu, Yanhong, Mi, Weidong

Publikováno v: In Journal of Clinical Anesthesia December 2024 99

Zobrazit plný text záznamu

Report

Neural Diffusion Model for Microscopic Cascade Prediction

Autor: Yang, Cheng, Sun, Maosong, Liu, Haoran, Han, Shiyi, Liu, Zhiyuan, Luan, Huanbo

The prediction of information diffusion or cascade has attracted much attention over the last decade. Most cascade prediction works target on predicting cascade-level macroscopic properties such as the final size of a cascade. Existing microscopic ca

Externí odkaz: http://arxiv.org/abs/1812.08933

Zobrazit plný text záznamu

Akademický článek

Beneficial herb-drug interaction of Gnaphalium affine extract on benzbromarone: A pharmacokinetic and pharmacodynamic study in rats

Autor: Liu, Xizi, Han, Shiyi, Yang, Qian, Fan, Siyang

Publikováno v: In Phytomedicine 20 July 2022 102

Zobrazit plný text záznamu

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Vyhledávací nástroje:

Upřesnit hledání