Zobrazeno 1 - 7
of 7
pro vyhledávání: '"Mitsuda, Koh"'
Multimodal language models that process both text and speech have a potential for applications in spoken dialogue systems. However, current models face two major challenges in response generation latency: (1) generating a spoken response requires the
Externí odkaz:
http://arxiv.org/abs/2406.12428
Autor:
Sawada, Kei, Zhao, Tianyu, Shing, Makoto, Mitsui, Kentaro, Kaga, Akio, Hono, Yukiya, Wakatsuki, Toshiaki, Mitsuda, Koh
AI democratization aims to create a world in which the average person can utilize AI techniques. To achieve this goal, numerous research institutes have attempted to make their results accessible to the public. In particular, large pre-trained models
Externí odkaz:
http://arxiv.org/abs/2404.01657
Advances in machine learning have made it possible to perform various text and speech processing tasks, such as automatic speech recognition (ASR), in an end-to-end (E2E) manner. E2E approaches utilizing pre-trained models are gaining attention for c
Externí odkaz:
http://arxiv.org/abs/2312.03668
Akademický článek
Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.
K zobrazení výsledku je třeba se přihlásit.
Autor:
Mitsuda, Koh
2020
この博士論文は、全文公表に適さないやむを得ない事由があり要約のみを公表していましたが、解消したため、2023年(令和5年)3月7日に全文を公表しました。
この博士論文は、全文公表に適さないやむを得ない事由があり要約のみを公表していましたが、解消したため、2023年(令和5年)3月7日に全文を公表しました。
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=dedup_wf_001::1e0a5a0c6ea60321fdd4a46403107575
https://tsukuba.repo.nii.ac.jp/records/2000784
https://tsukuba.repo.nii.ac.jp/records/2000784
Publikováno v:
言語処理学会第21回年次大会発表論文集. :553-556
Publikováno v:
Proceedings of the 11th Workshop on Asian Language Resources. :19-26