Zobrazeno 1 - 7
of 7
pro vyhledávání: '"Wakatsuki, Toshiaki"'
Multimodal language models that process both text and speech have a potential for applications in spoken dialogue systems. However, current models face two major challenges in response generation latency: (1) generating a spoken response requires the
Externí odkaz:
http://arxiv.org/abs/2406.12428
Autor:
Sawada, Kei, Zhao, Tianyu, Shing, Makoto, Mitsui, Kentaro, Kaga, Akio, Hono, Yukiya, Wakatsuki, Toshiaki, Mitsuda, Koh
AI democratization aims to create a world in which the average person can utilize AI techniques. To achieve this goal, numerous research institutes have attempted to make their results accessible to the public. In particular, large pre-trained models
Externí odkaz:
http://arxiv.org/abs/2404.01657
Advances in machine learning have made it possible to perform various text and speech processing tasks, such as automatic speech recognition (ASR), in an end-to-end (E2E) manner. E2E approaches utilizing pre-trained models are gaining attention for c
Externí odkaz:
http://arxiv.org/abs/2312.03668
Publikováno v:
DEIM Forum 2018論文集.
This paper demonstrates a fast Okapi's BM25 term weighting method on GPUs for information retrieval by combining a GPU-based dictionary using a succinct data structure and data parallel primitives. The problem of handling documents on GPUs is to proc
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=jairo_______::553ce35ebbe0a3390938a1bd8cc19cdc
https://hdl.handle.net/10086/78420
https://hdl.handle.net/10086/78420