Zobrazeno 1 - 3
of 3
pro vyhledávání: '"Wan-Cyuan Fan"'
Publikováno v:
ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
Publikováno v:
Proceedings of the AAAI Conference on Artificial Intelligence. 36:3036-3044
As a key characteristic in audio-visual speech recognition (AVSR), relating linguistic information observed across visual and audio data has been a challenge, benefiting not only audio/visual speech recognition (ASR/VSR) but also for manipulating dat
Publikováno v:
CVPR
When translating text inputs into layouts or images, existing works typically require explicit descriptions of each object in a scene, including their spatial information or the associated relationships. To better exploit the text input, so that impl