Zobrazeno 1 - 10
of 132
pro vyhledávání: '"Choi, Seongho"'
Vision-and-Language Navigation (VLN) agents navigate to a destination using natural language instructions and the visual information they observe. Existing methods for training VLN agents presuppose fixed datasets, leading to a significant limitation
Externí odkaz:
http://arxiv.org/abs/2403.15049
Video moment retrieval (VMR) identifies a specific moment in an untrimmed video for a given natural language query. This task is prone to suffer the weak alignment problem innate in video datasets. Due to the ambiguity, a query does not fully cover t
Externí odkaz:
http://arxiv.org/abs/2306.02728
Video corpus moment retrieval (VCMR) is the task to retrieve the most relevant video moment from a large video corpus using a natural language query. For narrative videos, e.g., dramas or movies, the holistic understanding of temporal dynamics and mu
Externí odkaz:
http://arxiv.org/abs/2210.12617
Autor:
Heo, Yu-Jung, Lee, Minsu, Choi, Seongho, Choi, Woo Suk, Shin, Minjung, Jung, Minjoon, Ryu, Jeh-Kwang, Zhang, Byoung-Tak
We aim to develop an AI agent that can watch video clips and have a conversation with human about the video story. Developing video understanding intelligence is a significantly challenging task, and evaluation methods for adequately measuring and an
Externí odkaz:
http://arxiv.org/abs/2110.04203
Autor:
Zhang, Junfang, Wang, Enze, Li, Qiang, Peng, Yinghua, Jin, Huaina, Naseem, Sajida, Sun, Bin, Park, Sungkwon, Choi, Seongho, Li, Xiangzi
Publikováno v:
In International Journal of Biological Macromolecules August 2024 275 Part 2
Video question answering has recently received a lot of attention from multimodal video researchers. Most video question answering datasets are usually in the form of multiple-choice. But, the model for the multiple-choice task does not infer the ans
Externí odkaz:
http://arxiv.org/abs/2108.05158
We introduce CogME, a cognition-inspired, multi-dimensional evaluation metric designed for AI models focusing on story understanding. CogME is a framework grounded in human thinking strategies and story elements that involve story understanding. With
Externí odkaz:
http://arxiv.org/abs/2107.09847
Autor:
Choi, Seongho1 (AUTHOR) choish@kyungnam.ac.kr
Publikováno v:
Religions. May2024, Vol. 15 Issue 5, p600. 12p.
Autor:
Choi, Seongho, On, Kyoung-Woon, Heo, Yu-Jung, Seo, Ahjeong, Jang, Youwon, Lee, Minsu, Zhang, Byoung-Tak
Despite recent progress on computer vision and natural language processing, developing a machine that can understand video story is still hard to achieve due to the intrinsic difficulty of video story. Moreover, researches on how to evaluate the degr
Externí odkaz:
http://arxiv.org/abs/2005.03356
Autor:
Ghodke, Swapnil, Muthusamy, Omprakash, Codrin, Kevin Delime, Choi, Seongho, Singh, Saurabh, Byeon, Dogyun, Adachi, Masahiro, Kiyama, Makoto, Matsuura, Takashi, Yamamoto, Yoshiyuki, Matsunami, Masaharu, Takeuchi, Tsunehiro
The efficiency of energy conversion in thermoelectric generators (TEGs) is directly proportional to electrical conductivity and Seebeck coefficient while inversely to thermal conductivity. The challenge is to optimize these interdependent parameters
Externí odkaz:
http://arxiv.org/abs/1909.12476