Zobrazeno 1 - 10
of 1 146
pro vyhledávání: '"Song, Chan"'
Spatial understanding is a crucial capability for robots to make grounded decisions based on their environment. This foundational skill enables robots not only to perceive their surroundings but also to reason about and interact meaningfully within t
Externí odkaz:
http://arxiv.org/abs/2411.16537
Autor:
Liu, Xiao, Zhang, Tianjie, Gu, Yu, Iong, Iat Long, Xu, Yifan, Song, Xixuan, Zhang, Shudan, Lai, Hanyu, Liu, Xinyi, Zhao, Hanlin, Sun, Jiadai, Yang, Xinyue, Yang, Yu, Qi, Zehan, Yao, Shuntian, Sun, Xueqiao, Cheng, Siyi, Zheng, Qinkai, Yu, Hao, Zhang, Hanchen, Hong, Wenyi, Ding, Ming, Pan, Lihang, Gu, Xiaotao, Zeng, Aohan, Du, Zhengxiao, Song, Chan Hee, Su, Yu, Dong, Yuxiao, Tang, Jie
Large Multimodal Models (LMMs) have ushered in a new era in artificial intelligence, merging capabilities in both language and vision to form highly capable Visual Foundation Agents. These agents are postulated to excel across a myriad of tasks, pote
Externí odkaz:
http://arxiv.org/abs/2408.06327
This study introduces an optimal mechanism in a dynamic stochastic knapsack environment. The model features a single seller who has a fixed quantity of a perfectly divisible item. Impatient buyers with a piece-wise linear utility function arrive rand
Externí odkaz:
http://arxiv.org/abs/2402.14269
Automatic web navigation aims to build a web agent that can follow language instructions to execute complex and diverse tasks on real-world websites. Existing work primarily takes HTML documents as input, which define the contents and action spaces (
Externí odkaz:
http://arxiv.org/abs/2402.04476
Autor:
Levering, Miriam
Publikováno v:
Journal of Song-Yuan Studies, 2000 Jan 01(30), 115-139.
Externí odkaz:
https://www.jstor.org/stable/23495825
Autor:
Stevens, Samuel, Wu, Jiaman, Thompson, Matthew J, Campolongo, Elizabeth G, Song, Chan Hee, Carlyn, David Edward, Dong, Li, Dahdul, Wasila M, Stewart, Charles, Berger-Wolf, Tanya, Chao, Wei-Lun, Su, Yu
Images of the natural world, collected by a variety of cameras, from drones to individual phones, are increasingly abundant sources of biological information. There is an explosion of computational methods and tools, particularly computer vision, for
Externí odkaz:
http://arxiv.org/abs/2311.18803
This study focuses on using large language models (LLMs) as a planner for embodied agents that can follow natural language instructions to complete complex tasks in a visually-perceived environment. The high data cost and poor sample efficiency of ex
Externí odkaz:
http://arxiv.org/abs/2212.04088
Autor:
Song, Eunwoo, Yamamoto, Ryuichi, Kwon, Ohsung, Song, Chan-Ho, Hwang, Min-Jae, Oh, Suhyeon, Yoon, Hyun-Wook, Kim, Jin-Seob, Kim, Jae-Min
Recent advances in synthetic speech quality have enabled us to train text-to-speech (TTS) systems by using synthetic corpora. However, merely increasing the amount of synthetic data is not always advantageous for improving training efficiency. Our ai
Externí odkaz:
http://arxiv.org/abs/2206.14984
Autor:
Soo Kim, Hak, Seo, JeongMin, Moon, Sunyoung, Ho Kim, Dong, Jung, Yujun, Chung, Yoong, Hoon Lee, Kong, Ho Song, Chan
Publikováno v:
In Energy Conversion and Management 15 December 2024 322
Publikováno v:
In Biochemical Engineering Journal December 2024 212