Zobrazeno 1 - 10
of 140
pro vyhledávání: '"Kuo, Yen‐Ling"'
We introduce intra-class memorability, where certain images within the same class are more memorable than others despite shared category characteristics. To investigate what features make one object instance more memorable than others, we design and
Externí odkaz:
http://arxiv.org/abs/2412.20761
Autor:
Wang, Bo, Tan, Dingwei, Kuo, Yen-Ling, Sun, Zhaowei, Wolfe, Jeremy M., Cham, Tat-Jen, Zhang, Mengmi
Imagine searching a collection of coins for quarters ($0.25$), dimes ($0.10$), nickels ($0.05$), and pennies ($0.01$)-a hybrid foraging task where observers look for multiple instances of multiple target types. In such tasks, how do target values and
Externí odkaz:
http://arxiv.org/abs/2411.09176
Autor:
Lee, Sung-Wook, Kuo, Yen-Ling
Recently, diffusion policy has shown impressive results in handling multi-modal tasks in robotic manipulation. However, it has fundamental limitations in out-of-distribution failures that persist due to compounding errors and its limited capability t
Externí odkaz:
http://arxiv.org/abs/2410.14868
Autor:
Kang, Xuhui, Kuo, Yen-Ling
Understanding the progress of a task allows humans to not only track what has been done but also to better plan for future goals. We demonstrate TaKSIE, a novel framework that incorporates task progress knowledge into visual subgoal generation for ro
Externí odkaz:
http://arxiv.org/abs/2410.11013
Understanding people's social interactions in complex real-world scenarios often relies on intricate mental reasoning. To truly understand how and why people interact with one another, we must infer the underlying mental states that give rise to the
Externí odkaz:
http://arxiv.org/abs/2408.12574
Autor:
Cloos, Nathan, Jens, Meagan, Naim, Michelangelo, Kuo, Yen-Ling, Cases, Ignacio, Barbu, Andrei, Cueva, Christopher J.
Humans solve problems by following existing rules and procedures, and also by leaps of creativity to redefine those rules and objectives. To probe these abilities, we developed a new benchmark based on the game Baba Is You where an agent manipulates
Externí odkaz:
http://arxiv.org/abs/2407.13729
Humans utilize their gaze to concentrate on essential information while perceiving and interpreting intentions in videos. Incorporating human gaze into computational algorithms can significantly enhance model performance in video understanding tasks.
Externí odkaz:
http://arxiv.org/abs/2404.07347
Eye-tracking applications that utilize the human gaze in video understanding tasks have become increasingly important. To effectively automate the process of video analysis based on eye-tracking data, it is important to accurately replicate human gaz
Externí odkaz:
http://arxiv.org/abs/2404.07351
Autor:
Jin, Chuanyang, Wu, Yutong, Cao, Jing, Xiang, Jiannan, Kuo, Yen-Ling, Hu, Zhiting, Ullman, Tomer, Torralba, Antonio, Tenenbaum, Joshua B., Shu, Tianmin
Theory of Mind (ToM), the ability to understand people's mental states, is an essential ingredient for developing machines with human-level social intelligence. Recent machine learning models, particularly large language models, seem to show some asp
Externí odkaz:
http://arxiv.org/abs/2401.08743
Multi-agent interactions, such as communication, teaching, and bluffing, often rely on higher-order social inference, i.e., understanding how others infer oneself. Such intricate reasoning can be effectively modeled through nested multi-agent reasoni
Externí odkaz:
http://arxiv.org/abs/2308.11071