Výsledky vyhledávání

Report

IGOR: Image-GOal Representations are the Atomic Control Units for Foundation Models in Embodied AI

Autor: Chen, Xiaoyu, Guo, Junliang, He, Tianyu, Zhang, Chuheng, Zhang, Pushi, Yang, Derek Cathera, Zhao, Li, Bian, Jiang

We introduce Image-GOal Representations (IGOR), aiming to learn a unified, semantically consistent action space across human and various robots. Through this unified latent action space, IGOR enables knowledge transfer among large-scale robot and hum

Externí odkaz: http://arxiv.org/abs/2411.00785

Zobrazit plný text záznamu

Report

Compositional 3D-aware Video Generation with LLM Director

Autor: Zhu, Hanxin, He, Tianyu, Tang, Anni, Guo, Junliang, Chen, Zhibo, Bian, Jiang

Publikováno v: NeurIPS 2024

Significant progress has been made in text-to-video generation through the use of powerful generative models and large-scale internet data. However, substantial challenges remain in precisely controlling individual concepts within the generated video

Externí odkaz: http://arxiv.org/abs/2409.00558

Zobrazit plný text záznamu

Report

A Generic Review of Integrating Artificial Intelligence in Cognitive Behavioral Therapy

Autor: Jiang, Meng, Zhao, Qing, Li, Jianqiang, Wang, Fan, He, Tianyu, Cheng, Xinyan, Yang, Bing Xiang, Ho, Grace W. K., Fu, Guanghui

Cognitive Behavioral Therapy (CBT) is a well-established intervention for mitigating psychological issues by modifying maladaptive cognitive and behavioral patterns. However, delivery of CBT is often constrained by resource limitations and barriers t

Externí odkaz: http://arxiv.org/abs/2407.19422

Zobrazit plný text záznamu

Report

Video In-context Learning

Autor: Zhang, Wentao, Guo, Junliang, He, Tianyu, Zhao, Li, Xu, Linli, Bian, Jiang

In-context learning for vision data has been underexplored compared with that in natural language. Previous works studied image in-context learning, urging models to generate a single image guided by demonstrations. In this paper, we propose and stud

Externí odkaz: http://arxiv.org/abs/2407.07356

Zobrazit plný text záznamu

Report

GaussianSR: 3D Gaussian Super-Resolution with 2D Diffusion Priors

Autor: Yu, Xiqian, Zhu, Hanxin, He, Tianyu, Chen, Zhibo

Achieving high-resolution novel view synthesis (HRNVS) from low-resolution input views is a challenging task due to the lack of high-resolution data. Previous methods optimize high-resolution Neural Radiance Field (NeRF) from low-resolution input vie

Externí odkaz: http://arxiv.org/abs/2406.10111

Zobrazit plný text záznamu

Report

Make Your Actor Talk: Generalizable and High-Fidelity Lip Sync with Motion and Appearance Disentanglement

Autor: Yu, Runyi, He, Tianyu, Zhang, Ailing, Wang, Yuchi, Guo, Junliang, Tan, Xu, Liu, Chang, Chen, Jie, Bian, Jiang

We aim to edit the lip movements in talking video according to the given speech while preserving the personal identity and visual details. The task can be decomposed into two sub-problems: (1) speech-driven lip motion generation and (2) visual appear

Externí odkaz: http://arxiv.org/abs/2406.08096

Zobrazit plný text záznamu

Report

Grokking Modular Polynomials

Autor: Doshi, Darshil, He, Tianyu, Das, Aritra, Gromov, Andrey

Neural networks readily learn a subset of the modular arithmetic tasks, while failing to generalize on the rest. This limitation remains unmoved by the choice of architecture and training strategies. On the other hand, an analytical solution for the

Externí odkaz: http://arxiv.org/abs/2406.03495

Zobrazit plný text záznamu

Report

Learning to grok: Emergence of in-context learning and skill composition in modular arithmetic tasks

Autor: He, Tianyu, Doshi, Darshil, Das, Aritra, Gromov, Andrey

Large language models can solve tasks that were not present in the training set. This capability is believed to be due to in-context learning and skill composition. In this work, we study the emergence of in-context learning and skill composition in

Externí odkaz: http://arxiv.org/abs/2406.02550

Zobrazit plný text záznamu

Akademický článek

Relationship Between Brightness and Current of the Propagating Positive Leaders in Laboratory High Voltage Atmospheric Discharges

Autor: He Tianyu, Shengxin Huang, Dengfeng Cheng, Yufei Fu, Zhong Fu, Kai Bian

Publikováno v: IEEE Access, Vol 8, Pp 158559-158567 (2020)

The discharge current of propagating lightning leaders is critical to understand lightning physics and to design lightning protection systems but almost impossible to be measured directly with present-day technology. In this paper, we have investigat

Externí odkaz: https://doaj.org/article/1560e1ee61564feb9be97768dbfc8533

Zobrazit plný text záznamu

Report

InstructAvatar: Text-Guided Emotion and Motion Control for Avatar Generation

Autor: Wang, Yuchi, Guo, Junliang, Bai, Jianhong, Yu, Runyi, He, Tianyu, Tan, Xu, Sun, Xu, Bian, Jiang

Recent talking avatar generation models have made strides in achieving realistic and accurate lip synchronization with the audio, but often fall short in controlling and conveying detailed expressions and emotions of the avatar, making the generated

Externí odkaz: http://arxiv.org/abs/2405.15758

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání