Zobrazeno 1 - 10
of 31
pro vyhledávání: '"Shan, Mengyi"'
Autor:
Yang, Zhangsihao, Shan, Mengyi, Farazi, Mohammad, Zhu, Wenhui, Chen, Yanxi, Dong, Xuanzhao, Wang, Yalin
Human video generation task has gained significant attention with the advancement of deep generative models. Generating realistic videos with human movements is challenging in nature, due to the intricacies of human body topology and sensitivity to v
Externí odkaz:
http://arxiv.org/abs/2409.01502
Autor:
Shan, Mengyi, Dong, Lu, Han, Yutao, Yao, Yuan, Liu, Tao, Nwogu, Ifeoma, Qi, Guo-Jun, Hill, Mitch
This work aims to generate natural and diverse group motions of multiple humans from textual descriptions. While single-person text-to-motion generation is extensively studied, it remains challenging to synthesize motions for more than one or two sub
Externí odkaz:
http://arxiv.org/abs/2405.18483
We present a system that automatically brings street view imagery to life by populating it with naturally behaving, animated pedestrians and vehicles. Our approach is to remove existing people and vehicles from the input image, insert moving objects
Externí odkaz:
http://arxiv.org/abs/2310.08534
Autor:
Or-El, Roy, Luo, Xuan, Shan, Mengyi, Shechtman, Eli, Park, Jeong Joon, Kemelmacher-Shlizerman, Ira
We introduce a high resolution, 3D-consistent image and shape generation technique which we call StyleSDF. Our method is trained on single-view RGB data only, and stands on the shoulders of StyleGAN2 for image generation, while solving two main chall
Externí odkaz:
http://arxiv.org/abs/2112.11427
Publikováno v:
In Bioorganic Chemistry August 2024 149
Autor:
Shan, Mengyi, Tsai, TJ
This paper tackles the problem of verifying the authenticity of speech recordings from world leaders. Whereas previous work on detecting deep fake or tampered audio focus on scrutinizing an audio recording in isolation, we instead reframe the problem
Externí odkaz:
http://arxiv.org/abs/2010.12173
Autor:
Shan, Mengyi, Tsai, TJ
This paper studies the problem of automatically generating piano score following videos given an audio recording and raw sheet music images. Whereas previous works focus on synthetic sheet music where the data has been cleaned and preprocessed, we in
Externí odkaz:
http://arxiv.org/abs/2007.14580
This article investigates a cross-modal retrieval problem in which a user would like to retrieve a passage of music from a MIDI file by taking a cell phone picture of several lines of sheet music. This problem is challenging for two reasons: it has a
Externí odkaz:
http://arxiv.org/abs/2004.11724
This paper investigates a cross-modal retrieval problem in which a user would like to retrieve a passage of music from a MIDI file by taking a cell phone picture of a physical page of sheet music. While audio-sheet music retrieval has been explored b
Externí odkaz:
http://arxiv.org/abs/2004.10347
Publikováno v:
In Computers in Biology and Medicine January 2023 152