Zobrazeno 1 - 10
of 10 939
pro vyhledávání: '"Cheng, Zhi"'
Achieving a balance between accuracy and efficiency is a critical challenge in facial landmark detection (FLD). This paper introduces the Parallel Optimal Position Search (POPoS), a high-precision encoding-decoding framework designed to address the f
Externí odkaz:
http://arxiv.org/abs/2410.09583
Autor:
Ding, Zhe, Chen, Zhousheng, Fan, Xiaodong, Zhang, Weihui, Fu, Jun, Sun, Yumeng, Cheng, Zhi, Yu, Zhiwei, Yang, Kai, Li, Yuxin, Liu, Xing, Wang, Pengfei, Wang, Ya, Jiang, Jianhua, Zeng, Hualing, Zeng, Changgan, Shi, Guosheng, Shi, Fazhan, Du, Jiangfeng
The one-dimensional side gate based on graphene edges shows a significant capability of reducing the channel length of field-effect transistors, further increasing the integration density of semiconductor devices. The nano-scale electric field distri
Externí odkaz:
http://arxiv.org/abs/2409.14942
Fashion image editing is a crucial tool for designers to convey their creative ideas by visualizing design concepts interactively. Current fashion image editing techniques, though advanced with multimodal prompts and powerful diffusion models, often
Externí odkaz:
http://arxiv.org/abs/2409.01086
Autor:
Wang, Jue, Lin, Yuxiang, Yuan, Tianshuo, Cheng, Zhi-Qi, Wang, Xiaolong, GH, Jiao, Chen, Wei, Peng, Xiaojiang
Combining Vision Large Language Models (VLLMs) with diffusion models offers a powerful method for executing image editing tasks based on human language instructions. However, language instructions alone often fall short in accurately conveying user r
Externí odkaz:
http://arxiv.org/abs/2408.12429
Autor:
Shikarpur, Nithya, Dendukuri, Krishna Maneesha, Wu, Yusong, Caillon, Antoine, Huang, Cheng-Zhi Anna
Hindustani music is a performance-driven oral tradition that exhibits the rendition of rich melodic patterns. In this paper, we focus on generative modeling of singers' vocal melodies extracted from audio recordings, as the voice is musically promine
Externí odkaz:
http://arxiv.org/abs/2408.12658
Autor:
Li, Yang, Cai, Wen-Qi, Ren, Ji-Gang, Wang, Chao-Ze, Yang, Meng, Zhang, Liang, Wu, Hui-Ying, Chang, Liang, Wu, Jin-Cai, Jin, Biao, Xue, Hua-Jian, Li, Xue-Jiao, Liu, Hui, Yu, Guang-Wen, Tao, Xue-Ying, Chen, Ting, Liu, Chong-Fei, Luo, Wen-Bin, Zhou, Jie, Yong, Hai-Lin, Li, Yu-Huai, Li, Feng-Zhi, Jiang, Cong, Chen, Hao-Ze, Wu, Chao, Tong, Xin-Hai, Xie, Si-Jiang, Zhou, Fei, Liu, Wei-Yue, Liu, Nai-Le, Li, Li, Xu, Feihu, Cao, Yuan, Yin, Juan, Shu, Rong, Wang, Xiang-Bin, Zhang, Qiang, Wang, Jian-Yu, Liao, Sheng-Kai, Peng, Cheng-Zhi, Pan, Jian-Wei
A quantum network provides an infrastructure connecting quantum devices with revolutionary computing, sensing, and communication capabilities. As the best-known application of a quantum network, quantum key distribution (QKD) shares secure keys guara
Externí odkaz:
http://arxiv.org/abs/2408.10994
Autor:
Cheng, Zebang, Tu, Shuyuan, Huang, Dawei, Li, Minghan, Peng, Xiaojiang, Cheng, Zhi-Qi, Hauptmann, Alexander G.
This paper presents our winning approach for the MER-NOISE and MER-OV tracks of the MER2024 Challenge on multimodal emotion recognition. Our system leverages the advanced emotional understanding capabilities of Emotion-LLaMA to generate high-quality
Externí odkaz:
http://arxiv.org/abs/2408.10500
Autor:
Xu, Chao, Sun, Mingze, Cheng, Zhi-Qi, Wang, Fei, Liu, Yang, Sun, Baigui, Huang, Ruqi, Hauptmann, Alexander
In this paper, we propose a novel framework, Combo, for harmonious co-speech holistic 3D human motion generation and efficient customizable adaption. In particular, we identify that one fundamental challenge as the multiple-input-multiple-output (MIM
Externí odkaz:
http://arxiv.org/abs/2408.09397
Autor:
Cheng, Zhi-Qi, Dong, Yifei, Shi, Aike, Liu, Wei, Hu, Yuzhi, O'Connor, Jason, Hauptmann, Alexander, Whitefoot, Kate
The electric vehicle (EV) battery supply chain's vulnerability to disruptions necessitates advanced predictive analytics. We present SHIELD (Schema-based Hierarchical Induction for EV supply chain Disruption), a system integrating Large Language Mode
Externí odkaz:
http://arxiv.org/abs/2408.05357
Autor:
Zhuang, Shi-Chang, Li, Bo, Zheng, Ming-Yang, Zeng, Yi-Xi, Wu, Hui-Nan, Li, Guang-Bing, Yao, Quan, Xie, Xiu-Ping, Li, Yu-Huai, Qin, Hao, You, Li-Xing, Xu, Fei-Hu, Yin, Juan, Cao, Yuan, Zhang, Qiang, Peng, Cheng-Zhi, Pan, Jian-Wei
The entangled photons are crucial resources for quantum communications and networking. Here, we present an ultra-bright polarization-entangled photon source based on a periodically poled lithium niobate waveguide designed for practical quantum commun
Externí odkaz:
http://arxiv.org/abs/2408.04361