Zobrazeno 1 - 10
of 431
pro vyhledávání: '"Wu, Ziheng"'
Autor:
Hu, Yujian, Xiang, Yilang, Zhou, Yan-Jie, He, Yangyan, Yang, Shifeng, Du, Xiaolong, Den, Chunlan, Xu, Youyao, Wang, Gaofeng, Ding, Zhengyao, Huang, Jingyong, Zhao, Wenjun, Wu, Xuejun, Li, Donglin, Zhu, Qianqian, Li, Zhenjiang, Qiu, Chenyang, Wu, Ziheng, He, Yunjun, Tian, Chen, Qiu, Yihui, Lin, Zuodong, Zhang, Xiaolong, He, Yuan, Yuan, Zhenpeng, Zhou, Xiaoxiang, Fan, Rong, Chen, Ruihan, Guo, Wenchao, Zhang, Jianpeng, Mok, Tony C. W., Li, Zi, Lu, Le, Lang, Dehai, Li, Xiaoqiang, Wang, Guofu, Lu, Wei, Huang, Zhengxing, Xu, Minfeng, Zhang, Hongkun
Chest pain symptoms are highly prevalent in emergency departments (EDs), where acute aortic syndrome (AAS) is a catastrophic cardiovascular emergency with a high fatality rate, especially when timely and accurate treatment is not administered. Howeve
Externí odkaz:
http://arxiv.org/abs/2406.15222
Recently, diffusion-based deep generative models (e.g., Stable Diffusion) have shown impressive results in text-to-image synthesis. However, current text-to-image models often require multiple passes of prompt engineering by humans in order to produc
Externí odkaz:
http://arxiv.org/abs/2311.06752
Fine-tuning pre-trained Vision Transformers (ViTs) has showcased significant promise in enhancing visual recognition tasks. Yet, the demand for individualized and comprehensive fine-tuning processes for each task entails substantial computational and
Externí odkaz:
http://arxiv.org/abs/2310.05393
Stable Diffusion web UI (SD-WebUI) is a comprehensive project that provides a browser interface based on Gradio library for Stable Diffusion models. In this paper, We propose a novel WebUI plugin called EasyPhoto, which enables the generation of AI p
Externí odkaz:
http://arxiv.org/abs/2310.04672
Self-attention-based vision transformers (ViTs) have emerged as a highly competitive architecture in computer vision. Unlike convolutional neural networks (CNNs), ViTs are capable of global information sharing. With the development of various structu
Externí odkaz:
http://arxiv.org/abs/2309.12424
Autor:
Liu, Yang, Yu, Cheng, Shang, Lei, He, Yongyi, Wu, Ziheng, Wang, Xingjun, Xu, Chao, Xie, Haoyu, Wang, Weida, Zhao, Yuze, Zhu, Lin, Cheng, Chen, Chen, Weitao, Yao, Yuan, Zhou, Wenmeng, Xu, Jiaqi, Wang, Qiang, Chen, Yingda, Xie, Xuansong, Sun, Baigui
Recent advancement in personalized image generation have unveiled the intriguing capability of pre-trained text-to-image models on learning identity information from a collection of portrait images. However, existing solutions are vulnerable in produ
Externí odkaz:
http://arxiv.org/abs/2308.14256
In recent years, diffusion models have emerged as the most powerful approach in image synthesis. However, applying these models directly to video synthesis presents challenges, as it often leads to noticeable flickering contents. Although recently pr
Externí odkaz:
http://arxiv.org/abs/2308.03463
This paper presents a new vision Transformer, Scale-Aware Modulation Transformer (SMT), that can handle various downstream tasks efficiently by combining the convolutional network and vision Transformer. The proposed Scale-Aware Modulation (SAM) in t
Externí odkaz:
http://arxiv.org/abs/2307.08579
Visual question answering (VQA) is a critical multimodal task in which an agent must answer questions according to the visual cue. Unfortunately, language bias is a common problem in VQA, which refers to the model generating answers only by associati
Externí odkaz:
http://arxiv.org/abs/2304.01647
We develop an all-in-one computer vision toolbox named EasyCV to facilitate the use of various SOTA computer vision methods. Recently, we add YOLOX-PAI, an improved version of YOLOX, into EasyCV. We conduct ablation studies to investigate the influen
Externí odkaz:
http://arxiv.org/abs/2208.13040