Zobrazeno 1 - 10
of 127 459
pro vyhledávání: '"zhiyong, An"'
Autor:
Chen, Zhe, Wang, Weiyun, Cao, Yue, Liu, Yangzhou, Gao, Zhangwei, Cui, Erfei, Zhu, Jinguo, Ye, Shenglong, Tian, Hao, Liu, Zhaoyang, Gu, Lixin, Wang, Xuehui, Li, Qingyun, Ren, Yimin, Chen, Zixuan, Luo, Jiapeng, Wang, Jiahao, Jiang, Tan, Wang, Bo, He, Conghui, Shi, Botian, Zhang, Xingcheng, Lv, Han, Wang, Yi, Shao, Wenqi, Chu, Pei, Tu, Zhongying, He, Tong, Wu, Zhiyong, Deng, Huipeng, Ge, Jiaye, Chen, Kai, Dou, Min, Lu, Lewei, Zhu, Xizhou, Lu, Tong, Lin, Dahua, Qiao, Yu, Dai, Jifeng, Wang, Wenhai
We introduce InternVL 2.5, an advanced multimodal large language model (MLLM) series that builds upon InternVL 2.0, maintaining its core model architecture while introducing significant enhancements in training and testing strategies as well as data
Externí odkaz:
http://arxiv.org/abs/2412.05271
Autor:
Wang, Lening, Zheng, Wenzhao, Du, Dalong, Zhang, Yunpeng, Ren, Yilong, Jiang, Han, Cui, Zhiyong, Yu, Haiyang, Zhou, Jie, Lu, Jiwen, Zhang, Shanghang
4D driving simulation is essential for developing realistic autonomous driving simulators. Despite advancements in existing methods for generating driving scenes, significant challenges remain in view transformation and spatial-temporal dynamic model
Externí odkaz:
http://arxiv.org/abs/2412.05280
Data is undoubtedly becoming a commodity like oil, land, and labor in the 21st century. Although there have been many successful marketplaces for data trading, the existing data marketplaces lack consideration of the case where buyers want to acquire
Externí odkaz:
http://arxiv.org/abs/2412.04853
Autor:
Yang, Wenzhe, Wang, Sheng, Huang, Shixun, Liao, Yuyang, Sun, Yuan, Freire, Juliana, Peng, Zhiyong
There has been increased interest in data search as a means to find relevant datasets or data points in data lakes and repositories. Although approaches have been proposed to support spatial dataset search and data point search, they consider the two
Externí odkaz:
http://arxiv.org/abs/2412.04805
Vector set search, an underexplored similarity search paradigm, aims to find vector sets similar to a query set. This search paradigm leverages the inherent structural alignment between sets and real-world entities to model more fine-grained and cons
Externí odkaz:
http://arxiv.org/abs/2412.03301
Autor:
Kong, Weijie, Tian, Qi, Zhang, Zijian, Min, Rox, Dai, Zuozhuo, Zhou, Jin, Xiong, Jiangfeng, Li, Xin, Wu, Bo, Zhang, Jianwei, Wu, Kathrina, Lin, Qin, Yuan, Junkun, Long, Yanxin, Wang, Aladdin, Wang, Andong, Li, Changlin, Huang, Duojun, Yang, Fang, Tan, Hao, Wang, Hongmei, Song, Jacob, Bai, Jiawang, Wu, Jianbing, Xue, Jinbao, Wang, Joey, Wang, Kai, Liu, Mengyang, Li, Pengyu, Li, Shuai, Wang, Weiyan, Yu, Wenqing, Deng, Xinchi, Li, Yang, Chen, Yi, Cui, Yutao, Peng, Yuanbo, Yu, Zhentao, He, Zhiyu, Xu, Zhiyong, Zhou, Zixiang, Xu, Zunnan, Tao, Yangyu, Lu, Qinglin, Liu, Songtao, Zhou, Daquan, Wang, Hongfa, Yang, Yong, Wang, Di, Liu, Yuhong, Jiang, Jie, Zhong, Caesar
Recent advancements in video generation have significantly impacted daily life for both individuals and industries. However, the leading video generation models remain closed-source, resulting in a notable performance gap between industry capabilitie
Externí odkaz:
http://arxiv.org/abs/2412.03603
The k-means algorithm can simplify large-scale spatial vectors, such as 2D geo-locations and 3D point clouds, to support fast analytics and learning. However, when processing large-scale datasets, existing k-means algorithms have been developed to ac
Externí odkaz:
http://arxiv.org/abs/2412.02244
Autor:
Zhang, Qizhe, Cheng, Aosong, Lu, Ming, Zhuo, Zhiyong, Wang, Minqi, Cao, Jiajun, Guo, Shaobo, She, Qi, Zhang, Shanghang
Large vision-language models (VLMs) often rely on a substantial number of visual tokens when interacting with large language models (LLMs), which has proven to be inefficient. Recent efforts have aimed to accelerate VLM inference by pruning visual to
Externí odkaz:
http://arxiv.org/abs/2412.01818
At present, in lattice-based linearly homomorphic signature schemes, especially under the standard model, there are very few schemes with tight security. This paper constructs the first lattice-based linearly homomorphic signature scheme that achieve
Externí odkaz:
http://arxiv.org/abs/2412.01641
Autor:
Zhou, Shuoyi, Zhou, Yixuan, Li, Weiqing, Chen, Jun, Ye, Runchuan, Wu, Weihao, Lin, Zijian, Lei, Shun, Wu, Zhiyong
This paper describes the zero-shot spontaneous style TTS system for the ISCSLP 2024 Conversational Voice Clone Challenge (CoVoC). We propose a LLaMA-based codec language model with a delay pattern to achieve spontaneous style voice cloning. To improv
Externí odkaz:
http://arxiv.org/abs/2412.01100