Zobrazeno 1 - 10
of 40
pro vyhledávání: '"Qiu, Jiantao"'
Autor:
Bai, Tianyi, Yang, Ling, Wong, Zhen Hao, Peng, Jiahui, Zhuang, Xinlin, Zhang, Chi, Wu, Lijun, Qiu, Jiantao, Zhang, Wentao, Yuan, Binhang, He, Conghui
Efficient data selection is crucial to accelerate the pretraining of large language models (LLMs). While various methods have been proposed to enhance data efficiency, limited research has addressed the inherent conflicts between these approaches to
Externí odkaz:
http://arxiv.org/abs/2410.08102
Autor:
Zhang, Chi, Zhong, Huaping, Zhang, Kuan, Chai, Chengliang, Wang, Rui, Zhuang, Xinlin, Bai, Tianyi, Qiu, Jiantao, Cao, Lei, Fan, Ju, Yuan, Ye, Wang, Guoren, He, Conghui
Data selection is of great significance in pre-training large language models, given the variation in quality within the large-scale available training corpora. To achieve this, researchers are currently investigating the use of data influence to mea
Externí odkaz:
http://arxiv.org/abs/2409.16986
Autor:
Cai, Zheng, Cao, Maosong, Chen, Haojiong, Chen, Kai, Chen, Keyu, Chen, Xin, Chen, Xun, Chen, Zehui, Chen, Zhi, Chu, Pei, Dong, Xiaoyi, Duan, Haodong, Fan, Qi, Fei, Zhaoye, Gao, Yang, Ge, Jiaye, Gu, Chenya, Gu, Yuzhe, Gui, Tao, Guo, Aijia, Guo, Qipeng, He, Conghui, Hu, Yingfan, Huang, Ting, Jiang, Tao, Jiao, Penglong, Jin, Zhenjiang, Lei, Zhikai, Li, Jiaxing, Li, Jingwen, Li, Linyang, Li, Shuaibin, Li, Wei, Li, Yining, Liu, Hongwei, Liu, Jiangning, Hong, Jiawei, Liu, Kaiwen, Liu, Kuikun, Liu, Xiaoran, Lv, Chengqi, Lv, Haijun, Lv, Kai, Ma, Li, Ma, Runyuan, Ma, Zerun, Ning, Wenchang, Ouyang, Linke, Qiu, Jiantao, Qu, Yuan, Shang, Fukai, Shao, Yunfan, Song, Demin, Song, Zifan, Sui, Zhihao, Sun, Peng, Sun, Yu, Tang, Huanze, Wang, Bin, Wang, Guoteng, Wang, Jiaqi, Wang, Jiayu, Wang, Rui, Wang, Yudong, Wang, Ziyi, Wei, Xingjian, Weng, Qizhen, Wu, Fan, Xiong, Yingtong, Xu, Chao, Xu, Ruiliang, Yan, Hang, Yan, Yirong, Yang, Xiaogui, Ye, Haochen, Ying, Huaiyuan, Yu, Jia, Yu, Jing, Zang, Yuhang, Zhang, Chuyu, Zhang, Li, Zhang, Pan, Zhang, Peng, Zhang, Ruijie, Zhang, Shuo, Zhang, Songyang, Zhang, Wenjian, Zhang, Wenwei, Zhang, Xingcheng, Zhang, Xinyue, Zhao, Hui, Zhao, Qian, Zhao, Xiaomeng, Zhou, Fengzhe, Zhou, Zaida, Zhuo, Jingming, Zou, Yicheng, Qiu, Xipeng, Qiao, Yu, Lin, Dahua
The evolution of Large Language Models (LLMs) like ChatGPT and GPT-4 has sparked discussions on the advent of Artificial General Intelligence (AGI). However, replicating such advancements in open-source models has been challenging. This paper introdu
Externí odkaz:
http://arxiv.org/abs/2403.17297
Autor:
Qiu, Jiantao, Lv, Haijun, Jin, Zhenjiang, Wang, Rui, Ning, Wenchang, Yu, Jia, Zhang, ChaoBin, Li, Zhenxiang, Chu, Pei, Qu, Yuan, Shi, Jin, Lu, Lindong, Peng, Runyu, Zeng, Zhiyuan, Tang, Huanze, Lei, Zhikai, Hong, Jiawei, Chen, Keyu, Fei, Zhaoye, Xu, Ruiliang, Li, Wei, Tu, Zhongying, Dahua, Lin, Qiao, Yu, Yan, Hang, He, Conghui
This paper presents WanJuan-CC, a safe and high-quality open-sourced English webtext dataset derived from Common Crawl data. The study addresses the challenges of constructing large-scale pre-training datasets for language models, which require vast
Externí odkaz:
http://arxiv.org/abs/2402.19282
Autor:
He, Conghui, Jin, Zhenjiang, Xu, Chao, Qiu, Jiantao, Wang, Bin, Li, Wei, Yan, Hang, Wang, Jiaqi, Lin, Dahua
The rise in popularity of ChatGPT and GPT-4 has significantly accelerated the development of large models, leading to the creation of numerous impressive large language models(LLMs) and multimodal large language models (MLLMs). These cutting-edge mod
Externí odkaz:
http://arxiv.org/abs/2308.10755
Autor:
Xu, Yuanfan, Yu, Jincheng, Tang, Jiahao, Qiu, Jiantao, Wang, Jian, Shen, Yuan, Wang, Yu, Yang, Huazhong
Autonomous exploration and mapping of unknown terrains employing single or multiple robots is an essential task in mobile robotics and has therefore been widely investigated. Nevertheless, given the lack of unified data sets, metrics, and platforms t
Externí odkaz:
http://arxiv.org/abs/2202.11931
Autor:
Song, Hongyu, Yu, Jincheng, Qiu, Jiantao, Sun, Zhixiao, Lang, Kuijun, Luo, Qing, Shen, Yuan, Wang, Yu
For scenes such as floods and earthquakes, the disaster area is large, and rescue time is tight. Multi-UAV exploration is more efficient than a single UAV. Existing UAV exploration work is modeled as a Coverage Path Planning (CPP) task to achieve ful
Externí odkaz:
http://arxiv.org/abs/2201.10150
Multi-agent formation as well as obstacle avoidance is one of the most actively studied topics in the field of multi-agent systems. Although some classic controllers like model predictive control (MPC) and fuzzy control achieve a certain measure of s
Externí odkaz:
http://arxiv.org/abs/2111.07334
Autor:
Xing, Yu, Liang, Shuang, Sui, Lingzhi, Jia, Xijie, Qiu, Jiantao, Liu, Xin, Wang, Yushun, Wang, Yu, Shan, Yi
The convolutional neural network (CNN) has become a state-of-the-art method for several artificial intelligence domains in recent years. The increasingly complex CNN models are both computation-bound and I/O-bound. FPGA-based accelerators driven by c
Externí odkaz:
http://arxiv.org/abs/1902.07463
Publikováno v:
Arabian Journal for Science & Engineering (Springer Science & Business Media B.V. ). 7/1/2022, Vol. 47 Issue 7, p8081-8091. 11p.