Výsledky vyhledávání

Report

Autor: Zhang, Bo-Wen, Wang, Liangdong, Li, Jijie, Gu, Shuhao, Wu, Xinya, Zhang, Zhengduo, Gao, Boyan, Ao, Yulong, Liu, Guang

This paper introduces the Aquila2 series, which comprises a wide range of bilingual models with parameter sizes of 7, 34, and 70 billion. These models are trained based on an innovative framework named HeuriMentor (HM), which offers real-time insight

Externí odkaz: http://arxiv.org/abs/2408.07410

Zobrazit plný text záznamu

Report

AquilaMoE: Efficient Training for MoE Models with Scale-Up and Scale-Out Strategies

In recent years, with the rapid application of large language models across various fields, the scale of these models has gradually increased, and the resources required for their pre-training have grown exponentially. Training an LLM from scratch wi

Externí odkaz: http://arxiv.org/abs/2408.06567

Zobrazit plný text záznamu

Report

End-to-end Adaptive Distributed Training on PaddlePaddle

Autor: Ao, Yulong, Wu, Zhihua, Yu, Dianhai, Gong, Weibao, Kui, Zhiqing, Zhang, Minxu, Ye, Zilingfeng, Shen, Liang, Ma, Yanjun, Wu, Tian, Wang, Haifeng, Zeng, Wei, Yang, Chao

Distributed training has become a pervasive and effective approach for training a large neural network (NN) model with processing massive data. However, it is very challenging to satisfy requirements from various NN models, diverse computing resource

Externí odkaz: http://arxiv.org/abs/2112.02752

Zobrazit plný text záznamu

Report

Adaptive SpMV/SpMSpV on GPUs for Input Vectors of Varied Sparsity

Autor: Li, Min, Ao, Yulong, Yang, Chao

Despite numerous efforts for optimizing the performance of Sparse Matrix and Vector Multiplication (SpMV) on modern hardware architectures, few works are done to its sparse counterpart, Sparse Matrix and Sparse Vector Multiplication (SpMSpV), not to

Externí odkaz: http://arxiv.org/abs/2006.16767

Zobrazit plný text záznamu

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Akademický článek

AutoWM: a novel domain-specific tool for universal multi-/many-core accelerations of the WRF cloud microphysics.

Autor: Zhang, Peng, Yang, Chao, Ao, Yulong

Publikováno v: Cluster Computing; Jun2021, Vol. 24 Issue 2, p935-951, 17p

Zobrazit plný text záznamu

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Vyhledávací nástroje:

Upřesnit hledání