Zobrazeno 1 - 10
of 35
pro vyhledávání: '"Zhiquan Lai"'
Publikováno v:
Applied Sciences, Vol 14, Iss 21, p 10025 (2024)
As Deep Neural Networks (DNNs) continue to increase in complexity, the computational demands of their training have become a significant bottleneck. Low-precision training has emerged as a crucial strategy, wherein full-precision values are quantized
Externí odkaz:
https://doaj.org/article/f4ef1ca0076647f3a20a60d4f1f249d1
Autor:
Zhiquan Lai, Shengwei Li, Xudong Tang, Keshi Ge, Weijie Liu, Yabo Duan, Linbo Qiao, Dongsheng Li
Publikováno v:
IEEE Transactions on Parallel and Distributed Systems. 34:1466-1478
Foundation models are becoming the dominant deep learning technologies. Pretraining a foundation model is always time-consumed due to the large scale of both the model parameter and training dataset. Besides being computing-intensive, the training pr
Publikováno v:
IEEE Transactions on Knowledge and Data Engineering. 35:3952-3965
Graph neural networks (GNN) have been proven to be mature enough for handling graph-structured data on node-level graph representation learning tasks. However, the graph pooling technique for learning expressive graph-level representation is critical
Publikováno v:
IEEE Journal on Selected Areas in Communications. 41:941-963
Publikováno v:
Science China Information Sciences. 66
Publikováno v:
Tsinghua Science and Technology. 27:114-126
In distributed training, increasing batch size can improve parallelism, but it can also bring many difficulties to the training process and cause training errors. In this work, we investigate the occurrence of training errors in theory and train ResN
Publikováno v:
2022 IEEE Intl Conf on Parallel & Distributed Processing with Applications, Big Data & Cloud Computing, Sustainable Computing & Communications, Social Computing & Networking (ISPA/BDCloud/SocialCom/SustainCom).
Publikováno v:
2022 IEEE 7th International Conference on Smart Cloud (SmartCloud).
Publikováno v:
2022 IEEE International Conference on Cluster Computing (CLUSTER).
Publikováno v:
2022 IEEE International Conference on Cluster Computing (CLUSTER).