Zobrazeno 1 - 10
of 75
pro vyhledávání: '"Do, Giang"'
Sparse mixture of experts (SMoE) have emerged as an effective approach for scaling large language models while keeping a constant computational cost. Regardless of several notable successes of SMoE, effective training such architecture remains elusiv
Externí odkaz:
http://arxiv.org/abs/2406.15883
Autor:
Pham, Quang, Do, Giang, Nguyen, Huy, Nguyen, TrungTin, Liu, Chenghao, Sartipi, Mina, Nguyen, Binh T., Ramasamy, Savitha, Li, Xiaoli, Hoi, Steven, Ho, Nhat
Sparse mixture of experts (SMoE) offers an appealing solution to scale up the model complexity beyond the mean of increasing the network's depth or width. However, effective training of SMoE has proven to be challenging due to the representation coll
Externí odkaz:
http://arxiv.org/abs/2402.02526
Autor:
Do, Giang, Le, Khiem, Pham, Quang, Nguyen, TrungTin, Doan, Thanh-Nam, Nguyen, Bint T., Liu, Chenghao, Ramasamy, Savitha, Li, Xiaoli, Hoi, Steven
By routing input tokens to only a few split experts, Sparse Mixture-of-Experts has enabled efficient training of large language models. Recent findings suggest that fixing the routers can achieve competitive performance by alleviating the collapsing
Externí odkaz:
http://arxiv.org/abs/2312.07035
Autor:
Schoenebeck, Sarita, Batool, Amna, Do, Giang, Darling, Sylvia, Grill, Gabriel, Wilkinson, Daricia, Khan, Mehtab, Toyama, Kentaro, Ashwell, Louise
Online harassment is a global problem. This article examines perceptions of harm and preferences for remedies associated with online harassment with nearly 4000 participants in 14 countries around the world. The countries in this work reflect a range
Externí odkaz:
http://arxiv.org/abs/2301.11715
Publikováno v:
Ho Chi Minh City Open University Journal of Science - Economics and Business Administration, Vol 14, Iss 3, Pp 18-43 (2024)
This study examines how psychological and external factors, including perceived uncertainty, resilience, value-added attribute, mass-media coverage, and travel constraints, affect perceived arousal, and lead to the behavior of seeking information abo
Externí odkaz:
https://doaj.org/article/88e60a40cfce441dbaa184ff6f63d69b
Autor:
Rachel E. Wittenberg, Kimberlee Gauvreau, Christopher P. Duggan, Xinwei Du, Do Giang, Kishore Jayanthi, Nestor Sandoval, Sivakumar Sivalingam, Xiaolei Zhao, Kathy J. Jenkins
Publikováno v:
Journal of the American Heart Association: Cardiovascular and Cerebrovascular Disease, Vol 13, Iss 13 (2024)
Background High energy requirements and poor feeding can lead to growth failure in patients with ventricular septal defect (VSD), but effects of preoperative malnutrition on surgical outcomes are poorly understood, especially in low‐resource settin
Externí odkaz:
https://doaj.org/article/74f36ae8c9604fad903c4cd797316a3e
Autor:
Paul, Larissa, Plum, Matthias, Schaufel, Merlin, Bretz, Thomas, Do, Giang, Hewitt, John W., Maslowski, Frank, Rehbein, Florian, Schäfer, Johannes, Zink, Adrian
IceAct is a proposed surface array of compact (50 cm diameter) and cost-effective Imaging Air Cherenkov Telescopes installed at the site of the IceCube Neutrino Observatory at the geographic South Pole. Since January 2019, two IceAct telescope demons
Externí odkaz:
http://arxiv.org/abs/2108.05572
Autor:
Nam, DO Giang, author, Kien, TRAN, author
Publikováno v:
Invalidity, 2022.
Externí odkaz:
https://doi.org/10.1093/oso/9780192859341.003.0023
Autor:
Wittenberg, Rachel E., Gauvreau, Kimberlee, Duggan, Christopher P., Xinwei Du, Do Giang, Jayanthi, Kishore, Sandoval, Nestor, Sivalingam, Sivakumar, Xiaolei Zhao, Jenkins, Kathy J.
Publikováno v:
Journal of the American Heart Association; 7/2/2024, Vol. 13 Issue 13, p1-13, 13p
Publikováno v:
In Chemical Engineering and Processing - Process Intensification February 2020 148