Zobrazeno 1 - 10
of 15 351
pro vyhledávání: '"Fishman, A P"'
We train, for the first time, large language models using FP8 precision on datasets up to 2 trillion tokens -- a 20-fold increase over previous limits. Through these extended training runs, we uncover critical instabilities in FP8 training that were
Externí odkaz:
http://arxiv.org/abs/2409.12517
We study the Berry curvature and Chern number of a non-collinear spin state on a honeycomb lattice that evolves from coplanar to ferromagnetic with a magnetic field applied along the $z$ axis. The coplanar state is stabilized by nearest-neighbor ferr
Externí odkaz:
http://arxiv.org/abs/2409.07319
We have investigated the title question for both a subset of the W4-11 total atomization energies benchmark, and for the A24x8 noncovalent interactions benchmark. Overall, counterpoise corrections to post-CCSD(T) contributions are about two orders of
Externí odkaz:
http://arxiv.org/abs/2408.10034
Autor:
Refael, Yehonathan, Hakim, Adam, Greenberg, Lev, Aviv, Tal, Lokam, Satya, Fishman, Ben, Seidman, Shachar
Large language models (LLMs) have recently seen widespread adoption, in both academia and industry. As these models grow, they become valuable intellectual property (IP), reflecting enormous investments by their owners. Moreover, the high cost of clo
Externí odkaz:
http://arxiv.org/abs/2407.10886
Autor:
Wang, Boyang, Sridhar, Nikhil, Feng, Chao, Van der Merwe, Mark, Fishman, Adam, Fazeli, Nima, Park, Jeong Joon
We propose a robot learning method for communicating, planning, and executing a wide range of tasks, dubbed This&That. We achieve robot planning for general tasks by leveraging the power of video generative models trained on internet-scale data conta
Externí odkaz:
http://arxiv.org/abs/2407.05530
Tensor network contractions are widely used in statistical physics, quantum computing, and computer science. We introduce a method to efficiently approximate tensor network contractions using low-rank approximations, where each intermediate tensor ge
Externí odkaz:
http://arxiv.org/abs/2406.09769
Autor:
Rad, Shervin Salehi, Muhlbaier, Micheal, Fishman, Oleg, Chevinly, Javad, Nadi, Elias, Zhang, Hua, Lu, Fei
The design and development of power electronics converters pose a multitude of challenges. The evaluation of power electronics converters, particularly when operating at high power levels, presents a significant task, offering designers a deeper unde
Externí odkaz:
http://arxiv.org/abs/2405.13243
Publikováno v:
J. Phys. Chem. A 128, 7462-7470 (2024)
Basis set extrapolations are typically rationalized either from analytical arguments involving the partial-wave or principal expansions of the correlation energy in helium-like systems, or from fitting extrapolation parameters to reference energetics
Externí odkaz:
http://arxiv.org/abs/2405.04658
Publikováno v:
In: Longo, L., Lapuschkin, S., Seifert, C. (eds) Explainable Artificial Intelligence. xAI 2024. Communications in Computer and Information Science, vol 2155. Springer, Cham
Deep learning is dramatically transforming the field of medical imaging and radiology, enabling the identification of pathologies in medical images, including computed tomography (CT) and X-ray scans. However, the performance of deep learning models,
Externí odkaz:
http://arxiv.org/abs/2404.12832
Autor:
Baur, Sebastien, Nabulsi, Zaid, Weng, Wei-Hung, Garrison, Jake, Blankemeier, Louis, Fishman, Sam, Chen, Christina, Kakarmath, Sujay, Maimbolwa, Minyoi, Sanjase, Nsala, Shuma, Brian, Matias, Yossi, Corrado, Greg S., Patel, Shwetak, Shetty, Shravya, Prabhakara, Shruthi, Muyoyeta, Monde, Ardila, Diego
Health acoustic sounds such as coughs and breaths are known to contain useful health signals with significant potential for monitoring health and disease, yet are underexplored in the medical machine learning community. The existing deep learning sys
Externí odkaz:
http://arxiv.org/abs/2403.02522