Zobrazeno 1 - 10
of 3 080
pro vyhledávání: '"A Daliri"'
Recently, 1-bit Large Language Models (LLMs) have emerged, showcasing an impressive combination of efficiency and performance that rivals traditional LLMs. Research by Wang et al. (2023); Ma et al. (2024) indicates that the performance of these 1-bit
Externí odkaz:
http://arxiv.org/abs/2411.01663
Suppose Alice has a distribution $P$ and Bob has a distribution $Q$. Alice wants to generate a sample $a\sim P$ and Bob a sample $b \sim Q$ such that $a = b$ with has as high of probability as possible. It is well-known that, by sampling from an opti
Externí odkaz:
http://arxiv.org/abs/2408.07978
Serving LLMs requires substantial memory due to the storage requirements of Key-Value (KV) embeddings in the KV cache, which grows with sequence length. An effective approach to compress KV cache is quantization. However, traditional quantization met
Externí odkaz:
http://arxiv.org/abs/2406.03482
The path to an autism diagnosis can be long and difficult, and delays can have serious consequences. Artificial intelligence can completely change the way autism is diagnosed, especially when it comes to situations where it is difficult to see the fi
Externí odkaz:
http://arxiv.org/abs/2311.04606
Autor:
Mahdieh Samei, Mahla Daliri, Masoumeh Sadeghi, Reza Ganji, Ali Parsa, Mohammad H. Ebrahimzadeh
Publikováno v:
BMC Musculoskeletal Disorders, Vol 25, Iss 1, Pp 1-13 (2024)
Abstract Background The use of a tourniquet is common during anterior cruciate ligament (ACL) reconstruction, offering convenience for the surgical procedure. However, the potential adverse effects of tourniquet use have gained increasing attention f
Externí odkaz:
https://doaj.org/article/580ccedf3f6a447c980e400608065a7b
Publikováno v:
Sustainable Earth Trends, Vol 4, Iss 4, Pp 73-82 (2024)
This study investigates the direct carbonylation of glycerol using a composite photocatalyst (TiO2 loaded with cellulose) and 2-cyanopyridine as a water-reducing agent. In this research, the performance of the photocatalytic system was evaluated unde
Externí odkaz:
https://doaj.org/article/0afd2b44af4e4d6eb3510544c78f01a7
Recently, Bessa et al. (PODS 2023) showed that sketches based on coordinated weighted sampling theoretically and empirically outperform popular linear sketching methods like Johnson-Lindentrauss projection and CountSketch for the ubiquitous problem o
Externí odkaz:
http://arxiv.org/abs/2309.16157
We prove a tight upper bound on the variance of the priority sampling method (aka sequential Poisson sampling). Our proof is significantly shorter and simpler than the original proof given by Mario Szegedy at STOC 2006, which resolved a conjecture by
Externí odkaz:
http://arxiv.org/abs/2308.05907
Publikováno v:
بومشناسی آبزیان, Vol 14, Iss 2, Pp 66-74 (2024)
The present study was carried out in the fishing grounds of small pelagic fish on Qeshm Island (namely Ramcha, Souza, Messen, and Salakh) from September 2022 to June 2023. Field sampling was performed by double-boat purse seiners and conducting 57 ne
Externí odkaz:
https://doaj.org/article/916301203ce94d79b383b6da25bc4bec
Dot-product attention mechanism plays a crucial role in modern deep architectures (e.g., Transformer) for sequence modeling, however, na\"ive exact computation of this model incurs quadratic time and memory complexities in sequence length, hindering
Externí odkaz:
http://arxiv.org/abs/2302.02451