Zobrazeno 1 - 10
of 16 089
pro vyhledávání: '"AbdelFattah AS"'
When predicting the next token in a sequence, vanilla transformers compute attention over all previous tokens, resulting in quadratic scaling of compute with sequence length. State-space models compress the entire sequence of tokens into a fixed-dime
Externí odkaz:
http://arxiv.org/abs/2411.17685
Autor:
Abdelfattah, Ahmad, Ahrens, Willow, Anzt, Hartwig, Armstrong, Chris, Brock, Ben, Buluc, Aydin, Busato, Federico, Cojean, Terry, Davis, Tim, Demmel, Jim, Dinh, Grace, Gardener, David, Fiala, Jan, Gates, Mark, Haider, Azzam, Imamura, Toshiyuki, Lara, Pedro Valero, Moreira, Jose, Li, Sherry, Luszczek, Piotr, Melichenko, Max, Moeira, Jose, Mokwinski, Yvan, Murray, Riley, Patty, Spencer, Peles, Slaven, Ribizel, Tobias, Riedy, Jason, Rajamanickam, Siva, Sao, Piyush, Shantharam, Manu, Teranishi, Keita, Tomov, Stan, Tsai, Yu-Hsiang, Weichelt, Heiko
The standardization of an interface for dense linear algebra operations in the BLAS standard has enabled interoperability between different linear algebra libraries, thereby boosting the success of scientific computing, in particular in scientific HP
Externí odkaz:
http://arxiv.org/abs/2411.13259
Autor:
Chen, Yuzong, AbouElhamayed, Ahmed F., Dai, Xilai, Wang, Yang, Andronic, Marta, Constantinides, George A., Abdelfattah, Mohamed S.
Large language models (LLMs) have demonstrated remarkable performance across various machine learning tasks. Yet the substantial memory footprint of LLMs significantly hinders their deployment. In this paper, we improve the accessibility of LLMs thro
Externí odkaz:
http://arxiv.org/abs/2411.11745
Optimal control theory extending from the calculus of variations has not been used to study the wind turbine power system (WTPS) control problem, which aims at achieving two targets: (i) maximizing power generation in lower wind speed conditions; and
Externí odkaz:
http://arxiv.org/abs/2411.09830
Autor:
Meskini, Majdi, Saafi, Houssem, Mlika, Abdelfattah, Arsicault, Marc, Zeghloul, Said, Laribi, Med Amine
Publikováno v:
Robotica, 2023, 41 (10), pp.3175-3194
This paper focuses on developing a novel hybrid-haptic (nHH) device with a remote center of rotation with 4 DOFs (degrees of freedom) intendant to be used as a haptic device. The new architecture is composed of two chains handling each one a part of
Externí odkaz:
http://arxiv.org/abs/2410.00481
Tracing a student's knowledge growth given the past exercise answering is a vital objective in automatic tutoring systems to customize the learning experience. Yet, achieving this objective is a non-trivial task as it involves modeling the knowledge
Externí odkaz:
http://arxiv.org/abs/2410.01836
Bit-level sparsity methods skip ineffectual zero-bit operations and are typically applicable within bit-serial deep learning accelerators. This type of sparsity at the bit-level is especially interesting because it is both orthogonal and compatible w
Externí odkaz:
http://arxiv.org/abs/2409.05227
Autor:
Chang, Chi-Chih, Lin, Wei-Cheng, Lin, Chien-Yu, Chen, Chong-Yan, Hu, Yu-Fang, Wang, Pei-Shuo, Huang, Ning-Chi, Ceze, Luis, Abdelfattah, Mohamed S., Wu, Kai-Chiang
Post-training KV-Cache compression methods typically either sample a subset of effectual tokens or quantize the data into lower numerical bit width. However, these methods cannot exploit redundancy in the hidden dimension of the KV tensors. This pape
Externí odkaz:
http://arxiv.org/abs/2407.21118
Autor:
Abdelfattah, Amr S.
Cloud-native systems represent a significant leap in constructing scalable, large systems, employing microservice architecture as a key element in developing distributed systems through self-contained components. However, the decentralized nature of
Externí odkaz:
http://arxiv.org/abs/2407.16873
FPGAs offer a flexible platform for accelerating deep neural network (DNN) inference, particularly for non-uniform workloads featuring fine-grained unstructured sparsity and mixed arithmetic precision. To leverage these redundancies, an emerging appr
Externí odkaz:
http://arxiv.org/abs/2407.06033