Výsledky vyhledávání

Report

ELSA: Exploiting Layer-wise N:M Sparsity for Vision Transformer Acceleration

Autor: Huang, Ning-Chi, Chang, Chi-Chih, Lin, Wei-Cheng, Taka, Endri, Marculescu, Diana, Wu, Kai-Chiang

$N{:}M$ sparsity is an emerging model compression method supported by more and more accelerators to speed up sparse matrix multiplication in deep neural networks. Most existing $N{:}M$ sparsity methods compress neural networks with a uniform setting

Externí odkaz: http://arxiv.org/abs/2409.09708

Zobrazit plný text záznamu

Report

Spatial-Aware Conformal Prediction for Trustworthy Hyperspectral Image Classification

Autor: Liu, Kangdao, Sun, Tianhao, Zeng, Hao, Zhang, Yongshan, Pun, Chi-Man, Vong, Chi-Man

Hyperspectral image (HSI) classification involves assigning specific labels to each pixel to identify various land cover categories. Although deep classifiers have shown high predictive accuracy in this field, quantifying their uncertainty remains a

Externí odkaz: http://arxiv.org/abs/2409.01236

Zobrazit plný text záznamu

Report

Persistent Pre-Training Poisoning of LLMs

Autor: Zhang, Yiming, Rando, Javier, Evtimov, Ivan, Chi, Jianfeng, Smith, Eric Michael, Carlini, Nicholas, Tramèr, Florian, Ippolito, Daphne

Large language models are pre-trained on uncurated text datasets consisting of trillions of tokens scraped from the Web. Prior work has shown that: (1) web-scraped pre-training datasets can be practically poisoned by malicious actors; and (2) adversa

Externí odkaz: http://arxiv.org/abs/2410.13722

Zobrazit plný text záznamu

Report

Applying the Velocity Gradient Technique in NGC 1333: Comparison with Dust Polarization Observations

Autor: Soam, Archana, Yuen, Ka Ho, Stephens, Ian, Law, Chi Yan, Ho, Ka Wai, Coudé, Simon

Magnetic fields (B-fields) are ubiquitous in the interstellar medium (ISM), and they play an essential role in the formation of molecular clouds and subsequent star formation. However, B-fields in interstellar environments remain challenging to measu

Externí odkaz: http://arxiv.org/abs/2410.13350

Zobrazit plný text záznamu

Report

Quamba: A Post-Training Quantization Recipe for Selective State Space Models

Autor: Chiang, Hung-Yueh, Chang, Chi-Chih, Frumkin, Natalia, Wu, Kai-Chiang, Marculescu, Diana

State Space Models (SSMs) have emerged as an appealing alternative to Transformers for large language models, achieving state-of-the-art accuracy with constant memory complexity which allows for holding longer context lengths than attention-based net

Externí odkaz: http://arxiv.org/abs/2410.13229

Zobrazit plný text záznamu

Report

A Class of Degenerate Mean Field Games, Associated FBSDEs and Master Equations

Autor: Bensoussan, Alain, Huang, Ziyu, Tang, Shanjian, Yam, Sheung Chi Phillip

In this paper, we study a class of degenerate mean field games (MFGs) with state-distribution dependent and unbounded functional diffusion coefficients. With a probabilistic method, we study the well-posedness of the forward-backward stochastic diffe

Externí odkaz: http://arxiv.org/abs/2410.12404

Zobrazit plný text záznamu

Report

RapidStream IR: Infrastructure for FPGA High-Level Physical Synthesis

Autor: Lau, Jason, Xiao, Yuanlong, Xie, Yutong, Chi, Yuze, Song, Linghao, Xiang, Shaojie, Lo, Michael, Zhang, Zhiru, Cong, Jason, Guo, Licheng

Publikováno v: IEEE/ACM International Conference on Computer-Aided Design (2024), October 27-31, New York, NY, USA. ACM, New York, NY, USA, 11 pages

The increasing complexity of large-scale FPGA accelerators poses significant challenges in achieving high performance while maintaining design productivity. High-level synthesis (HLS) has been adopted as a solution, but the mismatch between the high-

Externí odkaz: http://arxiv.org/abs/2410.13079

Zobrazit plný text záznamu

Report

FusionLLM: A Decentralized LLM Training System on Geo-distributed GPUs with Adaptive Compression

Autor: Tang, Zhenheng, Kang, Xueze, Yin, Yiming, Pan, Xinglin, Wang, Yuxin, He, Xin, Wang, Qiang, Zeng, Rongfei, Zhao, Kaiyong, Shi, Shaohuai, Zhou, Amelie Chi, Li, Bo, He, Bingsheng, Chu, Xiaowen

To alleviate hardware scarcity in training large deep neural networks (DNNs), particularly large language models (LLMs), we present FusionLLM, a decentralized training system designed and implemented for training DNNs using geo-distributed GPUs acros

Externí odkaz: http://arxiv.org/abs/2410.12707

Zobrazit plný text záznamu

Report

Experimental Design Using Interlacing Polynomials

Autor: Lau, Lap Chi, Wang, Robert, Zhou, Hong

We present a unified deterministic approach for experimental design problems using the method of interlacing polynomials. Our framework recovers the best-known approximation guarantees for the well-studied D/A/E-design problems with simple analysis.

Externí odkaz: http://arxiv.org/abs/2410.11390

Zobrazit plný text záznamu

Report

DISP-LLM: Dimension-Independent Structural Pruning for Large Language Models

Autor: Gao, Shangqian, Lin, Chi-Heng, Hua, Ting, Zheng, Tang, Shen, Yilin, Jin, Hongxia, Hsu, Yen-Chang

Large Language Models (LLMs) have achieved remarkable success in various natural language processing tasks, including language modeling, understanding, and generation. However, the increased memory and computational costs associated with these models

Externí odkaz: http://arxiv.org/abs/2410.11988

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání