Výsledky vyhledávání

Report

Loki: A System for Serving ML Inference Pipelines with Hardware and Accuracy Scaling

Autor: Ahmad, Sohaib, Guan, Hui, Sitaraman, Ramesh K.

The rapid adoption of machine learning (ML) has underscored the importance of serving ML models with high throughput and resource efficiency. Traditional approaches to managing increasing query demands have predominantly focused on hardware scaling,

Externí odkaz: http://arxiv.org/abs/2407.03583

Zobrazit plný text záznamu

Report

ProTrain: Efficient LLM Training via Memory-Aware Techniques

Autor: Yang, Hanmei, Zhou, Jin, Fu, Yao, Wang, Xiaoqun, Roane, Ramine, Guan, Hui, Liu, Tongping

It is extremely memory-hungry to train Large Language Models (LLM). To solve this problem, existing work exploits the combination of CPU and GPU for the training process, such as ZeRO-Offload. Such a technique largely democratizes billion-scale model

Externí odkaz: http://arxiv.org/abs/2406.08334

Zobrazit plný text záznamu

Report

Graph Neural Network Training Systems: A Performance Comparison of Full-Graph and Mini-Batch

Autor: Bajaj, Saurabh, Guan, Hui, Serafini, Marco

Graph Neural Networks (GNNs) have gained significant attention in recent years due to their ability to learn representations of graph structured data. Two common methods for training GNNs are mini-batch training and full-graph training. Since these t

Externí odkaz: http://arxiv.org/abs/2406.00552

Zobrazit plný text záznamu

Report

Thinking Forward: Memory-Efficient Federated Finetuning of Language Models

Autor: Panchal, Kunjal, Parikh, Nisarg, Choudhary, Sunav, Zhang, Lijun, Brun, Yuriy, Guan, Hui

Finetuning large language models (LLMs) in federated learning (FL) settings has become important as it allows resource-constrained devices to finetune a model using private data. However, finetuning LLMs using backpropagation requires excessive memor

Externí odkaz: http://arxiv.org/abs/2405.15551

Zobrazit plný text záznamu

Report

GraphMini: Accelerating Graph Pattern Matching Using Auxiliary Graphs

Autor: Liu, Juelin, Polisetty, Sandeep, Guan, Hui, Serafini, Marco

Graph pattern matching is a fundamental problem encountered by many common graph mining tasks and the basic building block of several graph mining systems. This paper explores for the first time how to proactively prune graphs to speed up graph patte

Externí odkaz: http://arxiv.org/abs/2403.01050

Zobrazit plný text záznamu

Report

Robust Image Watermarking using Stable Diffusion

Autor: Zhang, Lijun, Liu, Xiao, Martin, Antoni Viros, Bearfield, Cindy Xiong, Brun, Yuriy, Guan, Hui

Watermarking images is critical for tracking image provenance and claiming ownership. With the advent of generative models, such as stable diffusion, able to create fake but realistic images, watermarking has become particularly important, e.g., to m

Externí odkaz: http://arxiv.org/abs/2401.04247

Zobrazit plný text záznamu

Report

Efficient IoT Inference via Context-Awareness

Autor: Rastikerdar, Mohammad Mehdi, Huang, Jin, Fang, Shiwei, Guan, Hui, Ganesan, Deepak

While existing strategies to execute deep learning-based classification on low-power platforms assume the models are trained on all classes of interest, this paper posits that adopting context-awareness i.e. narrowing down a classification task to th

Externí odkaz: http://arxiv.org/abs/2310.19112

Zobrazit plný text záznamu

Report

Multi-Task Models Adversarial Attacks

Autor: Zhang, Lijun, Liu, Xiao, Mahmood, Kaleel, Ding, Caiwen, Guan, Hui

Multi-Task Learning (MTL) involves developing a singular model, known as a multi-task model, to concurrently perform multiple tasks. While the security of single-task models has been thoroughly studied, multi-task models pose several critical securit

Externí odkaz: http://arxiv.org/abs/2305.12066

Zobrazit plný text záznamu

Akademický článek

Deformation Characteristics of Large Section Tunnel Surrounding Rock and Optimization of Tunnel Construction Scheme for Qingdao Metro Line

Autor: JIANG Zhiwei, YUAN Changfeng, FAN Yan-xiang, GUAN Hui, ZHEN Zhuo, QIN Tianqing

Publikováno v: Chengshi guidao jiaotong yanjiu, Vol 27, Iss 6, Pp 95-99 (2024)

Objective The large section tunnel, due to flat shape and large span, usually adopts multiple steps and staged excavation. In the process of construction, the surrounding rock and the temporary support are repeatedly disturbed, affecting the stabilit

Externí odkaz: https://doaj.org/article/5c5b53bf0878443bb56474d6b3520f40

Zobrazit plný text záznamu

Report

Structured Pruning for Multi-Task Deep Neural Networks

Autor: Garg, Siddhant, Zhang, Lijun, Guan, Hui

Although multi-task deep neural network (DNN) models have computation and storage benefits over individual single-task DNN models, they can be further optimized via model compression. Numerous structured pruning methods are already developed that can

Externí odkaz: http://arxiv.org/abs/2304.06840

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání