Výsledky vyhledávání - "Sun, GUangyu"

Report

OriGen:Enhancing RTL Code Generation with Code-to-Code Augmentation and Self-Reflection

Autor: Cui, Fan, Yin, Chenyang, Zhou, Kexing, Xiao, Youwei, Sun, Guangyu, Xu, Qiang, Guo, Qipeng, Song, Demin, Lin, Dahua, Zhang, Xingcheng, Yun, Liang

Recent studies have demonstrated the significant potential of Large Language Models (LLMs) in generating Register Transfer Level (RTL) code, with notable advancements showcased by commercial models such as GPT-4 and Claude3-Opus. However, these propr

Externí odkaz: http://arxiv.org/abs/2407.16237

Zobrazit plný text záznamu

Report

Theseus: Towards High-Efficiency Wafer-Scale Chip Design Space Exploration for Large Language Models

Autor: Zhu, Jingchen, Xue, Chenhao, Chen, Yiqi, Wang, Zhao, Sun, Guangyu

The emergence of the large language model~(LLM) poses an exponential growth of demand for computation throughput, memory capacity, and communication bandwidth. Such a demand growth has significantly surpassed the improvement of corresponding chip des

Externí odkaz: http://arxiv.org/abs/2407.02079

Zobrazit plný text záznamu

Report

Navigating Heterogeneity and Privacy in One-Shot Federated Learning with Diffusion Models

Autor: Mendieta, Matias, Sun, Guangyu, Chen, Chen

Federated learning (FL) enables multiple clients to train models collectively while preserving data privacy. However, FL faces challenges in terms of communication cost and data heterogeneity. One-shot federated learning has emerged as a solution by

Externí odkaz: http://arxiv.org/abs/2405.01494

Zobrazit plný text záznamu

Report

Towards Multi-modal Transformers in Federated Learning

Autor: Sun, Guangyu, Mendieta, Matias, Dutta, Aritra, Li, Xin, Chen, Chen

Multi-modal transformers mark significant progress in different domains, but siloed high-quality data hinders their further improvement. To remedy this, federated learning (FL) has emerged as a promising privacy-preserving paradigm for training model

Externí odkaz: http://arxiv.org/abs/2404.12467

Zobrazit plný text záznamu

Report

Toward CXL-Native Memory Tiering via Device-Side Profiling

Autor: Zhou, Zhe, Chen, Yiqi, Zhang, Tao, Wang, Yang, Shu, Ran, Xu, Shuotao, Cheng, Peng, Qu, Lei, Xiong, Yongqiang, Sun, Guangyu

The Compute Express Link (CXL) interconnect has provided the ability to integrate diverse memory types into servers via byte-addressable SerDes links. Harnessing the full potential of such heterogeneous memory systems requires efficient memory tierin

Externí odkaz: http://arxiv.org/abs/2403.18702

Zobrazit plný text záznamu

Report

The Dawn of AI-Native EDA: Opportunities and Challenges of Large Circuit Models

Within the Electronic Design Automation (EDA) domain, AI-driven solutions have emerged as formidable tools, yet they typically augment rather than redefine existing methodologies. These solutions often repurpose deep learning models from other domain

Externí odkaz: http://arxiv.org/abs/2403.07257

Zobrazit plný text záznamu

Report

LLM Inference Unveiled: Survey and Roofline Model Insights

Autor: Yuan, Zhihang, Shang, Yuzhang, Zhou, Yang, Dong, Zhen, Zhou, Zhe, Xue, Chenhao, Wu, Bingzhe, Li, Zhikai, Gu, Qingyi, Lee, Yong Jae, Yan, Yan, Chen, Beidi, Sun, Guangyu, Keutzer, Kurt

The field of efficient Large Language Model (LLM) inference is rapidly evolving, presenting a unique blend of opportunities and challenges. Although the field has expanded and is vibrant, there hasn't been a concise framework that analyzes the variou

Externí odkaz: http://arxiv.org/abs/2402.16363

Zobrazit plný text záznamu

Report

Algorithm-hardware co-design for Energy-Efficient A/D conversion in ReRAM-based accelerators

Autor: Zhang, Chenguang, Yuan, Zhihang, Li, Xingchen, Sun, Guangyu

Deep neural networks are widely deployed in many fields. Due to the in-situ computation (known as processing in memory) capacity of the Resistive Random Access Memory (ReRAM) crossbar, ReRAM-based accelerator shows potential in accelerating DNN with

Externí odkaz: http://arxiv.org/abs/2402.06164

Zobrazit plný text záznamu

Report

ASVD: Activation-aware Singular Value Decomposition for Compressing Large Language Models

Autor: Yuan, Zhihang, Shang, Yuzhang, Song, Yue, Wu, Qiang, Yan, Yan, Sun, Guangyu

In this paper, we introduce a new post-training compression paradigm for Large Language Models (LLMs) to facilitate their wider adoption. We delve into LLM weight low-rank factorization, and find that the challenges of this task stem from the outlier

Externí odkaz: http://arxiv.org/abs/2312.05821

Zobrazit plný text záznamu

Report

FedPerfix: Towards Partial Model Personalization of Vision Transformers in Federated Learning

Autor: Sun, Guangyu, Mendieta, Matias, Luo, Jun, Wu, Shandong, Chen, Chen

Personalized Federated Learning (PFL) represents a promising solution for decentralized learning in heterogeneous data environments. Partial model personalization has been proposed to improve the efficiency of PFL by selectively updating local model

Externí odkaz: http://arxiv.org/abs/2308.09160

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání