Výsledky vyhledávání - "Constantinides A"

Report

QERA: an Analytical Framework for Quantization Error Reconstruction

Autor: Zhang, Cheng, Wong, Jeffrey T. H., Xiao, Can, Constantinides, George A., Zhao, Yiren

he growing number of parameters and computational demands of large language models (LLMs) present significant challenges for their efficient deployment. Recently, there is an increasing interest in quantizing weights to extremely low precision while

Externí odkaz: http://arxiv.org/abs/2410.06040

Zobrazit plný text záznamu

Report

Co-designing an AI Impact Assessment Report Template with AI Practitioners and AI Compliance Experts

Autor: Bogucka, Edyta, Constantinides, Marios, Šćepanović, Sanja, Quercia, Daniele

In the evolving landscape of AI regulation, it is crucial for companies to conduct impact assessments and document their compliance through comprehensive reports. However, current reports lack grounding in regulations and often focus on specific aspe

Externí odkaz: http://arxiv.org/abs/2407.17374

Zobrazit plný text záznamu

Report

The Atlas of AI Incidents in Mobile Computing: Visualizing the Risks and Benefits of AI Gone Mobile

Autor: Bogucka, Edyta, Constantinides, Marios, Velazquez, Julia De Miguel, Šćepanović, Sanja, Quercia, Daniele, Gvirtz, Andrés

Today's visualization tools for conveying the risks and benefits of AI technologies are largely tailored for those with technical expertise. To bridge this gap, we have developed a visualization that employs narrative patterns and interactive element

Externí odkaz: http://arxiv.org/abs/2407.15685

Zobrazit plný text záznamu

Report

The Impact of Responsible AI Research on Innovation and Development

Autor: Septiandri, Ali Akbar, Constantinides, Marios, Quercia, Daniele

Translational research, especially in the fast-evolving field of Artificial Intelligence (AI), is key to converting scientific findings into practical innovations. In Responsible AI (RAI) research, translational impact is often viewed through various

Externí odkaz: http://arxiv.org/abs/2407.15647

Zobrazit plný text záznamu

Report

Good Intentions, Risky Inventions: A Method for Assessing the Risks and Benefits of AI in Mobile and Wearable Uses

Autor: Constantinides, Marios, Bogucka, Edyta, Scepanovic, Sanja, Quercia, Daniele

Integrating Artificial Intelligence (AI) into mobile and wearables offers numerous benefits at individual, societal, and environmental levels. Yet, it also spotlights concerns over emerging risks. Traditional assessments of risks and benefits have be

Externí odkaz: http://arxiv.org/abs/2407.09322

Zobrazit plný text záznamu

Report

Exploring FPGA designs for MX and beyond

Autor: Samson, Ebby, Mellempudi, Naveen, Luk, Wayne, Constantinides, George A.

A number of companies recently worked together to release the new Open Compute Project MX standard for low-precision computation, aimed at efficient neural network implementation. In this paper, we describe and evaluate the first open-source FPGA imp

Externí odkaz: http://arxiv.org/abs/2407.01475

Zobrazit plný text záznamu

Report

Unlocking the Global Synergies in Low-Rank Adapters

Autor: Zhang, Zixi, Zhang, Cheng, Gao, Xitong, Mullins, Robert D., Constantinides, George A., Zhao, Yiren

Low-rank Adaption (LoRA) has been the de-facto parameter-efficient fine-tuning technique for large language models. We present HeteroLoRA, a light-weight search algorithm that leverages zero-cost proxies to allocate the limited LoRA trainable paramet

Externí odkaz: http://arxiv.org/abs/2406.14956

Zobrazit plný text záznamu

Report

Optimised Grouped-Query Attention Mechanism for Transformers

Autor: Chen, Yuang, Zhang, Cheng, Gao, Xitong, Mullins, Robert D., Constantinides, George A., Zhao, Yiren

Grouped-query attention (GQA) has been widely adopted in LLMs to mitigate the complexity of multi-head attention (MHA). To transform an MHA to a GQA, neighbour queries in MHA are evenly split into groups where each group shares the value and key laye

Externí odkaz: http://arxiv.org/abs/2406.14963

Zobrazit plný text záznamu

Report

ROVER: RTL Optimization via Verified E-Graph Rewriting

Autor: Coward, Samuel, Drane, Theo, Constantinides, George A.

Manual RTL design and optimization remains prevalent across the semiconductor industry because commercial logic and high-level synthesis tools are unable to match human designs. Our experience in industrial datapath design demonstrates that manual op

Externí odkaz: http://arxiv.org/abs/2406.12421

Zobrazit plný text záznamu

Report

Soft GPGPU versus IP cores: Quantifying and Reducing the Performance Gap

Autor: Langhammer, Martin, Constantinides, George A.

eGPU, a recently-reported soft GPGPU for FPGAs, has demonstrated very high clock frequencies (more than 750 MHz) and small footprint. This means that for the first time, commercial soft processors may be competitive for the kind of heavy numerical co

Externí odkaz: http://arxiv.org/abs/2406.03227

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání