Výsledky vyhledávání - "Gross, Warren J."

Report

Automatic Pruning of Fine-tuning Datasets for Transformer-based Language Models

Autor: Tayaranian, Mohammadreza, Mozafari, Seyyed Hasan, Meyer, Brett H., Clark, James J., Gross, Warren J.

Transformer-based language models have shown state-of-the-art performance on a variety of natural language understanding tasks. To achieve this performance, these models are first pre-trained on general corpus and then fine-tuned on downstream tasks.

Externí odkaz: http://arxiv.org/abs/2407.08887

Zobrazit plný text záznamu

Report

Step-GRAND: A Low Latency Universal Soft-input Decoder

Autor: Abbas, Syed Mohsin, Jalaleddine, Marwan, Tsui, Chi-Ying, Gross, Warren J.

GRAND features both soft-input and hard-input variants that are well suited to efficient hardware implementations that can be characterized with achievable average and worst-case decoding latency. This paper introduces step-GRAND, a soft-input varian

Externí odkaz: http://arxiv.org/abs/2307.07133

Zobrazit plný text záznamu

Report

SSS3D: Fast Neural Architecture Search For Efficient Three-Dimensional Semantic Segmentation

Autor: Therrien, Olivier, Amein, Marihan, Xiong, Zhuoran, Gross, Warren J., Meyer, Brett H.

We present SSS3D, a fast multi-objective NAS framework designed to find computationally efficient 3D semantic scene segmentation networks. It uses RandLA-Net, an off-the-shelf point-based network, as a super-network to enable weight sharing and reduc

Externí odkaz: http://arxiv.org/abs/2304.11207

Zobrazit plný text záznamu

Report

FMAS: Fast Multi-Objective SuperNet Architecture Search for Semantic Segmentation

Autor: Xiong, Zhuoran, Amein, Marihan, Therrien, Olivier, Gross, Warren J., Meyer, Brett H.

We present FMAS, a fast multi-objective neural architecture search framework for semantic segmentation. FMAS subsamples the structure and pre-trained parameters of DeepLabV3+, without fine-tuning, dramatically reducing training time during search. To

Externí odkaz: http://arxiv.org/abs/2303.16322

Zobrazit plný text záznamu

Report

Stochastic Simulated Quantum Annealing for Fast Solution of Combinatorial Optimization Problems

Autor: Onizawa, Naoya, Sasaki, Ryoma, Shin, Duckgyu, Gross, Warren J., Hanyu, Takahiro

In this paper, we introduce stochastic simulated quantum annealing (SSQA) for large-scale combinatorial optimization problems. SSQA is designed based on stochastic computing and quantum Monte Carlo, which can simulate quantum annealing (QA) by using

Externí odkaz: http://arxiv.org/abs/2302.12454

Zobrazit plný text záznamu

Report

BD-KD: Balancing the Divergences for Online Knowledge Distillation

Autor: Amara, Ibtihel, Sepahvand, Nazanin, Meyer, Brett H., Gross, Warren J., Clark, James J.

Knowledge distillation (KD) has gained a lot of attention in the field of model compression for edge devices thanks to its effectiveness in compressing large powerful networks into smaller lower-capacity models. Online distillation, in which both the

Externí odkaz: http://arxiv.org/abs/2212.12965

Zobrazit plný text záznamu

Report

Efficient Fine-Tuning of Compressed Language Models with Learners

Autor: Vucetic, Danilo, Tayaranian, Mohammadreza, Ziaeefard, Maryam, Clark, James J., Meyer, Brett H., Gross, Warren J.

Fine-tuning BERT-based models is resource-intensive in memory, computation, and time. While many prior works aim to improve inference efficiency via compression techniques, e.g., pruning, these works do not explicitly address the computational challe

Externí odkaz: http://arxiv.org/abs/2208.02070

Zobrazit plný text záznamu

Report

Efficient Fine-Tuning of BERT Models on the Edge

Autor: Vucetic, Danilo, Tayaranian, Mohammadreza, Ziaeefard, Maryam, Clark, James J., Meyer, Brett H., Gross, Warren J.

Resource-constrained devices are increasingly the deployment targets of machine learning applications. Static models, however, do not always suffice for dynamic environments. On-device training of models allows for quick adaptability to new scenarios

Externí odkaz: http://arxiv.org/abs/2205.01541

Zobrazit plný text záznamu

Report

GRAND for Rayleigh Fading Channels

Autor: Abbas, Syed Mohsin, Jalaleddine, Marwan, Gross, Warren J.

Publikováno v: GLOBECOM 2022 Workshops

Guessing Random Additive Noise Decoding (GRAND) is a code-agnostic decoding technique for short-length and high-rate channel codes. GRAND tries to guess the channel noise by generating test error patterns (TEPs), and the sequence of the TEPs is the m

Externí odkaz: http://arxiv.org/abs/2205.00030

Zobrazit plný text záznamu

Report

Standard Deviation-Based Quantization for Deep Neural Networks

Autor: Ardakani, Amir, Ardakani, Arash, Meyer, Brett, Clark, James J., Gross, Warren J.

Quantization of deep neural networks is a promising approach that reduces the inference cost, making it feasible to run deep networks on resource-restricted devices. Inspired by existing methods, we propose a new framework to learn the quantization i

Externí odkaz: http://arxiv.org/abs/2202.12422

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání