Výsledky vyhledávání - "Network compression"

Report

Adaptive Error-Bounded Hierarchical Matrices for Efficient Neural Network Compression

This paper introduces a dynamic, error-bounded hierarchical matrix (H-matrix) compression method tailored for Physics-Informed Neural Networks (PINNs). The proposed approach reduces the computational complexity and memory demands of large-scale physi

Externí odkaz: http://arxiv.org/abs/2409.07028

Zobrazit plný text záznamu

Report

TropNNC: Structured Neural Network Compression Using Tropical Geometry

Autor: Fotopoulos, Konstantinos, Maragos, Petros, Misiakos, Panagiotis

We present TropNNC, a framework for compressing neural networks with linear and convolutional layers and ReLU activations. TropNNC is a structured compression framework based on a geometrical approach to machine/deep learning, using tropical geometry

Externí odkaz: http://arxiv.org/abs/2409.03945

Zobrazit plný text záznamu

Report

Unified Framework for Neural Network Compression via Decomposition and Optimal Rank Selection

Autor: Aghababaei-Harandi, Ali, Amini, Massih-Reza

Despite their high accuracy, complex neural networks demand significant computational resources, posing challenges for deployment on resource-constrained devices such as mobile phones and embedded systems. Compression algorithms have been developed t

Externí odkaz: http://arxiv.org/abs/2409.03555

Zobrazit plný text záznamu

Report

Convolutional Neural Network Compression Based on Low-Rank Decomposition

Autor: He, Yaping, Jiang, Linhao, Wu, Di

Deep neural networks typically impose significant computational loads and memory consumption. Moreover, the large parameters pose constraints on deploying the model on edge devices such as embedded systems. Tensor decomposition offers a clear advanta

Externí odkaz: http://arxiv.org/abs/2408.16289

Zobrazit plný text záznamu

Report

Shapley Pruning for Neural Network Compression

Autor: Adamczewski, Kamil, Li, Yawei, van Gool, Luc

Neural network pruning is a rich field with a variety of approaches. In this work, we propose to connect the existing pruning concepts such as leave-one-out pruning and oracle pruning and develop them into a more general Shapley value-based framework

Externí odkaz: http://arxiv.org/abs/2407.15875

Zobrazit plný text záznamu

Akademický článek

Intelligent Fault Diagnosis Method Based on Neural Network Compression for Rolling Bearings.

Autor: Wang, Xinren¹ (AUTHOR), Hu, Dongming¹ (AUTHOR), Fan, Xueqi¹ (AUTHOR), Liu, Huiyi² (AUTHOR), Yang, Chenbin² (AUTHOR) yangchenbin@hhu.edu.cn

Publikováno v: Symmetry (20738994). Nov2024, Vol. 16 Issue 11, p1461. 19p.

Zobrazit plný text záznamu

Plný text ve formátu HTML

Report

Tiled Bit Networks: Sub-Bit Neural Network Compression Through Reuse of Learnable Binary Vectors

Autor: Gorbett, Matt, Shirazi, Hossein, Ray, Indrakshi

Binary Neural Networks (BNNs) enable efficient deep learning by saving on storage and computational costs. However, as the size of neural networks continues to grow, meeting computational requirements remains a challenge. In this work, we propose a n

Externí odkaz: http://arxiv.org/abs/2407.12075

Zobrazit plný text záznamu

Report

MCNC: Manifold Constrained Network Compression

Autor: Thrash, Chayne, Abbasi, Ali, Nooralinejad, Parsa, Koohpayegani, Soroush Abbasi, Andreas, Reed, Pirsiavash, Hamed, Kolouri, Soheil

The outstanding performance of large foundational models across diverse tasks-from computer vision to speech and natural language processing-has significantly increased their demand. However, storing and transmitting these models pose significant cha

Externí odkaz: http://arxiv.org/abs/2406.19301

Zobrazit plný text záznamu

Report

Neural Network Compression for Reinforcement Learning Tasks

Autor: Ivanov, Dmitry A., Larionov, Denis A., Maslennikov, Oleg V., Voevodin, Vladimir V.

In real applications of Reinforcement Learning (RL), such as robotics, low latency and energy efficient inference is very desired. The use of sparsity and pruning for optimizing Neural Network inference, and particularly to improve energy and latency

Externí odkaz: http://arxiv.org/abs/2405.07748

Zobrazit plný text záznamu

Report

Structure-Preserving Network Compression Via Low-Rank Induced Training Through Linear Layers Composition

Autor: Zhang, Xitong, Alkhouri, Ismail R., Wang, Rongrong

Deep Neural Networks (DNNs) have achieved remarkable success in addressing many previously unsolvable tasks. However, the storage and computational requirements associated with DNNs pose a challenge for deploying these trained models on resource-limi

Externí odkaz: http://arxiv.org/abs/2405.03089

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání