Výsledky vyhledávání - "Thottethodi, Mithuna"

Report

Efficient Sparse Processing-in-Memory Architecture (ESPIM) for Machine Learning Inference

Autor: He, Mingxuan, Thottethodi, Mithuna, Vijaykumar, T. N.

Emerging machine learning (ML) models (e.g., transformers) involve memory pin bandwidth-bound matrix-vector (MV) computation in inference. By avoiding pin crossings, processing in memory (PIM) can improve performance and energy for pin-bound workload

Externí odkaz: http://arxiv.org/abs/2404.04708

Zobrazit plný text záznamu

Report

QED: Scalable Verification of Hardware Memory Consistency

Autor: Ravi, Gokulan, Qiu, Xiaokang, Thottethodi, Mithuna, Vijaykumar, T. N.

Memory consistency model (MCM) issues in out-of-order-issue microprocessor-based shared-memory systems are notoriously non-intuitive and a source of hardware design bugs. Prior hardware verification work is limited to in-order-issue processors, to pr

Externí odkaz: http://arxiv.org/abs/2404.03113

Zobrazit plný text záznamu

Report

NetSmith: An Optimization Framework for Machine-Discovered Network Topologies

Autor: Green, Conor, Thottethodi, Mithuna

Over the past few decades, network topology design for general purpose, shared memory multicores has been primarily driven by human experts who use their insights to arrive at network designs that balance the competing goals of performance requiremen

Externí odkaz: http://arxiv.org/abs/2404.02357

Zobrazit plný text záznamu

Report

SafeBet: Secure, Simple, and Fast Speculative Execution

Autor: Green, Conor, Nelson, Cole, Thottethodi, Mithuna, Vijaykumar, T. N.

Spectre attacks exploit microprocessor speculative execution to read and transmit forbidden data outside the attacker's trust domain and sandbox. Recent hardware schemes allow potentially-unsafe speculative accesses but prevent the secret's transmiss

Externí odkaz: http://arxiv.org/abs/2306.07785

Zobrazit plný text záznamu

Report

OCCAM: Optimal Data Reuse for Convolutional Neural Networks

Autor: Gondimalla, Ashish, Liu, Jianqiao, Vijaykumar, T. N., Thottethodi, Mithuna

Convolutional neural networks (CNNs) are emerging as powerful tools for image processing in important commercial applications. We focus on the important problem of improving the latency of image recognition. CNNs' large data at each layer's input, fi

Externí odkaz: http://arxiv.org/abs/2106.14138

Zobrazit plný text záznamu

Report

Barrier-Free Large-Scale Sparse Tensor Accelerator (BARISTA) For Convolutional Neural Networks

Autor: Gondimalla, Ashish, Gundabolu, Sree Charan, Vijaykumar, T. N., Thottethodi, Mithuna

Convolutional neural networks (CNNs) are emerging as powerful tools for visual recognition. Recent architecture proposals for sparse CNNs exploit zeros in the feature maps and filters for performance and energy without losing accuracy. Sparse archite

Externí odkaz: http://arxiv.org/abs/2104.08734

Zobrazit plný text záznamu

Report

Booster: An Accelerator for Gradient Boosting Decision Trees

Autor: He, Mingxuan, Vijaykumar, T. N., Thottethodi, Mithuna

We propose Booster, a novel accelerator for gradient boosting trees based on the unique characteristics of gradient boosting models. We observe that the dominant steps of gradient boosting training (accounting for 90-98% of training time) involve sim

Externí odkaz: http://arxiv.org/abs/2011.02022

Zobrazit plný text záznamu

Report

Dart: Divide and Specialize for Fast Response to Congestion in RDMA-based Datacenter Networks

Autor: Xue, Jiachen, Chaudhry, Muhammad Usama, Vamanan, Balajee, Vijaykumar, T. N., Thottethodi, Mithuna

Though Remote Direct Memory Access (RDMA) promises to reduce datacenter network latencies significantly compared to TCP (e.g., 10x), end-to-end congestion control in the presence of incasts is a challenge. Targeting the full generality of the congest

Externí odkaz: http://arxiv.org/abs/1805.11158

Zobrazit plný text záznamu

Report

Enabling Efficient Dynamic Resizing of Large DRAM Caches via A Hardware Consistent Hashing Mechanism

Autor: Chang, Kevin K., Loh, Gabriel H., Thottethodi, Mithuna, Eckert, Yasuko, O'Connor, Mike, Manne, Srilatha, Hsu, Lisa, Subramanian, Lavanya, Mutlu, Onur

Die-stacked DRAM has been proposed for use as a large, high-bandwidth, last-level cache with hundreds or thousands of megabytes of capacity. Not all workloads (or phases) can productively utilize this much cache space, however. Unfortunately, the unu

Externí odkaz: http://arxiv.org/abs/1602.00722

Zobrazit plný text záznamu

Akademický článek

MapReduce with communication overlap (MaRCO)

Autor: Ahmad, Faraz, Lee, Seyong, Thottethodi, Mithuna, Vijaykumar, T.N.

Publikováno v: In Journal of Parallel and Distributed Computing May 2013 73(5):608-620

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání