Zobrazeno 1 - 10
of 75
pro vyhledávání: '"Tor M. Aamodt"'
Publikováno v:
Proceedings of the 49th Annual International Symposium on Computer Architecture.
Recent works have demonstrated that large quantum circuits can be cut and decomposed into smaller clusters of quantum circuits with fewer qubits that can be executed independently on a small quantum computer. Classical post-processing then combines t
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::a446ab6df6877e71aeb2b3baa6d46276
Autor:
Ehsan Atoofian, Amirali Baniasadi, Milad Mohammadi, Tor M. Aamodt, William J. Dally, Song Han
Publikováno v:
IEEE Transactions on Computers. 69:453-465
The branch predictor unit (BPU) is among the main energy consuming components in out-of-order (OoO) processors. For integer applications, we find 16 percent of the processor energy is consumed by the BPU. BPU is accessed in parallel with the instruct
Autor:
Tor M. Aamodt, Scott Peverelle, Amogh Manjunath, Vijay Kandiah, Junrui Pan, Mahmoud Khairy, Timothy G. Rogers, Nikos Hardavellas
Publikováno v:
MICRO
Graphics Processing Units (GPUs) are rapidly dominating the accelerator space, as illustrated by their wide-spread adoption in the data analytics and machine learning markets. At the same time, performance per watt has emerged as a crucial evaluation
Autor:
Francois Demoullin, Lufei Liu, Mohammadreza Saed, Tor M. Aamodt, Tyler Nowicki, Wesley Chang, Yuan Hsi Chou, David Pankratz
Publikováno v:
MICRO
Ray tracing has been used for years in motion picture to generate photorealistic images while faster raster-based shading techniques have been preferred for video games to meet real-time requirements. However, recent Graphics Processing Units (GPUs)
Autor:
Matthew D. Sinclair, Joseph Devietti, Yuan Hsi Chou, Christopher Ng, Timothy G. Rogers, Jeremy Intan, Tor M. Aamodt, Shaylin Cattell
Publikováno v:
MICRO
Deterministic execution for GPUs is a desirable property as it helps with debuggability and reproducibility. It is also important for safety regulations, as safety critical workloads are starting to be deployed onto GPUs. Prior deterministic architec
Publikováno v:
PACT
A critical component of high-throughput processors such as GPGPUs is the network-on-chip (NoC) that interconnects the cores and the memory partitions together. Different NoC architectures for throughput processors have been proposed but they have oft
Autor:
Tor M. Aamodt, Negar Goli
Publikováno v:
CVPR
The success of Convolutional Neural Networks (CNNs) in various applications is accompanied by a significant increase in computation and training time. In this work, we focus on accelerating training by observing that about 90% of gradients are reusab
Publikováno v:
ISCA
A reduction in the time it takes to train machine learning models can be translated into improvements in accuracy. An important factor that increases training time in deep neural networks (DNNs) is the need to store large amounts of temporary data du
Publikováno v:
ISCA
In computer architecture, significant innovation frequently comes from industry. However, the simulation tools used by industry are often not released for open use, and even when they are, the exact details of industrial designs are not disclosed. As