Zobrazeno 1 - 10
of 111
pro vyhledávání: '"Sinclair, Matthew P."'
Modern accelerators like GPUs are increasingly executing independent operations concurrently to improve the device's compute utilization. However, effectively harnessing it on GPUs for important primitives such as general matrix multiplications (GEMM
Externí odkaz:
http://arxiv.org/abs/2409.02227
Large-scale computing systems are increasingly using accelerators such as GPUs to enable peta- and exa-scale levels of compute to meet the needs of Machine Learning (ML) and scientific computing applications. Given the widespread and growing use of M
Externí odkaz:
http://arxiv.org/abs/2408.11919
Autor:
Xu, Yiyang, Xu, Hao, Sinclair, Matthew, Puyol-Antón, Esther, Niederer, Steven A, Chiribiri, Amedeo, Williams, Steven E, Williams, Michelle C, Young, Alistair A
Cardiac magnetic resonance (CMR) imaging and computed tomography (CT) are two common non-invasive imaging methods for assessing patients with cardiovascular disease. CMR typically acquires multiple sparse 2D slices, with unavoidable respiratory motio
Externí odkaz:
http://arxiv.org/abs/2408.07532
Large Language Models increasingly rely on distributed techniques for their training and inference. These techniques require communication across devices which can reduce scaling efficiency as the number of devices increases. While some distributed t
Externí odkaz:
http://arxiv.org/abs/2401.16677
Convolutional neural networks (CNNs) often suffer from poor performance when tested on target data that differs from the training (source) data distribution, particularly in medical imaging applications where variations in imaging protocols across di
Externí odkaz:
http://arxiv.org/abs/2307.00676
Autor:
Upasani, Gaurang, Sinclair, Matthew D., Sampson, Adrian, Ranganathan, Parthasarathy, Patterson, David, Shah, Shaan, Parthasarathy, Nidhi, Jain, Rutwik
Computer Architecture, broadly, involves optimizing hardware and software for current and future processing systems. Although there are several other top venues to publish Computer Architecture research, including ASPLOS, HPCA, and MICRO, ISCA (the I
Externí odkaz:
http://arxiv.org/abs/2306.03964
Accel-Sim is a widely used computer architecture simulator that models the behavior of modern NVIDIA GPUs in great detail. However, although Accel-Sim and the underlying GPGPU-Sim model many of the features of real GPUs, thus far it has not been able
Externí odkaz:
http://arxiv.org/abs/2304.11136
Scaling neural network models has delivered dramatic quality gains across ML problems. However, this scaling has increased the reliance on efficient distributed training techniques. Accordingly, as with other distributed computing scenarios, it is im
Externí odkaz:
http://arxiv.org/abs/2302.02825
Extracting complex structures from grid-based data is a common key step in automated medical image analysis. The conventional solution to recovering tree-structured geometries typically involves computing the minimal cost path through intermediate re
Externí odkaz:
http://arxiv.org/abs/2301.00447
Autor:
Sinha, Prasoon, Guliani, Akhil, Jain, Rutwik, Tran, Brandon, Sinclair, Matthew D., Venkataraman, Shivaram
Scientists are increasingly exploring and utilizing the massive parallelism of general-purpose accelerators such as GPUs for scientific breakthroughs. As a result, datacenters, hyperscalers, national computing centers, and supercomputers have procure
Externí odkaz:
http://arxiv.org/abs/2208.11035