Zobrazeno 1 - 10
of 209
pro vyhledávání: '"Denolf P"'
Quantization reduces the model's hardware costs, such as data movement, storage, and operations like multiply and addition. It also affects the model's behavior by degrading the output quality. Therefore, there is a need for methods that preserve the
Externí odkaz:
http://arxiv.org/abs/2410.11203
Autor:
Rouhani, Bita Darvish, Zhao, Ritchie, More, Ankit, Hall, Mathew, Khodamoradi, Alireza, Deng, Summer, Choudhary, Dhruv, Cornea, Marius, Dellinger, Eric, Denolf, Kristof, Dusan, Stosic, Elango, Venmugil, Golub, Maximilian, Heinecke, Alexander, James-Roxby, Phil, Jani, Dharmesh, Kolhe, Gaurav, Langhammer, Martin, Li, Ada, Melnick, Levi, Mesmakhosroshahi, Maral, Rodriguez, Andres, Schulte, Michael, Shafipour, Rasoul, Shao, Lei, Siu, Michael, Dubey, Pradeep, Micikevicius, Paulius, Naumov, Maxim, Verrilli, Colin, Wittig, Ralph, Burger, Doug, Chung, Eric
Narrow bit-width data formats are key to reducing the computational and storage costs of modern deep learning applications. This paper evaluates Microscaling (MX) data formats that combine a per-block scaling factor with narrow floating-point and int
Externí odkaz:
http://arxiv.org/abs/2310.10537
Autor:
Singh, Gagandeep, Khodamoradi, Alireza, Denolf, Kristof, Lo, Jack, Gómez-Luna, Juan, Melber, Joseph, Bisca, Andra, Corporaal, Henk, Mutlu, Onur
Fast and accurate climate simulations and weather predictions are critical for understanding and preparing for the impact of climate change. Real-world weather and climate modeling consist of complex compound stencil kernels that do not perform well
Externí odkaz:
http://arxiv.org/abs/2303.03509
Autor:
Weng, Olivia, Marcano, Gabriel, Loncar, Vladimir, Khodamoradi, Alireza, Sheybani, Nojan, Meza, Andres, Koushanfar, Farinaz, Denolf, Kristof, Duarte, Javier Mauricio, Kastner, Ryan
Deep neural networks use skip connections to improve training convergence. However, these skip connections are costly in hardware, requiring extra buffers and increasing on- and off-chip memory utilization and bandwidth requirements. In this paper, w
Externí odkaz:
http://arxiv.org/abs/2301.07247
Autor:
Zhuang, Jinming, Lau, Jason, Ye, Hanchen, Yang, Zhuoping, Du, Yubo, Lo, Jack, Denolf, Kristof, Neuendorffer, Stephen, Jones, Alex, Hu, Jingtong, Chen, Deming, Cong, Jason, Zhou, Peipei
Dense matrix multiply (MM) serves as one of the most heavily used kernels in deep learning applications. To cope with the high computation demands of these applications, heterogeneous architectures featuring both FPGA and dedicated ASIC accelerators
Externí odkaz:
http://arxiv.org/abs/2301.02359
Autor:
Singh, Gagandeep, Alser, Mohammed, Denolf, Kristof, Firtina, Can, Khodamoradi, Alireza, Cavlak, Meryem Banu, Corporaal, Henk, Mutlu, Onur
Nanopore sequencing generates noisy electrical signals that need to be converted into a standard string of DNA nucleotide bases using a computational step called basecalling. The accuracy and speed of basecalling have critical implications for all la
Externí odkaz:
http://arxiv.org/abs/2211.03079
Autor:
Gagandeep Singh, Mohammed Alser, Kristof Denolf, Can Firtina, Alireza Khodamoradi, Meryem Banu Cavlak, Henk Corporaal, Onur Mutlu
Publikováno v:
Genome Biology, Vol 25, Iss 1, Pp 1-29 (2024)
Abstract Nanopore sequencing generates noisy electrical signals that need to be converted into a standard string of DNA nucleotide bases using a computational step called basecalling. The performance of basecalling has critical implications for all l
Externí odkaz:
https://doaj.org/article/10f69e22b63a4076b05bc9347cff3aa4
Publikováno v:
Journal of the Belgian Society of Radiology, Vol 108, Iss 1, Pp 64-64 (2024)
Epithelioid hemangioendothelioma (EHE) is a rare vascular tumor that can originate in various parenchymatous organs, soft tissue, and bone. Extrahepatic involvement is exceedingly rare. In this case, multifocal disease in the spleen and bone was pres
Externí odkaz:
https://doaj.org/article/a3e7f9c0a6f840aeacac00f8573a9b6d
Autor:
E. Wezenbeek, S. Denolf, J. G. Bourgois, R. M. Philippaerts, B. De Winne, T. M. Willems, E. Witvrouw, S. Verstockt, J. Schuermans
Publikováno v:
Annals of Medicine, Vol 55, Iss 1 (2023)
AbstractObjectives To investigate possible persistent performance deficits after severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infection in elite athletes.Methods A prospective cohort study in three Belgian professional male football t
Externí odkaz:
https://doaj.org/article/a3f35e6b54ef49c2806167728b5037de
Autor:
Qasaimeh, Murad, Denolf, Kristof, Lo, Jack, Vissers, Kees, Zambreno, Joseph, Jones, Phillip H.
Developing high performance embedded vision applications requires balancing run-time performance with energy constraints. Given the mix of hardware accelerators that exist for embedded computer vision (e.g. multi-core CPUs, GPUs, and FPGAs), and thei
Externí odkaz:
http://arxiv.org/abs/1906.11879