Výsledky vyhledávání - "Weikang Qiao"

Autor: Yuze Chi, Weikang Qiao, Atefeh Sohrabizadeh, Jie Wang, Jason Cong

In the past few years, domain-specific accelerators (DSAs), such as Google's Tensor Processing Units, have shown to offer significant performance and energy efficiency over general-purpose CPUs. An important question is whether typical software devel

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::75cc995366ff32ba3b31df6d3d6b4a3d
http://arxiv.org/abs/2209.02951

Zobrazit plný text záznamu

TopSort: A High-Performance Two-Phase Sorting Accelerator Optimized on HBM-based FPGAs

Autor: Weikang Qiao, Licheng Guo, Zhenman Fang, Mau-Chung Frank Chang, Jason Cong

The emergence of high-bandwidth memory (HBM) brings new opportunities to boost the performance of sorting acceleration on FPGAs, which was conventionally bounded by the available off-chip memory bandwidth. However, it is nontrivial for designers to f

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::066ffbf9a970c0f4695d63ecee68dd83
http://arxiv.org/abs/2205.07991

Zobrazit plný text záznamu

RapidStream : parallel physical implementation of FPGA HLS designs

Autor: Licheng Guo, Pongstorn Maidee, Yun Zhou, Chris Lavin, Jie Wang, Yuze Chi, Weikang Qiao, Alireza Kaviani, Zhiru Zhang, Jason Cong

Publikováno v: FPGA '22 : proceedings of the 2022 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays

FPGAs require a much longer compilation cycle than conventional computing platforms like CPUs. In this paper, we shorten the overall compilation time by co-optimizing the HLS compilation (C-to-RTL) and the back-end physical implementation (RTL-to-bit

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::c53a2ab5e75d3bc0862cc270c952fb41
https://biblio.ugent.be/publication/8742525

Zobrazit plný text záznamu

TAPA: A Scalable Task-Parallel Dataflow Programming Framework for Modern FPGAs with Co-Optimization of HLS and Physical Design

Autor: Licheng Guo, Zhenman Fang, Zhiru Zhang, Jason Cong, Yuze Chi, Jason Lau, Linghao Song, Xingyu Tian, Moazin Khatti, Weikang Qiao, Jie Wang, Ecenur Ustun

Publikováno v: Web of Science

In this paper, we propose TAPA, an end-to-end framework that compiles a C++ task-parallel dataflow program into a high-frequency FPGA accelerator. Compared to existing solutions, TAPA has two major advantages. First, TAPA provides a set of convenient

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::3b600742f18716ec6675d992c2d33add

Zobrazit plný text záznamu

FANS: FPGA-Accelerated Near-Storage Sorting

Autor: Jason Cong, Weikang Qiao, Mau-Chung Frank Chang, Licheng Guo, Jihun Oh

Publikováno v: FCCM

Large-scale sorting is always an important yet demanding task for data center applications. In addition to powerful processing capability, high-performance sorting system requires efficient utilization of the available bandwidth of various levels in

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::8d481e3629d24840a3f1335680df4a0d
https://doi.org/10.1109/fccm51124.2021.00020

Zobrazit plný text záznamu

HBM Connect: High-Performance HLS Interconnect for FPGA HBM

Autor: Jason Cong, Young-kyu Choi, Nikola Samardzic, Weikang Qiao, Yuze Chi

Publikováno v: FPGA

With the recent release of High Bandwidth Memory (HBM) based FPGA boards, developers can now exploit unprecedented external memory bandwidth. This allows more memory-bounded applications to benefit from FPGA acceleration. However, fully utilizing the

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::beb23f5b68267aab33f05d428df06543
https://doi.org/10.1145/3431920.3439301

Zobrazit plný text záznamu

AutoBridge: Coupling Coarse-Grained Floorplanning and Pipelining for High-Frequency HLS Design on Multi-Die FPGAs

Autor: Jie Wang, Jason Cong, Zhiru Zhang, Yuze Chi, Jason Lau, Licheng Guo, Weikang Qiao, Ecenur Ustun

Publikováno v: FPGA

Despite an increasing adoption of high-level synthesis (HLS) for its design productivity advantages, there remains a significant gap in the achievable clock frequency between an HLS-generated design and a handcrafted RTL one. A key factor that limits

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::3ff1a3952aee4f8a74022b6967f1a740
https://europepmc.org/articles/PMC8041363/

Zobrazit plný text záznamu

Bonsai: High-Performance Adaptive Merge Tree Sorting

Autor: Weikang Qiao, Nikola Samardzic, Mau-Chung Frank Chang, Vaibhav Aggarwal, Jason Cong

Publikováno v: ISCA

Sorting is a key computational kernel in many big data applications. Most sorting implementations focus on a specific input size, record width, and hardware configuration. This has created a wide array of sorters that are optimized only to a narrow a

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::3d0e2110cc974e0e882b1dcf0a763724
https://doi.org/10.1109/isca45697.2020.00033

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání