Autor: |
Li, Qinjian, Zhong, Chengwen, Zhao, Kaiyong, Mei, Xinxin, Chu, Xiaowen |
Zdroj: |
2012 IEEE 14th International Conference on High Performance Computing & Communication & 2012 IEEE 9th International Conference on Embedded Software & Systems; 1/ 1/2012, p843-848, 6p |
Abstrakt: |
GPU is continuing its trend of vastly outperforming CPU while becoming more general purpose. In order to improve the efficiency of AES algorithm, this paper proposed a CUDA implementation of Electronic Codebook (ECB) mode encoding process and Cipher Feedback (CBC) mode decoding process on GPU. In our implementation, the frequently accessed T-boxes were allocated on on-chip shared memory and the granularity that one thread handles a 16 Bytes AES block was adopted. Finally, we achieved the highest performance of around 60 Gbps throughput on NVIDIA Tesla C2050 GPU, which runs up to 50 times faster than a sequential implementation based on Intel Core i7-920 2.66GHz CPU. In addition, we discussed the optimization under some practical application scenarios such as overlapping GPU processing and data transfer. [ABSTRACT FROM PUBLISHER] |
Databáze: |
Complementary Index |
Externí odkaz: |
|