Fast motion estimation for HEVC on graphics processing unit (GPU)
Autor: | Dongkyu Lee, Seoung-Jun Oh, Dong-Gyu Sim, Kee-Seong Cho |
---|---|
Rok vydání: | 2015 |
Předmět: |
Computer science
Graphics processing unit 02 engineering and technology Parallel computing Thread (computing) 020202 computer hardware & architecture Computer graphics CUDA Motion estimation 0202 electrical engineering electronic engineering information engineering 020201 artificial intelligence & image processing General-purpose computing on graphics processing units Encoder Information Systems Data compression |
Zdroj: | Journal of Real-Time Image Processing. 12:549-562 |
ISSN: | 1861-8219 1861-8200 |
DOI: | 10.1007/s11554-015-0522-6 |
Popis: | The recent video compression standard, HEVC (high efficiency video coding), will most likely be used in various applications in the near future. However, the encoding process is far too slow for real-time applications. At the same time, computing capabilities of GPUs (graphics processing units) have become more powerful in these days. In this paper, we have proposed a GPU-based parallel motion estimation (ME) algorithm to enhance the performance of an HEVC encoder. A frame is partitioned into two subframes for pipelined execution to improve GPU utilization. The flow chart is redetermined to solve data hazards in the pipelined execution. Two new methods are introduced in the proposed ME: decision of a representative search center position (RSCP) and warp-based concurrent parallel reduction (WCPR). A RSCP employs motion vectors of a co-located CTU in a previously encoded frame to solve a dependency problem in parallel computation with negligible coding loss. WCPR concurrently executes several parallel reduction operations, which increases the thread utilization from 20 to 89 % without any thread synchronization. The proposed encoder can make the portion of ME in the encoder negligible with 2.2 % bitrate increase against the HEVC test model (HM) encoder. In terms of ME, the proposed ME is 130.7 times faster than that of the HM encoder. |
Databáze: | OpenAIRE |
Externí odkaz: |