Abstrakt: |
The quadtree partition process involves major complexity in high-efficiency video coding (HEVC/H.265). It divides the coding tree units (CTUs) recursively into coding units (CUs). Determining the coding unit partition depth based on rate-distortion optimisation is computationally difficult in HEVC. This article proposes a system based on a deep learning architecture for determining the coding unit partition depth with less time in HEVC intra-prediction. The proposed system minimises computing complexity by removing the rate-distortion optimisation. The proposed system comprises two main blocks: the pre-processing block and the deep learning block. During the pre-processing phase, the spatial resolution of the input data is drastically reduced, enabling the neural network model to quickly adapt to the input sample and extract more meaningful feature data. This paper proposes two distinct deep learning architectures, CU-VGG16 and CU-VGG19. Pre-processed coding units (16 × 16) are the input for the deep learning architecture, and the corresponding coding units' depths (0, 1, 2, 3) are the output. To compare the accuracy of coding unit depth prediction in the two proposed models, we have created a database with varying resolutions. The performance of the proposed models was observed by replacing the CU partition block of traditional HEVC with the proposed systems and comparing the bit rate and encoding time with traditional HEVC. The results demonstrated that the proposed architecture with CU-VGG16 and CU-VGG19 designs speeds up coding unit partitioning by 87.15% and 87.70%, respectively, as compared to standard HEVC. [ABSTRACT FROM AUTHOR] |