Fast Depth and Inter Mode Prediction for Quality Scalable High Efficiency Video Coding

Autor: Weisheng Li, Ce Zhu, Yu Sun, Frederic Dufaux, Dayong Wang
Přispěvatelé: Chongqing University of Posts and Telecommunications, Department of Computer and Information Sciences [Alabama] (CIS), University of Alabama at Birmingham [ Birmingham] (UAB), University of Electronic Science and Technology of China (UESTC), Laboratoire des signaux et systèmes (L2S), Université Paris-Sud - Paris 11 (UP11)-CentraleSupélec-Centre National de la Recherche Scientifique (CNRS)
Rok vydání: 2020
Předmět:
Zdroj: IEEE Transactions on Multimedia
IEEE Transactions on Multimedia, Institute of Electrical and Electronics Engineers, 2020, 22 (4), pp.833-845. ⟨10.1109/TMM.2019.2937240⟩
ISSN: 1941-0077
1520-9210
DOI: 10.1109/tmm.2019.2937240
Popis: International audience; The scalable high efficiency video coding (SHVC) is an extension of high efficiency video coding (HEVC), which introduces multiple layers and inter-layer prediction, thus significantly increases the coding complexity on top of the already complicated HEVC encoder. In inter prediction for quality SHVC, in order to determine the best possible mode at each depth level, a coding tree unit can be recursively split into four depth levels, including merge mode, inter2Nx2N, inter2NxN, interNx2N, interNxN, in-ter2NxnU, inter2NxnD, internLx2N and internRx2N, intra modes and inter-layer reference (ILR) mode. This can obtain the highest coding efficiency, but also result in very high coding complexity. Therefore, it is crucial to improve coding speed while maintaining coding efficiency. In this research, we have proposed a new depth level and inter mode prediction algorithm for quality SHVC. First, the depth level candidates are predicted based on inter-layer correlation, spatial correlation and its correlation degree. Second, for a given depth candidate, we divide mode prediction into square and non-square mode predictions respectively. Third, in the square mode prediction, ILR and merge modes are predicted according to depth correlation, and early terminated whether residual distribution follows a Gaussian distribution. Moreover, ILR mode, merge mode and inter2Nx2N are early terminated based on significant differences in Rate Distortion (RD) costs. Fourth, if the early termination condition cannot be satisfied, non-square modes are further predicted based on significant differences in expected values of residual coefficients. Finally, inter-layer and spatial correlations are combined with residual distribution to examine whether to early terminate depth selection. Experimental results have demonstrated that, on average, the proposed algorithm can achieve a time saving of 71.14%, with a bit rate increase of 1.27%.
Databáze: OpenAIRE