A first approach to speeding-up the inter mode selection in MPEG-2/H.264 transcoders using machine learning

Authors:
Gerardo Fernández-Escribano;Hari Kalva;Pedro Cuenca;Luis Orozco-Barbosa
Affiliations:
Instituto de Investigación en Informática, Universidad de Castilla-La Mancha, Albacete, Spain 02071;Department of Computer Science and Engineering, Florida Altantic University, Boca Raton, USA 33431;Instituto de Investigación en Informática, Universidad de Castilla-La Mancha, Albacete, Spain 02071;Instituto de Investigación en Informática, Universidad de Castilla-La Mancha, Albacete, Spain 02071
Venue:
Multimedia Tools and Applications
Year:
2007

Citing 6
Cited 0

C4.5: programs for machine learning

C4.5: programs for machine learning
Efficient block size selection for MPEG-2 to H.264 transcoding

Proceedings of the 12th annual ACM international conference on Multimedia
Very low complexity MPEG-2 to H.264 transcoding using machine learning

MULTIMEDIA '06 Proceedings of the 14th annual ACM international conference on Multimedia
Data Mining: Practical Machine Learning Tools and Techniques, Second Edition (Morgan Kaufmann Series in Data Management Systems)

Data Mining: Practical Machine Learning Tools and Techniques, Second Edition (Morgan Kaufmann Series in Data Management Systems)
Requirements for motion-estimation search range in MPEG-2 coded video

IBM Journal of Research and Development
Motion vector refinement for high-performance transcoding

IEEE Transactions on Multimedia

Quantified Score

Hi-index	0.00

Visualization

Abstract

The H.264 standard achieves much higher coding efficiency than the MPEG-2 standard, due to its improved inter and intra prediction modes which come with a cost of higher computation complexity. Transcoding MPEG-2 video to H.264 is important to enable gradual migration to H.264. However, given the significant differences between the MPEG-2 and the H.264 coding algorithms, transcoding is much more complex and new approaches to transcoding are necessary. In this paper, we introduce and evaluate a low complexity macroblock partition mode decision algorithm, to be used as part of a high-efficient inter-frame prediction in MPEG-2 to H.264 transcoder. The proposed tools are used to compute an optimal MB coding mode decision with significantly reduced computational complexity. Specifically, we achieve the computational savings by using the following MB information coming from MPEG-2: the MB coding modes, the coded block pattern (CBPC) in MPEG-2, and the mean and variance of the 16 4驴脳驴4 sub blocks of the MPEG-2 residual MBs. We use data mining algorithms to develop a decision tree for H.264 coding mode decisions. The decision trees are built using RD optimized mode decisions and result in highly efficient mode decisions, with significantly reduced computational complexity. The proposed transcoder is 35% faster than the RD optimized H.264 reference transcoder without a significant PSNR degradation (0.05 dB on average). The proposed transcoder performs over 0.4 dB better on average than the SAE cost based H.264 transcoding.