Automatic Image Colorization Via Multimodal Predictions

Authors:
Guillaume Charpiat;Matthias Hofmann;Bernhard Schölkopf
Affiliations:
Pulsar Team, INRIA, Sophia-Antipolis, France and Max Planck Institute for Biological Cybernetics, Tübingen, Germany;Wolfson Medical Vision Lab, Dpt. of Engineering Science, University of Oxford, UK and Max Planck Institute for Biological Cybernetics, Tübingen, Germany;Max Planck Institute for Biological Cybernetics, Tübingen, Germany
Venue:
ECCV '08 Proceedings of the 10th European Conference on Computer Vision: Part III
Year:
2008

Citing 0
Cited 7

Semantic colorization with internet images

Proceedings of the 2011 SIGGRAPH Asia Conference
Towards automatic concept transfer

Proceedings of the ACM SIGGRAPH/Eurographics Symposium on Non-Photorealistic Animation and Rendering
Automatic grayscale image colorization using histogram regression

Pattern Recognition Letters
Special Section on CANS: Toward automatic and flexible concept transfer

Computers and Graphics
Image colorization using similar images

Proceedings of the 20th ACM international conference on Multimedia
Image colorization with an affective word

CVM'12 Proceedings of the First international conference on Computational Visual Media
Probabilistic color-by-numbers: suggesting pattern colorizations using factor graphs

ACM Transactions on Graphics (TOG) - SIGGRAPH 2013 Conference Proceedings

Quantified Score

Hi-index	0.00

Visualization

Abstract

We aim to color greyscale images automatically, without any manual intervention. The color proposition could then be interactively corrected by user-provided color landmarks if necessary. Automatic colorization is nontrivial since there is usually no one-to-one correspondence between color and local texture. The contribution of our framework is that we deal directly with multimodality and estimate, for each pixel of the image to be colored, the probability distribution of all possible colors, instead of choosing the most probable color at the local level. We also predict the expected variation of color at each pixel, thus defining a non-uniform spatial coherency criterion. We then use graph cuts to maximize the probability of the whole colored image at the global level. We work in the L-a-b color space in order to approximate the human perception of distances between colors, and we use machine learning tools to extract as much information as possible from a dataset of colored examples. The resulting algorithm is fast, designed to be more robust to texture noise, and is above all able to deal with ambiguity, in contrary to previous approaches.