What are Textons?

Authors:
Song-Chun Zhu;Cheng-En Guo;Yizhou Wang;Zijian Xu
Affiliations:
Departments of Statistics and Computer Science, University of California, Los Angeles, Los Angeles, USA 90095;Departments of Statistics and Computer Science, University of California, Los Angeles, Los Angeles, USA 90095;Departments of Statistics and Computer Science, University of California, Los Angeles, Los Angeles, USA 90095;Departments of Statistics and Computer Science, University of California, Los Angeles, Los Angeles, USA 90095
Venue:
International Journal of Computer Vision - Special Issue on Texture Analysis and Synthesis
Year:
2005

Citing 22
Cited 33

A Theory for Multiresolution Signal Decomposition: The Wavelet Representation

IEEE Transactions on Pattern Analysis and Machine Intelligence
What does the retina know about natural scenes?

Neural Computation
Geometry and photometry in three-dimensional visual recognition

Geometry and photometry in three-dimensional visual recognition
An information-maximization approach to blind separation and blind deconvolution

Neural Computation
The perception of shading and reflectance

Perception as Bayesian inference
What Is the Set of Images of an Object Under All Possible Illumination Conditions?

International Journal of Computer Vision
The Bas-Relief Ambiguity

International Journal of Computer Vision
Synthesizing bidirectional texture functions for real-world surfaces

Proceedings of the 28th annual conference on Computer graphics and interactive techniques
Image Segmentation by Data-Driven Markov Chain Monte Carlo

IEEE Transactions on Pattern Analysis and Machine Intelligence
Motion texture: a two-level statistical model for character motion synthesis

Proceedings of the 29th annual conference on Computer graphics and interactive techniques
Bidirectional Reflection Distribution Function of Thoroughly Pitted Surfaces

International Journal of Computer Vision
Modeling Visual Patterns by Integrating Descriptive and Generative Methods

International Journal of Computer Vision
What Are Textons?

ECCV '02 Proceedings of the 7th European Conference on Computer Vision-Part IV
A Generative Method for Textured Motion: Analysis and Synthesis

ECCV '02 Proceedings of the 7th European Conference on Computer Vision-Part I
Linear Fitting with Missing Data: Applications to Structure-from-Motion and to Characterizing Intensity Images

CVPR '97 Proceedings of the 1997 Conference on Computer Vision and Pattern Recognition (CVPR '97)
Recognizing Surfaces Using Three-Dimensional Textons

ICCV '99 Proceedings of the International Conference on Computer Vision-Volume 2 - Volume 2
Transformed Component Analysis: Joint Estimation of Spatial Transformations and Image Components

ICCV '99 Proceedings of the International Conference on Computer Vision-Volume 2 - Volume 2
Shading Primitives: Finding Folds and Shallow Grooves

ICCV '98 Proceedings of the Sixth International Conference on Computer Vision
Towards a Mathematical Theory of Primal Sketch and Sketchability

ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
Analysis and Synthesis of Textured Motion: Particles and Waves

IEEE Transactions on Pattern Analysis and Machine Intelligence
Data compression and harmonic analysis

IEEE Transactions on Information Theory
Statistical modeling and conceptualization of visual patterns

IEEE Transactions on Pattern Analysis and Machine Intelligence

Analysis and Synthesis of Textured Motion: Particles and Waves

IEEE Transactions on Pattern Analysis and Machine Intelligence
One-click lattice extraction from near-regular texture

GRAPHITE '05 Proceedings of the 3rd international conference on Computer graphics and interactive techniques in Australasia and South East Asia
Hypotheses for Image Features, Icons and Textons

International Journal of Computer Vision
A Lattice-Based MRF Model for Dynamic Near-Regular Texture Tracking

IEEE Transactions on Pattern Analysis and Machine Intelligence
A Two-Level Generative Model for Cloth Representation and Shape from Shading

IEEE Transactions on Pattern Analysis and Machine Intelligence
TEXEMS: Texture Exemplars for Defect Detection on Random Textured Surfaces

IEEE Transactions on Pattern Analysis and Machine Intelligence
Unsupervised segmentation of natural images via lossy data compression

Computer Vision and Image Understanding
Statistics and category systems for the shape index descriptor of local 2nd order natural image structure

Image and Vision Computing
Hierarchical multiple Markov chain model for unsupervised texture segmentation

IEEE Transactions on Image Processing
Texture image retrieval using compact texton co-occurrence matrix descriptor

Proceedings of the international conference on Multimedia information retrieval
Learning explicit and implicit visual manifolds by information projection

Pattern Recognition Letters
Variable duration motion texture for human motion modeling

PRICAI'06 Proceedings of the 9th Pacific Rim international conference on Artificial intelligence
Maximum likelihood metameres for local 2nd order image structure of natural images

SSVM'07 Proceedings of the 1st international conference on Scale space and variational methods in computer vision
Unsupervised detection and localization of structural textures using projection profiles

Pattern Recognition
Learning Active Basis Model for Object Detection and Recognition

International Journal of Computer Vision
An affine symmetric image model and its applications

IEEE Transactions on Image Processing
Pursuing atomic video words by information projection

ACCV'10 Proceedings of the 10th Asian conference on Computer vision - Volume Part II
Textons and the propagation of space in acousmatic music

Organised Sound
Homogeneity cues for texel size estimation of periodic and near-periodic textures

MCPR'11 Proceedings of the Third Mexican conference on Pattern recognition
Modeling human aesthetic perception of visual textures

ACM Transactions on Applied Perception (TAP)
Inpainting with image patches for compression

Journal of Visual Communication and Image Representation
Texture exemplars for defect detection on random textures

ICAPR'05 Proceedings of the Third international conference on Pattern Recognition and Image Analysis - Volume Part II
Deformable-Model based textured object segmentation

EMMCVPR'05 Proceedings of the 5th international conference on Energy Minimization Methods in Computer Vision and Pattern Recognition
Texton theory revisited: A bag-of-words approach to combine textons

Pattern Recognition
Coarse Iris classification by learned visual dictionary

ICB'07 Proceedings of the 2007 international conference on Advances in Biometrics
Learning basic patterns from repetitive texture surfaces under non-rigid deformations

ICIAR'07 Proceedings of the 4th international conference on Image Analysis and Recognition
Modeling dynamic swarms

Computer Vision and Image Understanding
Object recognition using sparse representation of overcomplete dictionary

ICONIP'12 Proceedings of the 19th international conference on Neural Information Processing - Volume Part IV
Fast texel size estimation in visual texture using homogeneity cues

Pattern Recognition Letters
A review of information fusion techniques employed in iris recognition systems

International Journal of Advanced Intelligence Paradigms
Scale detection via keypoint density maps in regular or near-regular textures

Pattern Recognition Letters
Detecting parametric objects in large scenes by Monte Carlo sampling

International Journal of Computer Vision
A distinct and compact texture descriptor

Image and Vision Computing

Quantified Score

Hi-index	0.00

Visualization

Abstract

Textons refer to fundamental micro-structures in natural images (and videos) and are considered as the atoms of pre-attentive human visual perception (Julesz, 1981). Unfortunately, the word "texton" remains a vague concept in the literature for lack of a good mathematical model. In this article, we first present a three-level generative image model for learning textons from texture images. In this model, an image is a superposition of a number of image bases selected from an over-complete dictionary including various Gabor and Laplacian of Gaussian functions at various locations, scales, and orientations. These image bases are, in turn, generated by a smaller number of texton elements, selected from a dictionary of textons. By analogy to the waveform-phoneme-word hierarchy in speech, the pixel-base-texton hierarchy presents an increasingly abstract visual description and leads to dimension reduction and variable decoupling. By fitting the generative model to observed images, we can learn the texton dictionary as parameters of the generative model. Then the paper proceeds to study the geometric, dynamic, and photometric structures of the texton representation by further extending the generative model to account for motion and illumination variations. (1) For the geometric structures, a texton consists of a number of image bases with deformable spatial configurations. The geometric structures are learned from static texture images. (2) For the dynamic structures, the motion of a texton is characterized by a Markov chain model in time which sometimes can switch geometric configurations during the movement. We call the moving textons as "motons". The dynamic models are learned using the trajectories of the textons inferred from video sequence. (3) For photometric structures, a texton represents the set of images of a 3D surface element under varying illuminations and is called a "lighton" in this paper. We adopt an illumination-cone representation where a lighton is a texton triplet. For a given light source, a lighton image is generated as a linear sum of the three texton bases. We present a sequence of experiments for learning the geometric, dynamic, and photometric structures from images and videos, and we also present some comparison studies with K-mean clustering, sparse coding, independent component analysis, and transformed component analysis. We shall discuss how general textons can be learned from generic natural images.