A parametric texture model based on deep convolutional features closely matches texture appearance for humans.

Journal: Journal of vision

Published Date: Oct 1, 2017

Abstract

Our visual environment is full of texture-"stuff" like cloth, bark, or gravel as distinct from "things" like dresses, trees, or paths-and humans are adept at perceiving subtle variations in material properties. To investigate image features important for texture perception, we psychophysically compare a recent parametric model of texture appearance (convolutional neural network [CNN] model) that uses the features encoded by a deep CNN (VGG-19) with two other models: the venerable Portilla and Simoncelli model and an extension of the CNN model in which the power spectrum is additionally matched. Observers discriminated model-generated textures from original natural textures in a spatial three-alternative oddity paradigm under two viewing conditions: when test patches were briefly presented to the near-periphery ("parafoveal") and when observers were able to make eye movements to all three patches ("inspection"). Under parafoveal viewing, observers were unable to discriminate 10 of 12 original images from CNN model images, and remarkably, the simpler Portilla and Simoncelli model performed slightly better than the CNN model (11 textures). Under foveal inspection, matching CNN features captured appearance substantially better than the Portilla and Simoncelli model (nine compared to four textures), and including the power spectrum improved appearance matching for two of the three remaining textures. None of the models we test here could produce indiscriminable images for one of the 12 textures under the inspection condition. While deep CNN (VGG-19) features can often be used to synthesize textures that humans cannot discriminate from natural textures, there is currently no uniformly best model for all textures and viewing conditions.

Authors

Thomas S A Wallis

Werner Reichardt Center for Integrative Neuroscience, Eberhard Karls Universität Tübingen, and the Bernstein Center for Computational Neuroscience, Tübingen, Germany.
Christina M Funke

Werner Reichardt Center for Integrative Neuroscience, Eberhard Karls Universität Tübingen, and the Bernstein Center for Computational Neuroscience, Tübingen, Germany.
Alexander S Ecker

Werner Reichardt Centre for Integrative Neuroscience and Institute of Theoretical Physics, University of Tübingen, Germany; Bernstein Center for Computational Neuroscience, Tübingen, Germany; Department of Neuroscience, Baylor College of Medicine, Houston, TX, USA.
Leon A Gatys

Werner Reichardt Centre for Integrative Neuroscience and Institute of Theoretical Physics, University of Tübingen, Germany; Bernstein Center for Computational Neuroscience, Tübingen, Germany; Graduate School for Neural Information Processing, Tübingen, Germany.
Felix A Wichmann

Neural Information Processing Group, Faculty of Science, Eberhard Karls Universität Tübingen, Bernstein Center for Computational Neuroscience, and the Max Planck Institute for Intelligent Systems, Empirical Inference Department, Tübingen, Germany.
Matthias Bethge

University of Tübingen, Tübingen, Germany.

Keywords

Eye Movements Fovea Centralis Humans Neural Networks, Computer Pattern Recognition, Visual Photic Stimulation Visual Perception

External Resources

View on PubMed Access via DOI PubMed (28983571)

A parametric texture model based on deep convolutional features closely matches texture appearance for humans.

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals