Expand this Topic clickable element to expand a topic
Skip to content
Optica Publishing Group

Demodulation, predictive coding, and spatial vision

Not Accessible

Your library or personal account may give you access

Abstract

We argue that some aspects of human spatial vision, particularly for textured patterns and scenes, can be described in terms of demodulation and predictive coding. Such nonlinear processes encode a pattern into local phasors that represent it completely as a modulation, in phase and amplitude, of a prediction associated with the image structure in some region by its predominant undulation(s). The demodulation representation of a pattern is an anisotropic, second-order form of predictive coding, and it offers a particularly efficient way to analyze and encode textures, as it identifies and exploits their underlying redundancies. In addition, self-consistent domains of redundancy in image structure provide a basis for image segmentation. We first provide an algorithm for computing the three elements of a complete demodulation transform of any image, and we illustrate such decompositions for both natural and synthetic images. We then present psychophysical evidence from spatial masking experiments, as well as illustrations of perceptual organization, that suggest a possible role for such underlying representations in human vision. In psychophysical experiments employing masks with more than two oriented Fourier components, we find that peaks of threshold elevation occur at locations in the Fourier plane remote from the orientations and frequencies of the actual mask components. Rather, as would occur from demodulation, these peaks in the frequency plane are related to the vector difference frequencies between the actual masking components and their spectral centers of mass. We offer a neural interpretation of demodulation coding, and finally we demonstrate a practical application of this process in a system for automatic visual recognition of personal identity by demodulation of a facial feature.

© 1995 Optical Society of America

Full Article  |  PDF Article

Corrections

John G. Daugman and Cathryn J. Downing, "Demodulation, predictive coding, and spatial vision: erratum," J. Opt. Soc. Am. A 12, 2077-2077 (1995)
https://opg.optica.org/josaa/abstract.cfm?uri=josaa-12-9-2077

More Like This
Demodulation, predictive coding, and spatial vision: erratum

John G. Daugman and Cathryn J. Downing
J. Opt. Soc. Am. A 12(9) 2077-2077 (1995)

Characterization of the spatial-frequency spectrum in the perception of shape from texture

Ko Sakai and Leif H. Finkel
J. Opt. Soc. Am. A 12(6) 1208-1224 (1995)

Pattern and motion vision without Laplacian zero crossings

John G. Daugman
J. Opt. Soc. Am. A 5(7) 1142-1148 (1988)

Cited By

You do not have subscription access to this journal. Cited by links are available to subscribers only. You may subscribe either as an Optica member, or as an authorized user of your institution.

Contact your librarian or system administrator
or
Login to access Optica Member Subscription

Figures (20)

You do not have subscription access to this journal. Figure files are available to subscribers only. You may subscribe either as an Optica member, or as an authorized user of your institution.

Contact your librarian or system administrator
or
Login to access Optica Member Subscription

Equations (31)

You do not have subscription access to this journal. Equations are available to subscribers only. You may subscribe either as an Optica member, or as an authorized user of your institution.

Contact your librarian or system administrator
or
Login to access Optica Member Subscription

Select as filters


Select Topics Cancel
© Copyright 2024 | Optica Publishing Group. All Rights Reserved