Extracting orientation and scale from smoothly varying textures with application to segmentation

Chang, Jason, Ph. D. Massachusetts Institute of Technology

The system will be going down for regular maintenance. Please save your work and logout.

Author(s)

Chang, Jason, Ph. D. Massachusetts Institute of Technology

DownloadFull printable version (26.39Mb)

Other Contributors

Massachusetts Institute of Technology. Dept. of Electrical Engineering and Computer Science.

Advisor

John W. Fisher, III.

Terms of use

M.I.T. theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. See provided URL for inquiries about permission. http://dspace.mit.edu/handle/1721.1/7582

Metadata

Show full item record

Abstract

The work in this thesis focuses on two main computer vision research topic: image segmentation and texture modeling. Information theoretic measures have been applied to image segmentation algorithms for the past decade. In previous work, common measures such as mutual information or J divergence have been used. Algorithms typically differ by the measure they use and the features they use to segment an image. When both the information measure and the features change, it is difficult to compare which algorithm actually performs better and for what reason. Though we do not provide a solution to this problem, we do compare and contrast three distances under two different measures. This thesis considers two forms of information theoretic based image segmentation algorithms that have previously been considered. We denote them here as the label method and the conditional method. Gradient ascent velocities are derived for a general Ali-Silvey distance for both methods, and a unique bijective mapping is shown to exist between the two methods when the Ali-Silvey distance takes on a specific form. While the conditional method is more commonly considered, it is implicitly limited by a two-region segmentation by construction. Using the derived mapping, one can easily extend a binary segmentation algorithm based on the conditional method to a multiregion segmentation algorithm based on the label method. The importance of initializations and local extrema is also considered, and a method of multiple random initializations is shown to produce better results.

(cont.) Additionally, segmentation results and methods for comparing the utility of the different measures are presented. This thesis also considers a novel texture model for representing textured regions with smooth variations in orientation and scale. By utilizing the steerable pyramid of Simoncelli and Freeman, the textured regions of natural images are decomposed into explicit local attributes of contrast, bias, scale, and orientation. Once found, smoothness in these attributes are imposed via estimation of Markov random fields. This combination allows for demonstrable improvements in common scene analysis applications including segmentation, reflectance and shading estimation, and estimation of the radiometric response function from a single grayscale image.

Description

Thesis (S.M.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2009.

Cataloged from PDF version of thesis.

Includes bibliographical references (p. 109-112).

Date issued

2009

URI

http://hdl.handle.net/1721.1/55147

Department

Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science

Publisher

Massachusetts Institute of Technology

Keywords

Electrical Engineering and Computer Science.

Collections

Graduate Theses