Higher-dimensional computational models of perceptual grouping and silhouette analysis and representation
Autor: | Twarog, Nathaniel R |
---|---|
Jazyk: | angličtina |
Rok vydání: | 2012 |
Předmět: | |
Druh dokumentu: | Diplomová práce |
Popis: | Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Brain and Cognitive Sciences, 2012. This electronic version was submitted by the student author. The certified thesis is available in the Institute Archives and Special Collections. Cataloged from student-submitted PDF version of thesis. Includes bibliographical references (p. 100-105). In the following thesis, I describe the investigation of two problems related to the organization and structural analysis of visual information: perceptual grouping and silhouette analysis and representation. For the problem of perceptual grouping, an intuitive model framework was developed which operates on raw images and locates relevant groupings utilizing a higher dimensional space that contains not only the two spatial dimensions of the image but one or more dimension corresponding to relevant image features such as luminance, hue, or orientation. A psychophysical experiment was run to measure how human visual observers perform perceptual grouping across a variety of spatial scales and luminance differences. These results were compared with the predictions of our grouping model, and the model was able to capture much of the grouping behavior of the human subjects. A second experiment was run in which the perception of groups was disrupted by the presence of noise or shifts in brightness. Though the experiments showed only small effects resulting from these disruptions on the behavior of human subjects, the model was still able to successfully capture much of the image-to-image variability. For the question of silhouette representation and analysis, I suggest that human silhouette representation may be inextricably tied to 3D interpretation of 2D shapes. To support this, I propose a novel algorithm for 2D silhouette inflation called Puffball, which closely matches human intuition for a variety of simple shapes and can be run on almost any input. Using this algorithm, a new model of human part segmentation was derived using 2D-to-3D inflation; this model was evaluated against human-generated part segmentations and two competing part segmentation algorithms. Across a variety of different analyses, Puffball part segmentation performed as well or better than its competitors, suggesting a potential role for 2D-to-3D inflation in the segmentation of silhouette parts. Finally, I suggest several avenues of research which may further illuminate the role of inflation in the human representation and analysis of 2D and 3D shape. by Nathaniel R. Twarog. Ph.D. |
Databáze: | Networked Digital Library of Theses & Dissertations |
Externí odkaz: |