The utility of image descriptions in the initial stages of vision: A case study of printed text

Autor:	Roger Watt, Steven C. Dakin
Rok vydání:	2010
Předmět:	Vocabulary Visual perception media_common.quotation_subject ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION Image processing computer.software_genre Models Biological Pattern Recognition Automated Domain (software engineering) Visual processing Digital image Image Processing Computer-Assisted Humans Computer Simulation Computer vision Vision Ocular General Psychology media_common business.industry Signal Processing Computer-Assisted Image Enhancement Pattern recognition (psychology) Word recognition Visual Perception Artificial intelligence Psychology business computer Photic Stimulation Natural language processing
Zdroj:	British Journal of Psychology. 101:1-26
ISSN:	0007-1269
DOI:	10.1348/000712608x379070
Popis:	Vision research has made very substantial progress towards understanding how we see. It is one area of psychology where the three-way thrust of behavioural measurements (psychophysics), brain imaging, and computational studies have been combined quite routinely for some years. The purpose of this paper is to demonstrate a relatively unusual form of computational modelling that we characterise as involving image descriptions. Image descriptions are statements about structures in images and relationships between structures. Most modelling in vision is either conceived in fairly abstract terms, or is done at the level of images. Neither is entirely satisfactory, and image descriptions are a simple formulation of age-old ideas about a Vocabulary of image features that are detected and parameterized from actual digital images. For our example, we use the domain of the visual perception of printed text. This is an area that has been characterized by thorough, robust psychophysical experiments. The fundamental requirements of visual processing in this domain are: grouping of some parts if the image into words; at the same time segmenting words from each other. We show how these are readily understood in terms of our model of image descriptions, and show quantitatively that typographical practice, refined over centuries, is about optimum for the visual system at least as represented by our model. In addition, we show that the same notion of image descriptions could, in principle, support word recognition in certain circumstances.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_dedup___::94a3a0da659c565a831d11a56b9657cf https://doi.org/10.1348/000712608x379070 Zobrazit plný text záznamu Plný text ve formátu PDF Plný text ve formátu HTML
Nepřihlášeným uživatelům se plný text nezobrazuje	K zobrazení výsledku je třeba se přihlásit.