Computational versus psychophysical bottom-up image saliency: A comparative evaluation study
Autor: | Alexander Toet |
---|---|
Jazyk: | angličtina |
Rok vydání: | 2011 |
Předmět: |
Conspicuity
Visual perception Image compression Computer science Image quality Measurement procedures Content-aware Feature dimensions Adaptive content Pattern Recognition Automated Image analysis Engineering Human visual Target support Digital image storage Computer vision Image communication systems Region of interest Visualization Relative contribution Image segmentation visual search Applied Mathematics Video summarization Cognitive neuroscience of visual object recognition Psychophysical Pattern Recognition Visual Computational Theory and Mathematics Feature (computer vision) Computer Vision and Pattern Recognition Algorithms Human PCS - Perceptual and Cognitive Systems Models Neurological ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION Image scaling Visual targets Image processing Fixation Ocular Artificial Intelligence Salience (neuroscience) Image quality assessment Psychophysics Humans Computer Simulation Human visual inspection Rank order Lateral interactions Visual saliency Visual search Saliency business.industry Visual effects Measurements Static and dynamic Comparative evaluations Multiscale contrast Object recognition Progressive image transmission BSS - Behavioural and Societal Sciences Artificial intelligence business Additional knowledge Software Forecasting |
Zdroj: | IEEE Transactions on Pattern Analysis and Machine Intelligence, 11, 33, 2131-2146 |
Popis: | The predictions of 13 computational bottom-up saliency models and a newly introduced Multiscale Contrast Conspicuity (MCC) metric are compared with human visual conspicuity measurements. The agreement between human visual conspicuity estimates and model saliency predictions is quantified through their rank order correlation. The maximum of the computational saliency value over the target support area correlates most strongly with visual conspicuity for 12 of the 13 models. A simple multiscale contrast model and the MCC metric both yield the largest correlation with human visual target conspicuity ({}0.84). Local image saliency largely determines human visual inspection and interpretation of static and dynamic scenes. Computational saliency models therefore have a wide range of important applications, like adaptive content delivery, region-of-interest- based image compression, video summarization, progressive image transmission, image segmentation, image quality assessment, object recognition, and content-aware image scaling. However, current bottom-up saliency models do not incorporate important visual effects like crowding and lateral interaction. Additional knowledge about the exact nature of the interactions between the mechanisms mediating human visual saliency is required to develop these models further. The MCC metric and its associated psychophysical saliency measurement procedure are useful tools to systematically investigate the relative contribution of different feature dimensions to overall visual target saliency. © 2011 IEEE. |
Databáze: | OpenAIRE |
Externí odkaz: |