Please use this identifier to cite or link to this item:
https://hdl.handle.net/20.500.14279/1951
Title: | An embedded saliency map estimator scheme: Application to video encoding | Authors: | Tsapatsoulis, Nicolas Rapantzikos, Konstantinos Pattichis, Constantinos S. |
Major Field of Science: | Natural Sciences | Keywords: | Embedded implementation;ROI-based video encoding;Visual attention model | Issue Date: | 2007 | Source: | International Journal of Neural Systems, 2007, vol. 17, no. 4, pp. 289-304. | Volume: | 17 | Issue: | 4 | Start page: | 289 | End page: | 304 | Journal: | International Journal of Neural Systems | Abstract: | In this paper we propose a novel saliency-based computational model for visual attention. This model processes both top-down (goal directed) and bottom-up information. Processing in the top-down channel creates the so called skin conspicuity map and emulates the visual search for human faces performed by humans. This is clearly a goal directed task but is generic enough to be context independent. Processing in the bottom-up information channel follows the principles set by Itti et al. but it deviates from them by computing the orientation, intensity and color conspicuity maps within a unified multi-resolution framework based on wavelet subband analysis. In particular, we apply a wavelet based approach for efficient computation of the topographic feature maps. Given that wavelets and multiresolution theory are naturally connected the usage of wavelet decomposition for mimicking the center surround process in humans is an obvious choice. However, our implementation goes further. We utilize the wavelet decomposition for inline computation of the features (such as orientation angles) that are used to create the topographic feature maps. The bottom-up topographic feature maps and the top-down skin conspicuity map are then combined through a sigmoid function to produce the final saliency map. A prototype of the proposed model was realized through the TMDSDMK642-0E DSP platform as an embedded system allowing real-time operation. For evaluation purposes, in terms of perceived visual quality and video compression improvement, a ROI-based video compression setup was followed. Extended experiments concerning both MPEG-1 as well as low bit-rate MPEG-4 video encoding were conducted showing significant improvement in video compression efficiency without perceived deterioration in visual quality. | URI: | https://hdl.handle.net/20.500.14279/1951 | DOI: | 10.1142/S0129065707001147 | Rights: | © World Scientific | Type: | Article | Affiliation : | University of Cyprus National Technical University Of Athens |
Publication Type: | Peer Reviewed |
Appears in Collections: | Άρθρα/Articles |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Tsapasoulis_An Embedded Saliency Map Scheme.pdf | 2.01 MB | Adobe PDF | View/Open |
CORE Recommender
SCOPUSTM
Citations
50
23
checked on Nov 9, 2023
WEB OF SCIENCETM
Citations
19
Last Week
0
0
Last month
0
0
checked on Oct 9, 2023
Page view(s) 5
675
Last Week
0
0
Last month
2
2
checked on Dec 3, 2024
Download(s) 50
487
checked on Dec 3, 2024
Google ScholarTM
Check
Altmetric
Items in KTISIS are protected by copyright, with all rights reserved, unless otherwise indicated.