Resolution enhancement of images for further pattern recognition applications

Autor: Maha Awad, Heba A. El-Khobby, Saleh A. Alshebeili, Mustafa M. Abd Elnaby, Salaheldin M. Diab, Fathi E. Abd El-Samie, El-Sayed M. El-Rabaie, B. M. Sallam, Said E. El Khamy, Fatma G. Hashad, Osama S. Faragallah, Alaa M. Abbas
Rok vydání: 2016
Předmět:
Discrete wavelet transform
Computer science
Feature extraction
ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION
Bilinear interpolation
02 engineering and technology
030507 speech-language pathology & audiology
03 medical and health sciences
0202 electrical engineering
electronic engineering
information engineering

Image scaling
Feature (machine learning)
Discrete cosine transform
Time domain
Electrical and Electronic Engineering
Decimation
business.industry
020206 networking & telecommunications
Pattern recognition
Atomic and Molecular Physics
and Optics

Electronic
Optical and Magnetic Materials

Discrete sine transform
Feature (computer vision)
Computer Science::Computer Vision and Pattern Recognition
Pattern recognition (psychology)
Bicubic interpolation
Artificial intelligence
Mel-frequency cepstrum
0305 other medical science
business
Interpolation
Zdroj: Optik. 127:484-492
ISSN: 0030-4026
DOI: 10.1016/j.ijleo.2015.08.122
Popis: In storing large databases of images such as fingerprint and medical databases, the required memory size becomes a great challenge. This work demonstrates a framework for reducing the size of large image databases used in pattern recognition applications with decimation, and reconstructing the images with their original sizes using interpolation for feature extraction. For pattern recognition applications, a new trend based on Mel-Frequency Cepstral Coefficients (MFCCs) is presented in the paper. To reconstruct the images to their original sizes, interpolation methods like bilinear, bicubic, warped-distance, and neural methods are investigated and compared. The sensitivity of the extracted features from the images to the interpolation method used is studied. For the feature extraction process, the interpolated images are converted into one dimensional signals with lexicographic ordering and employed in time domain or transformed to Discrete Wavelet Transform (DWT), Discrete Sine Transform (DST), or Discrete Cosine Transform (DCT) domain. The MFCCs and polynomial shape coefficients are then extracted to generate the database of features, which can be used for pattern identification using neural networks. The pattern recognition is conducted by getting features from the pattern image under test. Experimental results show that feature extraction from an interpolated image to retain the original image dimensions can be used robustly for pattern recognition. In addition, the results reveal that the best domain for feature extraction is the DCT.
Databáze: OpenAIRE