Introducing a new image dissimilarity measure with an application to character image clustering in degraded historical documents
Autor: | Sebastian Colutto |
---|---|
Rok vydání: | 2010 |
Předmět: |
Computer science
business.industry ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION Measure (physics) 020207 software engineering Pattern recognition 02 engineering and technology Curvature Image (mathematics) ComputingMethodologies_PATTERNRECOGNITION Character (mathematics) Font 0202 electrical engineering electronic engineering information engineering 020201 artificial intelligence & image processing Computer vision Artificial intelligence Cluster analysis business Distance transform Historical document |
Zdroj: | Document Analysis Systems |
Popis: | In this paper we present a novel method for the calculation of the distance between two input images that are representing characters of an historical document. The ultimate goal is to create a high quality clustering of the images, i.e. to extract an inventory of the document.Our image dissimilarity measure is based upon the Local Distance Map and robust curvature estimation using Integral Invariants. We demonstrate the superior behaviour of the image dissimilarity measure with experiments on three datasets of different font and quality comparing them to standard shape descriptors as well as clustering results produced by a state-of-the-art OCR engine. |
Databáze: | OpenAIRE |
Externí odkaz: |