String techniques for detecting duplicates in document databases

Autor: Andrew D. Bagdanov, Junichi Kanai
Rok vydání: 2000
Předmět:
Zdroj: ICDAR
ISSN: 1433-2825
1433-2833
DOI: 10.1007/s100320050005
Popis: A new projection profile based algorithm that extracts fiducial points needed to estimate a skew angle by decoding a JBIG compressed image is presented. This algorithm and three other projection profile based algorithms were tested using 460 page images and 1246 single column test zones extracted from the page images. Linear regression analyses of the experimental results showed that the new algorithm performed competitively with the other three algorithms.
Databáze: OpenAIRE