Extracting Images from Chinese PDF Documents

Autor: Quan Yin Zhu, Yong Hua Yin, Ying Jin, Yun Yang Yan
Rok vydání: 2014
Předmět:
Zdroj: Applied Mechanics and Materials. :887-890
ISSN: 1662-7482
DOI: 10.4028/www.scientific.net/amm.530-531.887
Popis: In order to efficient tap the potential value in Chinese PDF documents and use Chinese PDF documents, an unique idea that extracting images from Chinese PDF documents is proposed in this paper. The idea combines PDFs document structure and page tree to extract images. Based on this idea, the experiments in this paper are done with one hundred Chinese PDF documents. And the extraction rate of the experiments obtains 83.56 percent. According to the analysis of experimental results, it is proved that the idea proposed in this paper is applicable to most of Chinese PDF documents and it is able to meet most of the needs of practical application.
Databáze: OpenAIRE