Automated ID document extraction and classification: An integrated approach for efficient information retrieval from identity documents.

Autor: Shetty, S. Vijaya, Madhumitha, R., Fernandes, Roshan, Reddy, Mekala Meghana, Shettar, Shreya, Murthy, Tejashree Krishna
Předmět:
Zdroj: AIP Conference Proceedings; 2024, Vol. 3122 Issue 1, p1-12, 12p
Abstrakt: In the modern age, a significant number of official documents, including IDs approved by the government and certificates, are predominantly image-based or paper-based. Extracting, entering, and searching for information from these documents manually can be a cumbersome and time-consuming process for organizations. To address this challenge, Automated ID Document Extraction and Classification (AIDEX) is introduced. This model utilizes machine learning algorithms to automatically classify identity documents issued by the government into predetermined categories such as AADHAAR card, Driving License and PAN card. By identifying unique indicators present in these ID proofs, AIDEX effectively detects and displays vital information, eliminating the need for manual intervention and achieving automation. Through appropriate access and permissions, the extracted data can be securely saved and accessed by authenticated users as required. The application of this strategy significantly reduces the time spent on physical labor, leading to resource conservation. Notably, AIDEX stands out by integrating the identification and data extraction from images of Driving Licenses, distinguishing itself as a novel approach. This paper provides a comprehensive overview of the objectives set for AIDEX, the various techniques employed during its development, and the technical specifications of the resulting model. The findings highlight the potential of AIDEX to streamline information retrieval processes, enhance operational efficiency, and promote automation in handling government-issued identity documents. [ABSTRACT FROM AUTHOR]
Databáze: Complementary Index