Zobrazeno 1 - 3
of 3
pro vyhledávání: '"Hemmer, Arthur"'
Optical Character Recognition (OCR) continues to face accuracy challenges that impact subsequent applications. To address these errors, we explore the utility of OCR confidence scores for enhancing post-OCR error detection. Our study involves analyzi
Externí odkaz:
http://arxiv.org/abs/2409.04117
We explore the possibility of improving probabilistic models in structured prediction. Specifically, we combine the models with constrained decoding approaches in the context of token classification for information extraction. The decoding methods se
Externí odkaz:
http://arxiv.org/abs/2312.03367
Post-OCR processing has significantly improved over the past few years. However, these have been primarily beneficial for texts consisting of natural, alphabetical words, as opposed to documents of numerical nature such as invoices, payslips, medical
Externí odkaz:
http://arxiv.org/abs/2307.01020