Deep Learning based-framework for Math Formulas Understanding

Autor: Afef Kacem
Jazyk: angličtina
Rok vydání: 2024
Předmět:
Zdroj: ELCVIA Electronic Letters on Computer Vision and Image Analysis, Vol 23, Iss 2 (2024)
Druh dokumentu: article
ISSN: 1577-5097
DOI: 10.5565/rev/elcvia.1833
Popis: Extracting mathematical formulas from images of scientific documents and converting them into structured data for storage in a database is essential for their further use. However, recognizing and extracting math formulas automatically, rapidly, and effectively can be challenging. To handle this problem, we have proposed a system, with a deep learning architecture, that uses the formula combination features to train the YOLOv8 model. This system can detect and classify the formula inside and outside the text. Once extracted, we built a robust end-to-end math formula recognition system that automatically identifies and classifies math symbols, using the faster R-CNN object detection, then a Convolution Graphical Neural network (ConvGNN) to analyze the math formula layout, as the formula is better represented as a graph with complex relationships and object interdependency. ConvGNN can predict formula linkages without resorting to laborious feature engineering. Experimental results on the IBEM and CROHME 2019 datasets reveal that the proposed approach can accurately extract isolated formulas with mAP of 99.3\%, embedded formulas with mAP of 80.3%, detect symbols with mAP of 87.3%, and analyze formula layout with an accuracy of 92%. We also showed that our system is competitive with related work.
Databáze: Directory of Open Access Journals