Challenges in End-to-End Neural Scientific Table Recognition

Autor: Yuntian Deng, Gideon Mann, David Rosenberg
Rok vydání: 2019
Předmět:
Zdroj: ICDAR
Popis: In recent years, end-to-end trained neural models have been applied successfully to various optical character recognition (OCR) tasks. However, the same level of success has not yet been achieved in end-to-end neural scientific table recognition, which involves multi-row/multi-column structures and math formulas in cells. In this paper, we take a step forward to full end-to-end scientific table recognition by constructing a large dataset consisting of 450K table images paired with corresponding LaTeX sources. We apply a popular attentional encoder-decoder model to this dataset and show the potential of end-to-end trained neural systems, as well as associated challenges.
Databáze: OpenAIRE