Zobrazeno 1 - 2
of 2
pro vyhledávání: '"Polewczyk, Marek"'
Autor:
Spinaci, Marco, Polewczyk, Marek, Hoffart, Johannes, Kohler, Markus C., Thelin, Sam, Klein, Tassilo
Self-supervised learning on tabular data seeks to apply advances from natural language and image domains to the diverse domain of tables. However, current techniques often struggle with integrating multi-domain data and require data cleaning or speci
Externí odkaz:
http://arxiv.org/abs/2410.13516
Autor:
Polewczyk, Marek, Spinaci, Marco
We present a novel deep-learning-based method to cluster words in documents which we apply to detect and recognize tables given the OCR output. We interpret table structure bottom-up as a graph of relations between pairs of words (belonging to the sa
Externí odkaz:
http://arxiv.org/abs/2402.07502