Few-Shot Pixel-Precise Document Layout Segmentation via Dynamic Instance Generation and Local Thresholding.

Autor: De Nardin A; Department of Mathematics, Computer Science and Physics, Università degli Studi di Udine, Via delle Scienze 206, 33100 Udine, Italy., Zottin S; Department of Mathematics, Computer Science and Physics, Università degli Studi di Udine, Via delle Scienze 206, 33100 Udine, Italy., Piciarelli C; Department of Mathematics, Computer Science and Physics, Università degli Studi di Udine, Via delle Scienze 206, 33100 Udine, Italy., Colombi E; Department of Humanities and Cultural Heritage, Università degli Studi di Udine, Vicolo Florio 2/b, 33100 Udine, Italy., Foresti GL; Department of Mathematics, Computer Science and Physics, Università degli Studi di Udine, Via delle Scienze 206, 33100 Udine, Italy.
Jazyk: angličtina
Zdroj: International journal of neural systems [Int J Neural Syst] 2023 Oct; Vol. 33 (10), pp. 2350052. Date of Electronic Publication: 2023 Aug 10.
DOI: 10.1142/S0129065723500521
Abstrakt: Over the years, the humanities community has increasingly requested the creation of artificial intelligence frameworks to help the study of cultural heritage. Document Layout segmentation, which aims at identifying the different structural components of a document page, is a particularly interesting task connected to this trend, specifically when it comes to handwritten texts. While there are many effective approaches to this problem, they all rely on large amounts of data for the training of the underlying models, which is rarely possible in a real-world scenario, as the process of producing the ground truth segmentation task with the required precision to the pixel level is a very time-consuming task and often requires a certain degree of domain knowledge regarding the documents at hand. For this reason, in this paper, we propose an effective few-shot learning framework for document layout segmentation relying on two novel components, namely a dynamic instance generation and a segmentation refinement module. This approach is able of achieving performances comparable to the current state of the art on the popular Diva-HisDB dataset, while relying on just a fraction of the available data.
Databáze: MEDLINE