Výsledky vyhledávání - "Hradiš, Michal"

Report

A Comparative Study of Text Retrieval Models on DaReCzech

Autor: Stetina, Jakub, Fajcik, Martin, Stefanik, Michal, Hradis, Michal

This article presents a comprehensive evaluation of 7 off-the-shelf document retrieval models: Splade, Plaid, Plaid-X, SimCSE, Contriever, OpenAI ADA and Gemma2 chosen to determine their performance on the Czech retrieval dataset DaReCzech. The prima

Externí odkaz: http://arxiv.org/abs/2411.12921

Zobrazit plný text záznamu

Report

Self-supervised Pre-training of Text Recognizers

Autor: Kišš, Martin, Hradiš, Michal

In this paper, we investigate self-supervised pre-training methods for document text recognition. Nowadays, large unlabeled datasets can be collected for many research tasks, including text recognition, but it is costly to annotate them. Therefore, m

Externí odkaz: http://arxiv.org/abs/2405.00420

Zobrazit plný text záznamu

Report

Towards Writing Style Adaptation in Handwriting Recognition

Autor: Kohút, Jan, Hradiš, Michal, Kišš, Martin

One of the challenges of handwriting recognition is to transcribe a large number of vastly different writing styles. State-of-the-art approaches do not explicitly use information about the writer's style, which may be limiting overall accuracy due to

Externí odkaz: http://arxiv.org/abs/2302.06318

Zobrazit plný text záznamu

Report

Finetuning Is a Surprisingly Effective Domain Adaptation Baseline in Handwriting Recognition

Autor: Kohút, Jan, Hradiš, Michal

In many machine learning tasks, a large general dataset and a small specialized dataset are available. In such situations, various domain adaptation methods can be used to adapt a general model to the target dataset. We show that in the case of neura

Externí odkaz: http://arxiv.org/abs/2302.06308

Zobrazit plný text záznamu

Report

SoftCTC -- Semi-Supervised Learning for Text Recognition using Soft Pseudo-Labels

Autor: Kišš, Martin, Hradiš, Michal, Beneš, Karel, Buchal, Petr, Kula, Michal

This paper explores semi-supervised training for sequence tasks, such as Optical Character Recognition or Automatic Speech Recognition. We propose a novel loss function $\unicode{x2013}$ SoftCTC $\unicode{x2013}$ which is an extension of CTC allowing

Externí odkaz: http://arxiv.org/abs/2212.02135

Zobrazit plný text záznamu

Report

Importance of Textlines in Historical Document Classification

Autor: Kišš, Martin, Kohút, Jan, Beneš, Karel, Hradiš, Michal

This paper describes a system prepared at Brno University of Technology for ICDAR 2021 Competition on Historical Document Classification, experiments leading to its design, and the main findings. The solved tasks include script and font classificatio

Externí odkaz: http://arxiv.org/abs/2201.09575

Zobrazit plný text záznamu

Report

AT-ST: Self-Training Adaptation Strategy for OCR in Domains with Limited Transcriptions

Autor: Kišš, Martin, Beneš, Karel, Hradiš, Michal

This paper addresses text recognition for domains with limited manual annotations by a simple self-training strategy. Our approach should reduce human annotation effort when target domain data is plentiful, such as when transcribing a collection of s

Externí odkaz: http://arxiv.org/abs/2104.13037

Zobrazit plný text záznamu

Report

TS-Net: OCR Trained to Switch Between Text Transcription Styles

Autor: Kohút, Jan, Hradiš, Michal

Publikováno v: ICDAR 2021: Proceedings, Part IV 16 (pp. 478-493)

Users of OCR systems, from different institutions and scientific disciplines, prefer and produce different transcription styles. This presents a problem for training of consistent text recognition neural networks on real-world data. We propose to ext

Externí odkaz: http://arxiv.org/abs/2103.05489

Zobrazit plný text záznamu

Report

Page Layout Analysis System for Unconstrained Historic Documents

Autor: Kodym, Oldřich, Hradiš, Michal

Extraction of text regions and individual text lines from historic documents is necessary for automatic transcription. We propose extending a CNN-based text baseline detection system by adding line height and text block boundary predictions to the mo

Externí odkaz: http://arxiv.org/abs/2102.11838

Zobrazit plný text záznamu

Report

Brno Mobile OCR Dataset

Autor: Kišš, Martin, Hradiš, Michal, Kodym, Oldřich

We introduce the Brno Mobile OCR Dataset (B-MOD) for document Optical Character Recognition from low-quality images captured by handheld mobile devices. While OCR of high-quality scanned documents is a mature field where many commercial tools are ava

Externí odkaz: http://arxiv.org/abs/1907.01307

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání