Výsledky vyhledávání

Report

Towards Generative Class Prompt Learning for Few-shot Visual Recognition

Autor: Chattopadhyay, Soumitri, Biswas, Sanket, Vivoli, Emanuele, Lladós, Josep

Although foundational vision-language models (VLMs) have proven to be very successful for various semantic discrimination tasks, they still struggle to perform faithfully for fine-grained categorization. Moreover, foundational models trained on one d

Externí odkaz: http://arxiv.org/abs/2409.01835

Zobrazit plný text záznamu

Report

CoMix: A Comprehensive Benchmark for Multi-Task Comic Understanding

Autor: Vivoli, Emanuele, Bertini, Marco, Karatzas, Dimosthenis

The comic domain is rapidly advancing with the development of single-page analysis and synthesis models. However, evaluation metrics and datasets lag behind, often limited to small-scale or single-style test sets. We introduce a novel benchmark, CoMi

Externí odkaz: http://arxiv.org/abs/2407.03550

Zobrazit plný text záznamu

Report

Comics Datasets Framework: Mix of Comics datasets for detection benchmarking

Autor: Vivoli, Emanuele, Campaioli, Irene, Nardoni, Mariateresa, Biondi, Niccolò, Bertini, Marco, Karatzas, Dimosthenis

Comics, as a medium, uniquely combine text and images in styles often distinct from real-world visuals. For the past three decades, computational research on comics has evolved from basic object detection to more sophisticated tasks. However, the fie

Externí odkaz: http://arxiv.org/abs/2407.03540

Zobrazit plný text záznamu

Report

Multimodal Transformer for Comics Text-Cloze

Autor: Vivoli, Emanuele, Baeza, Joan Lafuente, Llobet, Ernest Valveny, Karatzas, Dimosthenis

This work explores a closure task in comics, a medium where visual and textual elements are intricately intertwined. Specifically, Text-cloze refers to the task of selecting the correct text to use in a comic panel, given its neighboring panels. Trad

Externí odkaz: http://arxiv.org/abs/2403.03719

Zobrazit plný text záznamu

Akademický článek

sezione lavoro; sentenza 22 giugno 1998, n. 6199; Pres. Sommella, Est. Coletti, P.M. Schirò (concl. diff.); Vivoli (Avv. Spallina, A. Tosi) c. Marasigan (Avv. Bellotti). Conferma Trib. Firenze 19 luglio 1995

Publikováno v: Il Foro Italiano, 1998 Sep 01. 121(9), 2375/2376-2379/2380.

Externí odkaz: https://www.jstor.org/stable/23194279

Zobrazit plný text záznamu

Report

Error assessment of microwave holography inversion for shallow buried objects

Autor: Vivoli, Emanuele, Bossi, Luca, Bertini, Marco, Falorni, Pierluigi, Capineri, Lorenzo

Holographic imaging is a technique that uses microwave energy to create a three-dimensional image of an object or scene. This technology has potential applications in land mine detection, as the long-wavelength microwave energy can penetrate the grou

Externí odkaz: http://arxiv.org/abs/2303.15335

Zobrazit plný text záznamu

Report

CTE: A Dataset for Contextualized Table Extraction

Autor: Gemelli, Andrea, Vivoli, Emanuele, Marinai, Simone

Relevant information in documents is often summarized in tables, helping the reader to identify useful facts. Most benchmark datasets support either document layout analysis or table understanding, but lack in providing data to apply both tasks in a

Externí odkaz: http://arxiv.org/abs/2302.01451

Zobrazit plný text záznamu

Akademický článek

Udienza 23 settembre 1904; Pres. Frigotto, Est. Torella; Frosali (Avv. Lipparini) c. Vivoli e Bellini (Avv. Bellini, Vivoli, Camporesi)

Publikováno v: Il Foro Italiano, 1905 Jan 01. 30, 177/178-179/180.

Externí odkaz: https://www.jstor.org/stable/23107486

Zobrazit plný text záznamu

Report

MUST-VQA: MUltilingual Scene-text VQA

Autor: Vivoli, Emanuele, Biten, Ali Furkan, Mafla, Andres, Karatzas, Dimosthenis, Gomez, Lluis

In this paper, we present a framework for Multilingual Scene Text Visual Question Answering that deals with new languages in a zero-shot fashion. Specifically, we consider the task of Scene Text Visual Question Answering (STVQA) in which the question

Externí odkaz: http://arxiv.org/abs/2209.06730

Zobrazit plný text záznamu

Report

Graph Neural Networks and Representation Embedding for Table Extraction in PDF Documents

Autor: Gemelli, Andrea, Vivoli, Emanuele, Marinai, Simone

Tables are widely used in several types of documents since they can bring important information in a structured way. In scientific papers, tables can sum up novel discoveries and summarize experimental results, making the research comparable and easi

Externí odkaz: http://arxiv.org/abs/2208.11203

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání