Výsledky vyhledávání - "Tronchon, Léo"

Report

Building and better understanding vision-language models: insights and future directions

Autor: Laurençon, Hugo, Marafioti, Andrés, Sanh, Victor, Tronchon, Léo

The field of vision-language models (VLMs), which take images and texts as inputs and output texts, is rapidly evolving and has yet to reach consensus on several key aspects of the development pipeline, including data, architecture, and training meth

Externí odkaz: http://arxiv.org/abs/2408.12637

Zobrazit plný text záznamu

Report

What matters when building vision-language models?

Autor: Laurençon, Hugo, Tronchon, Léo, Cord, Matthieu, Sanh, Victor

The growing interest in vision-language models (VLMs) has been driven by improvements in large language models and vision transformers. Despite the abundance of literature on this subject, we observe that critical decisions regarding the design of VL

Externí odkaz: http://arxiv.org/abs/2405.02246

Zobrazit plný text záznamu

Report

Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset

Autor: Laurençon, Hugo, Tronchon, Léo, Sanh, Victor

Using vision-language models (VLMs) in web development presents a promising strategy to increase efficiency and unblock no-code solutions: by providing a screenshot or a sketch of a UI, a VLM could generate the code to reproduce it, for instance in a

Externí odkaz: http://arxiv.org/abs/2403.09029

Zobrazit plný text záznamu

Report

OBELICS: An Open Web-Scale Filtered Dataset of Interleaved Image-Text Documents

Autor: Laurençon, Hugo, Saulnier, Lucile, Tronchon, Léo, Bekman, Stas, Singh, Amanpreet, Lozhkov, Anton, Wang, Thomas, Karamcheti, Siddharth, Rush, Alexander M., Kiela, Douwe, Cord, Matthieu, Sanh, Victor

Large multimodal models trained on natural documents, which interleave images and text, outperform models trained on image-text pairs on various multimodal benchmarks. However, the datasets used to train these models have not been released, and the c

Externí odkaz: http://arxiv.org/abs/2306.16527

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání