Zobrazeno 1 - 10
of 225
pro vyhledávání: '"Dufour , Nicolas"'
Global visual geolocation predicts where an image was captured on Earth. Since images vary in how precisely they can be localized, this task inherently involves a significant degree of ambiguity. However, existing approaches are deterministic and ove
Externí odkaz:
http://arxiv.org/abs/2412.06781
Autor:
Picard, David, Dufour, Nicolas
Diffusion models based on Multi-Head Attention (MHA) have become ubiquitous to generate high quality images and videos. However, encoding an image or a video as a sequence of patches results in costly attention patterns, as the requirements both in t
Externí odkaz:
http://arxiv.org/abs/2411.12663
Stories and emotions in movies emerge through the effect of well-thought-out directing decisions, in particular camera placement and movement over time. Crafting compelling camera trajectories remains a complex iterative process, even for skilful art
Externí odkaz:
http://arxiv.org/abs/2407.01516
Conditional diffusion models are powerful generative models that can leverage various types of conditional information, such as class labels, segmentation masks, or text captions. However, in many real-world scenarios, conditional information may be
Externí odkaz:
http://arxiv.org/abs/2405.20324
Autor:
Astruc, Guillaume, Dufour, Nicolas, Siglidis, Ioannis, Aronssohn, Constantin, Bouia, Nacim, Fu, Stephanie, Loiseau, Romain, Nguyen, Van Nguyen, Raude, Charles, Vincent, Elliot, XU, Lintao, Zhou, Hongyu, Landrieu, Loic
Determining the location of an image anywhere on Earth is a complex visual task, which makes it particularly relevant for evaluating computer vision algorithms. Yet, the absence of standard, large-scale, open-access datasets with reliably localizable
Externí odkaz:
http://arxiv.org/abs/2404.18873
Autor:
Wang, Xi, Dufour, Nicolas, Andreou, Nefeli, Cani, Marie-Paule, Abrevaya, Victoria Fernandez, Picard, David, Kalogeiton, Vicky
Classifier-Free Guidance (CFG) enhances the quality and condition adherence of text-to-image diffusion models. It operates by combining the conditional and unconditional predictions using a fixed weight. However, recent works vary the weights through
Externí odkaz:
http://arxiv.org/abs/2404.13040
Transformers were initially introduced for natural language processing (NLP) tasks, but fast they were adopted by most deep learning fields, including computer vision. They measure the relationships between pairs of input tokens (words in the case of
Externí odkaz:
http://arxiv.org/abs/2303.12068
A large body of recent work targets semantically conditioned image generation. Most such methods focus on the narrower task of pose transfer and ignore the more challenging task of subject transfer that consists in not only transferring the pose but
Externí odkaz:
http://arxiv.org/abs/2210.04883
Autor:
Arahmane, Hanan, Dumazert, Jonathan, Barat, Eric, Dautremer, Thomas, Carrel, Frédérick, Dufour, Nicolas, Michel, Maugan
Amongst the various technical challenges in the field of radiation detection is the need to carry out accurate low-level radioactivity measurements in the presence of large fluctuations in the natural radiation background, while lowering the false al
Externí odkaz:
http://arxiv.org/abs/2206.02615
Autor:
Arahmane, Hanan, Dumazert, Jonathan, Barat, Eric, Dautremer, Thomas, Carrel, Frédérick, Dufour, Nicolas, Michel, Maugan
Publikováno v:
In Results in Physics September 2024 64