Výsledky vyhledávání

Report

OnlyFlow: Optical Flow based Motion Conditioning for Video Diffusion Models

Autor: Koroglu, Mathis, Caselles-Dupré, Hugo, Sanmiguel, Guillaume Jeanneret, Cord, Matthieu

We consider the problem of text-to-video generation tasks with precise control for various applications such as camera movement control and video-to-video editing. Most methods tacking this problem rely on providing user-defined controls, such as bin

Externí odkaz: http://arxiv.org/abs/2411.10501

Zobrazit plný text záznamu

Report

Test-Time Adaptation for Keypoint-Based Spacecraft Pose Estimation Based on Predicted-View Synthesis

Autor: Pérez-Villar, Juan Ignacio Bravo, García-Martín, Álvaro, Bescós, Jesús, SanMiguel, Juan C.

Publikováno v: IEEE Transactions on Aerospace and Electronic Systems (2024)

Due to the difficulty of replicating the real conditions during training, supervised algorithms for spacecraft pose estimation experience a drop in performance when trained on synthetic data and applied to real operational data. To address this issue

Externí odkaz: http://arxiv.org/abs/2410.04298

Zobrazit plný text záznamu

Report

Layer-wise Model Merging for Unsupervised Domain Adaptation in Segmentation Tasks

Autor: Alcover-Couso, Roberto, SanMiguel, Juan C., Escudero-Viñolo, Marcos, Martínez, Jose M

Merging parameters of multiple models has resurfaced as an effective strategy to enhance task performance and robustness, but prior work is limited by the high costs of ensemble creation and inference. In this paper, we leverage the abundance of free

Externí odkaz: http://arxiv.org/abs/2409.15813

Zobrazit plný text záznamu

Report

Towards aerodynamic surrogate modeling based on $\beta$-variational autoencoders

Autor: Francés-Belda, Víctor, Solera-Rico, Alberto, Nieto-Centenero, Javier, Andrés, Esther, Vila, Carlos Sanmiguel, Castellanos, Rodrigo

Surrogate models that combine dimensionality reduction and regression techniques are essential to reduce the need for costly high-fidelity computational fluid dynamics data. New approaches using $\beta$-Variational Autoencoder ($\beta$-VAE) architect

Externí odkaz: http://arxiv.org/abs/2408.04969

Zobrazit plný text záznamu

Report

Gradient-based Class Weighting for Unsupervised Domain Adaptation in Dense Prediction Visual Tasks

Autor: Alcover-Couso, Roberto, Escudero-Viñolo, Marcos, SanMiguel, Juan C., Bescós, Jesus

In unsupervised domain adaptation (UDA), where models are trained on source data (e.g., synthetic) and adapted to target data (e.g., real-world) without target annotations, addressing the challenge of significant class imbalance remains an open issue

Externí odkaz: http://arxiv.org/abs/2407.01327

Zobrazit plný text záznamu

Report

Open-Vocabulary Attention Maps with Token Optimization for Semantic Segmentation in Diffusion Models

Autor: Marcos-Manchón, Pablo, Alcover-Couso, Roberto, SanMiguel, Juan C., Martínez, Jose M.

Publikováno v: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2024)

Diffusion models represent a new paradigm in text-to-image generation. Beyond generating high-quality images from text prompts, models such as Stable Diffusion have been successfully extended to the joint generation of semantic segmentation pseudo-ma

Externí odkaz: http://arxiv.org/abs/2403.14291

Zobrazit plný text záznamu

Report

The Robust Semantic Segmentation UNCV2023 Challenge Results

This paper outlines the winning solutions employed in addressing the MUAD uncertainty quantification challenge held at ICCV 2023. The challenge was centered around semantic segmentation in urban environments, with a particular focus on natural advers

Externí odkaz: http://arxiv.org/abs/2309.15478

Zobrazit plný text záznamu

Report

$\beta$-Variational autoencoders and transformers for reduced-order modelling of fluid flows

Autor: Solera-Rico, Alberto, Vila, Carlos Sanmiguel, Gómez, M. A., Wang, Yuning, Almashjary, Abdulrahman, Dawson, Scott T. M., Vinuesa, Ricardo

Variational autoencoder (VAE) architectures have the potential to develop reduced-order models (ROMs) for chaotic fluid flows. We propose a method for learning compact and near-orthogonal ROMs using a combination of a $\beta$-VAE and a transformer, t

Externí odkaz: http://arxiv.org/abs/2304.03571

Zobrazit plný text záznamu

Report

Soft labelling for semantic segmentation: Bringing coherence to label down-sampling

Autor: Alcover-Couso, Roberto, Escudero-Vinolo, Marcos, SanMiguel, Juan C., Martinez, Jose M.

In semantic segmentation, training data down-sampling is commonly performed due to limited resources, the need to adapt image size to the model input, or improve data augmentation. This down-sampling typically employs different strategies for the ima

Externí odkaz: http://arxiv.org/abs/2302.13961

Zobrazit plný text záznamu

Report

Detection-aware multi-object tracking evaluation

Autor: SanMiguel, Juan C., Muñoz, Jorge, Poiesi, Fabio

How would you fairly evaluate two multi-object tracking algorithms (i.e. trackers), each one employing a different object detector? Detectors keep improving, thus trackers can make less effort to estimate object states over time. Is it then fair to c

Externí odkaz: http://arxiv.org/abs/2212.08536

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání