Zobrazeno 1 - 10
of 813
pro vyhledávání: '"Baumann, Stefan"'
Autor:
Stracke, Nick, Baumann, Stefan Andreas, Susskind, Joshua M., Bautista, Miguel Angel, Ommer, Björn
Text-to-image generative models have become a prominent and powerful tool that excels at generating high-resolution realistic images. However, guiding the generative process of these models to consider detailed forms of conditioning reflecting style
Externí odkaz:
http://arxiv.org/abs/2405.07913
Autor:
Baumann, Stefan Andreas, Krause, Felix, Neumayr, Michael, Stracke, Nick, Hu, Vincent Tao, Ommer, Björn
In recent years, advances in text-to-image (T2I) diffusion models have substantially elevated the quality of their generated images. However, achieving fine-grained control over attributes remains a challenge due to the limitations of natural languag
Externí odkaz:
http://arxiv.org/abs/2403.17064
Autor:
Hu, Vincent Tao, Baumann, Stefan Andreas, Gui, Ming, Grebenkova, Olga, Ma, Pingchuan, Fischer, Johannes, Ommer, Björn
The diffusion model has long been plagued by scalability and quadratic complexity issues, especially within transformer-based structures. In this study, we aim to leverage the long sequence modeling capability of a State-Space Model called Mamba to e
Externí odkaz:
http://arxiv.org/abs/2403.13802
Autor:
Gui, Ming, Fischer, Johannes S., Prestel, Ulrich, Ma, Pingchuan, Kotovenko, Dmytro, Grebenkova, Olga, Baumann, Stefan Andreas, Hu, Vincent Tao, Ommer, Björn
Monocular depth estimation is crucial for numerous downstream vision tasks and applications. Current discriminative approaches to this problem are limited due to blurry artifacts, while state-of-the-art generative methods suffer from slow sampling du
Externí odkaz:
http://arxiv.org/abs/2403.13788
Autor:
Crowson, Katherine, Baumann, Stefan Andreas, Birch, Alex, Abraham, Tanishq Mathew, Kaplan, Daniel Z., Shippole, Enrico
We present the Hourglass Diffusion Transformer (HDiT), an image generative model that exhibits linear scaling with pixel count, supporting training at high-resolution (e.g. $1024 \times 1024$) directly in pixel-space. Building on the Transformer arch
Externí odkaz:
http://arxiv.org/abs/2401.11605
Autor:
Fischer, Johannes S., Gui, Ming, Ma, Pingchuan, Stracke, Nick, Baumann, Stefan A., Ommer, Björn
Recently, there has been tremendous progress in visual synthesis and the underlying generative models. Here, diffusion models (DMs) stand out particularly, but lately, flow matching (FM) has also garnered considerable interest. While DMs excel in pro
Externí odkaz:
http://arxiv.org/abs/2312.07360
Autor:
Tang, Yuning, Baumann, Stefan, Müller, Michael, Sebold, Doris, Nijmeijer, Arian, Guillon, Olivier, Meulenberg, Wilhelm A.
Publikováno v:
In Journal of the European Ceramic Society December 2024 44(15)
Autor:
Jennings, Dylan, Zahler, M. Pascal, Wang, Di, Ma, Qianli, Deibert, Wendelin, Kindelmann, Moritz, Kübel, Christian, Baumann, Stefan, Guillon, Olivier, Mayer, Joachim, Rheinheimer, Wolfgang
Publikováno v:
In Acta Materialia 1 July 2024 273