Zobrazeno 1 - 10
of 657
pro vyhledávání: '"Kim, Taehwan"'
Traditional weak-lensing mass reconstruction techniques suffer from various artifacts, including noise amplification and the mass-sheet degeneracy. In Hong et al. (2021), we demonstrated that many of these pitfalls of traditional mass reconstruction
Externí odkaz:
http://arxiv.org/abs/2410.19907
Text-guided image editing and generation methods have diverse real-world applications. However, text-guided infinite image synthesis faces several challenges. First, there is a lack of text-image paired datasets with high-resolution and contextual di
Externí odkaz:
http://arxiv.org/abs/2407.12642
The Transformer architecture has revolutionized the field of deep learning over the past several years in diverse areas, including natural language processing, code generation, image recognition, time series forecasting, etc. We propose to apply Zami
Externí odkaz:
http://arxiv.org/abs/2404.00102
Recent advances in the diffusion models have significantly improved text-to-image generation. However, generating videos from text is a more challenging task than generating images from text, due to the much larger dataset and higher computational co
Externí odkaz:
http://arxiv.org/abs/2404.00234
Autor:
Bae, Jaeyeon, Jeong, Seokhoon, Kang, Seokun, Han, Namgi, Lee, Jae-Yon, Kim, Hyounghun, Kim, Taehwan
Storytelling is multi-modal in the real world. When one tells a story, one may use all of the visualizations and sounds along with the story itself. However, prior studies on storytelling datasets and tasks have paid little attention to sound even th
Externí odkaz:
http://arxiv.org/abs/2310.19264
Slogans play a crucial role in building the brand's identity of the firm. A slogan is expected to reflect firm's vision and brand's value propositions in memorable and likeable ways. Automating the generation of slogans with such characteristics is c
Externí odkaz:
http://arxiv.org/abs/2310.04472
Representing wild sounds as images is an important but challenging task due to the lack of paired datasets between sound and images and the significant differences in the characteristics of these two modalities. Previous studies have focused on gener
Externí odkaz:
http://arxiv.org/abs/2309.02405
Autor:
Kim, Taehwan
Photovoltaic cells have become ideal alternatives to conventional energy technologies due to their ability to convert clean, unlimited, and sustainable solar energy into electricity. However, the conventional rigid, planar structure of photovoltaic c
Autor:
Kim, Taehwan
In recent years, Internet-of-Things (IoT) devices generate a large amount of personal data. However, due to the privacy concern, collecting the private data in cloud centers for training Machine Learning (ML) models becomes unrealistic. To address th
Externí odkaz:
http://hdl.handle.net/10919/109670
This technical report presents the 2nd winning model for AQTC, a task newly introduced in CVPR 2022 LOng-form VidEo Understanding (LOVEU) challenges. This challenge faces difficulties with multi-step answers, multi-modal, and diverse and changing but
Externí odkaz:
http://arxiv.org/abs/2206.14555