Zobrazeno 1 - 10
of 8 764
pro vyhledávání: '"An,Sungwon"'
Recent years have seen significant progress in Text-To-Audio (TTA) synthesis, enabling users to enrich their creative workflows with synthetic audio generated from natural language prompts. Despite this progress, the effects of data, model architectu
Externí odkaz:
http://arxiv.org/abs/2412.19351
As Generative AI continues to become more accessible, the case for robust detection of generated images in order to combat misinformation is stronger than ever. Invisible watermarking methods act as identifiers of generated content, embedding image-
Externí odkaz:
http://arxiv.org/abs/2412.12511
Autor:
Lee, Jaeseong, Kang, Taewoong, Bühler, Marcel C., Kim, Min-Jung, Hwang, Sungwon, Hyung, Junha, Jang, Hyojin, Choo, Jaegul
Recent advancements in head avatar rendering using Gaussian primitives have achieved significantly high-fidelity results. Although precise head geometry is crucial for applications like mesh reconstruction and relighting, current methods struggle to
Externí odkaz:
http://arxiv.org/abs/2410.11682
Mongolia is among the countries undergoing rapid urbanization, and its temporary nomadic dwellings-known as Ger-have expanded into urban areas. Ger settlements in cities are increasingly recognized as slums by their socio-economic deprivation. The di
Externí odkaz:
http://arxiv.org/abs/2410.09522
In the domain of Aspect-Based Sentiment Analysis (ABSA), generative methods have shown promising results and achieved substantial advancements. However, despite these advancements, the tasks of extracting sentiment quadruplets, which capture the nuan
Externí odkaz:
http://arxiv.org/abs/2410.02297
This paper studies a class of linear parabolic equations in non-divergence form in which the leading coefficients are measurable and they can be singular or degenerate as a weight belonging to the $A_{1+\frac{1}{n}}$ class of Muckenhoupt weights. Kry
Externí odkaz:
http://arxiv.org/abs/2409.09437
We propose VoiceTailor, a parameter-efficient speaker-adaptive text-to-speech (TTS) system, by equipping a pre-trained diffusion-based TTS model with a personalized adapter. VoiceTailor identifies pivotal modules that benefit from the adapter based o
Externí odkaz:
http://arxiv.org/abs/2408.14739
The spread of fake news negatively impacts individuals and is regarded as a significant social challenge that needs to be addressed. A number of algorithmic and insightful features have been identified for detecting fake news. However, with the recen
Externí odkaz:
http://arxiv.org/abs/2406.11260
Autor:
Han, Sungwon, Ahn, Donghyun, Lee, Seungeon, Song, Minhyuk, Park, Sungwon, Park, Sangyoon, Kim, Jihee, Cha, Meeyoung
Moving beyond traditional surveys, combining heterogeneous data sources with AI-driven inference models brings new opportunities to measure socio-economic conditions, such as poverty and population, over expansive geographic areas. The current resear
Externí odkaz:
http://arxiv.org/abs/2406.09799
The increasing frequency and intensity of natural disasters demand more sophisticated approaches for rapid and precise damage assessment. To tackle this issue, researchers have developed various methods on disaster benchmark datasets from satellite i
Externí odkaz:
http://arxiv.org/abs/2406.08020