Zobrazeno 1 - 10
of 20 496
pro vyhledávání: '"Piñar A"'
Text-to-video models have demonstrated impressive capabilities in producing diverse and captivating video content, showcasing a notable advancement in generative AI. However, these models generally lack fine-grained control over motion patterns, limi
Externí odkaz:
http://arxiv.org/abs/2412.05275
This study evaluates the effectiveness of Vision Language Models (VLMs) in representing and utilizing multimodal content for fact-checking. To be more specific, we investigate whether incorporating multimodal content improves performance compared to
Externí odkaz:
http://arxiv.org/abs/2412.05155
Autor:
Dalva, Yusuf, Li, Yijun, Liu, Qing, Zhao, Nanxuan, Zhang, Jianming, Lin, Zhe, Yanardag, Pinar
Large-scale diffusion models have achieved remarkable success in generating high-quality images from textual descriptions, gaining popularity across various applications. However, the generation of layered content, such as transparent images with for
Externí odkaz:
http://arxiv.org/abs/2412.04460
Training deep learning models is a repetitive and resource-intensive process. Data scientists often train several models before landing on set of parameters (e.g., hyper-parameter tuning), model architecture (e.g., neural architecture search), among
Externí odkaz:
http://arxiv.org/abs/2409.18749
This paper considers the minimization of a continuously differentiable function over a cardinality constraint. We focus on smooth and relatively smooth functions. These smoothness criteria result in new descent lemmas. Based on the new descent lemmas
Externí odkaz:
http://arxiv.org/abs/2409.12343
Autor:
Özdemir, Övgü, İşyapar, M. Tuğberk, Karagöz, Pınar, Schmidt, Klaus Werner, Demir, Demet, Karagöz, N. Alpay
Modern vehicles are equipped with Electronic Control Units (ECU) that are used for controlling important vehicle functions including safety-critical operations. ECUs exchange information via in-vehicle communication buses, of which the Controller Are
Externí odkaz:
http://arxiv.org/abs/2409.07505
Autor:
Ghasemlou, Shervin, Katiyar, Ashish, Saraf, Aparajita, Moon, Seungwhan, Pujari, Mangesh, Donmez, Pinar, Damavandi, Babak, Kumar, Anuj
In this paper, we investigate the problem of "generation supervision" in large language models, and present a novel bicameral architecture to separate supervision signals from their core capability, helpfulness. Doppelg\"anger, a new module parallel
Externí odkaz:
http://arxiv.org/abs/2409.06107
In therapeutic focused ultrasound (FUS), such as thermal ablation and hyperthermia, effective acousto-thermal manipulation requires precise targeting of complex geometries, sound wave propagation through irregular structures and selective focusing at
Externí odkaz:
http://arxiv.org/abs/2409.01323
Autor:
Ravva, Pavan Uttej, Kiafar, Behdokht, Kullu, Pinar, Li, Jicheng, Bhat, Anjana, Barmaki, Roghayeh Leila
Autism spectrum disorder (ASD) is characterized by significant challenges in social interaction and comprehending communication signals. Recently, therapeutic interventions for ASD have increasingly utilized Deep learning powered-computer vision tech
Externí odkaz:
http://arxiv.org/abs/2408.15077
Autor:
Schwöbel, Pola, Franceschi, Luca, Zafar, Muhammad Bilal, Vasist, Keerthan, Malhotra, Aman, Shenhar, Tomer, Tailor, Pinal, Yilmaz, Pinar, Diamond, Michael, Donini, Michele
fmeval is an open source library to evaluate large language models (LLMs) in a range of tasks. It helps practitioners evaluate their model for task performance and along multiple responsible AI dimensions. This paper presents the library and exposes
Externí odkaz:
http://arxiv.org/abs/2407.12872