Zobrazeno 1 - 10
of 47 334
pro vyhledávání: '"Afshin, A"'
Autor:
Abbaszadeh, Afshin, Budhu, Jordan
A generalized dispersion equation is derived featuring coupled mode theory, parity-time symmetry, and leaky wave antennas of arbitrary periodic modulation. It can be specialized to each of these cases individually or can describe a structure containi
Externí odkaz:
http://arxiv.org/abs/2408.02779
This paper proposes a hybrid fusion-based deep learning approach based on two different modalities, audio and video, to improve human activity recognition and violence detection in public places. To take advantage of audiovisual fusion, late fusion,
Externí odkaz:
http://arxiv.org/abs/2408.02033
Autor:
Kolahi, Sina Ghorbani, Chaharsooghi, Seyed Kamal, Khatibi, Toktam, Bozorgpour, Afshin, Azad, Reza, Heidari, Moein, Hacihaliloglu, Ilker, Merhof, Dorit
Medical image segmentation involves identifying and separating object instances in a medical image to delineate various tissues and structures, a task complicated by the significant variations in size, shape, and density of these features. Convolutio
Externí odkaz:
http://arxiv.org/abs/2407.21640
Autor:
Xu, Mingze, Gao, Mingfei, Gan, Zhe, Chen, Hong-You, Lai, Zhengfeng, Gang, Haiming, Kang, Kai, Dehghan, Afshin
We propose SlowFast-LLaVA (or SF-LLaVA for short), a training-free video large language model (LLM) that can jointly capture the detailed spatial semantics and long-range temporal context without exceeding the token budget of commonly used LLMs. This
Externí odkaz:
http://arxiv.org/abs/2407.15841
Physics-inspired generative models, in particular diffusion and Poisson flow models, enhance Bayesian methods and promise great utilities in medical imaging. This review examines the transformative role of such generative methods. First, a variety of
Externí odkaz:
http://arxiv.org/abs/2407.10856
Publikováno v:
Philosophical Transactions of the Royal Society A, 380 (2022) 20210332
The Ogden model is often considered as a standard model in the literature for application to the deformation of brain tissue. Here we show that, in some of those applications, the use of the Ogden model leads to non-convexity of the strain-energy fun
Externí odkaz:
http://arxiv.org/abs/2407.08372
Autor:
Silvestri, Gianluigi, Massoli, Fabio Valerio, Orekondy, Tribhuvanesh, Abdi, Afshin, Behboodi, Arash
A promising way to mitigate the expensive process of obtaining a high-dimensional signal is to acquire a limited number of low-dimensional measurements and solve an under-determined inverse problem by utilizing the structural prior about the signal.
Externí odkaz:
http://arxiv.org/abs/2407.07794
Autor:
Ahmad, Hafiz Mughees, Rahimi, Afshin
Workplace accidents continue to pose significant risks for human safety, particularly in industries such as construction and manufacturing, and the necessity for effective Personal Protective Equipment (PPE) compliance has become increasingly paramou
Externí odkaz:
http://arxiv.org/abs/2407.04590
Autor:
Amirloo, Elmira, Fauconnier, Jean-Philippe, Roesmann, Christoph, Kerl, Christian, Boney, Rinu, Qian, Yusu, Wang, Zirui, Dehghan, Afshin, Yang, Yinfei, Gan, Zhe, Grasch, Peter
Preference alignment has become a crucial component in enhancing the performance of Large Language Models (LLMs), yet its impact in Multimodal Large Language Models (MLLMs) remains comparatively underexplored. Similar to language models, MLLMs for im
Externí odkaz:
http://arxiv.org/abs/2407.02477
Autor:
Bachmann, Roman, Kar, Oğuzhan Fatih, Mizrahi, David, Garjani, Ali, Gao, Mingfei, Griffiths, David, Hu, Jiaming, Dehghan, Afshin, Zamir, Amir
Current multimodal and multitask foundation models like 4M or UnifiedIO show promising results, but in practice their out-of-the-box abilities to accept diverse inputs and perform diverse tasks are limited by the (usually rather small) number of moda
Externí odkaz:
http://arxiv.org/abs/2406.09406