Zobrazeno 1 - 10
of 4 405
pro vyhledávání: '"Cascante A"'
Autor:
Cubero-Cascante, José, Vaidyanathan, Arunkumar, Pelke, Rebecca, Pfeifer, Lorenzo, Leupers, Rainer, Joseph, Jan Moritz
The surge in AI usage demands innovative power reduction strategies. Novel Compute-in-Memory (CIM) architectures, leveraging advanced memory technologies, hold the potential for significantly lowering energy consumption by integrating storage with pa
Externí odkaz:
http://arxiv.org/abs/2405.04326
Visual Programming has recently emerged as an alternative to end-to-end black-box visual reasoning models. This type of method leverages Large Language Models (LLMs) to generate the source code for an executable computer program that solves a given p
Externí odkaz:
http://arxiv.org/abs/2403.16921
We introduce SynGround, a novel framework that combines data-driven learning and knowledge transfer from various large-scale pretrained models to enhance the visual grounding capabilities of a pretrained vision-and-language model. The knowledge trans
Externí odkaz:
http://arxiv.org/abs/2403.13804
Autor:
Pelke, Rebecca, Staudigl, Felix, Thomas, Niklas, Bosbach, Nils, Hossein, Mohammed, Cubero-Cascante, Jose, Poehls, Leticia Bolzani, Leupers, Rainer, Joseph, Jan Moritz
Resistive Random Access Memory (ReRAM) is a promising candidate for implementing Computing-in-Memory (CIM) architectures and neuromorphic circuits. ReRAM cells exhibit significant variability across different memristive devices and cycles, necessitat
Externí odkaz:
http://arxiv.org/abs/2403.13655
We introduce AutoVER, an Autoregressive model for Visual Entity Recognition. Our model extends an autoregressive Multi-modal Large Language Model by employing retrieval augmented constrained generation. It mitigates low performance on out-of-domain e
Externí odkaz:
http://arxiv.org/abs/2402.18695
In a recent paper of the authors together with A. Aleman, it is shown that the Bloch space $\mathcal{B}$ in the unit disc has the following radicality property: if an analytic function $g$ satisfies that $g^n\in \mathcal{B}$, then $g^m\in \mathcal{B}
Externí odkaz:
http://arxiv.org/abs/2402.16997
Vision-and-language models trained to match images with text can be combined with visual explanation methods to point to the locations of specific objects in an image. Our work shows that the localization --"grounding"-- abilities of these models can
Externí odkaz:
http://arxiv.org/abs/2312.04554
For a fixed analytic function g on the unit disc, we consider the analytic paraproducts induced by g, which are formally defined by $T_gf(z)=\int_0^zf(\zeta)g'(\zeta)d\zeta$, $S_gf(z)=\int_0^zf'(\zeta)g(\zeta)d\zeta$, and $M_gf(z)=g(z)f(z)$. We are c
Externí odkaz:
http://arxiv.org/abs/2311.05972
Detailed timing models are indispensable tools for the design space exploration of Multiprocessor Systems on Chip (MPSoCs). As core counts continue to increase, the complexity in memory hierarchies and interconnect topologies is also growing, making
Externí odkaz:
http://arxiv.org/abs/2308.09445
Autor:
Doveh, Sivan, Arbelle, Assaf, Harary, Sivan, Herzig, Roei, Kim, Donghyun, Cascante-bonilla, Paola, Alfassy, Amit, Panda, Rameswar, Giryes, Raja, Feris, Rogerio, Ullman, Shimon, Karlinsky, Leonid
Vision and Language (VL) models offer an effective method for aligning representation spaces of images and text, leading to numerous applications such as cross-modal retrieval, visual question answering, captioning, and more. However, the aligned ima
Externí odkaz:
http://arxiv.org/abs/2305.19595