Zobrazeno 1 - 10
of 61 660
pro vyhledávání: '"Ro AS"'
With the growing scale and complexity of video data, efficiently processing long video sequences poses significant challenges due to the quadratic increase in memory and computational demands associated with existing transformer-based Large Multi-mod
Externí odkaz:
http://arxiv.org/abs/2411.19460
Multispectral pedestrian detection is a crucial component in various critical applications. However, a significant challenge arises due to the misalignment between these modalities, particularly under real-world conditions where data often appear hea
Externí odkaz:
http://arxiv.org/abs/2411.17995
Despite advances in Large Multi-modal Models, applying them to long and untrimmed video content remains challenging due to limitations in context length and substantial memory overhead. These constraints often lead to significant information loss and
Externí odkaz:
http://arxiv.org/abs/2411.16173
Recent works on Generalized Referring Expression Segmentation (GRES) struggle with handling complex expressions referring to multiple distinct objects. This is because these methods typically employ an end-to-end foreground-background segmentation an
Externí odkaz:
http://arxiv.org/abs/2411.15087
Autor:
Lee, Chang-Gi, Chae, Byeong-Gyu, Ro, I-Jun, Jang, Kyuseon, Woods, Eric, Ahn, Jaemin, Park, Seong Yong, Gault, Baptiste, Kim, Se-Ho
Atom probe tomography (APT) enables near atomic scale three dimensional elemental mapping through the controlled field evaporation of surface atoms triggered by the combined application of a DC voltage and either voltage or laser pulses. As the selec
Externí odkaz:
http://arxiv.org/abs/2411.10506
Autor:
Cai, Ruisi, Ro, Yeonju, Kim, Geon-Woo, Wang, Peihao, Bejnordi, Babak Ehteshami, Akella, Aditya, Wang, Zhangyang
The proliferation of large language models (LLMs) has led to the adoption of Mixture-of-Experts (MoE) architectures that dynamically leverage specialized subnetworks for improved efficiency and performance. Despite their benefits, MoE models face sig
Externí odkaz:
http://arxiv.org/abs/2410.19123
Autor:
De Ro, Joeri
Given a locally compact quantum group and two $\mathbb{G}$-$W^*$-algebras $\alpha: A\curvearrowleft \mathbb{G}$ and $\beta: B\curvearrowleft \mathbb{G}$, we study the notion of equivariant $W^*$-Morita equivalence $(A, \alpha)\sim_{\mathbb{G}} (B, \b
Externí odkaz:
http://arxiv.org/abs/2410.17407
In-context learning (ICL) is a powerful paradigm where large language models (LLMs) benefit from task demonstrations added to the prompt. Yet, selecting optimal demonstrations is not trivial, especially for complex or multi-modal tasks where input an
Externí odkaz:
http://arxiv.org/abs/2410.14049
We further explore the notion of Ulam words considered by Bade, Cui, Labelle, and Li. We find that when interpreted as integers in a natural way, Ulam words appear to follow a new, unexplained distribution. Gaps between words and words of special typ
Externí odkaz:
http://arxiv.org/abs/2410.01217
The success of visual instruction tuning has accelerated the development of large language and vision models (LLVMs). Following the scaling laws of instruction-tuned large language models (LLMs), LLVMs either have further increased their sizes, reach
Externí odkaz:
http://arxiv.org/abs/2409.14713