Výsledky vyhledávání

Report

Robust Domain Generalization for Multi-modal Object Recognition

Autor: Qiao, Yuxin, Li, Keqin, Lin, Junhong, Wei, Rong, Jiang, Chufeng, Luo, Yang, Yang, Haoyu

In multi-label classification, machine learning encounters the challenge of domain generalization when handling tasks with distributions differing from the training data. Existing approaches primarily focus on vision object recognition and neglect th

Externí odkaz: http://arxiv.org/abs/2408.05831

Zobrazit plný text záznamu

Report

Adaptive Two-Stage Cloud Resource Scaling via Hierarchical Multi-Indicator Forecasting and Bayesian Decision-Making

Autor: Luo, Yang, Wang, Shiyu, Yu, Zhemeng, Lu, Wei, Gao, Xiaofeng, Ma, Lintao, Chen, Guihai

The surging demand for cloud computing resources, driven by the rapid growth of sophisticated large-scale models and data centers, underscores the critical importance of efficient and adaptive resource allocation. As major tech enterprises deploy mas

Externí odkaz: http://arxiv.org/abs/2408.01000

Zobrazit plný text záznamu

Report

Towards Open-World Mobile Manipulation in Homes: Lessons from the Neurips 2023 HomeRobot Open Vocabulary Mobile Manipulation Challenge

In order to develop robots that can effectively serve as versatile and capable home assistants, it is crucial for them to reliably perceive and interact with a wide variety of objects across diverse environments. To this end, we proposed Open Vocabul

Externí odkaz: http://arxiv.org/abs/2407.06939

Zobrazit plný text záznamu

Report

Robust Multimodal Learning via Representation Decoupling

Autor: Wei, Shicai, Luo, Yang, Wang, Yuji, Luo, Chunbo

Multimodal learning robust to missing modality has attracted increasing attention due to its practicality. Existing methods tend to address it by learning a common subspace representation for different modality combinations. However, we reveal that t

Externí odkaz: http://arxiv.org/abs/2407.04458

Zobrazit plný text záznamu

Report

Flat bands and distinct density wave orders in correlated Kagome superconductor CsCr$_3$Sb$_5$

Autor: Peng, Shuting, Han, Yulei, Li, Yongkai, Shen, Jianchang, Miao, Yu, Luo, Yang, Huai, Linwei, Ou, Zhipeng, Li, Hongyu, Xiang, Ziji, Liu, Zhengtai, Shen, Dawei, Hashimoto, Makoto, Lu, Donghui, Yao, Yugui, Qiao, Zhenhua, Wang, Zhiwei, He, Junfeng

Kagome metal CsV$_3$Sb$_5$ has attracted much recent attention due to the coexistence of multiple exotic orders and the associated proposals to mimic unconventional high temperature superconductors. Nevertheless, magnetism and strong electronic corre

Externí odkaz: http://arxiv.org/abs/2406.17769

Zobrazit plný text záznamu

Report

Exploring Adversarial Robustness of Deep State Space Models

Autor: Qi, Biqing, Luo, Yang, Gao, Junqi, Li, Pengfei, Tian, Kai, Ma, Zhiyuan, Zhou, Bowen

Deep State Space Models (SSMs) have proven effective in numerous task scenarios but face significant security challenges due to Adversarial Perturbations (APs) in real-world deployments. Adversarial Training (AT) is a mainstream approach to enhancing

Externí odkaz: http://arxiv.org/abs/2406.05532

Zobrazit plný text záznamu

Report

How Does the Textual Information Affect the Retrieval of Multimodal In-Context Learning?

Autor: Luo, Yang, Zheng, Zangwei, Zhu, Zirui, You, Yang

The increase in parameter size of multimodal large language models (MLLMs) introduces significant capabilities, particularly in-context learning, where MLLMs enhance task performance without updating pre-trained parameters. This effectiveness, howeve

Externí odkaz: http://arxiv.org/abs/2404.12866

Zobrazit plný text záznamu

Report

Learning to Rank Patches for Unbiased Image Redundancy Reduction

Autor: Luo, Yang, Chen, Zhineng, Zhou, Peng, Wu, Zuxuan, Gao, Xieping, Jiang, Yu-Gang

Images suffer from heavy spatial redundancy because pixels in neighboring regions are spatially correlated. Existing approaches strive to overcome this limitation by reducing less meaningful image regions. However, current leading methods rely on sup

Externí odkaz: http://arxiv.org/abs/2404.00680

Zobrazit plný text záznamu

Report

From Two-Stream to One-Stream: Efficient RGB-T Tracking via Mutual Prompt Learning and Knowledge Distillation

Autor: Luo, Yang, Guo, Xiqing, Li, Hao

Due to the complementary nature of visible light and thermal infrared modalities, object tracking based on the fusion of visible light images and thermal images (referred to as RGB-T tracking) has received increasing attention from researchers in rec

Externí odkaz: http://arxiv.org/abs/2403.16834

Zobrazit plný text záznamu

Report

Scale Decoupled Distillation

Autor: Luo, Shicai Wei Chunbo Luo Yang

Logit knowledge distillation attracts increasing attention due to its practicality in recent studies. However, it often suffers inferior performance compared to the feature knowledge distillation. In this paper, we argue that existing logit-based met

Externí odkaz: http://arxiv.org/abs/2403.13512

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání