Výsledky vyhledávání - "modal interaction"

Report

Semi-IIN: Semi-supervised Intra-inter modal Interaction Learning Network for Multimodal Sentiment Analysis

Autor: Lin, Jinhao, Wang, Yifei, Xu, Yanwu, Liu, Qi

Despite multimodal sentiment analysis being a fertile research ground that merits further investigation, current approaches take up high annotation cost and suffer from label ambiguity, non-amicable to high-quality labeled data acquisition. Furthermo

Externí odkaz: http://arxiv.org/abs/2412.09784

Zobrazit plný text záznamu

Report

Visual Language Tracking with Multi-modal Interaction: A Robust Benchmark

Autor: Li, Xuchen, Hu, Shiyu, Feng, Xiaokun, Zhang, Dailing, Wu, Meiqi, Zhang, Jing, Huang, Kaiqi

Visual Language Tracking (VLT) enhances tracking by mitigating the limitations of relying solely on the visual modality, utilizing high-level semantic information through language. This integration of the language enables more advanced human-machine

Externí odkaz: http://arxiv.org/abs/2409.08887

Zobrazit plný text záznamu

Report

Hypergraph-based Motion Generation with Multi-modal Interaction Relational Reasoning

Autor: Wu, Keshu, Zhou, Yang, Shi, Haotian, Lord, Dominique, Ran, Bin, Ye, Xinyue

The intricate nature of real-world driving environments, characterized by dynamic and diverse interactions among multiple vehicles and their possible future states, presents considerable challenges in accurately predicting the motion states of vehicl

Externí odkaz: http://arxiv.org/abs/2409.11676

Zobrazit plný text záznamu

Report

Hire: Hybrid-modal Interaction with Multiple Relational Enhancements for Image-Text Matching

Autor: Ge, Xuri, Chen, Fuhai, Xu, Songpei, Tao, Fuxiang, Wang, Jie, Jose, Joemon M.

Image-text matching (ITM) is a fundamental problem in computer vision. The key issue lies in jointly learning the visual and textual representation to estimate their similarity accurately. Most existing methods focus on feature enhancement within mod

Externí odkaz: http://arxiv.org/abs/2406.18579

Zobrazit plný text záznamu

Report

Detail-Enhanced Intra- and Inter-modal Interaction for Audio-Visual Emotion Recognition

Autor: Shi, Tong, Ge, Xuri, Jose, Joemon M., Pugeault, Nicolas, Henderson, Paul

Capturing complex temporal relationships between video and audio modalities is vital for Audio-Visual Emotion Recognition (AVER). However, existing methods lack attention to local details, such as facial state changes between video frames, which can

Externí odkaz: http://arxiv.org/abs/2405.16701

Zobrazit plný text záznamu

Report

SurvMamba: State Space Model with Multi-grained Multi-modal Interaction for Survival Prediction

Autor: Chen, Ying, Xie, Jiajing, Lin, Yuxiang, Song, Yuhang, Yang, Wenxian, Yu, Rongshan

Multi-modal learning that combines pathological images with genomic data has significantly enhanced the accuracy of survival prediction. Nevertheless, existing methods have not fully utilized the inherent hierarchical structure within both whole slid

Externí odkaz: http://arxiv.org/abs/2404.08027

Zobrazit plný text záznamu

Report

Leveraging Intra-modal and Inter-modal Interaction for Multi-Modal Entity Alignment

Autor: Hu, Zhiwei, Gutiérrez-Basulto, Víctor, Xiang, Zhiliang, Li, Ru, Pan, Jeff Z.

Multi-modal entity alignment (MMEA) aims to identify equivalent entity pairs across different multi-modal knowledge graphs (MMKGs). Existing approaches focus on how to better encode and aggregate information from different modalities. However, it is

Externí odkaz: http://arxiv.org/abs/2404.17590

Zobrazit plný text záznamu

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Report

VirtuWander: Enhancing Multi-modal Interaction for Virtual Tour Guidance through Large Language Models

Autor: Wang, Zhan, Yuan, Lin-Ping, Wang, Liangwei, Jiang, Bingchuan, Zeng, Wei

Tour guidance in virtual museums encourages multi-modal interactions to boost user experiences, concerning engagement, immersion, and spatial awareness. Nevertheless, achieving the goal is challenging due to the complexity of comprehending diverse us

Externí odkaz: http://arxiv.org/abs/2401.11923

Zobrazit plný text záznamu

Akademický článek

Temporal Multimodal Sentiment Analysis with Composite Cross Modal Interaction Network

Autor: YANG Li, ZHONG Junhong, ZHANG Yun, SONG Xinyu

Publikováno v: Jisuanji kexue yu tansuo, Vol 18, Iss 5, Pp 1318-1327 (2024)

To address the issues of insufficient modal fusion and weak interactivity caused by semantic feature differences between different modalities in multimodal emotion analysis, a temporal multimodal sentiment analysis model for composite cross modal int

Externí odkaz: https://doaj.org/article/17957e740a5b459c8310509ae18e9757

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání