Výsledky vyhledávání - "Gao, Guanglai"

Report

Fully Hyperbolic Rotation for Knowledge Graph Embedding

Autor: Liang, Qiuyu, Wang, Weihua, Bao, Feilong, Gao, Guanglai

Hyperbolic rotation is commonly used to effectively model knowledge graphs and their inherent hierarchies. However, existing hyperbolic rotation models rely on logarithmic and exponential mappings for feature transformation. These models only project

Externí odkaz: http://arxiv.org/abs/2411.03622

Zobrazit plný text záznamu

Report

Leveraging Retrieval Augment Approach for Multimodal Emotion Recognition Under Missing Modalities

Autor: Fan, Qi, Yuan, Hongyu, Zuo, Haolin, Liu, Rui, Gao, Guanglai

Multimodal emotion recognition utilizes complete multimodal information and robust multimodal joint representation to gain high performance. However, the ideal condition of full modality integrity is often not applicable in reality and there always a

Externí odkaz: http://arxiv.org/abs/2410.02804

Zobrazit plný text záznamu

Report

Leveraging Contrastive Learning and Self-Training for Multimodal Emotion Recognition with Limited Labeled Samples

Autor: Fan, Qi, Li, Yutong, Xin, Yi, Cheng, Xinyu, Gao, Guanglai, Ma, Miao

The Multimodal Emotion Recognition challenge MER2024 focuses on recognizing emotions using audio, language, and visual signals. In this paper, we present our submission solutions for the Semi-Supervised Learning Sub-Challenge (MER2024-SEMI), which ta

Externí odkaz: http://arxiv.org/abs/2409.04447

Zobrazit plný text záznamu

Report

MCDubber: Multimodal Context-Aware Expressive Video Dubbing

Autor: Zhao, Yuan, Jia, Zhenqi, Liu, Rui, Hu, De, Bao, Feilong, Gao, Guanglai

Automatic Video Dubbing (AVD) aims to take the given script and generate speech that aligns with lip motion and prosody expressiveness. Current AVD models mainly utilize visual information of the current sentence to enhance the prosody of synthesized

Externí odkaz: http://arxiv.org/abs/2408.11593

Zobrazit plný text záznamu

Report

Mitigating Heterogeneity among Factor Tensors via Lie Group Manifolds for Tensor Decomposition Based Temporal Knowledge Graph Embedding

Autor: Li, Jiang, Su, Xiangdong, Gong, Yeyun, Gao, Guanglai

Recent studies have highlighted the effectiveness of tensor decomposition methods in the Temporal Knowledge Graphs Embedding (TKGE) task. However, we found that inherent heterogeneity among factor tensors in tensor decomposition significantly hinders

Externí odkaz: http://arxiv.org/abs/2404.09155

Zobrazit plný text záznamu

Report

L^2GC:Lorentzian Linear Graph Convolutional Networks for Node Classification

Autor: Liang, Qiuyu, Wang, Weihua, Bao, Feilong, Gao, Guanglai

Linear Graph Convolutional Networks (GCNs) are used to classify the node in the graph data. However, we note that most existing linear GCN models perform neural network operations in Euclidean space, which do not explicitly capture the tree-like hier

Externí odkaz: http://arxiv.org/abs/2403.06064

Zobrazit plný text záznamu

Report

Learning Noise-Robust Joint Representation for Multimodal Emotion Recognition under Incomplete Data Scenarios

Autor: Fan, Qi, Zuo, Haolin, Liu, Rui, Lian, Zheng, Gao, Guanglai

Multimodal emotion recognition (MER) in practical scenarios is significantly challenged by the presence of missing or incomplete data across different modalities. To overcome these challenges, researchers have aimed to simulate incomplete conditions

Externí odkaz: http://arxiv.org/abs/2311.16114

Zobrazit plný text záznamu

Report

TransERR: Translation-based Knowledge Graph Embedding via Efficient Relation Rotation

Autor: Li, Jiang, Su, Xiangdong, Zhang, Fujun, Gao, Guanglai

This paper presents a translation-based knowledge geraph embedding method via efficient relation rotation (TransERR), a straightforward yet effective alternative to traditional translation-based knowledge graph embedding models. Different from the pr

Externí odkaz: http://arxiv.org/abs/2306.14580

Zobrazit plný text záznamu

Report

Betray Oneself: A Novel Audio DeepFake Detection Model via Mono-to-Stereo Conversion

Autor: Liu, Rui, Zhang, Jinhua, Gao, Guanglai, Li, Haizhou

Audio Deepfake Detection (ADD) aims to detect the fake audio generated by text-to-speech (TTS), voice conversion (VC) and replay, etc., which is an emerging topic. Traditionally we take the mono signal as input and focus on robust feature extraction

Externí odkaz: http://arxiv.org/abs/2305.16353

Zobrazit plný text záznamu

Report

MnTTS2: An Open-Source Multi-Speaker Mongolian Text-to-Speech Synthesis Dataset

Autor: Liang, Kailin, Liu, Bin, Hu, Yifan, Liu, Rui, Bao, Feilong, Gao, Guanglai

Text-to-Speech (TTS) synthesis for low-resource languages is an attractive research issue in academia and industry nowadays. Mongolian is the official language of the Inner Mongolia Autonomous Region and a representative low-resource language spoken

Externí odkaz: http://arxiv.org/abs/2301.00657

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání