Výsledky vyhledávání - "Zhang, Ruohong"

Report

Direct Preference Optimization of Video Large Multimodal Models from Language Model Reward

Autor: Zhang, Ruohong, Gui, Liangke, Sun, Zhiqing, Feng, Yihao, Xu, Keyang, Zhang, Yuanhan, Fu, Di, Li, Chunyuan, Hauptmann, Alexander, Bisk, Yonatan, Yang, Yiming

Preference modeling techniques, such as direct preference optimization (DPO), has shown effective in enhancing the generalization abilities of large language model (LLM). However, in tasks involving video instruction-following, providing informative

Externí odkaz: http://arxiv.org/abs/2404.01258

Zobrazit plný text záznamu

Report

A Self-enhancement Approach for Domain-specific Chatbot Training via Knowledge Mining and Digest

Autor: Zhang, Ruohong, Gao, Luyu, Zheng, Chen, Fan, Zhen, Lai, Guokun, Zhang, Zheng, Ai, Fangzhou, Yang, Yiming, Yang, Hongxia

Large Language Models (LLMs), despite their great power in language generation, often encounter challenges when dealing with intricate and knowledge-demanding queries in specific domains. This paper introduces a novel approach to enhance LLMs by effe

Externí odkaz: http://arxiv.org/abs/2311.10614

Zobrazit plný text záznamu

Report

SOTOPIA: Interactive Evaluation for Social Intelligence in Language Agents

Autor: Zhou, Xuhui, Zhu, Hao, Mathur, Leena, Zhang, Ruohong, Yu, Haofei, Qi, Zhengyang, Morency, Louis-Philippe, Bisk, Yonatan, Fried, Daniel, Neubig, Graham, Sap, Maarten

Humans are social beings; we pursue social goals in our daily interactions, which is a crucial aspect of social intelligence. Yet, AI systems' abilities in this realm remain elusive. We present SOTOPIA, an open-ended environment to simulate complex s

Externí odkaz: http://arxiv.org/abs/2310.11667

Zobrazit plný text záznamu

Report

PESCO: Prompt-enhanced Self Contrastive Learning for Zero-shot Text Classification

Autor: Wang, Yau-Shian, Chi, Ta-Chung, Zhang, Ruohong, Yang, Yiming

Publikováno v: ACL 2023

We present PESCO, a novel contrastive learning framework that substantially improves the performance of zero-shot text classification. We formulate text classification as a neural text matching problem where each document is treated as a query, and t

Externí odkaz: http://arxiv.org/abs/2305.14963

Zobrazit plný text záznamu

Report

Generation-driven Contrastive Self-training for Zero-shot Text Classification with Instruction-following LLM

Autor: Zhang, Ruohong, Wang, Yau-Shian, Yang, Yiming

The remarkable performance of large language models (LLMs) in zero-shot language understanding has garnered significant attention. However, employing LLMs for large-scale inference or domain-specific fine-tuning requires immense computational resourc

Externí odkaz: http://arxiv.org/abs/2304.11872

Zobrazit plný text záznamu

Report

Long-tailed Extreme Multi-label Text Classification with Generated Pseudo Label Descriptions

Autor: Zhang, Ruohong, Wang, Yau-Shian, Yang, Yiming, Yu, Donghan, Vu, Tom, Lei, Likun

Extreme Multi-label Text Classification (XMTC) has been a tough challenge in machine learning research and applications due to the sheer sizes of the label spaces and the severe data scarce problem associated with the long tail of rare labels in high

Externí odkaz: http://arxiv.org/abs/2204.00958

Zobrazit plný text záznamu

Report

Exploiting Local and Global Features in Transformer-based Extreme Multi-label Text Classification

Autor: Zhang, Ruohong, Wang, Yau-Shian, Yang, Yiming, Vu, Tom, Lei, Likun

Extreme multi-label text classification (XMTC) is the task of tagging each document with the relevant labels from a very large space of predefined categories. Recently, large pre-trained Transformer models have made significant performance improvemen

Externí odkaz: http://arxiv.org/abs/2204.00933

Zobrazit plný text záznamu

Report

Knowledge Embedding Based Graph Convolutional Network

Autor: Yu, Donghan, Yang, Yiming, Zhang, Ruohong, Wu, Yuexin

Recently, a considerable literature has grown up around the theme of Graph Convolutional Network (GCN). How to effectively leverage the rich structural information in complex graphs, such as knowledge graphs with heterogeneous types of entities and r

Externí odkaz: http://arxiv.org/abs/2006.07331

Zobrazit plný text záznamu

Report

Correlation-aware Unsupervised Change-point Detection via Graph Neural Networks

Autor: Zhang, Ruohong, Hao, Yu, Yu, Donghan, Chang, Wei-Cheng, Lai, Guokun, Yang, Yiming

Publikováno v: ICONIP 2020: Neural Information Processing

Change-point detection (CPD) aims to detect abrupt changes over time series data. Intuitively, effective CPD over multivariate time series should require explicit modeling of the dependencies across input variables. However, existing CPD methods eith

Externí odkaz: http://arxiv.org/abs/2004.11934

Zobrazit plný text záznamu

Report

Graph-Revised Convolutional Network

Autor: Yu, Donghan, Zhang, Ruohong, Jiang, Zhengbao, Wu, Yuexin, Yang, Yiming

Graph Convolutional Networks (GCNs) have received increasing attention in the machine learning community for effectively leveraging both the content features of nodes and the linkage patterns across graphs in various applications. As real-world graph

Externí odkaz: http://arxiv.org/abs/1911.07123

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání