Výsledky vyhledávání

Report

AMBER: An LLM-free Multi-dimensional Benchmark for MLLMs Hallucination Evaluation

Autor: Wang, Junyang, Wang, Yuhang, Xu, Guohai, Zhang, Jing, Gu, Yukai, Jia, Haitao, Wang, Jiaqi, Xu, Haiyang, Yan, Ming, Zhang, Ji, Sang, Jitao

Despite making significant progress in multi-modal tasks, current Multi-modal Large Language Models (MLLMs) encounter the significant challenge of hallucinations, which may lead to harmful consequences. Therefore, evaluating MLLMs' hallucinations is

Externí odkaz: http://arxiv.org/abs/2311.07397

Zobrazit plný text záznamu

Report

UReader: Universal OCR-free Visually-situated Language Understanding with Multimodal Large Language Model

Autor: Ye, Jiabo, Hu, Anwen, Xu, Haiyang, Ye, Qinghao, Yan, Ming, Xu, Guohai, Li, Chenliang, Tian, Junfeng, Qian, Qi, Zhang, Ji, Jin, Qin, He, Liang, Lin, Xin Alex, Huang, Fei

Text is ubiquitous in our visual world, conveying crucial information, such as in documents, websites, and everyday photographs. In this work, we propose UReader, a first exploration of universal OCR-free visually-situated language understanding base

Externí odkaz: http://arxiv.org/abs/2310.05126

Zobrazit plný text záznamu

Report

Evaluation and Analysis of Hallucination in Large Vision-Language Models

Autor: Wang, Junyang, Zhou, Yiyang, Xu, Guohai, Shi, Pengcheng, Zhao, Chenlin, Xu, Haiyang, Ye, Qinghao, Yan, Ming, Zhang, Ji, Zhu, Jihua, Sang, Jitao, Tang, Haoyu

Large Vision-Language Models (LVLMs) have recently achieved remarkable success. However, LVLMs are still plagued by the hallucination problem, which limits the practicality in many scenarios. Hallucination refers to the information of LVLMs' response

Externí odkaz: http://arxiv.org/abs/2308.15126

Zobrazit plný text záznamu

Report

CValues: Measuring the Values of Chinese Large Language Models from Safety to Responsibility

Autor: Xu, Guohai, Liu, Jiayi, Yan, Ming, Xu, Haotian, Si, Jinghui, Zhou, Zhuoran, Yi, Peng, Gao, Xing, Sang, Jitao, Zhang, Rong, Zhang, Ji, Peng, Chao, Huang, Fei, Zhou, Jingren

With the rapid evolution of large language models (LLMs), there is a growing concern that they may pose risks or have negative social impacts. Therefore, evaluation of human values alignment is becoming increasingly important. Previous work mainly fo

Externí odkaz: http://arxiv.org/abs/2307.09705

Zobrazit plný text záznamu

Report

mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding

Autor: Ye, Jiabo, Hu, Anwen, Xu, Haiyang, Ye, Qinghao, Yan, Ming, Dan, Yuhao, Zhao, Chenlin, Xu, Guohai, Li, Chenliang, Tian, Junfeng, Qi, Qian, Zhang, Ji, Huang, Fei

Document understanding refers to automatically extract, analyze and comprehend information from various types of digital documents, such as a web page. Existing Multi-model Large Language Models (MLLMs), including mPLUG-Owl, have demonstrated promisi

Externí odkaz: http://arxiv.org/abs/2307.02499

Zobrazit plný text záznamu

Report

Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Dataset for Pre-training and Benchmarks

Autor: Xu, Haiyang, Ye, Qinghao, Wu, Xuan, Yan, Ming, Miao, Yuan, Ye, Jiabo, Xu, Guohai, Hu, Anwen, Shi, Yaya, Xu, Guangwei, Li, Chenliang, Qian, Qi, Que, Maofei, Zhang, Ji, Zeng, Xiao, Huang, Fei

To promote the development of Vision-Language Pre-training (VLP) and multimodal Large Language Model (LLM) in the Chinese community, we firstly release the largest public Chinese high-quality video-language dataset named Youku-mPLUG, which is collect

Externí odkaz: http://arxiv.org/abs/2306.04362

Zobrazit plný text záznamu

Report

Distinguish Before Answer: Generating Contrastive Explanation as Knowledge for Commonsense Question Answering

Autor: Chen, Qianglong, Xu, Guohai, Yan, Ming, Zhang, Ji, Huang, Fei, Si, Luo, Zhang, Yin

Existing knowledge-enhanced methods have achieved remarkable results in certain QA tasks via obtaining diverse knowledge from different knowledge bases. However, limited by the properties of retrieved knowledge, they still have trouble benefiting fro

Externí odkaz: http://arxiv.org/abs/2305.08135

Zobrazit plný text záznamu

Report

AMTSS: An Adaptive Multi-Teacher Single-Student Knowledge Distillation Framework For Multilingual Language Inference

Autor: Chen, Qianglong, Ji, Feng, Li, Feng-Lin, Xu, Guohai, Yan, Ming, Zhang, Ji, Zhang, Yin

Knowledge distillation is of key importance to launching multilingual pre-trained language models for real applications. To support cost-effective language inference in multilingual settings, we propose AMTSS, an adaptive multi-teacher single-student

Externí odkaz: http://arxiv.org/abs/2305.07928

Zobrazit plný text záznamu

Report

mPLUG-Owl: Modularization Empowers Large Language Models with Multimodality

Autor: Ye, Qinghao, Xu, Haiyang, Xu, Guohai, Ye, Jiabo, Yan, Ming, Zhou, Yiyang, Wang, Junyang, Hu, Anwen, Shi, Pengcheng, Shi, Yaya, Li, Chenliang, Xu, Yuanhong, Chen, Hehong, Tian, Junfeng, Qian, Qi, Zhang, Ji, Huang, Fei, Zhou, Jingren

Large language models (LLMs) have demonstrated impressive zero-shot abilities on a variety of open-ended tasks, while recent research has also explored the use of LLMs for multi-modal generation. In this study, we introduce mPLUG-Owl, a novel trainin

Externí odkaz: http://arxiv.org/abs/2304.14178

Zobrazit plný text záznamu

Report

ChatPLUG: Open-Domain Generative Dialogue System with Internet-Augmented Instruction Tuning for Digital Human

Autor: Tian, Junfeng, Chen, Hehong, Xu, Guohai, Yan, Ming, Gao, Xing, Zhang, Jianhai, Li, Chenliang, Liu, Jiayi, Xu, Wenshen, Xu, Haiyang, Qian, Qi, Wang, Wei, Ye, Qinghao, Zhang, Jiejing, Zhang, Ji, Huang, Fei, Zhou, Jingren

In this paper, we present ChatPLUG, a Chinese open-domain dialogue system for digital human applications that instruction finetunes on a wide range of dialogue tasks in a unified internet-augmented format. Different from other open-domain dialogue mo

Externí odkaz: http://arxiv.org/abs/2304.07849

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání