Výsledky vyhledávání - "Xiong, Wayne"

Report

Integrative Decoding: Improve Factuality via Implicit Self-consistency

Autor: Cheng, Yi, Liang, Xiao, Gong, Yeyun, Xiao, Wen, Wang, Song, Zhang, Yuji, Hou, Wenjun, Xu, Kaishuai, Liu, Wenge, Li, Wenjie, Jiao, Jian, Chen, Qi, Cheng, Peng, Xiong, Wayne

Self-consistency-based approaches, which involve repeatedly sampling multiple outputs and selecting the most consistent one as the final response, prove to be remarkably effective in improving the factual accuracy of large language models. Nonetheles

Externí odkaz: http://arxiv.org/abs/2410.01556

Zobrazit plný text záznamu

Report

Can we only use guideline instead of shot in prompt?

Autor: Chen, Jiaxiang, Wang, Song, Li, Zhucong, Xiong, Wayne, Qu, Lizhen, Xu, Zenglin, Qi, Yuan

Currently, prompting techniques can be mainly divided into two categories:1)shot method implicitly inspires the model to answer the question by mimicing the steps in the given example, e.g., the few-shot CoT. 2) Guideline method explicitly instructs

Externí odkaz: http://arxiv.org/abs/2409.12979

Zobrazit plný text záznamu

Report

Developing a Reliable, General-Purpose Hallucination Detection and Mitigation Service: Insights and Lessons Learned

Autor: Wang, Song, Wang, Xun, Mei, Jie, Xie, Yujia, Muarray, Sean, Li, Zhang, Wu, Lingfeng, Chen, Si-Qing, Xiong, Wayne

Hallucination, a phenomenon where large language models (LLMs) produce output that is factually incorrect or unrelated to the input, is a major challenge for LLM applications that require accuracy and dependability. In this paper, we introduce a reli

Externí odkaz: http://arxiv.org/abs/2407.15441

Zobrazit plný text záznamu

Report

PyramidKV: Dynamic KV Cache Compression based on Pyramidal Information Funneling

Autor: Cai, Zefan, Zhang, Yichi, Gao, Bofei, Liu, Yuliang, Liu, Tianyu, Lu, Keming, Xiong, Wayne, Dong, Yue, Chang, Baobao, Hu, Junjie, Xiao, Wen

In this study, we investigate whether attention-based information flow inside large language models (LLMs) is aggregated through noticeable patterns for long context processing. Our observations reveal that LLMs aggregate information through Pyramida

Externí odkaz: http://arxiv.org/abs/2406.02069

Zobrazit plný text záznamu

Report

Interactive Editing for Text Summarization

Autor: Xie, Yujia, Wang, Xun, Chen, Si-Qing, Xiong, Wayne, He, Pengcheng

Summarizing lengthy documents is a common and essential task in our daily lives. Although recent advancements in neural summarization models can assist in crafting general-purpose summaries, human writers often have specific requirements that call fo

Externí odkaz: http://arxiv.org/abs/2306.03067

Zobrazit plný text záznamu

Report

Momentum Calibration for Text Generation

Autor: Zhang, Xingxing, Liu, Yiran, Wang, Xun, He, Pengcheng, Yu, Yang, Chen, Si-Qing, Xiong, Wayne, Wei, Furu

The input and output of most text generation tasks can be transformed to two sequences of tokens and they can be modeled using sequence-to-sequence learning modeling tools such as Transformers. These models are usually trained by maximizing the likel

Externí odkaz: http://arxiv.org/abs/2212.04257

Zobrazit plný text záznamu

Report

Z-Code++: A Pre-trained Language Model Optimized for Abstractive Summarization

Autor: He, Pengcheng, Peng, Baolin, Lu, Liyang, Wang, Song, Mei, Jie, Liu, Yang, Xu, Ruochen, Awadalla, Hany Hassan, Shi, Yu, Zhu, Chenguang, Xiong, Wayne, Zeng, Michael, Gao, Jianfeng, Huang, Xuedong

This paper presents Z-Code++, a new pre-trained language model optimized for abstractive text summarization. The model extends the state of the art encoder-decoder model using three techniques. First, we use a two-phase pre-training process to improv

Externí odkaz: http://arxiv.org/abs/2208.09770

Zobrazit plný text záznamu

Report

Advances in Online Audio-Visual Meeting Transcription

This paper describes a system that generates speaker-annotated transcripts of meetings by using a microphone array and a 360-degree camera. The hallmark of the system is its ability to handle overlapped speech, which has been an unsolved problem in r

Externí odkaz: http://arxiv.org/abs/1912.04979

Zobrazit plný text záznamu

Report

Progressive Joint Modeling in Unsupervised Single-channel Overlapped Speech Recognition

Autor: Chen, Zhehuai, Droppo, Jasha, Li, Jinyu, Xiong, Wayne

Publikováno v: IEEE/ACM Transactions on Audio, Speech, and Language Processing, 26 (2018) 184-196

Unsupervised single-channel overlapped speech recognition is one of the hardest problems in automatic speech recognition (ASR). Permutation invariant training (PIT) is a state of the art model-based approach, which applies a single neural network to

Externí odkaz: http://arxiv.org/abs/1707.07048

Zobrazit plný text záznamu

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Vyhledávací nástroje:

Upřesnit hledání