Výsledky vyhledávání

Report

Autoregressive Speech Synthesis without Vector Quantization

Autor: Meng, Lingwei, Zhou, Long, Liu, Shujie, Chen, Sanyuan, Han, Bing, Hu, Shujie, Liu, Yanqing, Li, Jinyu, Zhao, Sheng, Wu, Xixin, Meng, Helen, Wei, Furu

We present MELLE, a novel continuous-valued tokens based language modeling approach for text to speech synthesis (TTS). MELLE autoregressively generates continuous mel-spectrogram frames directly from text condition, bypassing the need for vector qua

Externí odkaz: http://arxiv.org/abs/2407.08551

Zobrazit plný text záznamu

Report

Enhancing Language Model Rationality with Bi-Directional Deliberation Reasoning

Autor: Zhang, Yadong, Mao, Shaoguang, Wu, Wenshan, Xia, Yan, Ge, Tao, Lan, Man, Wei, Furu

This paper introduces BI-Directional DEliberation Reasoning (BIDDER), a novel reasoning approach to enhance the decision rationality of language models. Traditional reasoning methods typically rely on historical information and employ uni-directional

Externí odkaz: http://arxiv.org/abs/2407.06112

Zobrazit plný text záznamu

Report

Direct Preference Knowledge Distillation for Large Language Models

Autor: Li, Yixing, Gu, Yuxian, Dong, Li, Wang, Dequan, Cheng, Yu, Wei, Furu

In the field of large language models (LLMs), Knowledge Distillation (KD) is a critical technique for transferring capabilities from teacher models to student models. However, existing KD methods face limitations and challenges in distillation of LLM

Externí odkaz: http://arxiv.org/abs/2406.19774

Zobrazit plný text záznamu

Report

Instruction Pre-Training: Language Models are Supervised Multitask Learners

Autor: Cheng, Daixuan, Gu, Yuxian, Huang, Shaohan, Bi, Junyu, Huang, Minlie, Wei, Furu

Unsupervised multitask pre-training has been the critical method behind the recent success of language models (LMs). However, supervised multitask learning still holds significant promise, as scaling it in the post-training stage trends towards bette

Externí odkaz: http://arxiv.org/abs/2406.14491

Zobrazit plný text záznamu

Report

Meta Reasoning for Large Language Models

Autor: Gao, Peizhong, Xie, Ao, Mao, Shaoguang, Wu, Wenshan, Xia, Yan, Mi, Haipeng, Wei, Furu

We introduce Meta-Reasoning Prompting (MRP), a novel and efficient system prompting method for large language models (LLMs) inspired by human meta-reasoning. Traditional in-context learning-based reasoning techniques, such as Tree-of-Thoughts, show p

Externí odkaz: http://arxiv.org/abs/2406.11698

Zobrazit plný text záznamu

Report

VALL-E R: Robust and Efficient Zero-Shot Text-to-Speech Synthesis via Monotonic Alignment

Autor: Han, Bing, Zhou, Long, Liu, Shujie, Chen, Sanyuan, Meng, Lingwei, Qian, Yanming, Liu, Yanqing, Zhao, Sheng, Li, Jinyu, Wei, Furu

With the help of discrete neural audio codecs, large language models (LLM) have increasingly been recognized as a promising methodology for zero-shot Text-to-Speech (TTS) synthesis. However, sampling based decoding strategies bring astonishing divers

Externí odkaz: http://arxiv.org/abs/2406.07855

Zobrazit plný text záznamu

Report

VALL-E 2: Neural Codec Language Models are Human Parity Zero-Shot Text to Speech Synthesizers

Autor: Chen, Sanyuan, Liu, Shujie, Zhou, Long, Liu, Yanqing, Tan, Xu, Li, Jinyu, Zhao, Sheng, Qian, Yao, Wei, Furu

This paper introduces VALL-E 2, the latest advancement in neural codec language models that marks a milestone in zero-shot text-to-speech synthesis (TTS), achieving human parity for the first time. Based on its predecessor, VALL-E, the new iteration

Externí odkaz: http://arxiv.org/abs/2406.05370

Zobrazit plný text záznamu

Report

xRAG: Extreme Context Compression for Retrieval-augmented Generation with One Token

Autor: Cheng, Xin, Wang, Xun, Zhang, Xingxing, Ge, Tao, Chen, Si-Qing, Wei, Furu, Zhang, Huishuai, Zhao, Dongyan

This paper introduces xRAG, an innovative context compression method tailored for retrieval-augmented generation. xRAG reinterprets document embeddings in dense retrieval--traditionally used solely for retrieval--as features from the retrieval modali

Externí odkaz: http://arxiv.org/abs/2405.13792

Zobrazit plný text záznamu

Report

MoRA: High-Rank Updating for Parameter-Efficient Fine-Tuning

Autor: Jiang, Ting, Huang, Shaohan, Luo, Shengyue, Zhang, Zihan, Huang, Haizhen, Wei, Furu, Deng, Weiwei, Sun, Feng, Zhang, Qi, Wang, Deqing, Zhuang, Fuzhen

Low-rank adaptation is a popular parameter-efficient fine-tuning method for large language models. In this paper, we analyze the impact of low-rank updating, as implemented in LoRA. Our findings suggest that the low-rank updating mechanism may limit

Externí odkaz: http://arxiv.org/abs/2405.12130

Zobrazit plný text záznamu

Report

You Only Cache Once: Decoder-Decoder Architectures for Language Models

Autor: Sun, Yutao, Dong, Li, Zhu, Yi, Huang, Shaohan, Wang, Wenhui, Ma, Shuming, Zhang, Quanlu, Wang, Jianyong, Wei, Furu

We introduce a decoder-decoder architecture, YOCO, for large language models, which only caches key-value pairs once. It consists of two components, i.e., a cross-decoder stacked upon a self-decoder. The self-decoder efficiently encodes global key-va

Externí odkaz: http://arxiv.org/abs/2405.05254

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání