Výsledky vyhledávání - "Tang, Tianyi"

Report

Towards Coarse-to-Fine Evaluation of Inference Efficiency for Large Language Models

Autor: Chen, Yushuo, Tang, Tianyi, Xiang, Erge, Li, Linjiang, Zhao, Wayne Xin, Wang, Jing, Chai, Yunpeng, Wen, Ji-Rong

In real world, large language models (LLMs) can serve as the assistant to help users accomplish their jobs, and also support the development of advanced applications. For the wide application of LLMs, the inference efficiency is an essential concern,

Externí odkaz: http://arxiv.org/abs/2404.11502

Zobrazit plný text záznamu

Report

Language-Specific Neurons: The Key to Multilingual Capabilities in Large Language Models

Autor: Tang, Tianyi, Luo, Wenyang, Huang, Haoyang, Zhang, Dongdong, Wang, Xiaolei, Zhao, Xin, Wei, Furu, Wen, Ji-Rong

Large language models (LLMs) demonstrate remarkable multilingual capabilities without being pre-trained on specially curated multilingual parallel corpora. It remains a challenging problem to explain the underlying mechanisms by which LLMs process mu

Externí odkaz: http://arxiv.org/abs/2402.16438

Zobrazit plný text záznamu

Report

Epitaxial growth of high-quality GaAs on Si(001) using ultrathin buffer layers

Autor: Cheng, Kun, Tang, Tianyi, Zhan, Wenkang, Sun, Zhenyu, Xu, Bo, Zhao, Chao, Wang, Zhanguo

The direct growth of III-V semiconductors on silicon holds tremendous potential for photonics applications. However, the inherent differences in their properties lead to defects in the epitaxial layer, including threading dislocations (TDs), antiphas

Externí odkaz: http://arxiv.org/abs/2312.15390

Zobrazit plný text záznamu

Report

Beyond Imitation: Leveraging Fine-grained Quality Signals for Alignment

Autor: Guo, Geyang, Zhao, Ranchi, Tang, Tianyi, Zhao, Wayne Xin, Wen, Ji-Rong

Alignment with human preference is a desired property of large language models (LLMs). Currently, the main alignment approach is based on reinforcement learning from human feedback (RLHF). Despite the effectiveness of RLHF, it is intricate to impleme

Externí odkaz: http://arxiv.org/abs/2311.04072

Zobrazit plný text záznamu

Report

BAMBOO: A Comprehensive Benchmark for Evaluating Long Text Modeling Capacities of Large Language Models

Autor: Dong, Zican, Tang, Tianyi, Li, Junyi, Zhao, Wayne Xin, Wen, Ji-Rong

Large language models (LLMs) have achieved dramatic proficiency over NLP tasks with normal length. Recently, multiple studies have committed to extending the context length and enhancing the long text modeling capabilities of LLMs. To comprehensively

Externí odkaz: http://arxiv.org/abs/2309.13345

Zobrazit plný text záznamu

Report

Towards Effective Ancient Chinese Translation: Dataset, Model, and Evaluation

Autor: Guo, Geyang, Yang, Jiarong, Lu, Fengyuan, Qin, Jiaxin, Tang, Tianyi, Zhao, Wayne Xin

Interpreting ancient Chinese has been the key to comprehending vast Chinese literature, tradition, and civilization. In this paper, we propose Erya for ancient Chinese translation. From a dataset perspective, we collect, clean, and classify ancient C

Externí odkaz: http://arxiv.org/abs/2308.00240

Zobrazit plný text záznamu

Report

Zero-shot Visual Question Answering with Language Model Feedback

Autor: Du, Yifan, Li, Junyi, Tang, Tianyi, Zhao, Wayne Xin, Wen, Ji-Rong

In this paper, we propose a novel language model guided captioning approach, LAMOC, for knowledge-based visual question answering (VQA). Our approach employs the generated captions by a captioning model as the context of an answer prediction model, w

Externí odkaz: http://arxiv.org/abs/2305.17006

Zobrazit plný text záznamu

Report

Learning to Imagine: Visually-Augmented Natural Language Generation

Autor: Tang, Tianyi, Chen, Yushuo, Du, Yifan, Li, Junyi, Zhao, Wayne Xin, Wen, Ji-Rong

People often imagine relevant scenes to aid in the writing process. In this work, we aim to utilize visual information for composition in the same manner as humans. We propose a method, LIVE, that makes pre-trained language models (PLMs) Learn to Ima

Externí odkaz: http://arxiv.org/abs/2305.16944

Zobrazit plný text záznamu

Report

Not All Metrics Are Guilty: Improving NLG Evaluation by Diversifying References

Autor: Tang, Tianyi, Lu, Hongyuan, Jiang, Yuchen Eleanor, Huang, Haoyang, Zhang, Dongdong, Zhao, Wayne Xin, Kocmi, Tom, Wei, Furu

Most research about natural language generation (NLG) relies on evaluation benchmarks with limited references for a sample, which may result in poor correlations with human judgements. The underlying reason is that one semantic meaning can actually b

Externí odkaz: http://arxiv.org/abs/2305.15067

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání