Zobrazeno 1 - 10
of 23
pro vyhledávání: '"Han, Rujun"'
Autor:
Xu, Wenda, Han, Rujun, Wang, Zifeng, Le, Long T., Madeka, Dhruv, Li, Lei, Wang, William Yang, Agarwal, Rishabh, Lee, Chen-Yu, Pfister, Tomas
Recent advances in knowledge distillation (KD) have enabled smaller student models to approach the performance of larger teacher models. However, popular methods such as supervised KD and on-policy KD, are adversely impacted by the knowledge gaps bet
Externí odkaz:
http://arxiv.org/abs/2410.11325
Autor:
Wu, Zhengxuan, Zhang, Yuhao, Qi, Peng, Xu, Yumo, Han, Rujun, Zhang, Yian, Chen, Jifan, Min, Bonan, Huang, Zhiheng
Modern language models (LMs) need to follow human instructions while being faithful; yet, they often fail to achieve both. Here, we provide concrete evidence of a trade-off between instruction following (i.e., follow open-ended instructions) and fait
Externí odkaz:
http://arxiv.org/abs/2407.21417
Autor:
Han, Rujun, Zhang, Yuhao, Qi, Peng, Xu, Yumo, Wang, Jenyuan, Liu, Lan, Wang, William Yang, Min, Bonan, Castelli, Vittorio
Question answering based on retrieval augmented generation (RAG-QA) is an important research topic in NLP and has a wide range of real-world applications. However, most existing datasets for this task are either constructed using a single source corp
Externí odkaz:
http://arxiv.org/abs/2407.13998
Commonsense reasoning is omnipresent in human communications and thus is an important feature for open-domain dialogue systems. However, evaluating commonsense in dialogue systems is still an open challenge. We take the first step by focusing on even
Externí odkaz:
http://arxiv.org/abs/2305.07797
Story visualization advances the traditional text-to-image generation by enabling multiple image generation based on a complete story. This task requires machines to 1) understand long text inputs and 2) produce a globally consistent image sequence t
Externí odkaz:
http://arxiv.org/abs/2210.08465
Stories or narratives are comprised of a sequence of events. To compose interesting stories, professional writers often leverage a creative writing technique called flashback that inserts past events into current storylines as we commonly observe in
Externí odkaz:
http://arxiv.org/abs/2205.01898
Understanding how events are semantically related to each other is the essence of reading comprehension. Recent event-centric reading comprehension datasets focus mostly on event arguments or temporal relations. While these tasks partially evaluate m
Externí odkaz:
http://arxiv.org/abs/2104.08350
Answer Sentence Selection (AS2) is an efficient approach for the design of open-domain Question Answering (QA) systems. In order to achieve low latency, traditional AS2 models score question-answer pairs individually, ignoring any information from th
Externí odkaz:
http://arxiv.org/abs/2101.12093
Autor:
Ma, Mingyu Derek, Sun, Jiao, Yang, Mu, Huang, Kung-Hsiang, Wen, Nuan, Singh, Shikhar, Han, Rujun, Peng, Nanyun
We present EventPlus, a temporal event understanding pipeline that integrates various state-of-the-art event understanding components including event trigger and type detection, event argument detection, event duration and temporal relation extractio
Externí odkaz:
http://arxiv.org/abs/2101.04922
While pre-trained language models (PTLMs) have achieved noticeable success on many NLP tasks, they still struggle for tasks that require event temporal reasoning, which is essential for event-centric applications. We present a continual pre-training
Externí odkaz:
http://arxiv.org/abs/2012.15283