Výsledky vyhledávání - "Korhonen, Anna"

Report

Can Rule-Based Insights Enhance LLMs for Radiology Report Classification? Introducing the RadPrompt Methodology

Autor: Fytas, Panagiotis, Breger, Anna, Selby, Ian, Baker, Simon, Shahipasand, Shahab, Korhonen, Anna

Developing imaging models capable of detecting pathologies from chest X-rays can be cost and time-prohibitive for large datasets as it requires supervision to attain state-of-the-art performance. Instead, labels extracted from radiology reports may s

Externí odkaz: http://arxiv.org/abs/2408.04121

Zobrazit plný text záznamu

Report

TurkishMMLU: Measuring Massive Multitask Language Understanding in Turkish

Autor: Yüksel, Arda, Köksal, Abdullatif, Şenel, Lütfi Kerem, Korhonen, Anna, Schütze, Hinrich

Multiple choice question answering tasks evaluate the reasoning, comprehension, and mathematical abilities of Large Language Models (LLMs). While existing benchmarks employ automatic translation for multilingual evaluation, this approach is error-pro

Externí odkaz: http://arxiv.org/abs/2407.12402

Zobrazit plný text záznamu

Report

'Seeing the Big through the Small': Can LLMs Approximate Human Judgment Distributions on NLI from a Few Explanations?

Autor: Chen, Beiduo, Wang, Xinpeng, Peng, Siyao, Litschko, Robert, Korhonen, Anna, Plank, Barbara

Human label variation (HLV) is a valuable source of information that arises when multiple human annotators provide different labels for valid reasons. In Natural Language Inference (NLI) earlier approaches to capturing HLV involve either collecting a

Externí odkaz: http://arxiv.org/abs/2406.17600

Zobrazit plný text záznamu

Report

Fairer Preferences Elicit Improved Human-Aligned Large Language Model Judgments

Autor: Zhou, Han, Wan, Xingchen, Liu, Yinhong, Collier, Nigel, Vulić, Ivan, Korhonen, Anna

Large language models (LLMs) have shown promising abilities as cost-effective and reference-free evaluators for assessing language generation quality. In particular, pairwise LLM evaluators, which compare two generated texts and determine the preferr

Externí odkaz: http://arxiv.org/abs/2406.11370

Zobrazit plný text záznamu

Report

Culturally Aware and Adapted NLP: A Taxonomy and a Survey of the State of the Art

Autor: Liu, Chen Cecilia, Gurevych, Iryna, Korhonen, Anna

The surge of interest in culturally aware and adapted Natural Language Processing (NLP) has inspired much recent research. However, the lack of common understanding of the concept of "culture" has made it difficult to evaluate progress in this emergi

Externí odkaz: http://arxiv.org/abs/2406.03930

Zobrazit plný text záznamu

Report

TopViewRS: Vision-Language Models as Top-View Spatial Reasoners

Autor: Li, Chengzu, Zhang, Caiqi, Zhou, Han, Collier, Nigel, Korhonen, Anna, Vulić, Ivan

Top-view perspective denotes a typical way in which humans read and reason over different types of maps, and it is vital for localization and navigation of humans as well as of `non-human' agents, such as the ones backed by large Vision-Language Mode

Externí odkaz: http://arxiv.org/abs/2406.02537

Zobrazit plný text záznamu

Report

Spectral Editing of Activations for Large Language Model Alignment

Autor: Qiu, Yifu, Zhao, Zheng, Ziser, Yftah, Korhonen, Anna, Ponti, Edoardo M., Cohen, Shay B.

Large language models (LLMs) often exhibit undesirable behaviours, such as generating untruthful or biased content. Editing their internal representations has been shown to be effective in mitigating such behaviours on top of the existing alignment m

Externí odkaz: http://arxiv.org/abs/2405.09719

Zobrazit plný text záznamu

Report

CALRec: Contrastive Alignment of Generative LLMs For Sequential Recommendation

Autor: Li, Yaoyiran, Zhai, Xiang, Alzantot, Moustafa, Yu, Keyi, Vulić, Ivan, Korhonen, Anna, Hammad, Mohamed

Traditional recommender systems such as matrix factorization methods rely on learning a shared dense embedding space to represent both items and user preferences. Sequence models such as RNN, GRUs, and, recently, Transformers have also excelled in th

Externí odkaz: http://arxiv.org/abs/2405.02429

Zobrazit plný text záznamu

Report

Aligning with Human Judgement: The Role of Pairwise Preference in Large Language Model Evaluators

Autor: Liu, Yinhong, Zhou, Han, Guo, Zhijiang, Shareghi, Ehsan, Vulić, Ivan, Korhonen, Anna, Collier, Nigel

Large Language Models (LLMs) have demonstrated promising capabilities as automatic evaluators in assessing the quality of generated natural language. However, LLMs still exhibit biases in evaluation and often struggle to generate coherent evaluations

Externí odkaz: http://arxiv.org/abs/2403.16950

Zobrazit plný text záznamu

Report

Analyzing and Adapting Large Language Models for Few-Shot Multilingual NLU: Are We There Yet?

Autor: Razumovskaia, Evgeniia, Vulić, Ivan, Korhonen, Anna

Supervised fine-tuning (SFT), supervised instruction tuning (SIT) and in-context learning (ICL) are three alternative, de facto standard approaches to few-shot learning. ICL has gained popularity recently with the advent of LLMs due to its simplicity

Externí odkaz: http://arxiv.org/abs/2403.01929

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání