Výsledky vyhledávání - "Raina, Vatsal"

Report

Finetuning LLMs for Comparative Assessment Tasks

Autor: Raina, Vatsal, Liusie, Adian, Gales, Mark

Automated assessment in natural language generation is a challenging task. Instruction-tuned large language models (LLMs) have shown promise in reference-free evaluation, particularly through comparative assessment. However, the quadratic computation

Externí odkaz: http://arxiv.org/abs/2409.15979

Zobrazit plný text záznamu

Report

Question-Based Retrieval using Atomic Units for Enterprise RAG

Autor: Raina, Vatsal, Gales, Mark

Enterprise retrieval augmented generation (RAG) offers a highly flexible framework for combining powerful large language models (LLMs) with internal, possibly temporally changing, documents. In RAG, documents are first chunked. Relevant chunks are th

Externí odkaz: http://arxiv.org/abs/2405.12363

Zobrazit plný text záznamu

Report

Efficient LLM Comparative Assessment: a Product of Experts Framework for Pairwise Comparisons

Autor: Liusie, Adian, Raina, Vatsal, Fathullah, Yassir, Gales, Mark

LLM-as-a-judge approaches are a practical and effective way of assessing a range of text tasks, aligning with human judgements especially when applied in a comparative assessment fashion. However, when using pairwise comparisons to rank a set of cand

Externí odkaz: http://arxiv.org/abs/2405.05894

Zobrazit plný text záznamu

Report

Question Difficulty Ranking for Multiple-Choice Reading Comprehension

Autor: Raina, Vatsal, Gales, Mark

Multiple-choice (MC) tests are an efficient method to assess English learners. It is useful for test creators to rank candidate MC questions by difficulty during exam curation. Typically, the difficulty is determined by having human test takers trial

Externí odkaz: http://arxiv.org/abs/2404.10704

Zobrazit plný text záznamu

Report

An Information-Theoretic Approach to Analyze NLP Classification Tasks

Autor: Wang, Luran, Gales, Mark, Raina, Vatsal

Understanding the importance of the inputs on the output is useful across many tasks. This work provides an information-theoretic framework to analyse the influence of inputs for text classification tasks. Natural language processing (NLP) tasks take

Externí odkaz: http://arxiv.org/abs/2402.00978

Zobrazit plný text záznamu

Report

Structural-Based Uncertainty in Deep Learning Across Anatomical Scales: Analysis in White Matter Lesion Segmentation

Autor: Molchanova, Nataliia, Raina, Vatsal, Malinin, Andrey, La Rosa, Francesco, Depeursinge, Adrien, Gales, Mark, Granziera, Cristina, Muller, Henning, Graziani, Mara, Cuadra, Meritxell Bach

This paper explores uncertainty quantification (UQ) as an indicator of the trustworthiness of automated deep-learning (DL) tools in the context of white matter lesion (WML) segmentation from magnetic resonance imaging (MRI) scans of multiple sclerosi

Externí odkaz: http://arxiv.org/abs/2311.08931

Zobrazit plný text záznamu

Report

Assessing Distractors in Multiple-Choice Tests

Autor: Raina, Vatsal, Liusie, Adian, Gales, Mark

Multiple-choice tests are a common approach for assessing candidates' comprehension skills. Standard multiple-choice reading comprehension exams require candidates to select the correct answer option from a discrete set based on a question in relatio

Externí odkaz: http://arxiv.org/abs/2311.04554

Zobrazit plný text záznamu

Report

Is it Possible to Modify Text to a Target Readability Level? An Initial Investigation Using Zero-Shot Large Language Models

Autor: Farajidizaji, Asma, Raina, Vatsal, Gales, Mark

Text simplification is a common task where the text is adapted to make it easier to understand. Similarly, text elaboration can make a passage more sophisticated, offering a method to control the complexity of reading comprehension tests. However, te

Externí odkaz: http://arxiv.org/abs/2309.12551

Zobrazit plný text záznamu

Report

Analyzing Multiple-Choice Reading and Listening Comprehension Tests

Autor: Raina, Vatsal, Liusie, Adian, Gales, Mark

Multiple-choice reading and listening comprehension tests are an important part of language assessment. Content creators for standard educational tests need to carefully curate questions that assess the comprehension abilities of candidates taking th

Externí odkaz: http://arxiv.org/abs/2307.01076

Zobrazit plný text záznamu

Report

Analysis of the Cambridge Multiple-Choice Questions Reading Dataset with a Focus on Candidate Response Distribution

Autor: Liusie, Adian, Raina, Vatsal, Mullooly, Andrew, Knill, Kate, Gales, Mark J. F.

Multiple choice exams are widely used to assess candidates across a diverse range of domains and tasks. To moderate question quality, newly proposed questions often pass through pre-test evaluation stages before being deployed into real-world exams.

Externí odkaz: http://arxiv.org/abs/2306.13047

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání