Výsledky vyhledávání - "Wallace, Byron C"

Report

Open (Clinical) LLMs are Sensitive to Instruction Phrasings

Autor: Arroyo, Alberto Mario Ceballos, Munnangi, Monica, Sun, Jiuding, Zhang, Karen Y. C., McInerney, Denis Jered, Wallace, Byron C., Amir, Silvio

Instruction-tuned Large Language Models (LLMs) can perform a wide range of tasks given natural language instructions to do so, but they are sensitive to how such instructions are phrased. This issue is especially concerning in healthcare, as clinicia

Externí odkaz: http://arxiv.org/abs/2407.09429

Zobrazit plný text záznamu

Report

Detection and Measurement of Syntactic Templates in Generated Text

Autor: Shaib, Chantal, Elazar, Yanai, Li, Junyi Jessy, Wallace, Byron C.

Recent work on evaluating the diversity of text generated by LLMs has focused on word-level features. Here we offer an analysis of syntactic features to characterize general repetition in models, beyond frequent n-grams. Specifically, we define synta

Externí odkaz: http://arxiv.org/abs/2407.00211

Zobrazit plný text záznamu

Report

Investigating Mysteries of CoT-Augmented Distillation

Autor: Wadhwa, Somin, Amir, Silvio, Wallace, Byron C.

Eliciting "chain of thought" (CoT) rationales -- sequences of token that convey a "reasoning" process -- has been shown to consistently improve LLM performance on tasks like question answering. More recent efforts have shown that such rationales can

Externí odkaz: http://arxiv.org/abs/2406.14511

Zobrazit plný text záznamu

Report

Learning from Natural Language Explanations for Generalizable Entity Matching

Autor: Wadhwa, Somin, Krishnan, Adit, Wang, Runhui, Wallace, Byron C., Kong, Chris

Entity matching is the task of linking records from different sources that refer to the same real-world entity. Past work has primarily treated entity linking as a standard supervised learning problem. However, supervised entity matching models often

Externí odkaz: http://arxiv.org/abs/2406.09330

Zobrazit plný text záznamu

Report

Automatically Extracting Numerical Results from Randomized Controlled Trials with Large Language Models

Autor: Yun, Hye Sun, Pogrebitskiy, David, Marshall, Iain J., Wallace, Byron C.

Meta-analyses statistically aggregate the findings of different randomized controlled trials (RCTs) to assess treatment effectiveness. Because this yields robust estimates of treatment effectiveness, results from meta-analyses are considered the stro

Externí odkaz: http://arxiv.org/abs/2405.01686

Zobrazit plný text záznamu

Report

On-the-fly Definition Augmentation of LLMs for Biomedical NER

Autor: Munnangi, Monica, Feldman, Sergey, Wallace, Byron C, Amir, Silvio, Hope, Tom, Naik, Aakanksha

Despite their general capabilities, LLMs still struggle on biomedical NER tasks, which are difficult due to the presence of specialized terminology and lack of training data. In this work we set out to improve LLM performance on biomedical NER in lim

Externí odkaz: http://arxiv.org/abs/2404.00152

Zobrazit plný text záznamu

Report

Standardizing the Measurement of Text Diversity: A Tool and a Comparative Analysis of Scores

Autor: Shaib, Chantal, Barrow, Joe, Sun, Jiuding, Siu, Alexa F., Wallace, Byron C., Nenkova, Ani

The diversity across outputs generated by large language models shapes the perception of their quality and utility. Prompt leaks, templated answer structure, and canned responses across different interactions are readily noticed by people, but there

Externí odkaz: http://arxiv.org/abs/2403.00553

Zobrazit plný text záznamu

Report

How Much Annotation is Needed to Compare Summarization Models?

Autor: Shaib, Chantal, Barrow, Joe, Siu, Alexa F., Wallace, Byron C., Nenkova, Ani

Modern instruction-tuned models have become highly capable in text generation tasks such as summarization, and are expected to be released at a steady pace. In practice one may now wish to choose confidently, but with minimal effort, the best perform

Externí odkaz: http://arxiv.org/abs/2402.18756

Zobrazit plný text záznamu

Report

Leveraging ChatGPT in Pharmacovigilance Event Extraction: An Empirical Study

Autor: Sun, Zhaoyue, Pergola, Gabriele, Wallace, Byron C., He, Yulan

With the advent of large language models (LLMs), there has been growing interest in exploring their potential for medical applications. This research aims to investigate the ability of LLMs, specifically ChatGPT, in the context of pharmacovigilance e

Externí odkaz: http://arxiv.org/abs/2402.15663

Zobrazit plný text záznamu

Report

GenAudit: Fixing Factual Errors in Language Model Outputs with Evidence

Autor: Krishna, Kundan, Ramprasad, Sanjana, Gupta, Prakhar, Wallace, Byron C., Lipton, Zachary C., Bigham, Jeffrey P.

LLMs can generate factually incorrect statements even when provided access to reference documents. Such errors can be dangerous in high-stakes applications (e.g., document-grounded QA for healthcare or finance). We present GenAudit -- a tool intended

Externí odkaz: http://arxiv.org/abs/2402.12566

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání