Výsledky vyhledávání

Report

Do RAG Systems Cover What Matters? Evaluating and Optimizing Responses with Sub-Question Coverage

Autor: Xie, Kaige, Laban, Philippe, Choubey, Prafulla Kumar, Xiong, Caiming, Wu, Chien-Sheng

Evaluating retrieval-augmented generation (RAG) systems remains challenging, particularly for open-ended questions that lack definitive answers and require coverage of multiple sub-topics. In this paper, we introduce a novel evaluation framework base

Externí odkaz: http://arxiv.org/abs/2410.15531

Zobrazit plný text záznamu

Report

Custom Non-Linear Model Predictive Control for Obstacle Avoidance in Indoor and Outdoor Environments

Autor: Laban, Lara, Wzorek, Mariusz, Rudol, Piotr, Persson, Tommy

Navigating complex environments requires Unmanned Aerial Vehicles (UAVs) and autonomous systems to perform trajectory tracking and obstacle avoidance in real-time. While many control strategies have effectively utilized linear approximations, address

Externí odkaz: http://arxiv.org/abs/2410.02732

Zobrazit plný text záznamu

Report

Can AI writing be salvaged? Mitigating Idiosyncrasies and Improving Human-AI Alignment in the Writing Process through Edits

Autor: Chakrabarty, Tuhin, Laban, Philippe, Wu, Chien-Sheng

LLM-based applications are helping people write, and LLM-generated text is making its way into social media, journalism, and our classrooms. However, the differences between LLM-generated and human-written text remain unclear. To explore this, we hir

Externí odkaz: http://arxiv.org/abs/2409.14509

Zobrazit plný text záznamu

Report

LEXI: Large Language Models Experimentation Interface

Autor: Laban, Guy, Laban, Tomer, Gunes, Hatice

The recent developments in Large Language Models (LLM), mark a significant moment in the research and development of social interactions with artificial agents. These agents are widely deployed in a variety of settings, with potential impact on users

Externí odkaz: http://arxiv.org/abs/2407.01488

Zobrazit plný text záznamu

Report

Past, Present, and Future: A Survey of The Evolution of Affective Robotics For Well-being

Autor: Spitale, Micol, Axelsson, Minja, Jeong, Sooyeon, Tuttosı, Paige, Stamatis, Caitlin A., Laban, Guy, Lim, Angelica, Gunes, Hatice

Recent research in affective robots has recognized their potential in supporting human well-being. Due to rapidly developing affective and artificial intelligence technologies, this field of research has undergone explosive expansion and advancement

Externí odkaz: http://arxiv.org/abs/2407.02957

Zobrazit plný text záznamu

Report

Summary of a Haystack: A Challenge to Long-Context LLMs and RAG Systems

Autor: Laban, Philippe, Fabbri, Alexander R., Xiong, Caiming, Wu, Chien-Sheng

LLMs and RAG systems are now capable of handling millions of input tokens or more. However, evaluating the output quality of such systems on long-context tasks remains challenging, as tasks like Needle-in-a-Haystack lack complexity. In this work, we

Externí odkaz: http://arxiv.org/abs/2407.01370

Zobrazit plný text záznamu

Report

Prompt Leakage effect and defense strategies for multi-turn LLM interactions

Autor: Agarwal, Divyansh, Fabbri, Alexander R., Risher, Ben, Laban, Philippe, Joty, Shafiq, Wu, Chien-Sheng

Prompt leakage poses a compelling security and privacy threat in LLM applications. Leakage of system prompts may compromise intellectual property, and act as adversarial reconnaissance for an attacker. A systematic evaluation of prompt leakage threat

Externí odkaz: http://arxiv.org/abs/2404.16251

Zobrazit plný text záznamu

Report

MiniCheck: Efficient Fact-Checking of LLMs on Grounding Documents

Autor: Tang, Liyan, Laban, Philippe, Durrett, Greg

Recognizing if LLM output can be grounded in evidence is central to many tasks in NLP: retrieval-augmented generation, summarization, document-grounded dialogue, and more. Current approaches to this kind of fact-checking are based on verifying each p

Externí odkaz: http://arxiv.org/abs/2404.10774

Zobrazit plný text záznamu

Report

A Longitudinal Study of Child Wellbeing Assessment via Online Interactions with a Social Robots

Autor: Abbasi, Nida Itrat, Laban, Guy, Ford, Tamsin, Jones, Peter B., Gunes, Hatice

Socially Assistive Robots are studied in different Child-Robot Interaction settings. However, logistical constraints limit accessibility, particularly affecting timely support for mental wellbeing. In this work, we have investigated whether online in

Externí odkaz: http://arxiv.org/abs/2404.10593

Zobrazit plný text záznamu

Report

Robotising Psychometrics: Validating Wellbeing Assessment Tools in Child-Robot Interactions

Autor: Abbasi, Nida Itrat, Laban, Guy, Ford, Tamsin, Jones, Peter B, Gunes, Hatice

The interdisciplinary nature of Child-Robot Interaction (CRI) fosters incorporating measures and methodologies from many established domains. However, when employing CRI approaches to sensitive avenues of health and wellbeing, caution is critical in

Externí odkaz: http://arxiv.org/abs/2402.18325

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání