Výsledky vyhledávání - "Koller, Alexander"

Report

A Survey on Complex Tasks for Goal-Directed Interactive Agents

Autor: Hartmann, Mareike, Koller, Alexander

Goal-directed interactive agents, which autonomously complete tasks through interactions with their environment, can assist humans in various domains of their daily lives. Recent advances in large language models (LLMs) led to a surge of new, more an

Externí odkaz: http://arxiv.org/abs/2409.18538

Zobrazit plný text záznamu

Report

Learning Program Behavioral Models from Synthesized Input-Output Pairs

Autor: Mammadov, Tural, Klakow, Dietrich, Koller, Alexander, Zeller, Andreas

We introduce Modelizer - a novel framework that, given a black-box program, learns a _model from its input/output behavior_ using _neural machine translation_. The resulting model _mocks_ the original program: Given an input, the model predicts the o

Externí odkaz: http://arxiv.org/abs/2407.08597

Zobrazit plný text záznamu

Report

Strengthening Structural Inductive Biases by Pre-training to Perform Syntactic Transformations

Autor: Lindemann, Matthias, Koller, Alexander, Titov, Ivan

Models need appropriate inductive biases to effectively learn from small amounts of data and generalize systematically outside of the training distribution. While Transformers are highly versatile and powerful, they can still benefit from enhanced st

Externí odkaz: http://arxiv.org/abs/2407.04543

Zobrazit plný text záznamu

Report

Scope-enhanced Compositional Semantic Parsing for DRT

Autor: Yang, Xiulin, Groschwitz, Jonas, Koller, Alexander, Bos, Johan

Publikováno v: EMNLP2024

Discourse Representation Theory (DRT) distinguishes itself from other semantic representation frameworks by its ability to model complex semantic and discourse phenomena through structural nesting and variable binding. While seq2seq models hold the s

Externí odkaz: http://arxiv.org/abs/2407.01899

Zobrazit plný text záznamu

Report

LLMs instead of Human Judges? A Large Scale Empirical Study across 20 NLP Evaluation Tasks

There is an increasing trend towards evaluating NLP models with LLM-generated judgments instead of human judgments. In the absence of a comparison against human data, this raises concerns about the validity of these evaluations; in case they are cond

Externí odkaz: http://arxiv.org/abs/2406.18403

Zobrazit plný text záznamu

Report

Fine-grained Controllable Text Generation through In-context Learning with Feedback

Autor: Thillainathan, Sarubi, Koller, Alexander

We present a method for rewriting an input sentence to match specific values of nontrivial linguistic features, such as dependency depth. In contrast to earlier work, our method uses in-context learning rather than finetuning, making it applicable in

Externí odkaz: http://arxiv.org/abs/2406.11338

Zobrazit plný text záznamu

Report

A Dialogue Game for Eliciting Balanced Collaboration

Autor: Jeknić, Isidora, Schlangen, David, Koller, Alexander

Collaboration is an integral part of human dialogue. Typical task-oriented dialogue games assign asymmetric roles to the participants, which limits their ability to elicit naturalistic role-taking in collaboration and its negotiation. We present a no

Externí odkaz: http://arxiv.org/abs/2406.08202

Zobrazit plný text záznamu

Report

AutoPlanBench: Automatically generating benchmarks for LLM planners from PDDL

Autor: Stein, Katharina, Fišer, Daniel, Hoffmann, Jörg, Koller, Alexander

LLMs are being increasingly used for planning-style tasks, but their capabilities for planning and reasoning are poorly understood. We present AutoPlanBench, a novel method for automatically converting planning benchmarks written in PDDL into textual

Externí odkaz: http://arxiv.org/abs/2311.09830

Zobrazit plný text záznamu

Report

Predicting generalization performance with correctness discriminators

Autor: Yao, Yuekun, Koller, Alexander

The ability to predict an NLP model's accuracy on unseen, potentially out-of-distribution data is a prerequisite for trustworthiness. We present a novel model that establishes upper and lower bounds on the accuracy, without requiring gold labels for

Externí odkaz: http://arxiv.org/abs/2311.09422

Zobrazit plný text záznamu

Report

ADaPT: As-Needed Decomposition and Planning with Language Models

Autor: Prasad, Archiki, Koller, Alexander, Hartmann, Mareike, Clark, Peter, Sabharwal, Ashish, Bansal, Mohit, Khot, Tushar

Large Language Models (LLMs) are increasingly being used for interactive decision-making tasks requiring planning and adapting to the environment. Recent works employ LLMs-as-agents in broadly two ways: iteratively determining the next action (iterat

Externí odkaz: http://arxiv.org/abs/2311.05772

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání