Výsledky vyhledávání - "Golebiowski A"

Report

Hyperband-based Bayesian Optimization for Black-box Prompt Selection

Autor: Schneider, Lennart, Wistuba, Martin, Klein, Aaron, Golebiowski, Jacek, Zappella, Giovanni, Merra, Felice Antonio

Optimal prompt selection is crucial for maximizing large language model (LLM) performance on downstream tasks. As the most powerful models are proprietary and can only be invoked via an API, users often manually refine prompts in a black-box setting

Externí odkaz: http://arxiv.org/abs/2412.07820

Zobrazit plný text záznamu

Report

Remetrizing dynamical systems to control distances of points in time

Autor: Gołębiowski, Krzysztof

The main aim of this article is to prove that for any continuous function $f \colon X \to X$, where $X$ is metrizable (or, more generally, for any family $\mathcal{F}$ of such functions, satisfying an additional condition), there exists a compatible

Externí odkaz: http://arxiv.org/abs/2412.03711

Zobrazit plný text záznamu

Report

Guiding Catalogue Enrichment with User Queries

Autor: Du, Yupei, Golebiowski, Jacek, Schmidt, Philipp, Abedjan, Ziawasch

Techniques for knowledge graph (KGs) enrichment have been increasingly crucial for commercial applications that rely on evolving product catalogues. However, because of the huge search space of potential enrichment, predictions from KG completion (KG

Externí odkaz: http://arxiv.org/abs/2406.07098

Zobrazit plný text záznamu

Report

Arena 3.0: Advancing Social Navigation in Collaborative and Highly Dynamic Environments

Autor: Kästner, Linh, Shcherbyna, Volodymyir, Zeng, Huajian, Le, Tuan Anh, Schreff, Maximilian Ho-Kyoung, Osmaev, Halid, Tran, Nam Truong, Diaz, Diego, Golebiowski, Jan, Soh, Harold, Lambrecht, Jens

Publikováno v: Robotics Science and Systems 2024, Delft Netherlands

Building upon our previous contributions, this paper introduces Arena 3.0, an extension of Arena-Bench, Arena 1.0, and Arena 2.0. Arena 3.0 is a comprehensive software stack containing multiple modules and simulation environments focusing on the deve

Externí odkaz: http://arxiv.org/abs/2406.00837

Zobrazit plný text záznamu

Report

Structural Pruning of Pre-trained Language Models via Neural Architecture Search

Autor: Klein, Aaron, Golebiowski, Jacek, Ma, Xingchen, Perrone, Valerio, Archambeau, Cedric

Pre-trained language models (PLM), for example BERT or RoBERTa, mark the state-of-the-art for natural language understanding task when fine-tuned on labeled data. However, their large size poses challenges in deploying them for inference in real-worl

Externí odkaz: http://arxiv.org/abs/2405.02267

Zobrazit plný text záznamu

Report

Geographical Erasure in Language Generation

Autor: Schwöbel, Pola, Golebiowski, Jacek, Donini, Michele, Archambeau, Cédric, Pruthi, Danish

Large language models (LLMs) encode vast amounts of world knowledge. However, since these models are trained on large swaths of internet data, they are at risk of inordinately capturing information about dominant groups. This imbalance can propagate

Externí odkaz: http://arxiv.org/abs/2310.14777

Zobrazit plný text záznamu

Report

Learning Action Embeddings for Off-Policy Evaluation

Autor: Cief, Matej, Golebiowski, Jacek, Schmidt, Philipp, Abedjan, Ziawasch, Bekasov, Artur

Off-policy evaluation (OPE) methods allow us to compute the expected reward of a policy by using the logged data collected by a different policy. OPE is a viable alternative to running expensive online A/B tests: it can speed up the development of ne

Externí odkaz: http://arxiv.org/abs/2305.03954

Zobrazit plný text záznamu

Report

Optimizing Hyperparameters with Conformal Quantile Regression

Autor: Salinas, David, Golebiowski, Jacek, Klein, Aaron, Seeger, Matthias, Archambeau, Cedric

Many state-of-the-art hyperparameter optimization (HPO) algorithms rely on model-based optimizers that learn surrogate models of the target function to guide the search. Gaussian processes are the de facto surrogate model due to their ability to capt

Externí odkaz: http://arxiv.org/abs/2305.03623

Zobrazit plný text záznamu

Report

Towards Unbiased Calibration using Meta-Regularization

Autor: Wang, Cheng, Golebiowski, Jacek

Model miscalibration has been frequently identified in modern deep neural networks. Recent work aims to improve model calibration directly through a differentiable calibration proxy. However, the calibration produced is often biased due to the binnin

Externí odkaz: http://arxiv.org/abs/2303.15057

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání