Výsledky vyhledávání - "Kristiadi, Agustinus"

Report

Efficient Weight-Space Laplace-Gaussian Filtering and Smoothing for Sequential Deep Learning

Autor: Sliwa, Joanna, Schneider, Frank, Bosch, Nathanael, Kristiadi, Agustinus, Hennig, Philipp

Efficiently learning a sequence of related tasks, such as in continual learning, poses a significant challenge for neural nets due to the delicate trade-off between catastrophic forgetting and loss of plasticity. We address this challenge with a grou

Externí odkaz: http://arxiv.org/abs/2410.06800

Zobrazit plný text záznamu

Report

Uncertainty-Guided Optimization on Large Language Model Search Trees

Autor: Grosse, Julia, Wu, Ruotian, Rashid, Ahmad, Hennig, Philipp, Poupart, Pascal, Kristiadi, Agustinus

Tree search algorithms such as greedy and beam search are the standard when it comes to finding sequences of maximum likelihood in the decoding processes of large language models (LLMs). However, they are myopic since they do not take the complete ro

Externí odkaz: http://arxiv.org/abs/2407.03951

Zobrazit plný text záznamu

Report

A Critical Look At Tokenwise Reward-Guided Text Generation

Autor: Rashid, Ahmad, Wu, Ruotian, Grosse, Julia, Kristiadi, Agustinus, Poupart, Pascal

Large language models (LLMs) can significantly be improved by aligning to human preferences -- the so-called reinforcement learning from human feedback (RLHF). However, the cost of fine-tuning an LLM is prohibitive for many users. Due to their abilit

Externí odkaz: http://arxiv.org/abs/2406.07780

Zobrazit plný text záznamu

Report

How Useful is Intermittent, Asynchronous Expert Feedback for Bayesian Optimization?

Autor: Kristiadi, Agustinus, Strieth-Kalthoff, Felix, Subramanian, Sriram Ganapathi, Fortuin, Vincent, Poupart, Pascal, Pleiss, Geoff

Bayesian optimization (BO) is an integral part of automated scientific discovery -- the so-called self-driving lab -- where human inputs are ideally minimal or at least non-blocking. However, scientists often have strong intuition, and thus human fee

Externí odkaz: http://arxiv.org/abs/2406.06459

Zobrazit plný text záznamu

Report

A Sober Look at LLMs for Material Discovery: Are They Actually Good for Bayesian Optimization Over Molecules?

Autor: Kristiadi, Agustinus, Strieth-Kalthoff, Felix, Skreta, Marta, Poupart, Pascal, Aspuru-Guzik, Alán, Pleiss, Geoff

Automation is one of the cornerstones of contemporary material discovery. Bayesian optimization (BO) is an essential part of such workflows, enabling scientists to leverage prior domain knowledge into efficient exploration of a large molecular space.

Externí odkaz: http://arxiv.org/abs/2402.05015

Zobrazit plný text záznamu

Report

Position: Bayesian Deep Learning is Needed in the Age of Large-Scale AI

In the current landscape of deep learning research, there is a predominant emphasis on achieving high predictive accuracy in supervised tasks involving large image and language datasets. However, a broader perspective reveals a multitude of overlooke

Externí odkaz: http://arxiv.org/abs/2402.00809

Zobrazit plný text záznamu

Report

Structured Inverse-Free Natural Gradient: Memory-Efficient & Numerically-Stable KFAC

Autor: Lin, Wu, Dangel, Felix, Eschenhagen, Runa, Neklyudov, Kirill, Kristiadi, Agustinus, Turner, Richard E., Makhzani, Alireza

Second-order methods such as KFAC can be useful for neural net training. However, they are often memory-inefficient since their preconditioning Kronecker factors are dense, and numerically unstable in low precision as they require matrix inversion or

Externí odkaz: http://arxiv.org/abs/2312.05705

Zobrazit plný text záznamu

Report

Preventing Arbitrarily High Confidence on Far-Away Data in Point-Estimated Discriminative Neural Networks

Autor: Rashid, Ahmad, Hacker, Serena, Zhang, Guojun, Kristiadi, Agustinus, Poupart, Pascal

Discriminatively trained, deterministic neural networks are the de facto choice for classification problems. However, even though they achieve state-of-the-art results on in-domain test sets, they tend to be overconfident on out-of-distribution (OOD)

Externí odkaz: http://arxiv.org/abs/2311.03683

Zobrazit plný text záznamu

Report

On the Disconnect Between Theory and Practice of Neural Networks: Limits of the NTK Perspective

Autor: Wenger, Jonathan, Dangel, Felix, Kristiadi, Agustinus

The neural tangent kernel (NTK) has garnered significant attention as a theoretical framework for describing the behavior of large-scale neural networks. Kernel methods are theoretically well-understood and as a result enjoy algorithmic benefits, whi

Externí odkaz: http://arxiv.org/abs/2310.00137

Zobrazit plný text záznamu

Report

Promises and Pitfalls of the Linearized Laplace in Bayesian Optimization

Autor: Kristiadi, Agustinus, Immer, Alexander, Eschenhagen, Runa, Fortuin, Vincent

The linearized-Laplace approximation (LLA) has been shown to be effective and efficient in constructing Bayesian neural networks. It is theoretically compelling since it can be seen as a Gaussian process posterior with the mean function given by the

Externí odkaz: http://arxiv.org/abs/2304.08309

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání