Výsledky vyhledávání - "Crabbé, Jonathan"

Report

Autor: van Breugel, Boris, Crabbé, Jonathan, Davis, Rob, van der Schaar, Mihaela

Tabular data is one of the most ubiquitous modalities, yet the literature on tabular generative foundation models is lagging far behind its text and vision counterparts. Creating such a model is hard, due to the heterogeneous feature spaces of differ

Externí odkaz: http://arxiv.org/abs/2406.17673

Zobrazit plný text záznamu

Report

DAGnosis: Localized Identification of Data Inconsistencies using Structures

Autor: Huynh, Nicolas, Berrevoets, Jeroen, Seedat, Nabeel, Crabbé, Jonathan, Qian, Zhaozhi, van der Schaar, Mihaela

Identification and appropriate handling of inconsistencies in data at deployment time is crucial to reliably use machine learning models. While recent data-centric methods are able to identify such inconsistencies with respect to the training set, th

Externí odkaz: http://arxiv.org/abs/2402.17599

Zobrazit plný text záznamu

Report

Time Series Diffusion in the Frequency Domain

Autor: Crabbé, Jonathan, Huynh, Nicolas, Stanczuk, Jan, van der Schaar, Mihaela

Fourier analysis has been an instrumental tool in the development of signal processing. This leads us to wonder whether this framework could similarly benefit generative modelling. In this paper, we explore this question through the scope of time ser

Externí odkaz: http://arxiv.org/abs/2402.05933

Zobrazit plný text záznamu

Report

MatterGen: a generative model for inorganic materials design

The design of functional materials with desired properties is essential in driving technological advances in areas like energy storage, catalysis, and carbon capture. Generative models provide a new paradigm for materials design by directly generatin

Externí odkaz: http://arxiv.org/abs/2312.03687

Zobrazit plný text záznamu

Report

TRIAGE: Characterizing and auditing training data for improved regression

Autor: Seedat, Nabeel, Crabbé, Jonathan, Qian, Zhaozhi, van der Schaar, Mihaela

Data quality is crucial for robust machine learning algorithms, with the recent interest in data-centric AI emphasizing the importance of training data characterization. However, current data characterization methods are largely focused on classifica

Externí odkaz: http://arxiv.org/abs/2310.18970

Zobrazit plný text záznamu

Report

Robust multimodal models have outlier features and encode more concepts

Autor: Crabbé, Jonathan, Rodríguez, Pau, Shankar, Vaishaal, Zappella, Luca, Blaas, Arno

What distinguishes robust models from non-robust ones? This question has gained traction with the appearance of large-scale multimodal models, such as CLIP. These models have demonstrated unprecedented robustness with respect to natural distribution

Externí odkaz: http://arxiv.org/abs/2310.13040

Zobrazit plný text záznamu

Report

Evaluating the Robustness of Interpretability Methods through Explanation Invariance and Equivariance

Autor: Crabbé, Jonathan, van der Schaar, Mihaela

Interpretability methods are valuable only if their explanations faithfully describe the explained model. In this work, we consider neural networks whose predictions are invariant under a specific symmetry group. This includes popular architectures,

Externí odkaz: http://arxiv.org/abs/2304.06715

Zobrazit plný text záznamu

Report

TANGOS: Regularizing Tabular Neural Networks through Gradient Orthogonalization and Specialization

Autor: Jeffares, Alan, Liu, Tennison, Crabbé, Jonathan, Imrie, Fergus, van der Schaar, Mihaela

Despite their success with unstructured data, deep neural networks are not yet a panacea for structured tabular data. In the tabular domain, their efficiency crucially relies on various forms of regularization to prevent overfitting and provide stron

Externí odkaz: http://arxiv.org/abs/2303.05506

Zobrazit plný text záznamu

Report

Joint Training of Deep Ensembles Fails Due to Learner Collusion

Autor: Jeffares, Alan, Liu, Tennison, Crabbé, Jonathan, van der Schaar, Mihaela

Ensembles of machine learning models have been well established as a powerful method of improving performance over a single model. Traditionally, ensembling algorithms train their base learners independently or sequentially with the goal of optimizin

Externí odkaz: http://arxiv.org/abs/2301.11323

Zobrazit plný text záznamu

Report

Data-IQ: Characterizing subgroups with heterogeneous outcomes in tabular data

Autor: Seedat, Nabeel, Crabbé, Jonathan, Bica, Ioana, van der Schaar, Mihaela

High model performance, on average, can hide that models may systematically underperform on subgroups of the data. We consider the tabular setting, which surfaces the unique issue of outcome heterogeneity - this is prevalent in areas such as healthca

Externí odkaz: http://arxiv.org/abs/2210.13043

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání