Výsledky vyhledávání

Report

Interactive Machine Teaching by Labeling Rules and Instances

Autor: Karamanolakis, Giannis, Hsu, Daniel, Gravano, Luis

Weakly supervised learning aims to reduce the cost of labeling data by using expert-designed labeling rules. However, existing methods require experts to design effective rules in a single shot, which is difficult in the absence of proper guidance an

Externí odkaz: http://arxiv.org/abs/2409.05199

Zobrazit plný text záznamu

Report

One-layer transformers fail to solve the induction heads task

Autor: Sanford, Clayton, Hsu, Daniel, Telgarsky, Matus

A simple communication complexity argument proves that no one-layer transformer can solve the induction heads task unless its size is exponentially larger than the size sufficient for a two-layer transformer.

Externí odkaz: http://arxiv.org/abs/2408.14332

Zobrazit plný text záznamu

Report

Transformers Provably Learn Sparse Token Selection While Fully-Connected Nets Cannot

Autor: Wang, Zixuan, Wei, Stanley, Hsu, Daniel, Lee, Jason D.

The transformer architecture has prevailed in various deep learning settings due to its exceptional capabilities to select and compose structural information. Motivated by these capabilities, Sanford et al. proposed the sparse token selection task, i

Externí odkaz: http://arxiv.org/abs/2406.06893

Zobrazit plný text záznamu

Report

Group-wise oracle-efficient algorithms for online multi-group learning

Autor: Deng, Samuel, Hsu, Daniel, Liu, Jingwen

We study the problem of online multi-group learning, a learning model in which an online learner must simultaneously achieve small prediction regret on a large collection of (possibly overlapping) subsequences corresponding to a family of groups. Gro

Externí odkaz: http://arxiv.org/abs/2406.05287

Zobrazit plný text záznamu

Report

Seasonality Patterns in 311-Reported Foodborne Illness Cases and Machine Learning-Identified Indications of Foodborne Illnesses from Yelp Reviews, New York City, 2022-2023

Autor: Shaveet, Eden, Su, Crystal, Hsu, Daniel, Gravano, Luis

Restaurants are critical venues at which to investigate foodborne illness outbreaks due to shared sourcing, preparation, and distribution of foods. Formal channels to report illness after food consumption, such as 311, New York City's non-emergency m

Externí odkaz: http://arxiv.org/abs/2405.06138

Zobrazit plný text záznamu

Report

Transformers, parallel computation, and logarithmic depth

Autor: Sanford, Clayton, Hsu, Daniel, Telgarsky, Matus

We show that a constant number of self-attention layers can efficiently simulate, and be simulated by, a constant number of communication rounds of Massively Parallel Computation. As a consequence, we show that logarithmic depth is sufficient for tra

Externí odkaz: http://arxiv.org/abs/2402.09268

Zobrazit plný text záznamu

Report

Multi-group Learning for Hierarchical Groups

Autor: Deng, Samuel, Hsu, Daniel

The multi-group learning model formalizes the learning scenario in which a single predictor must generalize well on multiple, possibly overlapping subgroups of interest. We extend the study of multi-group learning to the natural case where the groups

Externí odkaz: http://arxiv.org/abs/2402.00258

Zobrazit plný text záznamu

Report

Distribution-Specific Auditing For Subgroup Fairness

Autor: Hsu, Daniel, Huang, Jizhou, Juba, Brendan

We study the problem of auditing classifiers with the notion of statistical subgroup fairness. Kearns et al. (2018) has shown that the problem of auditing combinatorial subgroups fairness is as hard as agnostic learning. Essentially all work on remed

Externí odkaz: http://arxiv.org/abs/2401.16439

Zobrazit plný text záznamu

Report

Efficient Estimation of the Central Mean Subspace via Smoothed Gradient Outer Products

Autor: Yuan, Gan, Xu, Mingyue, Kpotufe, Samory, Hsu, Daniel

We consider the problem of sufficient dimension reduction (SDR) for multi-index models. The estimators of the central mean subspace in prior works either have slow (non-parametric) convergence rates, or rely on stringent distributional conditions (e.

Externí odkaz: http://arxiv.org/abs/2312.15469

Zobrazit plný text záznamu

Report

On the sample complexity of parameter estimation in logistic regression with normal design

Autor: Hsu, Daniel, Mazumdar, Arya

The logistic regression model is one of the most popular data generation model in noisy binary classification problems. In this work, we study the sample complexity of estimating the parameters of the logistic regression model up to a given $\ell_2$

Externí odkaz: http://arxiv.org/abs/2307.04191

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání