Instance Based Approximations to Profile Maximum Likelihood
Autor: | Nima Anari, Charikar, M., Shiragur, K., Sidford, A. |
---|---|
Rok vydání: | 2020 |
Předmět: |
FOS: Computer and information sciences
Computer Science - Machine Learning Statistics - Machine Learning Computer Science - Information Theory Information Theory (cs.IT) Computer Science - Data Structures and Algorithms Data Structures and Algorithms (cs.DS) Machine Learning (stat.ML) Statistics - Computation Computation (stat.CO) Machine Learning (cs.LG) |
Zdroj: | Scopus-Elsevier |
DOI: | 10.48550/arxiv.2011.02761 |
Popis: | In this paper we provide a new efficient algorithm for approximately computing the profile maximum likelihood (PML) distribution, a prominent quantity in symmetric property estimation. We provide an algorithm which matches the previous best known efficient algorithms for computing approximate PML distributions and improves when the number of distinct observed frequencies in the given instance is small. We achieve this result by exploiting new sparsity structure in approximate PML distributions and providing a new matrix rounding algorithm, of independent interest. Leveraging this result, we obtain the first provable computationally efficient implementation of PseudoPML, a general framework for estimating a broad class of symmetric properties. Additionally, we obtain efficient PML-based estimators for distributions with small profile entropy, a natural instance-based complexity measure. Further, we provide a simpler and more practical PseudoPML implementation that matches the best-known theoretical guarantees of such an estimator and evaluate this method empirically. Comment: Accepted at Thirty-fourth Conference on Neural Information Processing Systems (NeurIPS 2020) |
Databáze: | OpenAIRE |
Externí odkaz: |