Zobrazeno 1 - 10
of 42
pro vyhledávání: '"Perry, Patrick O."'
We consider a particular instance of a common problem in recommender systems: using a database of book reviews to inform user-targeted recommendations. In our dataset, books are categorized into genres and sub-genres. To exploit this nested taxonomy,
Externí odkaz:
http://arxiv.org/abs/1806.02321
Autor:
Perry, Patrick O., Benoit, Kenneth
Probabilistic methods for classifying text form a rich tradition in machine learning and natural language processing. For many important problems, however, class prediction is uninteresting because the class is known, and instead the focus shifts to
Externí odkaz:
http://arxiv.org/abs/1710.08963
Autor:
Fu, Wei, Perry, Patrick O.
Many clustering methods, including k-means, require the user to specify the number of clusters as an input parameter. A variety of methods have been devised to choose the number of clusters automatically, but they often rely on strong modeling assump
Externí odkaz:
http://arxiv.org/abs/1702.02658
Publikováno v:
Political Analysis, 2020 Jul 01. 28(3), 412-434.
Externí odkaz:
https://www.jstor.org/stable/27116018
Autor:
Perry, Patrick O.
Hierarchical models allow for heterogeneous behaviours in a population while simultaneously borrowing estimation strength across all subpopulations. Unfortunately, existing likelihood-based methods for fitting hierarchical models have high computatio
Externí odkaz:
http://arxiv.org/abs/1504.04941
Publikováno v:
The Annals of Applied Statistics, 2019 Dec 01. 13(4), 2260-2288.
Externí odkaz:
https://www.jstor.org/stable/26866723
Autor:
Perry, Patrick O., Pillai, Natesh S.
In the AGEMAP genomics study, researchers were interested in detecting genes related to age in a variety of tissue types. After not finding many age-related genes in some of the analyzed tissue types, the study was criticized for having low power. It
Externí odkaz:
http://arxiv.org/abs/1310.7269
Autor:
Flynn, Cheryl J., Perry, Patrick O.
Publikováno v:
Electron. J. Statist., Volume 14, Number 1 (2020), 731-768
Biclustering, the process of simultaneously clustering the rows and columns of a data matrix, is a popular and effective tool for finding structure in a high-dimensional dataset. Many biclustering procedures appear to work well in practice, but most
Externí odkaz:
http://arxiv.org/abs/1206.6927
Autor:
Perry, Patrick O., Wolfe, Patrick J.
The analysis of datasets taking the form of simple, undirected graphs continues to gain in importance across a variety of disciplines. Two choices of null model, the logistic-linear model and the implicit log-linear model, have come into common use f
Externí odkaz:
http://arxiv.org/abs/1201.5871
Autor:
Perry, Patrick O., Mahoney, Michael W.
Recently, Mahoney and Orecchia demonstrated that popular diffusion-based procedures to compute a quick \emph{approximation} to the first nontrivial eigenvector of a data graph Laplacian \emph{exactly} solve certain regularized Semi-Definite Programs
Externí odkaz:
http://arxiv.org/abs/1110.1757