Zobrazeno 1 - 10
of 198
pro vyhledávání: '"Witten, Daniela M."'
Autor:
Chen, Yiqun T., Witten, Daniela M.
We consider the problem of testing for a difference in means between clusters of observations identified via k-means clustering. In this setting, classical hypothesis tests lead to an inflated Type I error rate. To overcome this problem, we take a se
Externí odkaz:
http://arxiv.org/abs/2203.15267
The graph fused lasso -- which includes as a special case the one-dimensional fused lasso -- is widely used to reconstruct signals that are piecewise constant on a graph, meaning that nodes connected by an edge tend to have identical values. We consi
Externí odkaz:
http://arxiv.org/abs/2109.10451
We consider conducting inference on the output of the Classification and Regression Tree (CART) [Breiman et al., 1984] algorithm. A naive approach to inference that does not account for the fact that the tree was estimated from the data will not achi
Externí odkaz:
http://arxiv.org/abs/2106.07816
In recent years, a number of methods have been proposed to estimate the times at which a neuron spikes on the basis of calcium imaging data. However, quantifying the uncertainty associated with these estimated spikes remains an open problem. We consi
Externí odkaz:
http://arxiv.org/abs/2103.07818
The fused lasso, also known as total-variation denoising, is a locally-adaptive function estimator over a regular grid of design points. In this paper, we extend the fused lasso to settings in which the points do not occur on a regular grid, leading
Externí odkaz:
http://arxiv.org/abs/1807.11641
We consider the task of learning a dynamical system from high-dimensional time-course data. For instance, we might wish to estimate a gene regulatory network from gene expression data measured at discrete time points. We model the dynamical system no
Externí odkaz:
http://arxiv.org/abs/1610.03177
Autor:
Witten, Daniela M.
Publikováno v:
Annals of Applied Statistics 2011, Vol. 5, No. 4, 2493-2518
In recent years, advances in high throughput sequencing technology have led to a need for specialized methods for the analysis of digital gene expression data. While gene expression data measured on a microarray take on continuous values and can be m
Externí odkaz:
http://arxiv.org/abs/1202.6201
We consider the problem of estimating multiple related but distinct graphical models on the basis of a high-dimensional data set with observations that belong to distinct classes. A motivating example occurs in the analysis of gene expression data fo
Externí odkaz:
http://arxiv.org/abs/1111.0324
Autor:
Witten, Daniela M., Tibshirani, Robert
Publikováno v:
Annals of Applied Statistics 2008, Vol. 2, No. 3, 986-1012
We consider the problem of testing the significance of features in high-dimensional settings. In particular, we test for differentially-expressed genes in a microarray experiment. We wish to identify genes that are associated with some type of outcom
Externí odkaz:
http://arxiv.org/abs/0811.1700
Publikováno v:
Journal of the American Statistical Association, 2017 Dec 01. 112(520), 1697-1707.
Externí odkaz:
https://www.jstor.org/stable/45028061