Zobrazeno 1 - 10
of 12
pro vyhledávání: '"Daniel Ahfock"'
Autor:
Daniel Ahfock, Geoffrey J. McLachlan
Publikováno v:
Econometrics and Statistics. 26:124-138
There has been increasing attention to semi-supervised learning (SSL) approaches in machine learning to forming a classifier in situations where the training data for a classifier consists of a limited number of classified observations but a much lar
Publikováno v:
Statistics and Computing. 33
There is an increasing body of work exploring the integration of random projection into algorithms for numerical linear algebra. The primary motivation is to reduce the overall computational cost of processing large datasets. A suitably chosen random
Publikováno v:
SSRN Electronic Journal.
Publikováno v:
Statistics and Computing. 31
Data fusion involves the integration of multiple related datasets. The statistical file-matching problem is a canonical data fusion problem in multivariate analysis, where the objective is to characterise the joint distribution of a set of variables
Publikováno v:
Australian & New Zealand Journal of Statistics. 61:175-188
We present an application study which exemplifies a cutting edge statistical approach for detecting climate regime shifts. The algorithm uses Bayesian computational techniques that make time-efficient analysis of large volumes of climate data possibl
Autor:
Daniel Ahfock, Geoffrey J. McLachlan
Publikováno v:
Data Analysis and Rationality in a Complex World ISBN: 9783030601034
We consider the situation where the observed sample contains some observations whose class of origin is known (that is, they are classified with respect to the g underlying classes of interest), and where the remaining observations in the sample are
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_________::804660f7e9210024fea102f10e8c5399
https://doi.org/10.1007/978-3-030-60104-1_17
https://doi.org/10.1007/978-3-030-60104-1_17
Publikováno v:
Computational Statistics & Data Analysis. 168:107387
The statistical file-matching problem is a data integration problem with structured missing data. The general form involves the analysis of multiple datasets that only have a strict subset of variables jointly observed across all datasets. Missing-da
Autor:
Daniel Ahfock, Geoffrey J. McLachlan
Publikováno v:
Computational Statistics & Data Analysis. 161:107253
Manual labelling of training examples is common practice in supervised learning. When the labelling task is of non-trivial difficulty, the supplied labels may not be equal to the ground-truth labels, and label noise is introduced into the training da
Publikováno v:
Computational statistics & data analysis
The statistical matching problem involves the integration of multiple datasets where some variables are not observed jointly. This missing data pattern leaves most statistical models unidentifiable. Statistical inference is still possible when operat
Publikováno v:
Biometrika
Sketching is a probabilistic data compression technique that has been largely developed in the computer science community. Numerical operations on big datasets can be intolerably slow; sketching algorithms address this issue by generating a smaller s
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::af061ee9bf98210cfa6aed3081704851
http://arxiv.org/abs/1706.03665
http://arxiv.org/abs/1706.03665