The out-of-source error in multi-source cross validation-type procedures

Autor: Afendras, Georgios, Markatou, Marianthi
Rok vydání: 2016
Předmět:
Zdroj: New Advances in Statistics and Data Science 2017, 27-44
Druh dokumentu: Working Paper
DOI: 10.1007/978-3-319-69416-0_2
Popis: A scientific phenomenon under study may often be manifested by data arising from processes, i.e. sources, that may describe this phenomenon. In this contex of multi-source data, we define the "out-of-source" error, that is the error committed when a new observation of unknown source origin is allocated to one of the sources using a rule that is trained on the known labeled data. We present an unbiased estimator of this error, and discuss its variance. We derive natural and easily verifiable assumptions under which the consistency of our estimator is guaranteed for a broad class of loss functions and data distributions. Finally, we evaluate our theoretical results via a simulation study.
Comment: 16 pages, 4 tables
Databáze: arXiv