The out-of-source error in multi-source cross validation-type procedures
Autor: | Afendras, Georgios, Markatou, Marianthi |
---|---|
Rok vydání: | 2016 |
Předmět: | |
Zdroj: | New Advances in Statistics and Data Science 2017, 27-44 |
Druh dokumentu: | Working Paper |
DOI: | 10.1007/978-3-319-69416-0_2 |
Popis: | A scientific phenomenon under study may often be manifested by data arising from processes, i.e. sources, that may describe this phenomenon. In this contex of multi-source data, we define the "out-of-source" error, that is the error committed when a new observation of unknown source origin is allocated to one of the sources using a rule that is trained on the known labeled data. We present an unbiased estimator of this error, and discuss its variance. We derive natural and easily verifiable assumptions under which the consistency of our estimator is guaranteed for a broad class of loss functions and data distributions. Finally, we evaluate our theoretical results via a simulation study. Comment: 16 pages, 4 tables |
Databáze: | arXiv |
Externí odkaz: |