Heterogeneous sets in dimensionality reduction and ensemble learning.

Autor: Reeve, Henry W. J., Kabán, Ata, Bootkrajang, Jakramate
Předmět:
Zdroj: Machine Learning; Apr2024, Vol. 113 Issue 4, p1683-1704, 22p
Abstrakt: We present a general framework for dealing with set heterogeneity in data and learning problems, which is able to exploit low complexity components. The main ingredients are (i) A definition of complexity for elements of a convex union that takes into account the complexities of their individual composition – this is used to cover the heterogeneous convex union; and (ii) Upper bounds on the complexities of restricted subsets. We demonstrate this approach in two different application areas, highlighting their conceptual connection. (1) In random projection based dimensionality reduction, we obtain improved bounds on the uniform preservation of Euclidean norms and distances when low complexity components are present in the union. (2) In statistical learning, our generalisation bounds justify heterogeneous ensemble learning methods that were incompletely understood before. We exemplify empirical results with boosting type random subspace and random projection ensembles that implement our bounds. [ABSTRACT FROM AUTHOR]
Databáze: Complementary Index