Improving Numerical Stability of Normalized Mutual Information Estimator on High Dimensions

Autor: Tuononen, Marko, Hautamäki, Ville
Rok vydání: 2024
Předmět:
Druh dokumentu: Working Paper
Popis: Mutual information provides a powerful, general-purpose metric for quantifying the amount of shared information between variables. Estimating normalized mutual information using a k-Nearest Neighbor (k-NN) based approach involves the calculation of the scaling-invariant k-NN radius. Calculation of the radius suffers from numerical overflow when the joint dimensionality of the data becomes high, typically in the range of several hundred dimensions. To address this issue, we propose a logarithmic transformation technique that improves the numerical stability of the radius calculation in high-dimensional spaces. By applying the proposed transformation during the calculation of the radius, numerical overflow is avoided, and precision is maintained. Proposed transformation is validated through both theoretical analysis and empirical evaluation, demonstrating its ability to stabilize the calculation without compromizing the precision of the results.
Comment: 4+1 pages, 2 figures, 20 equations
Databáze: arXiv