Online Learning Bipartite Matching with Non-stationary Distributions.

Autor: WEIRONG CHEN, JIAQI ZHENG, HAOYU YU, GUIHAI CHEN, YIXIN CHEN, DONGSHENG LI
Předmět:
Zdroj: ACM Transactions on Knowledge Discovery from Data; Mar2022, Vol. 16 Issue 5, p1-22, 22p
Abstrakt: Online bipartite matching has attracted wide interest since it can successfully model the popular online carhailing problem and sharing economy. Existing works consider this problem under either adversary setting or i.i.d. setting. The former is too pessimistic to improve the performance in the general case; the latter is too optimistic to deal with the varying distribution of vertices. In this article, we initiate the study of the nonstationary online bipartite matching problem, which allows the distribution of vertices to vary with time and is more practical. We divide the non-stationary online bipartite matching problem into two subproblems, the matching problem and the selecting problem, and solve them individually. Combining Batch algorithms and deep Q-learning networks, we first construct a candidate algorithm set to solve the matching problem. For the selecting problem, we use a classical online learning algorithm, Exp3, as a selector algorithm and derive a theoretical bound. We further propose CDUCB as a selector algorithm by integrating distribution change detection into UCB. Rigorous theoretical analysis demonstrates that the performance of our proposed algorithms is no worse than that of any candidate algorithms in terms of competitive ratio. Finally, extensive experiments show that our proposed algorithms have much higher performance for the non-stationary online bipartite matching problem comparing to the state-of-the-art. [ABSTRACT FROM AUTHOR]
Databáze: Complementary Index