Beyond NDCG: behavioral testing of recommender systems with RecList
Autor: | Chia, Patrick John, Tagliabue, Jacopo, Bianchi, Federico, He, Chloe, Ko, Brian |
---|---|
Rok vydání: | 2021 |
Předmět: | |
Druh dokumentu: | Working Paper |
DOI: | 10.1145/3487553.3524215 |
Popis: | As with most Machine Learning systems, recommender systems are typically evaluated through performance metrics computed over held-out data points. However, real-world behavior is undoubtedly nuanced: ad hoc error analysis and deployment-specific tests must be employed to ensure the desired quality in actual deployments. In this paper, we propose RecList, a behavioral-based testing methodology. RecList organizes recommender systems by use case and introduces a general plug-and-play procedure to scale up behavioral testing. We demonstrate its capabilities by analyzing known algorithms and black-box commercial systems, and we release RecList as an open source, extensible package for the community. Comment: Paper accepted to the WebConf 2022 |
Databáze: | arXiv |
Externí odkaz: |