Zobrazeno 1 - 4
of 4
pro vyhledávání: '"Canzhe Zhao"'
Publikováno v:
User Modeling and User-Adapted Interaction.
Publikováno v:
Proceedings of the ACM Web Conference 2022.
Temporal difference (TD) learning is a widely used method to evaluate policies in reinforcement learning. While many TD learning methods have been developed in recent years, little attention has been paid to preserving privacy and most of the existin
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::8571e60c38d027249b80cd6f36c172a8
Publikováno v:
CIKM
Conversational recommender systems elicit user preference via interactive conversational interactions. By introducing conversational key-terms, existing conversational recommenders can effectively reduce the need for extensive exploration in a tradit