Thread Popularity Prediction and Tracking with a Permutation-invariant Model

Autor:	Irwin King, Hou Pong Chan
Rok vydání:	2018
Předmět:	Theoretical computer science Computer science 0502 economics and business 05 social sciences Reinforcement learning Thread (computing) 050207 economics 010501 environmental sciences 01 natural sciences Popularity 0105 earth and related environmental sciences
Zdroj:	EMNLP
DOI:	10.18653/v1/d18-1376
Popis:	The task of thread popularity prediction and tracking aims to recommend a few popular comments to subscribed users when a batch of new comments arrive in a discussion thread. This task has been formulated as a reinforcement learning problem, in which the reward of the agent is the sum of positive responses received by the recommended comments. In this work, we propose a novel approach to tackle this problem. First, we propose a deep neural network architecture to model the expected cumulative reward (Q-value) of a recommendation (action). Unlike the state-of-the-art approach, which treats an action as a sequence, our model uses an attention mechanism to integrate information from a set of comments. Thus, the prediction of Q-value is invariant to the permutation of the comments, which leads to a more consistent agent behavior. Second, we employ a greedy procedure to approximate the action that maximizes the predicted Q-value from a combinatorial action space. Different from the state-of-the-art approach, this procedure does not require an additional pre-trained model to generate candidate actions. Experiments on five real-world datasets show that our approach outperforms the state-of-the-art.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_________::4ce754a601a2eef6a0b598dddd73ffe1 https://doi.org/10.18653/v1/d18-1376 Zobrazit plný text záznamu