Summarization of Twitter Events with Deep Neural Network Pre-trained Models

Autor: Swapan Debbarma, Amitava Das, Kunal Chakma
Rok vydání: 2021
Předmět:
Zdroj: Information Management and Big Data ISBN: 9783030762278
SIMBig
DOI: 10.1007/978-3-030-76228-5_4
Popis: Due to the proliferation of online social media services such as Twitter, there is an upsurge in the volume of user-generated textual content. Such voluminous content is difficult to be consumed by users. Therefore, the development of technological solutions to automatically summarise the voluminous texts are essential. The work presented in this paper reports on the development of automatically generating abstractive summaries from a collection of texts from Twitter. Our proposed approach is a two-stage framework which includes: 1) Event detection by clustering and 2) Summarization of the events. We first generated a contextualized vector representation of the tweets and then applied different clustering techniques on the vectors. We evaluated the generated clusters, and based on the evaluation; we chose the best one found suitable for the summarization task. For the summarization task, we used the pre-trained models of two recently developed state-of-the-art deep neural network architectures and evaluated them on the event clusters. Standard measures of ROUGE scores have been used for evaluating the summaries. We obtained best ROUGE-1 score of 46%, ROUGE-2 score of 30%, ROUGE-L score of 41% and ROUGE-SU score of 23% from our experiments.
Databáze: OpenAIRE