Turkish abstractive text document summarization using text to text transfer transformer

Autor: Betul Ay, Fatih Ertam, Guven Fidan, Galip Aydin
Jazyk: angličtina
Rok vydání: 2023
Předmět:
Zdroj: Alexandria Engineering Journal, Vol 68, Iss , Pp 1-13 (2023)
Druh dokumentu: article
ISSN: 1110-0168
DOI: 10.1016/j.aej.2023.01.008
Popis: Text summarization is the process of reducing text size while preserving its key points. Thanks to this process, the reading time of the text is also reduced which contributes to reaching the desired information quickly, especially in today's world where time is much more important. In addition, summarization can be used to create a solution to extract outstanding information from the text. In this study, we focus on abstract summarization, which can draw more human like conclusions from the text. A summarization study was carried out on the data set that was collected from online Turkish news sources. Rouge and Bert-score performance metrics were used to compare the performance of this study using the text to text transfer transformer (T5) method. The precision values ​​of the Rouge-1, Rouge-2, Rouge-L and Bert-score performance metrics obtained in this study were found to be 0.6913, 0.6623, 0.7528 and 0.8718, respectively. Recall values ​​were 0.9210, 0.8917, 0.9183 and 0.9138, respectively. F measure values ​​were 0.7649, 0.7338, 0.8084 and 0.8913 respectively. Considering the success of the results obtained in the study, a method that can obtain successful results for Turkish text summarization is presented and the original dataset is made available to other researchers.
Databáze: Directory of Open Access Journals