Auto-tagging of Short Conversational Sentences using Transformer Methods

Autor: Umut Özdil, Semih Gulum, Sukru Ozan, M. Fatih Akca, Secilay Kutal, D. Emre Tasar, Oguzhan Olmez, Ceren Belhan
Rok vydání: 2021
Předmět:
DOI: 10.48550/arxiv.2106.01735
Popis: The problem of categorizing short speech sentences according to their semantic features with high accuracy is a subject studied in natural language processing. In this study, a data set created with samples classified in 46 different categories was used. Examples consist of sentences taken from chat conversations between a company's customer representatives and the company's website visitors. The primary purpose is to automatically tag questions and requests from visitors in the most accurate way for 46 predetermined categories for use in a chat application to generate meaningful answers to the questions asked by the website visitors. For this, different BERT models and one GPT-2 model, pre-trained in Turkish, were preferred. The classification performances of the relevant models were analyzed in detail and reported accordingly.
Comment: in Turkish language
Databáze: OpenAIRE