Auto-tagging of Short Conversational Sentences using Transformer Methods

Autor:	Umut Özdil, Semih Gulum, Sukru Ozan, M. Fatih Akca, Secilay Kutal, D. Emre Tasar, Oguzhan Olmez, Ceren Belhan
Rok vydání:	2021
Předmět:	FOS: Computer and information sciences Computer Science - Machine Learning Computer Science - Computation and Language business.industry Computer science Turkish Subject (documents) AC power computer.software_genre language.human_language Machine Learning (cs.LG) Unified Modeling Language language Bit error rate Data set (IBM mainframe) Artificial intelligence business computer Computation and Language (cs.CL) Natural language processing Transformer (machine learning model) computer.programming_language
DOI:	10.48550/arxiv.2106.01735
Popis:	The problem of categorizing short speech sentences according to their semantic features with high accuracy is a subject studied in natural language processing. In this study, a data set created with samples classified in 46 different categories was used. Examples consist of sentences taken from chat conversations between a company's customer representatives and the company's website visitors. The primary purpose is to automatically tag questions and requests from visitors in the most accurate way for 46 predetermined categories for use in a chat application to generate meaningful answers to the questions asked by the website visitors. For this, different BERT models and one GPT-2 model, pre-trained in Turkish, were preferred. The classification performances of the relevant models were analyzed in detail and reported accordingly. Comment: in Turkish language
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_dedup___::cc0423fb0cdd99bee784d629a1c1ee0d Zobrazit plný text záznamu