Posts Quality Prediction for StackOverflow Website

Autor:	Jiawei Hu, Bo Yang
Jazyk:	angličtina
Rok vydání:	2024
Předmět:	Quality-analysis deep-learning machine-learning NLP text classification Electrical engineering. Electronics. Nuclear engineering TK1-9971
Zdroj:	IEEE Access, Vol 12, Pp 135601-135615 (2024)
Druh dokumentu:	article
ISSN:	2169-3536
DOI:	10.1109/ACCESS.2024.3440879
Popis:	The development of the computer industry is closely linked to various question-and-answer websites, whose primary function is to discover and solve problems encountered by users. This paper focuses on the quality prediction of question posts on the StackOverflow(SO) website, which can essentially be considered a text classification problem. Given the large number of users, manual moderation becomes inadequate when faced with a vast quantity of user questions. Reducing the occurrence of low-quality questions can effectively alleviate the operational pressure on the website. We preprocess and vectorize the posts to obtain vector representations of the training and testing sets. After training 5 different machine learning models, including decision trees, random forests, naive Bayes, support vector machines, logistic regression, and 2 deep learning models, Bi-LSTM and BERT, these models are compared through experiments by adjusting the values of different parameters. The results indicate that different parameters have a certain impact on the experimental results, and there are significant differences in the quality prediction performance of different models. The lowest accuracy rate only reaches 54%, while the highest accuracy is 92%. The comparison shows that quality assessment based on the attention mechanism model is effective and can be used to predict post-quality.
Databáze:	Directory of Open Access Journals
Externí odkaz:	https://doaj.org/article/7da9ef64cf19479ab786889c8b8056e9 Zobrazit plný text záznamu View record in DOAJ