A Deep Learning Method to Analyze and Classify Happy Moments: A Comparative Analytic Study (Preprint)

Autor: Qiudan Li, Ruoran Liu, Riheng Yao, Daniel Dajun Zeng
Rok vydání: 2019
DOI: 10.2196/preprints.13894
Popis: BACKGROUND Happiness is considered as an important indicator of users’ mental and physical health. Fostering happiness has gained increasing public attention as one of the ways to decrease health costs in the long run. Understanding what makes users feel happy may help policy makers develop policies and methods that steer users towards behaviors identified to promote happiness. OBJECTIVE This paper aimed to investigate the use of deep learning methods to analyze happy moments and compare them with the traditional machine learning methods, which may provide a mechanism to accurately classify happy moments and help understand why users feel happy. METHODS A crowdsourced corpus of happy moments, HappyDB, was used. The dataset contained 14,125 posts with category labels that described sources and reasons for happy feelings: Achievement, Affection, Bonding, Enjoy the moment, Leisure, Nature and Exercise. We compared the performance of deep learning methods such as the convolutional neural network (CNN), bidirectional long-short term memory (Bi-LSTM), and attention Bi-LSTM with that of the traditional machine learning methods including logistic regression, SVM, and naïve Bayes. Standard measures including precision, recall, and F1 were adopted for each category. Macro-precision, macro-recall, and macro-F1 were used to evaluate the overall performance of the models. RESULTS We found that CNN achieved the best performance on macro-precision, macro-recall, and macro-F1, with values of 80.8, 79.3, and 80.0, respectively. Among the traditional machine learning methods, logistic regression performed the best, with macro-precision of 80.6, macro-recall of 71.1, and macro-F1 of 75.5. A detailed comparison of CNN and logistic regression on each category showed that CNN was able to improve F1 score for all categories. Specifically, F1 improved by at least 1.8% on the Bonding category and up to 11.3% on Nature. Performance improvements mainly depended on significant improvements on recall, especially for minor categories. For example, the recall of CNN was 80.9 and 70.9 for Nature and Exercise, which was an improvement of 28.5% and 11.6% compared with logistic regression. The reason was that CNN explicitly modeled the relationship between word features and the categories of happy moments by extracting important word features through convolution and pooling operations. CONCLUSIONS This is the first study to analyze happy moments based on deep learning methods. Compared with the traditional machine learning methods, deep learning methods, especially CNN, showed superiority on classifying the happy moments, which would facilitate understanding of the reasons why users feel happy and thus help policy makers formulate targeted policies to promote happiness.
Databáze: OpenAIRE