Going Big and Deep: Using Convolutional Neural Network to Leverage Training Data from Multiple Domains for Cross-Domain Sentiment Classification on Product Reviews
Autor: | Jasy Liew Suet Yan, Cheah Yu-N, Aditi Gupta |
---|---|
Rok vydání: | 2020 |
Předmět: |
0209 industrial biotechnology
Training set Computer science business.industry Deep learning Sentiment analysis 02 engineering and technology Machine learning computer.software_genre Convolutional neural network 020901 industrial engineering & automation Product reviews 0202 electrical engineering electronic engineering information engineering Leverage (statistics) Labeled data 020201 artificial intelligence & image processing Artificial intelligence business Classifier (UML) computer |
Zdroj: | IICAIET |
DOI: | 10.1109/iicaiet49801.2020.9257815 |
Popis: | Training a classifier for sentiment polarity detection in product reviews when labeled data is not available for a particular domain poses a challenge, which can be addressed through cross-domain sentiment analysis. We experimented with Convolutional Neural Network (CNN) to learn sentiment polarity (positive or negative) from labeled data available in many different source domains and test its performance on a target domain that it is not trained on. Extensive experiments were conducted on 14 different domains using Amazon product reviews. Our preliminary findings show that cross-domain CNN models trained with multiple source domains achieved accuracy of above 80% across all the domains and outperform the in-domain models trained using limited labeled data from the same domain. In fact, the cross-domain CNN models demonstrated better performance when a larger number of source domains are used for training. Therefore, going deep and big is a promising direction to explore for cross-domain sentiment classification. |
Databáze: | OpenAIRE |
Externí odkaz: |