Lexicon Based Sentiment Analysis in Indonesia Languages : A Systematic Literature Review

Autor: Yuli Fauziah, Bambang Yuwono, Agus Sasmito Aribowo
Rok vydání: 2021
Zdroj: RSF Conference Series: Engineering and Technology. 1:363-367
ISSN: 2809-6878
2809-6843
Popis: This systematic literature review aims to determine the trend of lexicon based sentiment analysis research in Indonesian Language in the last two years. The focus of the study is on the understanding of preprocessing used in lexicon-based sentiment analysis studies in the last two years, the lexicon used in these studies, and classification accuracy. The main question in this SLR : what techniques of lexicon based sentiment analysis will provide the highest accuracy. The most widely used preprocessing methods in previous research are tokenization, case conversion, stemming, remove punctuation, remove stop word, remove or replace emoji and emoticons, and normalization or slangword conversion. The sentiment labeling process in previous studies calculated based on the comparison of the number of negative sentiment keywords with positive sentiment keywords in one sentence. The maximum accuracy from previous study is 90%. The most widely used lexicon is NRC and Inset which is a lexicon dictionary in Indonesian. Knowledge of this can be used to propose a better model for lexicon based sentiment analysis in Indonesian Languages.
Databáze: OpenAIRE