Lexicon Based Sentiment Analysis in Indonesia Languages : A Systematic Literature Review

Authors

  • Yuli Fauziah Lecturer from Information System Department, Universitas Pembangunan Nasional “Veteran” Yogyakarta
  • Bambang Yuwono Lecturer from Informatics Department, Universitas Pembangunan Nasional “Veteran” Yogyakarta;
  • Agus Sasmito Aribowo Lecturer from Informatics Department, Universitas Pembangunan Nasional “Veteran” Yogyakarta. PhD Student at FTMK UTeM Malaysia

DOI:

https://doi.org/10.31098/cset.v1i1.397

Abstract

This systematic literature review aims to determine the trend of lexicon based sentiment analysis research in Indonesian Language in the last two years. The focus of the study is on the understanding of preprocessing used in lexicon-based sentiment analysis studies in the last two years, the lexicon used in these studies, and classification accuracy. The main question in this SLR : what techniques of lexicon based sentiment analysis will provide the highest accuracy. The most widely used preprocessing methods in previous research are tokenization, case conversion, stemming, remove punctuation, remove stop word, remove or replace emoji and emoticons, and normalization or slangword conversion. The sentiment labeling process in previous studies calculated based on the comparison of the number of negative sentiment keywords with positive sentiment keywords in one sentence. The maximum accuracy from previous study is 90%. The most widely used lexicon is NRC and Inset which is a lexicon dictionary in Indonesian. Knowledge of this can be used to propose a better model for lexicon based sentiment analysis in Indonesian Languages.

Downloads

Published

2022-11-15

How to Cite

Fauziah, Y. ., Yuwono, B. ., & Aribowo, A. S. . (2022). Lexicon Based Sentiment Analysis in Indonesia Languages : A Systematic Literature Review. RSF Conference Series: Engineering and Technology, 1(1), 363–367. https://doi.org/10.31098/cset.v1i1.397