Lexicon Based Sentiment Analysis in Indonesia Languages : A Systematic Literature Review

Yuli  Fauziah; Bambang  Yuwono; Agus Sasmito  Aribowo

doi:10.31098/cset.v1i1.397

Authors

Yuli Fauziah Lecturer from Information System Department, Universitas Pembangunan Nasional “Veteran” Yogyakarta
Bambang Yuwono Lecturer from Informatics Department, Universitas Pembangunan Nasional “Veteran” Yogyakarta;
Agus Sasmito Aribowo Lecturer from Informatics Department, Universitas Pembangunan Nasional “Veteran” Yogyakarta. PhD Student at FTMK UTeM Malaysia

DOI:

https://doi.org/10.31098/cset.v1i1.397

Abstract

This systematic literature review aims to determine the trend of lexicon based sentiment analysis research in Indonesian Language in the last two years. The focus of the study is on the understanding of preprocessing used in lexicon-based sentiment analysis studies in the last two years, the lexicon used in these studies, and classification accuracy. The main question in this SLR : what techniques of lexicon based sentiment analysis will provide the highest accuracy. The most widely used preprocessing methods in previous research are tokenization, case conversion, stemming, remove punctuation, remove stop word, remove or replace emoji and emoticons, and normalization or slangword conversion. The sentiment labeling process in previous studies calculated based on the comparison of the number of negative sentiment keywords with positive sentiment keywords in one sentence. The maximum accuracy from previous study is 90%. The most widely used lexicon is NRC and Inset which is a lexicon dictionary in Indonesian. Knowledge of this can be used to propose a better model for lexicon based sentiment analysis in Indonesian Languages.

Lexicon Based Sentiment Analysis in Indonesia Languages : A Systematic Literature Review

Authors

DOI:

Abstract

Downloads

Published

Citation Check

How to Cite

Issue

Section

Make a Submission

quickmenu

statecounter