Imbalance Dataset in Aspect-Based Sentiment Analysis on Game Genshin Impact Review

Main Article Content

Prabowo Adi Perwira
Nelly Indriani Widiastuti

Abstract

Sentiment analysis was commonly used to determine the polarity of the review text. However, there is a problem if some reviews have more than one aspect with different polarities, so the reviews have more than one polarity. That has happened in some reviews on the game Genshin Impact. Not merely that, the number of sentiments contained in a review is not always the same as other reviews will cause imbalanced data. So, this study will handle imbalance data with Random Under-Sampling and Random Over-Sampling on aspect-based-sentiment-analysis of Genshin Impact Review with Multinomial Naïve-Bayes, so that the classification prediction does not ignore the minority class due to the dominance of the majority class. The classification process used K-Fold Cross Validation (k=10) validation method and the Laplace smoothing technique on Multinomial Naïve Bayes. As a result, the conclusion is that Random Oversampling had better accuracy than Random Undersampling in handling imbalanced data on aspect-based sentiment analysis of Genshin Impact game Review in Indonesian with Naïve Bayes Multinomial, with the highest accuracy of 85.55%.

Downloads

Download data is not yet available.

Article Details

How to Cite
[1]
P. Perwira and N. Widiastuti, “Imbalance Dataset in Aspect-Based Sentiment Analysis on Game Genshin Impact Review”, INFOTEL, vol. 16, no. 1, pp. 71-81, Feb. 2024.
Section
Informatics