Sentiment Analysis of Health Insurance Application Reviews Using Naïve Bayes, Support Vector Machine, and SMOTE
DOI:
https://doi.org/10.61591/jslhu.26.1087Keywords:
Sentiment analysis;, Naïve Bayes;, Support Vector Machine;, SMOTE ;, Health Insurance ApplicationAbstract
This study explores factors shaping user experience in mobile health insurance application by analyzing user reviews through sentiment classification. Two models such as Naïve Bayes (NB) and Support Vector Machine (SVM) were examined, with NB yielding higher precision and F1-score. Data preparation proved influential: applying the Synthetic Minority Oversampling Technique (SMOTE) improved average accuracy by 7.51%, slang normalization added a modest 0.25%, while including neutral sentiment reduced accuracy by 9.67%. The selected configuration, NB combined with SMOTE and slang replacement, achieved 93.53% accuracy, 93.65% precision, 93.53% recall, and an F1-score of 93.47%. Examination of reviews further shows that discussions about security, ease of use, and timeliness are dominated by negative sentiment.
References
Djohanis, H., et al., (2025). Implementation Of The National Health Insurance Policy Within The Banggai District Health Office. Public Policy Journal, 6(1), 115-132.
Ho, T. (2019). Optimizing healthcare policies through healthcare delivery and insurance design (Doctoral dissertation).
Alawode, G. B., et al., (2025). Optimizing the health workforce for Universal Health Coverage: a framework for analysis and action. Human resources for health, 23(1), 27.
Wisniewski, H., et al., (2019). Understanding the quality, effectiveness and attributes of top-rated smartphone health apps. BMJ Mental Health, 22(1).
Nabasya, O. W., et al., (2024). Analysis of Service Quality Factors Affecting Patient Satisfaction in mHealth Teleconsultation. In 2024 Ninth International Conference on Informatics and Computing (ICIC) (pp. 1-6). IEEE.
Zhu, L., et al., (2020). Sentiment and guest satisfaction with peer-to-peer accommodation: when are online ratings more trustworthy?. International Journal of Hospitality Management, 86, 102369.
GhoshRoy, D., et al., (2022). Explainable AI to predict male fertility using extreme gradient boosting algorithm with SMOTE. Electronics, 12(1), 15.
Hariyadi, H., et al., (2024). Implementasi Algoritma Naïve Bayes dan Support Vector Machine pada Analisis Sentimen Ulasan Aplikasi Canva. Jurnal Minfo Polgan, 13(1), 261-269.
Nurdin, T. A., et al., (2023). Sentiment analysis of user preference for old vs new fintech technology using SVM and NB algorithms. Management Systems in Production Engineering, (4 (31), 373-380.
Cahyono, H. D., et al., (2024). Fast Naïve Bayes classifiers for COVID-19 news in social networks. Indonesian Journal of Electrical Engineering and Computer Science, 34(2), 1033-1041.
Chachoui, Y., et al., (2024). Enhancing algorithmic assessment in education: Equi-fused-data-based SMOTE for balanced learning. Computers and Education: Artificial Intelligence, 6, 100222.
El Massari, H., et al., (2022). An ontological model based on machine learning for predicting breast cancer. International Journal of Advanced Computer Science and Applications, 13(7), 108-15.
Apriliani, D., et al., (2020). Sentiment analysis for Indonesia hotel services review using optimized neural network. In Journal of Physics: Conference Series (Vol. 1538, No. 1, p. 012060). IOP Publishing.
Antonius, R., et al., (2024). Pendekatan TF-IDF, SMOTE, dan SVM dalam Klasifikasi Sentimen Masyarakat terhadap Pemblokiran Judi Online. Buletin Ilmiah Informatika Teknologi, 2(3), 115-122.
Ghaida, D. R., et al., (2024). SVM-Classified FaceNet and Eigenface Models under Lighting and Occlusion Variations for Face Recognition. In 2024 International Seminar on Application for Technology of Information and Communication (iSemantic) (pp. 415-420). IEEE.
Vu, V. H. (2024). Predict customer churn using combination deep learning networks model. Neural Computing and Applications, 36(9), 4867-4883.
Siering, M., et al., (2018). Explaining and predicting online review helpfulness: The role of content and reviewer-related signals. Decision support systems, 108, 1-12.
Ryu, J., & Kim, T. (2025). Enhancing Hospital Data Security: A Blockchain-Based Protocol for Secure Information Sharing and Recovery. Electronics, 14(3), 580.
Maduku, D. K., et al., (2023). Assessing customer passion, commitment, and word-of-mouth intentions in digital assistant usage: the moderating role of technology anxiety. Journal of Retailing and Consumer Services, 71, 103208.
Taj, S., Daudpota, et al., (2025). Aspect-based sentiment analysis for software requirements elicitation using fine-tuned Bidirectional Encoder Representations from Transformers and Explainable Artificial Intelligence. Engineering Applications of Artificial Intelligence, 151, 110632.