Keywords:-

Keywords: Google Play Store, Android apps, sentiment analysis, natural language processing (NLP), machine learning, Random Forest, Logistic Regression, Naïve Bayes, KNN.

Article Content:-

Abstract

The Google Play Store is one of the world’s largest Android app marketplaces, hosting on the order of two million applications. To help developers interpret the vast volume of user feedback, we analyze a publicly available dataset of 9,146 Google Play apps (from Kaggle, covering dozens of categories). We conduct enhanced exploratory data analysis (EDA) of app metadata and then apply Natural Language Processing to user reviews to classify sentiment. Specifically, we train four supervised classifiers (Naïve Bayes, Random Forest, Logistic Regression, and K-Nearest Neighbors) on the preprocessed review text. We evaluate each model using accuracy, precision, and recall. Our experiments show that logistic regression attains the highest accuracy (around 90%), with balanced precision and recall, while Random Forest and KNN perform comparably (≈87-89%) and Naïve Bayes trails behind (~59%). These updated results provide insights into app market trends: for example, Games and Communication apps dominate install totals, and over 97% of apps are free. Most user reviews are positive (as reported in prior app-review studies), reinforcing that sentiment analysis can help developers understand user preferences and improve app quality.

References:-

References

Vijayanarayanan A, Savithiri R, Lekha P, Abbirami RS. Google Playstore Reviews. JSCE Engineering Research 2023; 6(4): 61–67.

Kurniawan R, et al. Analysis of Google Play Reviews. J. SISFOKOM 2024; 13(2): 170–178.

Sagala GJ, Samuel YT. Mobile App Review Insights. Int. J. Eng. Bus. Soc. Sci. 2024; 2(4): 61–69.

Delikkaya Y. Google Play Store Dataset. Kaggle Dataset 2018.

42matters. Google Play Statistics and Trends. 2025.

Samanmali PHC, Rupasingha RAHM. User Review Classification Methods. Multimedia Tools and Applications 2024.

Miller KJ. App Market Insights. Business of Apps 2024.

Liu B. Sentiment Analysis and Opinion Mining. 2012.

Manning CD, Raghavan P, Schütze H. Introduction to Information Retrieval. 2008.

Jurafsky D, Martin JH. Speech and Language Processing. 2023.

Rifat AN, et al. Sentiment Analysis on App Reviews. IJACSA 2020; 11(6): 546–553.

Kowsari S, et al. Text Classification Survey. Information 2019; 10(4): 150.

Choudhary SR, Kumar M. NLP Techniques for Review Mining. Procedia Computer Science 2020; 167: 1821–1830.

Rezwan MA, et al. Big Data Approaches for App Review Analysis. IEEE Big Data 2019.

Kim Y. Convolutional Neural Networks for Sentence Classification. EMNLP 2014.

Islam MT, et al. Review Mining Using Machine Learning. J. Info. & Comm. Tech. 2021; 20(3): 307–329.

Wang H, et al. Data Mining and Knowledge Discovery Techniques. WIREs DMKD 2022; 12(1): e1430.

Bird S, Klein E, Loper E. Natural Language Processing with Python. 2009.

Google Play Developer Docs. App Quality Guidelines. 2024.

Ranshous S, et al. Computational Methods for Large-Scale Networks. IEEE TMC 2022; 21(2): 560–573.

Downloads

Citation Tools

How to Cite
Lembhe, A., Dokrimare, S., Kamthe, R., Swami, A., & Dange, A. (2026). Google Play Store Apps Analysis using NLP. International Journal Of Mathematics And Computer Research, 14(03), 138-143. https://doi.org/10.47191/ijmcr/v14iSPC3.29