Published: 2026-07-01
IndoBERT-Based Natural Language Processing for Early Detection of Mental Disorders among Indonesian Gen-Z Students: A Mobile Application Approach with Logistic Regression Baseline
DOI: 10.35870/jtik.v10i3.6418
Athif Basyar Mussafa, Widi Hastomo
- Athif Basyar Mussafa: Institut Teknologi dan Bisnis Ahmad Dahlan
- Widi Hastomo: Institut Teknologi dan Bisnis Ahmad Dahlan
Article Metrics
- Scopus Citations
- Google Scholar
- Crossref Citations
- Semantic Scholar
- DataCite Metrics
-
If the link doesn't work, copy the DOI or article title for manual search (API Maintenance).
Abstract
Mental health issues have become a growing concern among young adults, while access to professional psychological services remains limited. Most existing digital mental health applications rely mainly on self-report questionnaires and lack the ability to interpret contextual emotional expressions found in user-written text, which reduces their effectiveness for early screening. This study proposes the design and implementation of a mobile-based mental health detection system that integrates contextual natural language processing with interactive assessment features. The system analyzes Indonesian-language textual reflections using an IndoBERT-based classification model and complements the results with a rule-based psychological scoring mechanism derived from questionnaire responses. Logistic Regression with TF–IDF features is employed as a baseline model for comparative evaluation. System performance is assessed using accuracy, precision, recall, and F1-score metrics. Experimental results show that the IndoBERT model outperforms the baseline, achieving an accuracy of 97.79%, compared to 94.17% for Logistic Regression. The proposed system is implemented as a Flutter-based mobile application to improve accessibility to early mental health screening among Indonesian university students. This study integrates two complementary approaches: NLP-based text classification using IndoBERT and rule-based psychological scoring derived from self-report questionnaires.
Keywords
IndoBERT Natural Language Processing; Logistic Regression; Detection Generation Z
Peer Review Process
This article has undergone a double-blind peer review process to ensure quality and impartiality.
Indexing Information
Discover where this journal is indexed at our indexing page.
Open Science Badges
This journal supports transparency in research and encourages authors to meet criteria for Open Science Badges.
How to Cite
Article Information
This article has been peer-reviewed and published in the Jurnal JTIK (Jurnal Teknologi Informasi dan Komunikasi). The content is available under the terms of the Creative Commons Attribution 4.0 International License.
-
Issue: Vol. 10 No. 3 (2026)
-
Section: Computer & Communication Science
-
Published: 2026-07-01
-
License: CC BY 4.0
-
Copyright: © 2026 Authors
-
DOI: 10.35870/jtik.v10i3.6418
AI Research Hub
This article is indexed and available through various AI-powered research tools and citation platforms. Our AI Research Hub ensures that scholarly work is discoverable, accessible, and easily integrated into the global research ecosystem.
Athif Basyar Mussafa, Institut Teknologi dan Bisnis Ahmad Dahlan
Department of Information Technology, Institut Teknologi dan Bisnis Ahmad Dahlan, Kota Jakarta Pusat, Daerah Khusus Ibukota Jakarta, Indonesia.
-
Cahyawijaya, S., et al. (2021). IndoNLG: Benchmark and resources for evaluating Indonesian natural language generation. EMNLP 2021 - 2021 Conference on Empirical Methods in Natural Language Processing, 8875–8898. https://doi.org/10.18653/v1/2021.emnlp-main.699.
-
Chancellor, S., & De Choudhury, M. (2020). Methods in predictive techniques for mental health status on social media: A critical review. npj Digital Medicine, 3(1). https://doi.org/10.1038/s41746-020-0233-7.
-
Couto, M., Perez, A., Parapar, J., & Losada, D. E. (2025). Temporal word embeddings for early detection of psychological disorders on social media. Journal of Healthcare Informatics Research. https://doi.org/10.1007/s41666-025-00186-9
-
-
-
Guidotti, R., Monreale, A., Ruggieri, S., Turini, F., Giannotti, F., & Pedreschi, D. (2019). A survey of methods for explaining black box models. ACM Computing Surveys, 51(5). https://doi.org/10.1145/3236009.
-
Huang, H., & Savkin, A. V. (2020). Autonomous navigation of a solar-powered UAV for secure communication in urban environments with eavesdropping avoidance. Future Internet, 12(10), 1–14. https://doi.org/10.3390/fi12100170
-
Koto, F., Rahimi, A., Lau, J. H., & Baldwin, T. (2020). IndoLEM and IndoBERT: A benchmark dataset and pre-trained language model for Indonesian NLP. COLING 2020 - 28th International Conference on Computational Linguistics - Proceedings, 757–770. https://doi.org/10.18653/v1/2020.coling-main.66
-
Le Glaz, A., et al. (2021). Machine learning and natural language processing in mental health: Systematic review. Journal of Medical Internet Research, 23(5). https://doi.org/10.2196/15708.
-
Minaee, S., Kalchbrenner, N., Cambria, E., Nikzad, N., Chenaghlu, M., & Gao, J. (2022). Deep learning-based text classification. ACM Computing Surveys, 54(3). https://doi.org/10.1145/3439726
-
Pakray, P., Gelbukh, A., & Bandyopadhyay, S. (2025). Natural language processing applications for low-resource languages. Natural Language Processing Journal, 31(2), 183–197. https://doi.org/10.1017/nlp.2024.33.
-
Scherbakov, D. A., Hubig, N. C., Lenert, L. A., Alekseyenko, A. V., & Obeid, J. S. (2025). Natural language processing and social determinants of health in mental health research: AI-assisted scoping review. JMIR Mental Health, 12, 1–15. https://doi.org/10.2196/67192.
-
Shaw, C., LaCasse, P., & Champagne, L. (2025). Exploring emotion classification of Indonesian tweets using large-scale transfer learning via IndoBERT. Social Network Analysis and Mining, 15(1), 1–12. https://doi.org/10.1007/s13278-025-01439-6.
-
-
Yardley, L., et al. (2016). Understanding and promoting effective engagement with digital behavior change interventions. American Journal of Preventive Medicine, 51(5), 833–842. https://doi.org/10.1016/j.amepre.2016.06.015

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
1. Copyright Retention and Open Access License
Authors retain copyright of their work and grant the journal non-exclusive right of first publication under the Creative Commons Attribution 4.0 International License (CC BY 4.0).
This license allows unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
2. Rights Granted Under CC BY 4.0
Under this license, readers are free to:
- Share — copy and redistribute the material in any medium or format
- Adapt — remix, transform, and build upon the material for any purpose, including commercial use
- No additional restrictions — the licensor cannot revoke these freedoms as long as license terms are followed
3. Attribution Requirements
All uses must include:
- Proper citation of the original work
- Link to the Creative Commons license
- Indication if changes were made to the original work
- No suggestion that the licensor endorses the user or their use
4. Additional Distribution Rights
Authors may:
- Deposit the published version in institutional repositories
- Share through academic social networks
- Include in books, monographs, or other publications
- Post on personal or institutional websites
Requirement: All additional distributions must maintain the CC BY 4.0 license and proper attribution.
5. Self-Archiving and Pre-Print Sharing
Authors are encouraged to:
- Share pre-prints and post-prints online
- Deposit in subject-specific repositories (e.g., arXiv, bioRxiv)
- Engage in scholarly communication throughout the publication process
6. Open Access Commitment
This journal provides immediate open access to all content, supporting the global exchange of knowledge without financial, legal, or technical barriers.