IndoBERT-Based Natural Language Processing for Early Detection of Mental Disorders among Indonesian Gen-Z Students: A Mobile Application Approach with Logistic Regression Baseline

Athif Basyar Mussafa; Widi Hastomo

doi:10.35870/jtik.v10i3.6418

Published: 2026-07-01

IndoBERT-Based Natural Language Processing for Early Detection of Mental Disorders among Indonesian Gen-Z Students: A Mobile Application Approach with Logistic Regression Baseline

DOI: 10.35870/jtik.v10i3.6418

Athif Basyar Mussafa, Widi Hastomo

Affiliation Details

Athif Basyar Mussafa: Institut Teknologi dan Bisnis Ahmad Dahlan
Widi Hastomo: Institut Teknologi dan Bisnis Ahmad Dahlan

PDF

Article Metrics

Scopus Citations
Google Scholar
Crossref Citations
Semantic Scholar
DataCite Metrics
If the link doesn't work, copy the DOI or article title for manual search (API Maintenance).

Mental health issues have become a growing concern among young adults, while access to professional psychological services remains limited. Most existing digital mental health applications rely mainly on self-report questionnaires and lack the ability to interpret contextual emotional expressions found in user-written text, which reduces their effectiveness for early screening. This study proposes the design and implementation of a mobile-based mental health detection system that integrates contextual natural language processing with interactive assessment features. The system analyzes Indonesian-language textual reflections using an IndoBERT-based classification model and complements the results with a rule-based psychological scoring mechanism derived from questionnaire responses. Logistic Regression with TF–IDF features is employed as a baseline model for comparative evaluation. System performance is assessed using accuracy, precision, recall, and F1-score metrics. Experimental results show that the IndoBERT model outperforms the baseline, achieving an accuracy of 97.79%, compared to 94.17% for Logistic Regression. The proposed system is implemented as a Flutter-based mobile application to improve accessibility to early mental health screening among Indonesian university students. This study integrates two complementary approaches: NLP-based text classification using IndoBERT and rule-based psychological scoring derived from self-report questionnaires.

Keywords

IndoBERT Natural Language Processing; Logistic Regression; Detection Generation Z

Peer Review Process

This article has undergone a double-blind peer review process to ensure quality and impartiality.

Indexing Information

Discover where this journal is indexed at our indexing page.

Open Science Badges

This journal supports transparency in research and encourages authors to meet criteria for Open Science Badges.

How to Cite

Mussafa, A. B., & Hastomo, W. (2026). IndoBERT-Based Natural Language Processing for Early Detection of Mental Disorders among Indonesian Gen-Z Students: A Mobile Application Approach with Logistic Regression Baseline. Jurnal JTIK (Jurnal Teknologi Informasi Dan Komunikasi), 10(3), 1225-1238. https://doi.org/10.35870/jtik.v10i3.6418

Article Information

This article has been peer-reviewed and published in the Jurnal JTIK (Jurnal Teknologi Informasi dan Komunikasi). The content is available under the terms of the Creative Commons Attribution 4.0 International License.

Issue: Vol. 10 No. 3 (2026)
Section: Computer & Communication Science
Published: 2026-07-01

License: CC BY 4.0
Copyright: © 2026 Authors
DOI: 10.35870/jtik.v10i3.6418

AI Research Hub

This article is indexed and available through various AI-powered research tools and citation platforms. Our AI Research Hub ensures that scholarly work is discoverable, accessible, and easily integrated into the global research ecosystem.

Scholarly Connection Platforms

Dimensions

Connected Papers

Scite

Google Scholar

Semantic Scholar

Garuda

Scilit

Crossref

BASE

Zenodo

Unpaywall

OpenCitations

Author Biographies

Athif Basyar Mussafa, Institut Teknologi dan Bisnis Ahmad Dahlan

Department of Information Technology, Institut Teknologi dan Bisnis Ahmad Dahlan, Kota Jakarta Pusat, Daerah Khusus Ibukota Jakarta, Indonesia.

Widi Hastomo, Institut Teknologi dan Bisnis Ahmad Dahlan

Department of Information Technology, Institut Teknologi dan Bisnis Ahmad Dahlan, Kota Jakarta Pusat, Daerah Khusus Ibukota Jakarta, Indonesia.

References

Cahyawijaya, S., et al. (2021). IndoNLG: Benchmark and resources for evaluating Indonesian natural language generation. EMNLP 2021 - 2021 Conference on Empirical Methods in Natural Language Processing, 8875–8898. https://doi.org/10.18653/v1/2021.emnlp-main.699.
Chancellor, S., & De Choudhury, M. (2020). Methods in predictive techniques for mental health status on social media: A critical review. npj Digital Medicine, 3(1). https://doi.org/10.1038/s41746-020-0233-7.
Couto, M., Perez, A., Parapar, J., & Losada, D. E. (2025). Temporal word embeddings for early detection of psychological disorders on social media. Journal of Healthcare Informatics Research. https://doi.org/10.1007/s41666-025-00186-9
Devlin, J., Chang, M. W., Lee, K., & Toutanova, K. (2019). BERT: Pre-training of deep bidirectional transformers for language understanding. NAACL HLT 2019 Conference on North American Chapter of the Association for Computational Linguistics - Human Language Technologies - Proceedings, 4171–4186.
Endriyani, S., & Susanti, E. (2024). Android-based application for depression, anxiety, and stress screening at Poltekkes Kemenkes Palembang, South Sumatra Province, Indonesia. Journal of Health Informatics, 17(4), 1486–1492.
Guidotti, R., Monreale, A., Ruggieri, S., Turini, F., Giannotti, F., & Pedreschi, D. (2019). A survey of methods for explaining black box models. ACM Computing Surveys, 51(5). https://doi.org/10.1145/3236009.
Huang, H., & Savkin, A. V. (2020). Autonomous navigation of a solar-powered UAV for secure communication in urban environments with eavesdropping avoidance. Future Internet, 12(10), 1–14. https://doi.org/10.3390/fi12100170
Koto, F., Rahimi, A., Lau, J. H., & Baldwin, T. (2020). IndoLEM and IndoBERT: A benchmark dataset and pre-trained language model for Indonesian NLP. COLING 2020 - 28th International Conference on Computational Linguistics - Proceedings, 757–770. https://doi.org/10.18653/v1/2020.coling-main.66
Le Glaz, A., et al. (2021). Machine learning and natural language processing in mental health: Systematic review. Journal of Medical Internet Research, 23(5). https://doi.org/10.2196/15708.
Minaee, S., Kalchbrenner, N., Cambria, E., Nikzad, N., Chenaghlu, M., & Gao, J. (2022). Deep learning-based text classification. ACM Computing Surveys, 54(3). https://doi.org/10.1145/3439726
Pakray, P., Gelbukh, A., & Bandyopadhyay, S. (2025). Natural language processing applications for low-resource languages. Natural Language Processing Journal, 31(2), 183–197. https://doi.org/10.1017/nlp.2024.33.
Scherbakov, D. A., Hubig, N. C., Lenert, L. A., Alekseyenko, A. V., & Obeid, J. S. (2025). Natural language processing and social determinants of health in mental health research: AI-assisted scoping review. JMIR Mental Health, 12, 1–15. https://doi.org/10.2196/67192.
Shaw, C., LaCasse, P., & Champagne, L. (2025). Exploring emotion classification of Indonesian tweets using large-scale transfer learning via IndoBERT. Social Network Analysis and Mining, 15(1), 1–12. https://doi.org/10.1007/s13278-025-01439-6.
Wolf, T., et al. (2020). Transformers: State-of-the-art natural language processing. Transformers for NLP, 38–45.
Yardley, L., et al. (2016). Understanding and promoting effective engagement with digital behavior change interventions. American Journal of Preventive Medicine, 51(5), 833–842. https://doi.org/10.1016/j.amepre.2016.06.015

License & Copyright

This work is licensed under a Creative Commons Attribution 4.0 International License.

Authors who publish with this journal agree to the following terms:

1. Copyright Retention and Open Access License

Authors retain copyright of their work and grant the journal non-exclusive right of first publication under the Creative Commons Attribution 4.0 International License (CC BY 4.0).

This license allows unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

2. Rights Granted Under CC BY 4.0

Under this license, readers are free to:

Share — copy and redistribute the material in any medium or format
Adapt — remix, transform, and build upon the material for any purpose, including commercial use
No additional restrictions — the licensor cannot revoke these freedoms as long as license terms are followed

3. Attribution Requirements

All uses must include:

Proper citation of the original work
Link to the Creative Commons license
Indication if changes were made to the original work
No suggestion that the licensor endorses the user or their use

4. Additional Distribution Rights

Authors may:

Deposit the published version in institutional repositories
Share through academic social networks
Include in books, monographs, or other publications
Post on personal or institutional websites

Requirement: All additional distributions must maintain the CC BY 4.0 license and proper attribution.

5. Self-Archiving and Pre-Print Sharing

Authors are encouraged to:

Share pre-prints and post-prints online
Deposit in subject-specific repositories (e.g., arXiv, bioRxiv)
Engage in scholarly communication throughout the publication process

6. Open Access Commitment

This journal provides immediate open access to all content, supporting the global exchange of knowledge without financial, legal, or technical barriers.

Published: 2026-07-01

IndoBERT-Based Natural Language Processing for Early Detection of Mental Disorders among Indonesian Gen-Z Students: A Mobile Application Approach with Logistic Regression Baseline

DOI: 10.35870/jtik.v10i3.6418

Athif Basyar Mussafa, Widi Hastomo

Article Metrics

Share:

Abstract

Keywords

Peer Review Process

Indexing Information

Open Science Badges

How to Cite

Article Information

Issue: Vol. 10 No. 3 (2026)

Section: Computer & Communication Science

Published: 2026-07-01

License: CC BY 4.0

Copyright: © 2026 Authors

DOI: 10.35870/jtik.v10i3.6418

AI Research Hub

Athif Basyar Mussafa, Institut Teknologi dan Bisnis Ahmad Dahlan

Widi Hastomo, Institut Teknologi dan Bisnis Ahmad Dahlan

1. Copyright Retention and Open Access License

2. Rights Granted Under CC BY 4.0

3. Attribution Requirements

4. Additional Distribution Rights

5. Self-Archiving and Pre-Print Sharing

6. Open Access Commitment

Powered by Contrimetric

Recommendations