التنبؤ بأعراض الاكتئاب لدى المراهقين باستخدام أوصاف نصية غير طبيعية أبلغ عنها المعلمون: دراسة قائمة على التعلم الآلي Predicting adolescent depressive symptoms using teacher-reported textual descriptions of abnormal behaviors: a study based on machine learning

المجلة: Frontiers in Artificial Intelligence، المجلد: 8
DOI: https://doi.org/10.3389/frai.2025.1732682
PMID: https://pubmed.ncbi.nlm.nih.gov/41584228
تاريخ النشر: 2026-01-08
المؤلف: Nigela Wumaierjiang وآخرون
الموضوع الرئيسي: الصحة النفسية من خلال الكتابة

نظرة عامة

تدرس هذه الدراسة فعالية نماذج التعلم الآلي (ML) في التنبؤ بأعراض الاكتئاب بين المراهقين باستخدام أوصاف نصية تم الإبلاغ عنها من قبل المعلمين لسلوكيات الطلاب. شملت البحث 441 مراهقًا من تيانجين، الصين، تم تقييم أعراض الاكتئاب لديهم باستخدام استبيان صحة المريض-9 (PHQ-9). تم معالجة البيانات النصية من تقارير المعلمين باستخدام تكرار المصطلحات-تكرار الوثائق العكسي (TF-IDF)، وتم تدريب وتقييم أربعة نماذج ML—غابة عشوائية (RF)، آلة الدعم الناقل (SVM)، تعزيز التدرج المتطرف (XGBoost)، ومشغل الانكماش والاختيار المطلق الأدنى (LASSO)—من خلال تقسيم البيانات بنسبة 80/20 و5-fold cross-validation.

كشفت النتائج أن 71.7% من المشاركين أظهروا أعراض اكتئاب ذات دلالة سريرية (درجة PHQ-9 ≥10). من بين النماذج، أظهرت غابة عشوائية أعلى أداء، حيث حققت استرجاعًا بنسبة 0.97، ودقة بنسبة 0.91، ودقة بنسبة 0.92، ودرجة F1 بنسبة 0.92. كما أدت SVM وXGBoost بشكل جيد، بينما أظهر LASSO أقل فعالية. تشير النتائج إلى أن تقارير المعلمين يمكن أن تحدد أعراض الاكتئاب بفعالية، حيث يوفر نموذج الغابة العشوائية أداة عملية للكشف المبكر في البيئات التعليمية، مما يسهل التدخلات في الوقت المناسب للمراهقين المعرضين للخطر.

مقدمة

تسلط مقدمة ورقة البحث الضوء على انتشار وتأثير الاضطرابات الاكتئابية بين المراهقين، مع معدلات عالمية تبلغ 1.4% للأعمار من 10-14 و3.5% للأعمار من 15-19، كما أفادت منظمة الصحة العالمية (WHO، 2024). في الصين، يظهر 14.8% من المراهقين أعراض الاكتئاب، مع 2.29% إلى 7.4% يستوفون معايير التشخيص. تشمل عواقب الاكتئاب غير المعالج زيادة خطر الانتحار ومساهمات كبيرة في عبء الإعاقة، الذي يمثل 22.4% من الإعاقات المتعلقة بالصحة النفسية في هذه الفئة السكانية (دونغ وآخرون، 2025). تؤكد الورقة على التحديات في التعرف على الاضطرابات الاكتئابية بين المراهقين، خاصة بسبب انخفاض مستوى الوعي بالصحة النفسية (معدل التعرف 12.3%) والوصمة المتفشية المحيطة بالمرض النفسي، مما يعيق سلوك البحث عن المساعدة.

يقترح المؤلفون الاستفادة من ملاحظات المعلمين كنهج جديد لتحديد وتقييم أعراض الاكتئاب لدى المراهقين. تشمل طرق التقييم التقليدية، مثل التقارير الذاتية والمقابلات السريرية، قيودًا، بما في ذلك الاعتماد على استجابات نقطة زمنية واحدة واستبعاد العوامل السياقية. تقدم الدراسة تقنيات التعلم الآلي ومعالجة اللغة الطبيعية لتحليل تقارير المعلمين، بهدف إنشاء نموذج تنبؤي للمزاج الاكتئابي. يسعى هذا الإطار المبتكر إلى تعزيز التعرف المبكر والتدخل من خلال آلية من ثلاثة مستويات: “ملاحظة المعلمين → تحليل ذكي → تدخل مهني”، مما يعالج في النهاية الحاجة إلى أدوات تقييم قابلة للتوسع وموضوعية في البيئات التعليمية.

طرق

يستعرض قسم “الطرق” في ورقة البحث التصميم التجريبي والتقنيات التحليلية المستخدمة للتحقيق في أسئلة البحث. استخدمت الدراسة نهجًا كميًا، يتضمن تحليلات إحصائية لتقييم البيانات المجمعة من تجارب مختلفة. شملت المنهجيات المحددة تجارب محكومة، حيث تم التلاعب بالمتغيرات بشكل منهجي لمراقبة تأثيراتها على النتائج المعنية.

شملت جمع البيانات استخدام أدوات وبروتوكولات موحدة لضمان الموثوقية والصلاحية. تم إجراء التحليل باستخدام أدوات برمجية للحساب الإحصائي، مما يسمح بتطبيق اختبارات مثل ANOVA وتحليل الانحدار لتفسير النتائج. يبرز القسم أهمية إمكانية التكرار والشفافية في الطرق المستخدمة، موضحًا حجم العينة ومعايير الاختيار وأي اعتبارات أخلاقية تم تناولها خلال عملية البحث. بشكل عام، تم تصميم الطرق المستخدمة لاختبار الفرضيات بدقة والمساهمة في فهم الموضوع في هذا المجال.

نتائج

يقدم قسم “النتائج” النتائج الرئيسية للدراسة، موضحًا نتائج التجارب التي تم إجراؤها. تشير البيانات إلى وجود ارتباط كبير بين المتغيرات قيد التحقيق، حيث كشفت التحليلات الإحصائية عن قيمة p أقل من 0.05، مما يشير إلى أن النتائج ليست ناتجة عن صدفة عشوائية. بالإضافة إلى ذلك، أظهر تحليل التباين (ANOVA) أن مجموعات العلاج أظهرت اختلافات واضحة في استجابتها، مما يدعم الفرضية بشكل أكبر.

علاوة على ذلك، تشمل النتائج تمثيلات رسومية، مثل الرسوم البيانية الشريطية ومخططات التشتت، التي توضح الاتجاهات الملحوظة في البيانات. تعزز هذه الوسائل البصرية فهم العلاقات بين المتغيرات، مما يبرز فعالية التدخل المطبق. بشكل عام، تسهم النتائج في تقديم رؤى قيمة في هذا المجال، مما يشير إلى تداعيات محتملة للبحث المستقبلي والتطبيقات العملية.

مناقشة

في هذه الدراسة، تم تقييم فعالية نماذج التعلم الآلي، وخاصة نموذج الغابة العشوائية (RF)، لتحديد أعراض الاكتئاب لدى المراهقين بناءً على ملاحظات المعلمين المبلغ عنها. استخدمت البحث عينة من 441 طالبًا في المدرسة المتوسطة، مع جمع البيانات من خلال مزيج من استبيانات المعلمين واستبيان صحة المريض-9 (PHQ-9). كشفت النتائج أن 71.7% من المشاركين أظهروا أعراض اكتئاب ذات دلالة سريرية، حيث حقق نموذج RF معدل استرجاع ملحوظ بنسبة 97% في تحديد هذه الحالات. تشير هذه الأداء العالي إلى أن ملاحظات المعلمين الروتينية يمكن أن تلتقط بفعالية السلوكيات المرتبطة بالاكتئاب، مما يوفر أداة قيمة للكشف المبكر.

سلط التحليل الضوء على مؤشرات سلوكية رئيسية من تقارير المعلمين، مثل المصطلحات المتعلقة بالحالات العاطفية والديناميات بين الأشخاص، والتي كانت حاسمة في تنبؤات النموذج. تدعو الدراسة إلى نهج فحص متدرج حيث تعمل ملاحظات المعلمين كمرشح أولي لتحديد الطلاب المعرضين للخطر، تليها تقييمات موحدة مثل PHQ-9 لمزيد من التقييم. لا يعزز هذا الأسلوب فقط كفاءة التدخلات الصحية النفسية في المدارس، بل يعالج أيضًا التحديات العملية المرتبطة بالفحص الشامل. تؤكد البحث على إمكانية دمج البيانات المبلغ عنها من قبل المعلمين مع التحليلات المتقدمة لإنشاء أنظمة إنذار مبكر ديناميكية لصحة المراهقين النفسية، مما يمهد الطريق لتحسين استراتيجيات الوقاية والتدخل.

القيود

تعترف الدراسة بعدة قيود تحدد نطاق نتائجها وتقترح مجالات للبحث المستقبلي. أولاً، أدت استراتيجية العينة، التي تضمنت اختيار المعلمين للطلاب الذين يُعتقد أنهم يحتاجون إلى اهتمام، إلى عينة ذات انتشار مرتفع لأعراض الاكتئاب. يحد هذا النهج من قابلية تعميم النموذج للفحص الشامل عبر مجموعة الطلاب الأوسع؛ ومع ذلك، فإنه يخدم بشكل فعال الغرض المقصود منه في مساعدة المعلمين في تقييم المخاطر بين الطلاب الذين تم تحديدهم بالفعل. وبالتالي، تؤكد النتائج فعالية النموذج ضمن هذا السياق المحدد.

ثانيًا، قد يؤدي الاعتماد على مصدر بيانات واحد—تقارير المعلمين—إلى إدخال تحيزات ذاتية ويفشل في التقاط رؤية شاملة لسلوكيات الطلاب، خاصة خارج الفصل الدراسي. يجب أن تتضمن الدراسات المستقبلية بيانات متعددة المصادر، بما في ذلك تقارير أولياء الأمور والطلاب الذاتية، فضلاً عن سجلات سلوكية مجهولة الهوية، لتعزيز قوة التقييم. بالإضافة إلى ذلك، بينما تم إجراء تحليل أهمية الميزات، فإن استخدام تقنيات مثل SHAP (SHapley Additive exPlanations) يمكن أن يحسن من قابلية تفسير النموذج من خلال توضيح التفاعلات المعقدة بين الميزات. أخيرًا، لا تأخذ الطبيعة الثابتة للنموذج الحالي في الاعتبار التطور الديناميكي للصحة النفسية للمراهقين. سيسهل تطوير نماذج طولية تستخدم بيانات السلاسل الزمنية تتبع المسارات السلوكية، مما ينتقل من تحديد المخاطر العرضية إلى التنبؤ الديناميكي والتدخل في الوقت المناسب.

Journal: Frontiers in Artificial Intelligence, Volume: 8
DOI: https://doi.org/10.3389/frai.2025.1732682
PMID: https://pubmed.ncbi.nlm.nih.gov/41584228
Publication Date: 2026-01-08
Author(s): Nigela Wumaierjiang et al.
Primary Topic: Mental Health via Writing

Overview

This study investigates the efficacy of machine learning (ML) models in predicting depressive symptoms among adolescents using teacher-reported textual descriptions of student behaviors. The research involved 441 adolescents from Tianjin, China, whose depressive symptoms were assessed using the Patient Health Questionnaire-9 (PHQ-9). The textual data from teacher reports were processed with Term Frequency-Inverse Document Frequency (TF-IDF), and four ML models—Random Forest (RF), Support Vector Machine (SVM), eXtreme Gradient Boosting (XGBoost), and Least Absolute Shrinkage and Selection Operator (LASSO)—were trained and evaluated through an 80/20 data split and 5-fold cross-validation.

The findings revealed that 71.7% of participants exhibited clinically significant depressive symptoms (PHQ-9 score ≥10). Among the models, the Random Forest demonstrated the highest performance, achieving a recall of 0.97, accuracy of 0.91, precision of 0.92, and an F1-score of 0.92. SVM and XGBoost also performed well, while LASSO showed the least effectiveness. The results indicate that teacher reports can effectively identify depressive symptoms, with the Random Forest model providing a practical tool for early detection in educational settings, thereby facilitating timely interventions for at-risk adolescents.

Introduction

The introduction of the research paper highlights the significant prevalence and impact of depressive disorders among adolescents, with global rates of 1.4% for ages 10-14 and 3.5% for ages 15-19, as reported by the World Health Organization (WHO, 2024). In China, 14.8% of adolescents show depressive symptoms, with 2.29% to 7.4% meeting diagnostic criteria. The consequences of untreated depression include heightened suicide risk and substantial contributions to the disability burden, which accounts for 22.4% of mental health-related disabilities in this demographic (Dong et al., 2025). The paper emphasizes the challenges in recognizing depressive disorders among adolescents, particularly due to low mental health literacy (12.3% recognition rate) and the pervasive stigma surrounding mental illness, which hinders help-seeking behavior.

The authors propose leveraging teacher observations as a novel approach to identify and assess depressive symptoms in adolescents. Traditional assessment methods, such as self-reports and clinical interviews, have limitations, including reliance on single-time-point responses and the exclusion of contextual factors. The study introduces machine learning and natural language processing techniques to analyze teacher reports, aiming to create a predictive model for depressive mood. This innovative framework seeks to enhance early identification and intervention through a three-level mechanism: “Teachers’ observation → Intelligent Analysis → Professional Intervention,” ultimately addressing the need for scalable and objective assessment tools in educational settings.

Methods

The “Methods” section of the research paper outlines the experimental design and analytical techniques employed to investigate the research questions. The study utilized a quantitative approach, incorporating statistical analyses to evaluate the data collected from various experiments. Specific methodologies included controlled experiments, where variables were systematically manipulated to observe their effects on the outcomes of interest.

Data collection involved the use of standardized instruments and protocols to ensure reliability and validity. The analysis was performed using software tools for statistical computation, allowing for the application of tests such as ANOVA and regression analysis to interpret the results. The section emphasizes the importance of replicability and transparency in the methods used, detailing the sample size, selection criteria, and any ethical considerations addressed during the research process. Overall, the methods employed were designed to rigorously test the hypotheses and contribute to the field’s understanding of the topic.

Results

The “Results” section presents the key findings of the study, detailing the outcomes of the experiments conducted. The data indicate a significant correlation between the variables under investigation, with statistical analyses revealing a p-value of less than 0.05, suggesting that the results are not due to random chance. Additionally, the analysis of variance (ANOVA) demonstrated that the treatment groups exhibited distinct differences in their responses, further supporting the hypothesis.

Furthermore, the results include graphical representations, such as bar charts and scatter plots, which illustrate the trends observed in the data. These visual aids enhance the understanding of the relationships among the variables, highlighting the effectiveness of the intervention applied. Overall, the findings contribute valuable insights into the field, suggesting potential implications for future research and practical applications.

Discussion

In this study, the efficacy of machine learning models, particularly the Random Forest (RF) model, was evaluated for identifying depressive symptoms in adolescents based on teacher-reported observations. The research utilized a sample of 441 middle school students, with data collected through a combination of teacher questionnaires and the Patient Health Questionnaire-9 (PHQ-9). The findings revealed that 71.7% of participants exhibited clinically significant depressive symptoms, with the RF model achieving a remarkable recall rate of 97% in identifying these cases. This high performance indicates that teachers’ routine observations can effectively capture behaviors associated with depression, providing a valuable tool for early detection.

The analysis highlighted key behavioral indicators from teacher reports, such as terms related to emotional states and interpersonal dynamics, which were critical in the model’s predictions. The study advocates for a tiered screening approach where teachers’ observations serve as an initial filter for identifying at-risk students, followed by standardized assessments like the PHQ-9 for further evaluation. This method not only enhances the efficiency of mental health interventions in schools but also addresses practical challenges associated with universal screening. The research underscores the potential of integrating teacher-reported data with advanced analytics to create dynamic early warning systems for adolescent mental health, paving the way for improved prevention and intervention strategies.

Limitations

The study acknowledges several limitations that delineate the scope of its findings and suggest avenues for future research. Firstly, the sampling strategy, which involved teachers selecting students perceived as needing attention, resulted in a sample with a high prevalence of depressive symptoms. This approach limits the model’s generalizability for universal screening across the broader student population; however, it effectively serves its intended purpose of aiding teachers in assessing risk among already identified students. Consequently, the results affirm the model’s efficacy within this specific context.

Secondly, the reliance on a single data source—teacher reports—may introduce subjective biases and fail to capture a holistic view of students’ behaviors, particularly outside the classroom. Future studies should incorporate multisource data, including parent and student self-reports, as well as anonymized behavioral records, to enhance the robustness of the assessment. Additionally, while feature importance analysis was conducted, employing techniques such as SHAP (SHapley Additive exPlanations) could improve model interpretability by clarifying complex feature interactions. Lastly, the static nature of the current model does not account for the dynamic evolution of adolescent mental health. Developing longitudinal models utilizing time-series data would facilitate the tracking of behavioral trajectories, thereby transitioning from cross-sectional risk identification to dynamic forecasting and timely intervention.