الكشف عن الأفكار الانتحارية وغير الانتحارية المعتمدة على الذكاء الاصطناعي القابل للتفسير من نصوص وسائل التواصل الاجتماعي باستخدام تقنية التجميع المحسنة Explainable AI-based suicidal and non-suicidal ideations detection from social media text with enhanced ensemble technique

المجلة: Scientific Reports، المجلد: 15، العدد: 1
DOI: https://doi.org/10.1038/s41598-024-84275-6
PMID: https://pubmed.ncbi.nlm.nih.gov/39774753
تاريخ النشر: 2025-01-07
المؤلف: Daniyal Alghazzawi وآخرون
الموضوع الرئيسي: الصحة النفسية من خلال الكتابة

نظرة عامة

تقدم هذه البحث إطارًا مبتكرًا للتفريق بين الأفكار الانتحارية الحقيقية وغير الانتحارية في تفاعلات وسائل التواصل الاجتماعي من خلال تقنية تجميعية تتضمن الذكاء الاصطناعي القابل للتفسير (XAI). يهدف الإطار إلى تعزيز التعرف الفوري على المشاعر على منصات الشبكات الاجتماعية، وهو أمر حيوي للتدخلات الفعالة في الوقاية من الانتحار. من خلال استخدام مصنف تراكمي يجمع بين نماذج التعلم الآلي المختلفة – وهي آلة الدعم الناقل (SVM)، والانحدار اللوجستي (LR)، وتعزيز التدرج (GB)، وأشجار القرار (DT) – تحقق الدراسة دقة تنبؤية عالية، مع درجات F1 تبلغ 95.5% للمحتوى الانتحاري و99% للمحتوى غير الانتحاري. يسمح التفسير المقدم من XAI بفهم أعمق للميزات التي تؤثر على تصنيفات النموذج، مما يكشف عن أنماط خفية مرتبطة بالأفكار الانتحارية.

تقر الدراسة بحدود الاعتماد فقط على البيانات النصية للتصنيف وتقترح أن دمج البيانات متعددة الوسائط، مثل المحتوى السمعي البصري والرموز التعبيرية، قد يعزز فهم الأفكار الانتحارية. ستركز الأعمال المستقبلية على تحسين قدرة النموذج على التعميم عبر منصات وبيانات وسائل التواصل الاجتماعي المختلفة، ومعالجة عدم توازن الفئات، واستكشاف تقنيات اختيار وتمثيل الميزات المتقدمة. يؤكد المؤلفون على أهمية الإشراف البشري والاعتبارات الأخلاقية في تطبيق نموذجهم، خاصة في السياقات الحساسة. يقترحون مزيدًا من البحث في تعزيز قابلية تفسير الطرق التجميعية ودمج XAI مع الخبرة البشرية لتحسين استراتيجيات التدخل، مع مراعاة الكفاءة الحاسوبية وقابلية التطبيق عبر الثقافات للنموذج.

الطرق

تتضمن المنهجية المقترحة لتمييز المشاركات التي تحتوي على أفكار انتحارية عن المحتوى غير الانتحاري نهجًا منظمًا موضحًا في الشكل 1. في البداية، تخضع البيانات المستمدة من الموارد المرجعية لهندسة الميزات، والمعالجة المسبقة، وإعادة أخذ العينات. تشمل مرحلة المعالجة المسبقة تقنيات مثل تقسيم النص، وإزالة الكلمات الشائعة، وتحويل الكلمات إلى متجهات باستخدام تردد المصطلح – تردد الوثيقة العكسية (TF-IDF) لتحويل النص إلى ميزات عددية ذات معنى، مما يعزز قابلية تفسير البيانات.

في مرحلة التصنيف، يتم استخدام إطار تعلم تجميعي، مستفيدًا من مصنف تراكمي يدمج أربعة مصنفات أساسية: آلة الدعم الناقل (SVM)، والانحدار اللوجستي (LR)، وتعزيز التدرج (GB)، وشجرة القرار (DT). يعمل نموذج الغابة العشوائية كمصنف ميتا لتحسين الأداء. تم تصميم هذا النهج التجميعي لالتقاط الفروق الدقيقة في النص التي تشير إلى إشارات الضيق، مع تحقيق توازن بين دقة الكشف والحاجة إلى الشفافية وقابلية التفسير في سلوكيات المصنف.

النتائج

تقدم قسم النتائج نتائج من النموذج المقترح الذي يهدف إلى تمييز الأفكار الانتحارية الحقيقية عن التعبيرات غير الانتحارية في منشورات وسائل التواصل الاجتماعي، باستخدام وحدة الذكاء الاصطناعي القابل للتفسير (XAI). تشير مقاييس أداء النموذج إلى أن الطريقة التجميعية المعتمدة على التراص حققت أعلى دقة بنسبة 96% مع انحراف معياري قدره 0.02، متفوقة على نماذج أخرى مثل آلة الدعم الناقل (SVM) بنسبة 90% وBERT بنسبة 89%. تسلط التحليل الضوء على أهمية ميزات لغوية معينة، تم تحديدها من خلال قيم SHAP، في التنبؤ بالأفكار الانتحارية. ومن الجدير بالذكر أن تعبيرات اليأس والضمائر الشخصية كانت مؤشرات قوية على الأفكار الانتحارية الحقيقية، بينما اقترحت اللغة الغامضة عدم الإخلاص.

أظهرت الأداء التنبؤي للطريقة التجميعية، المعززة برؤى من XAI، قدرة قوية على التمييز بين الأفكار الانتحارية الحقيقية والزائفة، كما يتضح من قيمة AUC ROC العالية. يُعزى دقة النموذج في الكشف عن الأفكار المصنوعة إلى قدرته على التعرف على الأنماط في اللغة المبالغ فيها. علاوة على ذلك، تؤكد الدراسة على أهمية القابلية للتعميم في نماذج التعلم الآلي، مع معالجة احتمال الإفراط في التكيف من خلال ضبط دقيق للمصنف التراكمي ومصنفاته الأساسية، إلى جانب تطبيق إعادة أخذ العينات العشوائية لإدارة البيانات المنحرفة. تؤكد هذه النتائج على إمكانيات الذكاء الاصطناعي في دعم مبادرات الصحة النفسية من خلال تحسين دقة وقابلية تفسير تقييمات مخاطر الانتحار في سياقات وسائل التواصل الاجتماعي.

المناقشة

تناقش ورقة البحث تطوير نظام تصنيف نصوص ثنائي التسمية يهدف إلى التمييز بين الأفكار الانتحارية وغير الانتحارية في منشورات وسائل التواصل الاجتماعي. تشمل الأهداف الرئيسية استخدام نماذج تعلم آلي متنوعة لتعزيز دقة التصنيف، ودمج تقنيات الذكاء الاصطناعي القابل للتفسير (XAI) للشفافية في اتخاذ القرار، وتقييم أداء النموذج باستخدام مقاييس مثل الدقة، والدقة، والاسترجاع. تؤكد الدراسة على الآثار الأخلاقية للذكاء الاصطناعي في الصحة النفسية، بهدف تسهيل التدخل المبكر وتعزيز الرفاهية النفسية على المنصات الاجتماعية.

تشمل المساهمات الرئيسية للبحث تطبيق تقنيات التعلم التجميعي لتصنيف الأفكار الانتحارية، مما يبني على النماذج السابقة التي استخدمت مصنفات أبسط. تستخدم الطريقة المقترحة مصنفات تراكبية لدمج التنبؤات من نماذج متعددة، مما يحسن الدقة وقابلية التفسير. تشير النتائج إلى أن النظام التجميعي يتفوق على النماذج الحالية الرائدة، مما يوفر رؤى قيمة في تمييز التعبيرات الحقيقية مقابل غير الانتحارية في الاتصالات عبر الإنترنت. يبرز هذا العمل أهمية الشفافية والاعتبارات الأخلاقية في نشر تقنيات الذكاء الاصطناعي في سياقات الصحة النفسية، مما يدعو في النهاية إلى دمج مسؤول للتكنولوجيا والقيم الإنسانية في معالجة الأزمات النفسية.

Journal: Scientific Reports, Volume: 15, Issue: 1
DOI: https://doi.org/10.1038/s41598-024-84275-6
PMID: https://pubmed.ncbi.nlm.nih.gov/39774753
Publication Date: 2025-01-07
Author(s): Daniyal Alghazzawi et al.
Primary Topic: Mental Health via Writing

Overview

This research introduces an innovative framework for differentiating between actual and non-suicidal ideation in social media interactions through an ensemble technique that incorporates Explainable AI (XAI). The framework aims to enhance the timely identification of sentiments on social networking platforms, which is vital for effective suicide prevention interventions. By employing a stacking classifier that combines various machine learning models—namely Support Vector Machine (SVM), Logistic Regression (LR), Gradient Boosting (GB), and Decision Trees (DT)—the study achieves high predictive accuracy, with F1-scores of 95.5% for suicidal content and 99% for non-suicidal content. The interpretability provided by XAI allows for a deeper understanding of the features influencing the model’s classifications, revealing hidden patterns associated with suicidal ideation.

The study acknowledges the limitations of relying solely on textual data for classification and suggests that incorporating multimodal data, such as audio-visual content and emojis, could enhance understanding of suicidal ideation. Future work will focus on improving the model’s generalizability across different social media platforms and datasets, addressing class imbalance, and exploring advanced feature selection and representation techniques. The authors emphasize the importance of human oversight and ethical considerations in the application of their model, particularly in sensitive contexts. They propose further research into enhancing the interpretability of ensemble methods and integrating XAI with human expertise to improve intervention strategies, while also considering computational efficiency and cross-cultural applicability of the model.

Methods

The proposed methodology for distinguishing posts containing suicidal ideation from non-suicidal content involves a structured approach illustrated in Fig. 1. Initially, data obtained from benchmark resources undergoes feature engineering, pre-processing, and resampling. The pre-processing phase includes techniques such as tokenization, stop-word removal, and word vectorization using Term Frequency-Inverse Document Frequency (TF-IDF) to convert text into meaningful numerical features, enhancing the interpretability of the data.

In the classification phase, an ensemble learning framework is employed, utilizing a stacked classifier that integrates four base classifiers: Support Vector Machine (SVM), Logistic Regression (LR), Gradient Boosting (GB), and Decision Tree (DT). A Random Forest model serves as the meta-classifier to optimize performance. This ensemble approach is designed to capture subtle distinctions in text indicative of distress signals, balancing detection accuracy with the need for transparency and explainability in classifier behaviors.

Results

The results section presents findings from the proposed model aimed at distinguishing genuine suicidal thoughts from nonsuicidal expressions in social media posts, utilizing an Explainable AI (XAI) module. The model’s performance metrics indicate that the stacking-based ensemble method achieved the highest accuracy of 96% with a standard deviation of 0.02, outperforming other models such as Support Vector Machine (SVM) at 90% and BERT at 89%. The analysis highlights the importance of specific linguistic features, identified through SHAP values, in predicting suicidal ideation. Notably, expressions of despair and personal pronouns were strong indicators of genuine suicidal thoughts, while vague language suggested insincerity.

The predictive performance of the ensemble method, enhanced by insights from XAI, demonstrated a robust ability to differentiate between genuine and false suicidal thoughts, as evidenced by a high AUC ROC value. The model’s precision in detecting fabricated ideations is attributed to its capability to recognize patterns in exaggerated language. Furthermore, the study emphasizes the importance of generalizability in machine learning models, addressing potential overfitting through careful tuning of the stacking classifier and its base classifiers, alongside the application of Random Oversampling to manage skewed data. These findings underscore the potential of AI in supporting mental health initiatives by improving the accuracy and interpretability of suicide risk assessments in social media contexts.

Discussion

The research paper discusses the development of a binary-label text categorization system aimed at distinguishing between suicidal and non-suicidal thoughts in social media posts. The primary objectives include employing various machine learning models to enhance classification accuracy, integrating Explainable AI (XAI) techniques for transparency in decision-making, and evaluating the model’s performance using metrics such as accuracy, precision, and recall. The study emphasizes the ethical implications of AI in mental health, aiming to facilitate early intervention and promote mental well-being on social platforms.

Key contributions of the research include the application of ensemble learning techniques to classify suicidal ideations, which builds upon previous models that utilized simpler classifiers. The proposed method employs stacking classifiers to combine predictions from multiple models, thereby improving accuracy and interpretability. The findings indicate that the ensemble system outperforms existing state-of-the-art models, providing valuable insights into the differentiation of genuine versus non-suicidal expressions in online communications. This work underscores the importance of transparency and ethical considerations in deploying AI technologies within mental health contexts, ultimately advocating for a responsible integration of technology and human-centered values in addressing mental health crises.