دور خوارزميات التعلم الآلي في توقع مخاطر الانتحار: مراجعة منهجية وتحليل تلوي للدراسات السريرية Role of machine learning algorithms in suicide risk prediction: a systematic review-meta analysis of clinical studies

المجلة: BMC Medical Informatics and Decision Making، المجلد: 24، العدد: 1
DOI: https://doi.org/10.1186/s12911-024-02524-0
PMID: https://pubmed.ncbi.nlm.nih.gov/38802823
تاريخ النشر: 2024-05-27
المؤلف: Houriyeh Ehtemam وآخرون
الموضوع الرئيسي: دراسات الانتحار وإيذاء النفس

نظرة عامة

تتناول ورقة البحث القضية المعقدة للصحة العامة المتعلقة بالانتحار، مع التأكيد على إمكانية تقنيات التعلم الآلي (ML) في التنبؤ بمخاطر الانتحار. تم إجراء مراجعة منهجية لـ 41 دراسة نُشرت بين عامي 2011 و2022 لتقييم أداء خوارزميات ML المختلفة، باستثناء طرق معالجة اللغة الطبيعية ومعالجة الصور. أظهرت النتائج أن خوارزمية الغابة العشوائية حققت أعلى دقة بنسبة 0.94، بينما كانت خوارزمية الشبكة العصبية هي الأدنى عند 0.70. أظهر مصنف XGBoost أفضل قيمة لمنطقة تحت المنحنى (AUC) بلغت 0.97، مما يدل على أداء متفوق في التمييز بين الأفراد المعرضين للخطر وأولئك غير المعرضين للخطر للانتحار.

حددت الدراسة عوامل الخطر الرئيسية المرتبطة بالانتحار، بما في ذلك العمر، الجنس، تعاطي المخدرات، الاكتئاب، القلق، استهلاك الكحول، الحالة الاجتماعية، الدخل، التعليم، والمهنة. تؤكد هذه الرؤى على تعقيد مخاطر الانتحار والحاجة إلى استراتيجيات وقائية مستهدفة. بينما تظهر خوارزميات ML وعدًا في هذا المجال، تدعو الورقة إلى مزيد من البحث لتعزيز قابليتها للتفسير، والتحقق من أدائها عبر مجموعات بيانات متنوعة، ومعالجة الاعتبارات الأخلاقية. التعاون بين الباحثين، والأطباء، وصانعي السياسات أمر ضروري لتطوير أطر شاملة لتنفيذ ML بشكل مسؤول في جهود الوقاية من الانتحار.

مقدمة

تسلط مقدمة هذه الورقة البحثية الضوء على القضية العالمية الملحة المتعلقة بالانتحار، حيث يتم فقدان حياة كل 40 ثانية، وفقًا لما ذكرته منظمة الصحة العالمية (WHO). تؤكد على الطبيعة المعقدة للانتحار، المتأثرة بالعوامل السلوكية، ونمط الحياة، والعوامل السريرية، مثل تعاطي المخدرات، والمشاكل المالية، والاضطرابات النفسية. إن انتشار الانتحار مرتفع بشكل ملحوظ بين المراهقين والشباب، بينما قد يظهر الأفراد الأكبر سنًا معدلات أقل بسبب زيادة المرونة. كما يتم مناقشة الفروق بين الجنسين في معدلات الانتحار، مما يشير إلى أن الرجال أكثر عرضة للموت بالانتحار، على الرغم من أن هذا يتأثر بعوامل بيولوجية وبيئية مختلفة. العبء المالي للانتحار على أنظمة الرعاية الصحية كبير، مع تكاليف تختلف حسب البلد.

تستكشف الورقة أيضًا إمكانية تقنيات التعلم الآلي (ML) في التنبؤ بمخاطر الانتحار، مشيرة إلى قدرتها على التعلم من البيانات والتحسن مع مرور الوقت. يتم تصنيف ML إلى التعلم المراقب وغير المراقب، كل منهما قابل للتطبيق حسب نوع البيانات. أظهرت الأبحاث السابقة أن ML يتفوق على الطرق التقليدية في تحديد محاولات الانتحار. على الرغم من وعد ML، يعترف المؤلفون بالقيود في التدريب، والمعرفة، ودمج قواعد البيانات، حيث تعتمد طرق التقييم الحالية غالبًا على الأسئلة الذاتية. تهدف الدراسة إلى مراجعة منهجية للأبحاث الموجودة حول تطبيقات ML في التنبؤ بالانتحار، وتقييم أداء الخوارزميات المختلفة، وتحديد المتغيرات الرئيسية التي تشير بفعالية إلى مخاطر الانتحار.

الطرق

في قسم الطرق، استخدم الباحثون كل من الأساليب الكمية والنوعية لتحليل البيانات المتعلقة بعوامل خطر الانتحار. بالنسبة للمرحلة الكمية، استخدموا STATA 14.1 لإجراء إجراءات التحليل التلوي، وتطبيق تحويل قوسين مزدوجين من فريمان-توكي لتقدير انتشار مجمع وفترات الثقة 95% (CIs). تم استخدام نموذج التأثيرات العشوائية بناءً على طريقة ديرسيمونيان وليرد لأخذ التباين بين الدراسات في الاعتبار، مع دمج نموذج التأثير الثابت العكسي لتعزيز قوة النتائج.

في المرحلة النوعية، تم تحليل البيانات باستخدام MAXQDA 20، مما يسهل عملية التركيب الشامل لعوامل خطر الانتحار. قام الباحثون بترميز البيانات إلى فئات موضوعية متنوعة، بما في ذلك العوامل الديموغرافية (العمر، الجنس، الحالة الاجتماعية)، والجوانب السريرية والسلوكية (الاندفاع، إيذاء النفس، تشخيصات الصحة النفسية)، وعوامل نمط الحياة (العادات اليومية)، ومؤشرات المختبر والبيوماركر (العلامات الجينية، اختلال التوازن الهرموني)، والاستبيانات الموحدة. مكنت هذه العملية الشاملة من الترميز من تحديد عوامل خطر الانتحار الشائعة عبر الدراسات، مما ساهم في فهم أعمق للتعقيدات المحيطة بمخاطر الانتحار.

النتائج

يستعرض قسم النتائج عملية المراجعة المنهجية كما هو موضح في مخطط تدفق PRISMA (الشكل 1). حدد البحث الأولي 7,529 دراسة منشورة، تم إزالة 569 منها كنسخ مكررة، مما ترك 6,624 عنوانًا وملخصًا للفحص. بعد ذلك، تم استبعاد 5,624 ورقة لعدم استيفاء معايير الإدراج. أدى التقييم الشامل للنصوص الكاملة للدراسات الـ 369 المتبقية إلى استبعاد 328، مما أسفر عن 41 دراسة مؤهلة للتحليل التلوي والتركيب الشامل بناءً على معايير تقييم الجودة.

أظهرت الدراسات المختارة جودة مرضية، مع أحجام عينات تتراوح من 159 إلى 13,980,570 مشاركًا، مما أسفر عن متوسط حجم عينة يبلغ حوالي 549,944.51 (SD = 2,242,858.719). يبرز هذا التباين في أحجام العينات، مع وسيلة قدرها 13,420، تباين النتائج ويؤكد أهمية مراعاة تنوع حجم العينة في تفسير النتائج. تم إجراء معظم الدراسات في الولايات المتحدة وكوريا الجنوبية، باستخدام تصميمات المجموعات والدراسات الحالة، حيث يمثل المشاركون بشكل أساسي السكان العامين. يتم مناقشة معايير قياس النتائج ونتائج جمع البيانات في الأقسام اللاحقة.

المناقشة

في هذه الدراسة، تم إجراء مراجعة منهجية وتحليل تلوي لتقييم فعالية خوارزميات التعلم الآلي (ML) في التنبؤ بمخاطر الانتحار وتحديد عوامل خطر الانتحار المهمة. وفقًا لإرشادات PRISMA، شمل البحث بحثًا شاملاً في الأدبيات عبر عدة قواعد بيانات، مما أسفر عن اختيار 41 دراسة استوفت معايير الإدراج والاستبعاد الصارمة. كشفت التحليلات أن خوارزمية الغابة العشوائية أظهرت أعلى دقة (AUC = 0.94)، بينما حقق مصنف XGBoost AUC مثير للإعجاب قدره 0.97، مما يدل على قدرات تنبؤية قوية. تؤكد هذه النتائج على إمكانية تقنيات ML في تعزيز نماذج التنبؤ بمخاطر الانتحار من خلال دمج مجموعة أوسع من عوامل الخطر.

كما حددت الدراسة 261 عامل خطر للانتحار، وتم تضييقها إلى 55 مؤشرًا يتم الاستشهاد بها بشكل متكرر من خلال عملية استخراج صارمة. تم إبلاغ هذا التصنيف من خلال مدخلات الخبراء من مجموعة تركيز من الأطباء النفسيين وأطباء الطوارئ، مما يضمن صلة العوامل المختارة. بينما تتماشى النتائج مع الأبحاث السابقة التي تدعو إلى فائدة ML في التنبؤ بالصحة النفسية، فإنها أيضًا تبرز التباين في أداء الخوارزميات بناءً على جودة وحجم مجموعة البيانات. بشكل عام، تشير النتائج إلى أن ML يحمل وعدًا لتحسين التنبؤ بالانتحار، على الرغم من أن التحديات لا تزال قائمة في ترجمة هذه النماذج إلى الممارسة السريرية بشكل فعال.

Journal: BMC Medical Informatics and Decision Making, Volume: 24, Issue: 1
DOI: https://doi.org/10.1186/s12911-024-02524-0
PMID: https://pubmed.ncbi.nlm.nih.gov/38802823
Publication Date: 2024-05-27
Author(s): Houriyeh Ehtemam et al.
Primary Topic: Suicide and Self-Harm Studies

Overview

The research paper addresses the multifaceted public health issue of suicide, emphasizing the potential of machine learning (ML) techniques in predicting suicide risk. A systematic review of 41 studies published between 2011 and 2022 was conducted to evaluate the performance of various ML algorithms, excluding natural language processing and image processing methods. The findings revealed that the random forest algorithm achieved the highest accuracy of 0.94, while the neural network algorithm had the lowest at 0.70. The XGBoost classifier demonstrated the best area under the curve (AUC) value of 0.97, indicating superior performance in distinguishing between individuals at risk and those not at risk for suicide.

The study identified key risk factors associated with suicide, including age, gender, substance abuse, depression, anxiety, alcohol consumption, marital status, income, education, and occupation. These insights underscore the complexity of suicide risk and the need for targeted prevention strategies. While ML algorithms show promise in this domain, the paper calls for further research to enhance their interpretability, validate their performance across diverse datasets, and address ethical considerations. Collaboration among researchers, clinicians, and policymakers is essential to develop comprehensive frameworks for responsibly implementing ML in suicide prevention efforts.

Introduction

The introduction of this research paper highlights the urgent global issue of suicide, with a life lost every 40 seconds, as reported by the World Health Organization (WHO). It emphasizes the multifaceted nature of suicide, influenced by behavioral, lifestyle, and clinical factors, such as substance use, financial problems, and mental disorders. The prevalence of suicide is notably high among adolescents and young adults, while older individuals may exhibit lower rates due to enhanced resilience. Gender differences in suicide rates are also discussed, indicating that men are more likely to die by suicide, although this is influenced by various biological and environmental factors. The financial burden of suicide on healthcare systems is significant, with costs varying by country.

The paper further explores the potential of machine learning (ML) techniques in predicting suicide risk, noting their ability to learn from data and improve over time. ML is categorized into supervised and unsupervised learning, each applicable depending on the data type. Previous research has shown that ML outperforms traditional methods in identifying suicide attempts. Despite the promise of ML, the authors acknowledge limitations in training, knowledge, and database integration, as current assessment methods often rely on subjective questioning. The study aims to systematically review existing research on ML applications in suicide prediction, evaluate the performance of various algorithms, and identify key variables that effectively indicate suicide risk.

Methods

In the methods section, the researchers employed both quantitative and qualitative approaches to analyze data related to suicide risk factors. For the quantitative phase, they utilized STATA 14.1 to conduct meta-analytic procedures, applying the Freeman-Tukey double arcsine transformation to estimate pooled prevalence and 95% confidence intervals (CIs). A random-effects model based on DerSimonian and Laird’s method was used to account for variability among studies, incorporating an inverse-variance fixed-effect model to enhance the robustness of the findings.

In the qualitative phase, data were analyzed using MAXQDA 20, facilitating a metasynthesis of suicide risk factors. The researchers coded the data into various thematic categories, including demographic factors (age, gender, marital status), clinical and behavioral aspects (impulsivity, self-harm, mental health diagnoses), lifestyle factors (daily habits), laboratory and biomarker indicators (genetic markers, hormonal imbalances), and standardized questionnaires. This comprehensive coding process enabled the identification of common suicide risk factors across the studies, contributing to a deeper understanding of the complexities surrounding suicide risk.

Results

The results section outlines the systematic review process as depicted in the PRISMA flow chart (Figure 1). An initial search identified 7,529 published studies, from which 569 duplicates were removed, leaving 6,624 titles and abstracts for screening. Following this, 5,624 papers were excluded for not meeting the inclusion criteria. A thorough assessment of the full texts of the remaining 369 studies led to the exclusion of 328, resulting in 41 studies that qualified for meta-analysis and meta-synthesis based on quality assessment criteria.

The selected studies exhibited satisfactory quality, with sample sizes ranging from 159 to 13,980,570 participants, yielding a mean sample size of approximately 549,944.51 (SD = 2,242,858.719). This variability in sample sizes, with a median of 13,420, highlights the heterogeneity of the findings and emphasizes the importance of considering sample size diversity in result interpretation. The majority of studies were conducted in the United States and South Korea, utilizing cohort and case-control designs, with participants primarily representing the general population. The outcome measurement criteria and data collection results are discussed in subsequent sections.

Discussion

In this study, a systematic review and meta-analysis were conducted to evaluate the effectiveness of machine learning (ML) algorithms in predicting suicide risk and to identify significant suicide risk factors. Following PRISMA guidelines, the research included a comprehensive literature search across multiple databases, resulting in the selection of 41 studies that met stringent inclusion and exclusion criteria. The analysis revealed that the random forest algorithm exhibited the highest accuracy (AUC = 0.94), while the XGBoost classifier achieved an impressive AUC of 0.97, indicating strong predictive capabilities. These findings underscore the potential of ML techniques in enhancing suicide risk prediction models by integrating a broader range of risk factors.

The study also identified 261 suicide risk factors, narrowing down to 55 frequently cited indicators through a rigorous extraction process. This categorization was informed by expert input from a focus group of psychiatrists and emergency physicians, ensuring the relevance of the selected factors. While the results align with previous research advocating for the utility of ML in mental health predictions, they also highlight the variability in algorithm performance based on dataset quality and size. Overall, the findings suggest that ML holds promise for improving suicide prediction, although challenges remain in translating these models into clinical practice effectively.