تحسين اكتشاف الاكتئاب في المقابلات السريرية بين الأطباء والمرضى باستخدام إطار تعلم متعدد الحالات Optimizing depression detection in clinical doctor-patient interviews using a multi-instance learning framework

المجلة: Scientific Reports، المجلد: 15، العدد: 1
DOI: https://doi.org/10.1038/s41598-025-90117-w
PMID: https://pubmed.ncbi.nlm.nih.gov/39994325
تاريخ النشر: 2025-02-24
المؤلف: Xu Zhang وآخرون
الموضوع الرئيسي: الصحة النفسية من خلال الكتابة

نظرة عامة

تتناول ورقة البحث الزيادة المتزايدة في انتشار الاكتئاب والحاجة إلى تحسين طرق الكشف. تعاني الأساليب التقليدية، مثل مقياس الاكتئاب الذاتي (SDS) والمقابلات، غالبًا من الذاتية والتحيزات البيئية. لتعزيز الموضوعية وكفاءة الكشف التلقائي عن الاكتئاب (ADD)، يقترح المؤلفون تطبيقًا جديدًا لإطار التعلم متعدد الحالات (MIL) على بيانات المقابلات النصية. يسمح هذا النهج باستخراج الميزات بشكل مستقل من كل حالة، مما يحسن تمثيل النص ويعالج عدم توازن العينة. النموذج التجميعي، الذي يجمع بين MT5 و RoBERTa (المسمى multi-MTRB)، يحدد المحتوى الاكتئابي بفعالية، محققًا درجات F1 تبلغ 0.88 و 0.86 على مجموعتي بيانات DAIC-WOZ و E-DAIC، على التوالي.

في الختام، تُظهر الدراسة أن دمج إطار MIL مع النماذج المدربة مسبقًا يعزز بشكل كبير من التقاط المعلومات العاطفية في النصوص الطويلة. إن إدخال المعلمات الفائقة للتخفيف من ضوضاء الحالات السلبية يعزز أيضًا أداء النموذج. ستركز الأعمال المستقبلية على دمج الاعتماد الزمني ودمج بيانات تخطيط الدماغ الكهربائي (EEG) مع الميزات النصية لتحليل أكثر شمولاً للاكتئاب. بالإضافة إلى ذلك، يهدف المؤلفون إلى تحسين قابلية تفسير النموذج من خلال دمج معرفة الاكتئاب مع تقنيات التعلم العميق، مما يمهد الطريق للتقدم العملي في ADD وتحسين نتائج المرضى.

الطرق

في هذه الدراسة، تم إجراء تجارب في بيئة مسيطر عليها باستخدام نظام تشغيل ويندوز وبطاقة رسومات NVIDIA RTX 4070Ti، مع تنفيذ تم باستخدام إطار PyTorch. تم تكوين مجموعة التدريب بحجم دفعة يبلغ 128، ومعدل تعلم قدره 0.001، وإجمالي 100 دورة. لتعزيز قدرة النموذج على تقييم عدم اليقين في التنبؤ، استخدم الباحثون تقنية Monte Carlo Dropout، التي تتضمن تنفيذ 10 تمريرات أمامية مستقلة لكل ميزة إدخال. يسمح هذا الأسلوب بحساب التباين بين التنبؤات، مما يوفر تقديرًا قويًا لعدم اليقين.

يهدف دمج Monte Carlo Dropout بشكل خاص إلى تحسين دقة الكشف وموثوقيته، خاصةً للحالات غير النمطية أو الصعبة. من خلال الاستفادة من هذه التقنية، تسعى الدراسة إلى معالجة قيود أساليب الإسقاط التقليدية في التعلم العميق، مما يعزز في النهاية أداء النموذج في التطبيقات الواقعية.

النتائج

يقدم قسم “النتائج” في ورقة البحث النتائج الرئيسية المستمدة من التجارب أو التحليلات التي تم إجراؤها. عادةً ما يتضمن بيانات كمية، وتحليلات إحصائية، وتمثيلات بصرية مثل الرسوم البيانية أو الجداول التي توضح النتائج. غالبًا ما تتم مقارنة النتائج مع الفرضيات أو الدراسات السابقة لتسليط الضوء على الفروق أو التأكيدات الهامة.

في هذا القسم، قد يبلغ المؤلفون عن مقاييس محددة، مثل القيم المتوسطة، والانحرافات المعيارية، أو معاملات الارتباط، التي توفر رؤى حول العلاقات أو التأثيرات الملحوظة في الدراسة. بالإضافة إلى ذلك، يتم مناقشة أي اتجاهات أو أنماط ملحوظة تم تحديدها في البيانات، مع الإشارة إلى آثارها على أسئلة البحث المطروحة. بشكل عام، يخدم هذا القسم لنقل الأدلة التجريبية التي تدعم استنتاجات الدراسة.

المناقشة

في قسم المناقشة من ورقة البحث، يستكشف المؤلفون التقدم في الكشف عن الاكتئاب القائم على النص، مع التأكيد على دمج الذكاء الاصطناعي (AI) في الرعاية الصحية. يبرزون تعقيد تجليات الاكتئاب وأهمية التعبير اللفظي في فهم الحالات النفسية للمرضى. استخدمت الدراسات السابقة أنماطًا مختلفة، مثل الفيديو والصوت، لكن المؤلفين يجادلون بأن النص لا يزال وسيلة حاسمة لتحليل المشاعر. يصنفون الأساليب القائمة على النص الموجودة إلى أساليب قائمة على قاموس العواطف، والتعلم الآلي، وتقنيات التعلم العميق، مشيرين إلى فعالية تقنيات التعلم العميق، خاصة في تحليل بيانات وسائل التواصل الاجتماعي. ومع ذلك، يشيرون إلى قيود استخدام النصوص القصيرة من منصات مثل تويتر بسبب مخاوف جودة البيانات ويقترحون نهجًا جديدًا يستفيد من بيانات المقابلات السريرية من خلال إطار التعلم متعدد الحالات (MIL)، الذي يعامل النصوص الطويلة كأكياس من الحالات لتحسين دقة الكشف.

يقدم المؤلفون إطار MIL الخاص بهم لكشف الاكتئاب، معالجين تحدي مجموعات البيانات غير المتوازنة حيث تكون العينات الإيجابية (الأفراد المكتئبين) أقل من العينات السلبية. يقومون بتنقيح منطق الحكم التقليدي لـ MIL من خلال وضع معايير لتشخيص الاكتئاب بناءً على وجود حالات اكتئابية داخل كيس من الردود. كما تتناول الورقة مجموعة البيانات المستخدمة، والتي تتكون من مقابلات سريرية من مجموعات DAIC-WOZ و E-DAIC، وتوضح منهجيتهم، بما في ذلك دمج نماذج MT5 و RoBERTa لأداء تصنيف محسّن. تظهر النتائج أن نموذجهم المدمج Multi-MTRB يتفوق على الأساليب الحالية من حيث الدقة، والدقة، والاسترجاع، ودرجة F1، مما يشير إلى إمكانيته للكشف الموثوق عن الاكتئاب في البيئات السريرية. يختتم المؤلفون بالتأكيد على قابلية تفسير نموذجهم، مع عرض تمثيلات بصرية لدرجات الاكتئاب لتعزيز فهم معالجة البيانات الأساسية وآليات اتخاذ القرار.

Journal: Scientific Reports, Volume: 15, Issue: 1
DOI: https://doi.org/10.1038/s41598-025-90117-w
PMID: https://pubmed.ncbi.nlm.nih.gov/39994325
Publication Date: 2025-02-24
Author(s): Xu Zhang et al.
Primary Topic: Mental Health via Writing

Overview

The research paper addresses the increasing prevalence of depression and the need for improved detection methods. Traditional approaches, such as the self-rating depression scale (SDS) and interviews, often suffer from subjectivity and environmental biases. To enhance the objectivity and efficiency of automatic depression detection (ADD), the authors propose a novel application of the multiple instance learning (MIL) framework to textual interview data. This approach allows for independent feature extraction from each instance, improving text representation and addressing sample imbalance. The ensemble model, combining MT5 and RoBERTa (termed multi-MTRB), effectively identifies depressive content, achieving F1 scores of 0.88 and 0.86 on the DAIC-WOZ and E-DAIC datasets, respectively.

In conclusion, the study demonstrates that integrating the MIL framework with pre-trained models significantly enhances the capture of emotional information in lengthy texts. The introduction of hyperparameters to mitigate negative instance noise further strengthens the model’s performance. Future work will focus on incorporating temporal dependencies and integrating electroencephalogram (EEG) data with textual features for a more comprehensive analysis of depression. Additionally, the authors aim to improve model interpretability by combining depression knowledge with deep learning techniques, paving the way for practical advancements in ADD and enhancing patient outcomes.

Methods

In this study, experiments were conducted in a controlled environment utilizing a Windows operating system and an NVIDIA RTX 4070Ti GPU, with the implementation carried out using the PyTorch framework. The training subset was configured with a batch size of 128, a learning rate of 0.001, and a total of 100 epochs. To enhance the model’s ability to assess prediction uncertainty, the researchers employed Monte Carlo Dropout, which involves executing 10 independent forward passes for each input feature. This method allows for the computation of variance among the predictions, thereby providing a robust estimate of uncertainty.

The incorporation of Monte Carlo Dropout is particularly aimed at improving detection accuracy and reliability, especially for atypical or challenging cases. By leveraging this technique, the study seeks to address the limitations of traditional dropout methods in deep learning, ultimately enhancing the model’s performance in real-world applications.

Results

The “Results” section of the research paper presents the key findings derived from the conducted experiments or analyses. It typically includes quantitative data, statistical analyses, and visual representations such as graphs or tables that illustrate the outcomes. The results are often compared against the hypotheses or previous studies to highlight significant differences or confirmations.

In this section, the authors may report on specific metrics, such as mean values, standard deviations, or correlation coefficients, which provide insight into the relationships or effects observed in the study. Additionally, any notable trends or patterns identified in the data are discussed, along with their implications for the research questions posed. Overall, this section serves to convey the empirical evidence supporting the study’s conclusions.

Discussion

In the discussion section of the research paper, the authors explore advancements in text-based depression detection, emphasizing the integration of artificial intelligence (AI) in healthcare. They highlight the complexity of depression manifestations and the importance of verbal expression in understanding patients’ psychological states. Previous studies have utilized various modalities, such as video and audio, but the authors argue that text remains a crucial medium for sentiment analysis. They categorize existing text-based methods into emotion dictionary-based, machine learning, and deep learning approaches, noting the effectiveness of deep learning techniques, particularly in analyzing social media data. However, they point out the limitations of using short texts from platforms like Twitter due to data quality concerns and propose a novel approach that leverages clinical interview data through a Multi-Instance Learning (MIL) framework, which treats long texts as bags of instances to improve detection accuracy.

The authors introduce their MIL framework for depression detection, addressing the challenge of imbalanced datasets where positive samples (depressed individuals) are fewer than negative samples. They refine traditional MIL judgment logic by establishing criteria for diagnosing depression based on the presence of depressive instances within a bag of responses. The paper also details the dataset used, comprising clinical interviews from the DAIC-WOZ and E-DAIC corpora, and outlines their methodology, including the integration of the MT5 and RoBERTa models for enhanced classification performance. The results demonstrate that their fused Multi-MTRB model outperforms existing methods in terms of accuracy, precision, recall, and F1-score, indicating its potential for reliable depression detection in clinical settings. The authors conclude by emphasizing the interpretability of their model, showcasing visual representations of depression scores to enhance understanding of the underlying data processing and decision-making mechanisms.