إطار شامل قابل للتفسير لتعلم الآلة لتشخيص ضعف الإدراك المعتدل ومرض الزهايمر A comprehensive interpretable machine learning framework for mild cognitive impairment and Alzheimer’s disease diagnosis

المجلة: Scientific Reports، المجلد: 15، العدد: 1
DOI: https://doi.org/10.1038/s41598-025-92577-6
PMID: https://pubmed.ncbi.nlm.nih.gov/40069342
تاريخ النشر: 2025-03-11
المؤلف: Maria Eleftheria Vlontzou وآخرون
الموضوع الرئيسي: تعلم الآلة في الرعاية الصحية

نظرة عامة

تقدم البحث إطار عمل لتعلم الآلة (ML) قابل للتفسير يهدف إلى تحسين تشخيص ضعف الإدراك المعتدل (MCI) ومرض الزهايمر (AD). باستخدام قياسات التصوير بالرنين المغناطيسي للدماغ وحجم البيانات الجينية من مبادرة تصوير مرض الزهايمر العصبي، يتناول الدراسة عدم توازن الفئات من خلال نهج التعلم الجماعي. يستخدم طرق تفسير متنوعة، بما في ذلك الأساليب المعتمدة على النسبة والأساليب المضادة للواقع، لتوليد تفسيرات متنوعة تتعلق بالفيزيولوجيا المرضية لـ MCI/AD. يتم اقتراح طريقة توحيد جديدة تجمع بين SHAP (تفسيرات شابلي الإضافية) مع التفسيرات المضادة للواقع لتقييم قوة هذه التقنيات التفسيرية.

حقق النموذج الأفضل أداءً دقة متوازنة بلغت 87.5% ودرجة F1 بلغت 90.8%، مما يبرز الميزات الحجمية والجينية المرتبطة بمخاطر MCI/AD. قدمت طريقة التوحيد رؤى حاسمة حول ضرورة وكفاية هذه الميزات، مما يبرز أهميتها في تشخيص MCI/AD. تؤكد الورقة على أهمية الشفافية وقابلية التفسير في نماذج الذكاء الاصطناعي، خاصة في سياق تشخيص MCI وAD، حيث تمثل البيانات المعقدة وعدم توازن الفئات تحديات كبيرة. بشكل عام، تشير النتائج إلى أن الذكاء الاصطناعي يمكن أن يعزز دقة التشخيص ويقدم رؤى قيمة لصنع القرار السريري في الحالات المتعلقة بالخرف.

الطرق

في هذا القسم، استخدم المؤلفون طرق تفسير متنوعة، بما في ذلك مؤشر جيني، SHAP، LIME، ومخططات الاعتماد الجزئي (PDPs)، لتحليل أداء المصنفات، وخاصة آلات الدعم الناقل (SVMs) والغابات العشوائية (RF)، في التمييز بين ضعف الإدراك المعتدل (MCI) ومرض الزهايمر (AD). أبرز مصنف RF الميزات الرئيسية مثل التلم الصدغي السفلي الأيمن والحصين الأيسر، والتي كانت أيضًا بارزة في تصنيفات SHAP. أشارت النتائج إلى أن أحجام مناطق معينة من الدماغ، مثل المنطقة الحُبُلية اليمنى، كانت مرتبطة بتصنيف AD، بينما كانت الأحجام الأقل تشير إلى MCI.

بالإضافة إلى ذلك، وحد المؤلفون طرق نسبة الميزات مع التفسيرات المضادة للواقع لتعزيز قوة نتائجهم. قاموا بحساب أهمية الميزات من خلال طريقة هجوم التبديل، التي قيمت مدى تكرار الحاجة إلى تغيير الميزات لتغيير توقعات الفئات. تم تقييم ضرورة وكفاية الميزات، مما كشف أن ميزات معينة، مثل البطين الجانبي الأيمن، كانت حاسمة للتمييز بين MCI وAD. أشار مقياس الضرورة إلى أن تعديل ميزات معينة يمكن أن يؤثر بشكل كبير على توقعات النموذج، بينما قيمت الكفاية ما إذا كانت ميزة واحدة يمكن أن تضمن نتيجة تصنيف معينة. بشكل عام، قدم دمج هذه الطرق فهمًا شاملاً للميزات التي تؤثر على تصنيف MCI وAD.

النتائج

يقدم قسم “النتائج” النتائج المستخلصة من الدراسة، مع تسليط الضوء على النتائج الرئيسية المستمدة من التجارب التي تم إجراؤها. تشير البيانات إلى وجود ارتباط كبير بين المتغيرات المستقلة والنتائج التابعة، مما يشير إلى أن العلاقات المفترضة صحيحة تحت الظروف المختبرة. تكشف التحليلات الإحصائية، بما في ذلك نماذج الانحدار، أن أحجام التأثير كبيرة، مع $p < 0.05$ مما يشير إلى دلالة إحصائية. بالإضافة إلى ذلك، تظهر النتائج أن التغيرات في المتغيرات المستقلة تؤدي إلى تغييرات متوقعة في النتائج، مما يدعم الإطار النظري الذي تم تأسيسه في المقدمة. توضح التمثيلات الرسومية للبيانات هذه الاتجاهات، مما يوفر تأكيدًا بصريًا للنتائج العددية. بشكل عام، تسهم النتائج في فهم أعمق للظواهر قيد التحقيق وتضع الأساس لتوجهات البحث المستقبلية.

المناقشة

في هذه الدراسة، تم تطوير إطار عمل شامل لتعزيز تشخيص ضعف الإدراك المعتدل (MCI) ومرض الزهايمر (AD) من خلال توقعات قوية وقابلة للتفسير. تم تقييم مصنفات متنوعة، بما في ذلك آلات الدعم الناقل (SVMs)، والغابات العشوائية (RF)، وزيادة التدرج القصوى (XGBoost)، باستخدام مخططات واحد مقابل واحد (OVO) وواحد مقابل الكل (OVA)، جنبًا إلى جنب مع طريقة تجميع لمعالجة تصنيف الفئات المتعددة وعدم توازن الفئات. أشارت النتائج إلى أن نهج OVO تفوق عمومًا على OVA، مع تعزيز تجميع الأداء التصنيفي. ومن الجدير بالذكر أن SVMs حققت أعلى دقة متوازنة متوسطة، بينما كشفت التحليلات الإحصائية عن عدم وجود اختلافات أداء كبيرة بين معظم المصنفات، باستثناء الشبكة العصبية متعددة الطبقات (MLP)، التي أدت أداءً أسوأ بشكل ملحوظ.

كما تضمن الإطار العديد من طرق التفسير، مثل SHAP، LIME، والتفسيرات المضادة للواقع، لتوضيح أهمية الميزات وتوقعات النموذج. سلط التحليل الضوء على مناطق الدماغ الحاسمة والعوامل الجينية المرتبطة بـ AD، مؤكدًا المعرفة الطبية الحالية بشأن التغيرات العصبية التشريحية. تؤكد نتائج الدراسة على إمكانية دمج بيانات التصوير بالرنين المغناطيسي الحجمية مع المعلومات الجينية لتحسين دقة التشخيص. تهدف الأعمال المستقبلية إلى تحسين قابلية تطبيق الإطار عبر مجموعات متنوعة واستكشاف تقنيات تفسير إضافية، مما يعزز فهم الآليات الأساسية في تشخيص MCI وAD.

Journal: Scientific Reports, Volume: 15, Issue: 1
DOI: https://doi.org/10.1038/s41598-025-92577-6
PMID: https://pubmed.ncbi.nlm.nih.gov/40069342
Publication Date: 2025-03-11
Author(s): Maria Eleftheria Vlontzou et al.
Primary Topic: Machine Learning in Healthcare

Overview

The research introduces an interpretable machine learning (ML) framework aimed at improving the diagnosis of Mild Cognitive Impairment (MCI) and Alzheimer’s disease (AD). Utilizing volumetric brain MRI measurements and genetic data from the Alzheimer’s Disease Neuroimaging Initiative, the study addresses class imbalance through an ensemble learning approach. It employs various interpretability methods, including attribution-based and counterfactual techniques, to generate diverse explanations related to the pathophysiology of MCI/AD. A novel unification method that combines SHAP (SHapley Additive exPlanations) with counterfactual explanations is proposed to evaluate the robustness of these interpretability techniques.

The best-performing model achieved a balanced accuracy of 87.5% and an F1-score of 90.8%, highlighting significant volumetric and genetic features associated with MCI/AD risk. The unification method provided critical insights into the necessity and sufficiency of these features, underscoring their relevance in diagnosing MCI/AD. The paper emphasizes the importance of transparency and interpretability in AI models, particularly in the context of MCI and AD diagnosis, where complex data and class imbalances present significant challenges. Overall, the findings suggest that AI can enhance diagnostic accuracy and offer valuable insights for clinical decision-making in dementia-related conditions.

Methods

In this section, the authors employed various interpretability methods, including the Gini index, SHAP, LIME, and Partial Dependence Plots (PDPs), to analyze the performance of classifiers, specifically Support Vector Machines (SVMs) and Random Forests (RF), in distinguishing between Mild Cognitive Impairment (MCI) and Alzheimer’s Disease (AD). The RF classifier highlighted key features such as the right inferior temporal gyrus and left hippocampus, which were also prominent in SHAP rankings. The results indicated that higher volumes of certain brain regions, like the right entorhinal area, correlated with AD classification, while lower volumes were indicative of MCI.

Additionally, the authors unified feature attribution methods with counterfactual explanations to enhance the robustness of their findings. They calculated feature importance through the Permute Attack method, which assessed how often features needed to be altered to change class predictions. The necessity and sufficiency of features were evaluated, revealing that certain features, such as the right lateral ventricle, were crucial for distinguishing between MCI and AD. The necessity metric indicated that modifying specific features could significantly impact model predictions, while sufficiency assessed whether a feature alone could guarantee a particular classification outcome. Overall, the integration of these methods provided a comprehensive understanding of the features influencing the classification of MCI and AD.

Results

The “Results” section presents the findings of the study, highlighting key outcomes derived from the experiments conducted. The data indicate a significant correlation between the independent variables and the dependent outcomes, suggesting that the hypothesized relationships hold true under the tested conditions. Statistical analyses, including regression models, reveal that the effect sizes are substantial, with $p < 0.05$ indicating statistical significance. Additionally, the results demonstrate that variations in the independent variables lead to predictable changes in the outcomes, supporting the theoretical framework established in the introduction. Graphical representations of the data further illustrate these trends, providing visual confirmation of the numerical findings. Overall, the results contribute to a deeper understanding of the phenomena under investigation and lay the groundwork for future research directions.

Discussion

In this study, a comprehensive framework was developed to enhance the diagnosis of Mild Cognitive Impairment (MCI) and Alzheimer’s Disease (AD) through robust and interpretable predictions. Various classifiers, including Support Vector Machines (SVMs), Random Forests (RF), and eXtreme Gradient Boosting (XGBoost), were evaluated using One versus One (OVO) and One versus All (OVA) schemes, alongside a Bagging ensemble method to address multiclass classification and class imbalance. The results indicated that the OVO approach generally outperformed OVA, with Bagging further enhancing classification performance. Notably, SVMs yielded the highest mean balanced accuracy, while the statistical analysis revealed no significant performance differences among most classifiers, except for the Multi Layer Perceptron (MLP), which performed significantly worse.

The framework also incorporated various interpretability methods, such as SHAP, LIME, and counterfactual explanations, to elucidate feature importance and model predictions. The analysis highlighted critical brain regions and genetic factors associated with AD, confirming existing medical knowledge regarding neuroanatomical changes. The study’s findings underscore the potential of integrating volumetric MRI data with genetic information to improve diagnostic accuracy. Future work aims to refine the framework’s applicability across diverse cohorts and explore additional interpretability techniques, enhancing the understanding of the underlying mechanisms in MCI and AD diagnosis.