مصنف التصويت الناعم القابل للتفسير لتوقع أمراض القلب باستخدام SHAP و LIME Explainable soft-voting classifier for heart disease prediction using SHAP and LIME

المجلة: Discover Computing، المجلد: 29، العدد: 1
DOI: https://doi.org/10.1007/s10791-026-09972-4
تاريخ النشر: 2026-02-10
المؤلف: Samiksha Walia وآخرون
الموضوع الرئيسي: الذكاء الاصطناعي في الرعاية الصحية

نظرة عامة

يعتبر قطاع الرعاية الصحية حيويًا في مواجهة التحديات المتعددة الأوجه التي تطرحها الأمراض القلبية الوعائية (CVDs)، وخاصة النوبات القلبية، التي تمثل حوالي 17.9 مليون حالة وفاة سنويًا. يسلط هذا البحث الضوء على العبء الكبير للأمراض القلبية الوعائية، مؤكدًا أن أكثر من 80% من هذه الوفيات ناتجة عن النوبات القلبية. يقترح البحث نهجًا مبتكرًا يجمع بين التعلم الجماعي الذكي ومنهجيات الذكاء الاصطناعي القابل للتفسير (XAI) لتعزيز كل من دقة التنبؤ وقابلية التفسير في تشخيص أمراض القلب.

أظهر المصنف الناعم القائم على التصويت أداءً ملحوظًا، حيث حقق دقة بنسبة 89.10%، ودقة بنسبة 83.90%، واسترجاع بنسبة 95.20%، ودرجة F1 بنسبة 89.90%، متفوقًا على النماذج الأساسية الفردية. قدم دمج منهجيات SHAP وLIME رؤى قيمة حول الميزات الحرجة التي تؤثر على التنبؤات، مثل العمر، ومستويات الكوليسترول، وضغط الدم أثناء الراحة، مما يعزز الثقة السريرية واتخاذ القرارات المستنيرة. بينما يشير متوسط وقت التنبؤ للنموذج الذي يقل عن ثلاث مللي ثانية إلى عمليته في الإعدادات السريرية المعتدلة، لا تزال هناك تحديات، بما في ذلك العبء الحسابي لتفسيرات SHAP والحاجة إلى حماية خصوصية المرضى. يبرز البحث أهمية معالجة التحيزات المحتملة في تمثيل البيانات وتنفيذ ممارسات الذكاء الاصطناعي المسؤولة لتسهيل انتقال هذه النماذج التنبؤية إلى التطبيقات السريرية في العالم الحقيقي.

النتائج

يقدم قسم النتائج في ورقة البحث نتائج من تجارب القابلية للتفسير باستخدام SHAP (SHapley Additive exPlanations) لتحليل تنبؤات النموذج المتعلقة بتشخيص أمراض القلب. تم تقسيم النتائج إلى فئات محلية وعالمية.

في التحليل المحلي، يتم فحص التنبؤات الفردية، موضحًا كيف تؤثر ميزات معينة على نتيجة التصنيف للمرضى. على سبيل المثال، تزيد مستويات ‘الكوليسترول’ المرتفعة وقيم ‘oldpeak’ غير الطبيعية من خطر الإصابة بأمراض القلب، بينما تقلل ‘restecg’ الطبيعية وغياب ‘الذبحة الصدرية الناتجة عن التمارين’ من هذا الخطر. يوضح مخطط شلال SHAP هذا بشكل أكبر من خلال عرض المساهمات التراكمية للميزات في التنبؤ النهائي، مما يعزز شفافية النموذج ويعزز الثقة بين الممارسين الصحيين. تعتبر هذه القابلية للتفسير المحلية حاسمة للأطباء حيث تتماشى مع تفكيرهم التشخيصي.

توفر النتائج العالمية منظورًا أوسع حول أهمية الميزات عبر مجموعة البيانات. يصنف مخطط الملخص العالمي لـ SHAP الميزات حسب تأثيرها المتوسط على تنبؤات النموذج، كاشفًا أن ‘cp’ و’oldpeak’ و’thal’ و’ca’ هي مؤشرات مهمة لخطر الإصابة بأمراض القلب. يساعد هذا التصنيف العالمي الأطباء في تحديد عوامل الخطر الرئيسية ويبرز أهمية بعض الميزات التي قد لا تؤخذ عادةً في الاعتبار. بالإضافة إلى ذلك، يسمح مخطط الملخص المعقد بتجميع المرضى ذوي الملفات الشخصية المماثلة، مما يعزز فهم كيفية تعميم النموذج عبر مجموعات سكانية متنوعة. بشكل عام، تؤكد النتائج فعالية منهجيات SHAP وLIME في تقديم تفسيرات متسقة لأهمية المتغيرات، مما يدعم النشر الأخلاقي للذكاء الاصطناعي في الإعدادات السريرية.

المناقشة

تقدم ورقة البحث مصنفًا جديدًا قائمًا على التصويت الناعم يجمع بين خوارزميات الغابة العشوائية (RF) وشجرة القرار (DT) وآلة الدعم الناقل (SVM) لاكتشاف أمراض القلب. يستخدم النموذج تقنيات الذكاء الاصطناعي القابل للتفسير (XAI)، وتحديدًا SHAP وLIME، لتعزيز قدرات اتخاذ القرار للمهنيين الطبيين، مما يحسن دقة التشخيص. تشير تقييمات النموذج المقترح باستخدام مجموعة بيانات قياسية لأمراض القلب إلى أنه يتفوق على النماذج الأساسية التقليدية، محققًا دقة قصوى تبلغ 89.10%، إلى جانب دقة ملحوظة، واسترجاع، ودرجات F1.

تم هيكلة الورقة لمراجعة الأعمال ذات الصلة في تطبيقات الذكاء الاصطناعي وXAI ضمن الرعاية الصحية، مع تسليط الضوء على التقدم في استخراج الميزات من خلال التعلم العميق ودمج آليات الانتباه. تناقش الورقة نماذج مختلفة استخدمت بنجاح XAI للقابلية للتفسير في التشخيصات الطبية. يجمع النموذج المقترح بين شبكة عصبية تلافيفية عميقة (DCNN) ووحدة متكررة بوابة (GRU) لالتقاط الأنماط المحلية والعالمية في صور الرئة، مما يعزز القابلية للتفسير من خلال تقنيات XAI. يتم تفصيل النهج المنهجي لمعالجة البيانات وخط الأنابيب الحسابي، مما يضمن قوة تنبؤات النموذج مع معالجة تحديات الشفافية في تطبيقات التعلم الآلي في الإعدادات السريرية.

Journal: Discover Computing, Volume: 29, Issue: 1
DOI: https://doi.org/10.1007/s10791-026-09972-4
Publication Date: 2026-02-10
Author(s): Samiksha Walia et al.
Primary Topic: Artificial Intelligence in Healthcare

Overview

The healthcare sector is crucial in tackling the multifaceted challenges posed by cardiovascular diseases (CVDs), particularly heart attacks, which account for approximately 17.9 million deaths annually. This paper highlights the significant burden of CVDs, emphasizing that over 80% of these fatalities stem from heart attacks. The research proposes an innovative approach that combines intelligent ensemble learning with explainable artificial intelligence (XAI) methodologies to enhance both predictive accuracy and interpretability in heart disease diagnosis.

The proposed soft voting classifier demonstrated remarkable performance metrics, achieving an accuracy of 89.10%, precision of 83.90%, recall of 95.20%, and an F1-score of 89.90%, outperforming individual baseline models. The integration of SHAP and LIME methodologies provided valuable insights into critical features influencing predictions, such as age, cholesterol levels, and resting blood pressure, thereby fostering clinical trust and informed decision-making. While the model’s average prediction time of less than three milliseconds suggests practicality for moderate clinical settings, challenges remain, including the computational overhead of SHAP explanations and the need to safeguard patient confidentiality. The study underscores the importance of addressing potential biases in data representation and implementing responsible AI practices to facilitate the transition of these predictive models into real-world clinical applications.

Results

The results section of the research paper presents findings from interpretability experiments using SHAP (SHapley Additive exPlanations) to analyze model predictions related to heart disease diagnosis. The findings are divided into local and global categories.

In the local analysis, individual predictions are examined, illustrating how specific features influence the classification outcome for patients. For instance, elevated levels of ‘cholesterol’ and abnormal ‘oldpeak’ values increase the risk of heart disease, while normal ‘restecg’ and absence of ‘exercise-induced angina’ decrease it. The SHAP waterfall plot further elucidates this by showing the cumulative contributions of features to the final prediction, enhancing the model’s transparency and fostering trust among healthcare practitioners. This local interpretability is crucial for clinicians as it aligns with their diagnostic reasoning.

The global findings provide a broader perspective on feature importance across the dataset. The SHAP global summary plot ranks features by their average influence on model predictions, revealing that ‘cp’, ‘oldpeak’, ‘thal’, and ‘ca’ are significant predictors of cardiovascular risk. This global ranking aids clinicians in identifying key risk factors and emphasizes the importance of certain features that may not typically be considered. Additionally, a nuanced summary plot allows for clustering of patients with similar profiles, enhancing understanding of how the model generalizes across diverse populations. Overall, the results underscore the effectiveness of SHAP and LIME methodologies in providing consistent interpretations of variable importance, thereby supporting the ethical deployment of AI in clinical settings.

Discussion

The research paper presents a novel ensemble soft voting classifier that integrates Random Forest (RF), Decision Tree (DT), and Support Vector Machine (SVM) algorithms for heart disease detection. The model employs explainable artificial intelligence (XAI) techniques, specifically SHAP and LIME, to enhance decision-making capabilities for medical professionals, thereby improving diagnostic accuracy. Evaluation of the proposed model using a standard heart disease dataset indicates that it outperforms traditional baseline models, achieving a maximum accuracy of 89.10%, along with notable precision, recall, and F1 score metrics.

The paper is structured to first review related work in AI and XAI applications within healthcare, highlighting advancements in feature extraction through deep learning and the integration of attention mechanisms. It discusses various models that have successfully utilized XAI for interpretability in medical diagnostics. The proposed model combines a deep convolutional neural network (DCNN) with a gated recurrent unit (GRU) to capture both local and global patterns in lung images, further enhancing interpretability through XAI techniques. The systematic approach to data preprocessing and the computational pipeline is detailed, ensuring the robustness of the model’s predictions while addressing the challenges of transparency in machine learning applications in clinical settings.