UbiQTree: تحديد عدم اليقين في XAI باستخدام مجموعات الأشجار UbiQTree: Uncertainty quantification in XAI with tree ensembles

المجلة: Patterns، المجلد: 7، العدد: 4
DOI: https://doi.org/10.1016/j.patter.2025.101454
PMID: https://pubmed.ncbi.nlm.nih.gov/42005390
تاريخ النشر: 2026-02-04
المؤلف: Akshat Dubey وآخرون
الموضوع الرئيسي: تحليل السلاسل الزمنية والتنبؤ

نظرة عامة

تناقش هذه القسم أهمية تقنيات الذكاء الاصطناعي القابل للتفسير (XAI)، وبشكل خاص تفسيرات شابلي الإضافية (SHAP)، في تفسير نماذج الأشجار التجميعية، وخاصة في مجال الرعاية الصحية. يبرز المؤلفون قيدًا حاسمًا لقيم SHAP، التي غالبًا ما تُقدم كتقديرات نقطية تتجاهل عدم اليقين الكامن الناجم عن مصادر عشوائية (ضوضاء غير قابلة للتقليل) ومعرفية (ندرة البيانات). لمعالجة ذلك، يقدم البحث نهجًا جديدًا يقوم بتفكيك عدم اليقين في قيم SHAP إلى مكونات متميزة – عشوائية، معرفية، وتشابك – باستخدام نظرية أدلة ديمبستر-شيفر وعينة فرضيات عملية ديريشليت.

تشير النتائج إلى أن هذا الإطار المحسن يحسن من تقارير عدم اليقين من خلال توفير قيم SHAP مع فترات ثقة بنسبة 95% بدلاً من مجرد تقديرات نقطية. هذا ذو قيمة خاصة في الرعاية الصحية، حيث يساعد في تحديد الميزات التي تتطلب مراجعة من الخبراء. يؤكد المؤلفون أن القياس الفعال لعدم اليقين، خاصة من خلال نماذج الأشجار مثل التجميع، أمر حاسم لاتخاذ قرارات قوية وتحسين النماذج. كما يبرز البحث أهمية مقاييس عدم اليقين القابلة للتدقيق في تنظيمات الذكاء الاصطناعي، مقترحًا أن التباين، والانتروبيا، وثبات الإشارة تقدم معًا فهمًا شاملاً لعدم اليقين في نسب SHAP.

مقدمة

تناقش مقدمة ورقة البحث الدور الحاسم لتعلم الآلة (ML) في تعزيز تحليلات الرعاية الصحية، وخاصة في مجالات مثل تخطيط الموارد، تشخيص الأمراض، التنبؤ، وتصنيف المخاطر. على الرغم من دقة وكفاءة نماذج الأشجار التجميعية – مثل الغابة العشوائية (RF)، وآلات تعزيز التدرج (GBM)، وتعزيز التدرج المتطرف (XGBoost) – إلا أنها تتأثر بطبيعتها بعدم اليقين بسبب عوامل مثل البيانات الضوضائية وقيود النموذج. تتفوق هذه النماذج في إدارة مجموعات البيانات المعقدة والمهيكلة التي تعتبر نموذجية في الرعاية الصحية، لكن طبيعتها “الصندوق الأسود” تعقد من قابلية التفسير، وهو أمر ضروري لاتخاذ القرارات في البيئات ذات المخاطر العالية.

لتخفيف هذه التحديات في قابلية التفسير، يبرز البحث ظهور تقنيات الذكاء الاصطناعي القابل للتفسير (XAI)، وخاصة تفسيرات شابلي الإضافية (SHAP)، التي توفر إطارًا لنسب مساهمات الميزات في التنبؤات. تهدف التطورات الأخيرة في XAI، بما في ذلك طرق مثل المسارات المضادة (CPATH) وSHAP الكامن، إلى تعزيز الشفافية وفهم تنبؤات النموذج. ومع ذلك، يشير البحث إلى أن قيم SHAP غالبًا ما تُعامل كتقديرات نقطية، متجاهلة عدم اليقين المعرفي الناجم عن التباين في تدريب النموذج. يمكن أن يؤدي هذا الإغفال إلى تفسيرات خاطئة في التطبيقات الحرجة، مثل تشخيصات الرعاية الصحية. يقترح المؤلفون إطارًا جديدًا يقوم بتفكيك تباين SHAP إلى مكونات عشوائية، معرفية، وتشابك، باستخدام وظائف الاعتقاد وعمليات ديريشليت لأخذ عينات من فضاء الفرضيات، مما يعزز من موثوقية وقابلية تفسير نماذج ML في البيئات غير المؤكدة.

طرق

في هذه الدراسة، يقترح المؤلفون إطارًا متكاملًا يجمع بين أخذ عينات فرضيات عملية ديريشليت، ونظرية عدم اليقين ليو، ونظرية ديمبستر-شيفر لتفكيك وعدم قياس عدم اليقين في قيم SHAP (تفسيرات شابلي الإضافية) بشكل فعال. يعالج هذا النهج تعقيدات كل من عدم اليقين العشوائي والمعرفي في نسب الميزات، والتي غالبًا ما يتم التعامل معها بشكل غير كافٍ من قبل الطرق التقليدية. يسهل الإطار استكشافًا مفصلًا لمصادر عدم اليقين، كما هو موضح في الخوارزمية 5.

لضمان إمكانية تكرار التجارب، استخدم المؤلفون مجموعة بيانات MIMIC-III، حيث تمت إزالة الأعمدة غير المفيدة وتصنيف مدة الإقامة (LOS) إلى ثلاث مجموعات: إقامة قصيرة (<3 أيام)، إقامة متوسطة (3-7 أيام)، وإقامة طويلة (≥7 أيام). شملت إعداد البيانات استخدام تعويض الوسيط للميزات العددية والتوحيد باستخدام StandardScaler. تم تدريب مصنف الغابة العشوائية مع 100 مُقدّر، محققًا درجة F1 متوسطة قدرها 94.70% على تقسيم تدريبي-اختباري متوازن بنسبة 70:30. شمل تحليل عدم اليقين توليد مخططات أهمية الميزات العالمية، وتوزيعات قيم SHAP مع فترات موثوقة بنسبة 95%، وتفكيك عدم اليقين إلى مكوناته، جميعها مرئية من خلال إطار UbiQTree.

نقاش

يؤكد قسم النقاش في ورقة البحث على أهمية قيم SHAP (تفسيرات شابلي الإضافية) في قياس وتفسير مساهمات الميزات في تعلم الآلة (ML)، خاصة في تطبيقات الرعاية الصحية. تُعرف SHAP بطبيعتها المستقلة عن النموذج، مما يعزز من قابلية تفسير نماذج ML المعقدة المستخدمة في التنبؤ بنتائج الصحة، مثل مخاطر سرطان الثدي وتشخيص الأمراض المزمنة. يبرز البحث أنه بينما توفر SHAP رؤى قيمة، فإن الطرق التقليدية لحساب درجات SHAP يمكن أن تكون محدودة بسبب اعتمادها على بيانات إدخال محددة جيدًا وتوزيعات احتمالية أساسية، والتي غالبًا ما تكون غير معروفة أو مقدرة بشكل سيء. أدت التطورات الأخيرة في قياس عدم اليقين لقيم SHAP إلى تقديم أطر تحسب فترات الثقة والتوزيعات، مما يعالج عدم اليقين المعرفي الكامن في تقديرات أهمية الميزات.

يقترح المؤلفون تفكيكًا جديدًا لتباين SHAP إلى مكونات عشوائية، معرفية، وتشابك، مما يسمح بفهم أكثر دقة لعدم اليقين في نسب الميزات. يكشف هذا التفكيك أن التباين الكلي لقيم SHAP يتأثر بالتغاير بين التوقعات المرتكزة على النموذج وتلك المرتكزة على البيانات، خاصة في نماذج الأشجار التجميعية. تشير النتائج إلى أنه مع زيادة عدم اليقين المعرفي، يمكن أن تنخفض تقديرات عدم اليقين العشوائي، مما قد يؤدي إلى تحيزات منهجية في تنبؤات النموذج. يناقش البحث أيضًا تطبيق نظرية ديمبستر-شيفر وعمليات ديريشليت لتعزيز تمثيل عدم اليقين في قيم SHAP، مما يوفر إطارًا قويًا لإدارة الغموض في أنظمة الذكاء الاصطناعي ذات المخاطر العالية. بشكل عام، يبرز البحث الحاجة إلى مزيد من الاستكشاف لعدم اليقين في قيم SHAP لتحسين موثوقية وشفافية نماذج ML في سياقات الرعاية الصحية الحرجة.

Journal: Patterns, Volume: 7, Issue: 4
DOI: https://doi.org/10.1016/j.patter.2025.101454
PMID: https://pubmed.ncbi.nlm.nih.gov/42005390
Publication Date: 2026-02-04
Author(s): Akshat Dubey et al.
Primary Topic: Time Series Analysis and Forecasting

Overview

This section discusses the significance of Explainable Artificial Intelligence (XAI) techniques, specifically SHapley Additive exPlanations (SHAP), in interpreting ensemble tree-based models, particularly within the healthcare domain. The authors highlight a critical limitation of SHAP values, which are often presented as point estimates that overlook the inherent uncertainties arising from aleatoric (irreducible noise) and epistemic (data scarcity) sources. To address this, the paper introduces a novel approach that decomposes SHAP value uncertainty into distinct components—aleatoric, epistemic, and entanglement—utilizing Dempster-Shafer evidence theory and Dirichlet process hypothesis sampling.

The findings indicate that this enhanced framework improves uncertainty reporting by providing SHAP values with 95% confidence intervals instead of mere point estimates. This is particularly valuable in healthcare, where it aids in identifying features that necessitate expert review. The authors emphasize that effective quantification of uncertainties, especially through tree-based models like bagging, is crucial for robust decision-making and model refinement. The study also underscores the importance of auditable uncertainty metrics in AI regulations, suggesting that variance, entropy, and sign stability collectively offer a comprehensive understanding of uncertainty in SHAP attributions.

Introduction

The introduction of the research paper discusses the critical role of machine learning (ML) in enhancing healthcare analytics, particularly in areas such as resource planning, disease diagnosis, prognosis, and risk stratification. Despite the accuracy and efficiency of ensemble tree-based models—like Random Forest (RF), Gradient Boosting Machines (GBM), and Extreme Gradient Boosting (XGBoost)—they are inherently affected by uncertainty due to factors such as noisy data and model limitations. These models excel in managing complex, structured datasets typical in healthcare, but their “black box” nature complicates interpretability, which is essential for decision-making in high-stakes environments.

To mitigate these interpretability challenges, the paper highlights the emergence of explainable AI (XAI) techniques, particularly SHapley Additive exPlanations (SHAP), which provide a framework for attributing feature contributions to predictions. Recent advancements in XAI, including methods like Counterfactual Paths (CPATH) and Latent SHAP, aim to enhance transparency and understanding of model predictions. However, the paper notes that SHAP values are often treated as point estimates, neglecting the epistemic uncertainty that arises from variability in model training. This oversight can lead to misinterpretations in critical applications, such as healthcare diagnostics. The authors propose a new framework that decomposes SHAP variance into aleatoric, epistemic, and entanglement components, utilizing belief functions and Dirichlet processes for hypothesis space sampling, thereby enhancing the reliability and interpretability of ML models in uncertain environments.

Methods

In this study, the authors propose an integrated framework that combines Dirichlet process hypothesis sampling, Liu’s uncertainty theory, and Dempster-Shafer theory to effectively decompose and quantify uncertainty in SHAP (SHapley Additive exPlanations) values. This approach addresses the complexities of both aleatoric and epistemic uncertainties in feature attribution, which are often inadequately handled by traditional methods. The framework facilitates a detailed exploration of uncertainty sources, as outlined in Algorithm 5.

For experimental reproducibility, the authors utilized the MIMIC-III dataset, removing non-informative columns and categorizing the length of stay (LOS) into three groups: short stay (<3 days), medium stay (3-7 days), and long stay (≥7 days). The data preparation involved median imputation for numerical features and standardization using StandardScaler. A Random Forest classifier was trained with 100 estimators, achieving a micro-averaged F1 score of 94.70% on a stratified 70:30 train-test split. The uncertainty analysis encompassed generating global feature importance plots, SHAP value distributions with 95% credible intervals, and decomposing uncertainty into its components, all visualized through the UbiQTree framework.

Discussion

The discussion section of the research paper emphasizes the significance of SHAP (SHapley Additive exPlanations) values in quantifying and interpreting feature contributions in machine learning (ML), particularly within healthcare applications. SHAP is recognized for its model-agnostic nature, enhancing the interpretability of complex ML models used in predicting health outcomes, such as breast cancer risk and chronic disease diagnosis. The paper highlights that while SHAP provides valuable insights, traditional methods of calculating SHAP scores can be limited by their reliance on well-specified input data and underlying probability distributions, which are often unknown or poorly estimated. Recent advancements in uncertainty quantification for SHAP values have introduced frameworks that calculate confidence intervals and distributions, addressing the inherent epistemic uncertainty in feature importance estimates.

The authors propose a novel decomposition of SHAP variance into aleatoric, epistemic, and entanglement components, which allows for a more nuanced understanding of uncertainty in feature attributions. This decomposition reveals that the total variance of SHAP values is influenced by the covariance between model-centric and data-centric expectations, particularly in tree ensemble models. The findings indicate that as epistemic uncertainty increases, estimates of aleatoric uncertainty can decrease, potentially leading to systematic biases in model predictions. The paper also discusses the application of Dempster-Shafer theory and Dirichlet processes to enhance the representation of uncertainty in SHAP values, providing a robust framework for managing ambiguity in high-stakes AI systems. Overall, the research underscores the need for further exploration of uncertainty in SHAP values to improve the reliability and transparency of ML models in critical healthcare contexts.