التحقيق في نماذج التعلم الآلي القابلة للتفسير للتنبؤ بأمراض الكلى المزمنة Investigation on explainable machine learning models to predict chronic kidney diseases

المجلة: Scientific Reports، المجلد: 14، العدد: 1
DOI: https://doi.org/10.1038/s41598-024-54375-4
PMID: https://pubmed.ncbi.nlm.nih.gov/38355876
تاريخ النشر: 2024-02-14
المؤلف: Samit Kumar Ghosh وآخرون
الموضوع الرئيسي: الذكاء الاصطناعي في الرعاية الصحية

نظرة عامة

تعتبر مرض الكلى المزمن (CKD) تحديًا صحيًا عالميًا كبيرًا، وغالبًا ما يتقدم بدون أعراض في مراحله المبكرة، مما يعقد التشخيص والعلاج في الوقت المناسب. تستكشف هذه الدراسة نهج الذكاء الاصطناعي القابل للتفسير (XAI) للتنبؤ بمرض الكلى المزمن باستخدام الخصائص السريرية من مجموعة بيانات تضم 491 مريضًا، بما في ذلك 56 تم تشخيصهم بمرض الكلى المزمن. تم استخدام خمسة خوارزميات تعلم آلي (ML) – الانحدار اللوجستي (LR)، وغابة عشوائية (RF)، وشجرة قرار (DT)، ونايف بايز (NB)، وتعزيز التدرج المتطرف (XGBoost) – لتطوير نماذج تنبؤية، حيث حقق نموذج XGBoost أعلى أداء، محققًا منطقة تحت المنحنى (AUC) تبلغ 0.9689 ودقة 93.29%.

حدد التحليل الكرياتينين، والهيموجلوبين السكري من النوع A1C (HgbA1C)، والعمر كأهم المتنبئين بمرض الكلى المزمن ضمن نموذج XGBoost. استخدمت الدراسة SHAP (تفسيرات شابلي الإضافية) وLIME (تفسيرات محلية قابلة للتفسير وغير مرتبطة بالنموذج) لتعزيز قابلية تفسير النموذج، مما يسمح للأطباء بفهم الأسباب وراء التنبؤات. تؤكد النتائج على إمكانية وجود إطار عمل قوي لتعلم الآلة للتشخيص المبكر لمرض الكلى المزمن وتبرز أهمية دمج تقنيات تعلم الآلة القابلة للتفسير لتحسين الشفافية في التحليلات التنبؤية. قد تركز الأعمال المستقبلية على توسيع مجموعة البيانات ودمج خوارزميات التعلم العميق المتقدمة لتعزيز دقة التنبؤ بشكل أكبر.

طرق

تحدد قسم “المواد والطرق” التصميم التجريبي والإجراءات المستخدمة في الدراسة. يوضح المواد المحددة المستخدمة، بما في ذلك أي مواد كيميائية أو معدات أو عينات بيولوجية، مما يضمن إمكانية تكرار التجارب. يتم وصف المنهجية بطريقة خطوة بخطوة، مع تسليط الضوء على التقنيات والبروتوكولات المتبعة لجمع البيانات، مثل التحليلات الإحصائية أو النماذج الحسابية.

بالإضافة إلى ذلك، قد يتضمن القسم معلومات عن حجم العينة، والضوابط، وأي معلمات ذات صلة تم قياسها. يضمن هذا النهج الشامل أن يمكن التحقق من النتائج ومقارنتها مع أبحاث أخرى في هذا المجال. بشكل عام، فإن وضوح ودقة الطرق أمران حاسمان لسلامة استنتاجات الدراسة.

نتائج

حللت الدراسة مجموعة من 491 مريضًا بمعدل تصفية كبيبات الكلى المقدّر (eGFR) ≥ 60 مل/دقيقة/1.73 م²، تضم 250 ذكرًا و241 أنثى، بمتوسط عمر يبلغ 53.20 عامًا. من بين هؤلاء، تم تصنيف 435 مريضًا (88.59%) على أنهم غير مصابين بمرض الكلى المزمن (المراحل 1-2)، بينما تم تحديد 56 مريضًا (11.41%) على أنهم مصابون بمرض الكلى المزمن (المراحل 3-5). لوحظت اختلافات كبيرة بين المجموعتين، حيث أظهرت مجموعة مرضى الكلى المزمن انتشارًا أعلى للأمراض المصاحبة مثل السكري (83.93% مقابل 38.62%)، وأمراض القلب التاجية (30.36% مقابل 6.44%)، وارتفاع ضغط الدم (91.07% مقابل 65.29%). كشفت التحليلات الإحصائية أن العمر والجنس وحالات صحية متنوعة أثرت بشكل كبير على مراحل مرض الكلى المزمن، مع قيمة p < 0.05 تشير إلى دلالة إحصائية. تم تطوير نماذج تعلم الآلة (ML) للتنبؤ بمرض الكلى المزمن، حيث أظهر خوارزم XGBoost أعلى دقة (93.29%) ودرجة منطقة تحت المنحنى (AUC) تبلغ 0.9689، متفوقًا على نماذج أخرى مثل الانحدار اللوجستي (LR) وغابة عشوائية (RF). حدد تحليل أهمية الميزات باستخدام قيم SHAP الكرياتينين وHbA1C والعمر كأهم المتنبئين. استخدمت الدراسة تقنيات مثل SMOTE لزيادة البيانات لمعالجة عدم التوازن في الفئات وطبقت التحقق المتقاطع لضمان تقييم قوي للنموذج. تؤكد النتائج على إمكانية خوارزميات تعلم الآلة، وخاصة XGBoost، في تعزيز التنبؤ بمرض الكلى المزمن وتسهيل التدخلات السريرية المستهدفة بناءً على ملفات المخاطر الفردية للمرضى.

مناقشة

تستكشف الدراسة تطبيق خوارزميات تعلم الآلة (ML) للتنبؤ بمراحل مرض الكلى المزمن (CKD) بين المرضى في مستشفى توام في مدينة العين، الإمارات العربية المتحدة. باستخدام مجموعة بيانات تضم 544 مريضًا تم جمعها في عام 2008، استخدمت الأبحاث نماذج ML متنوعة، بما في ذلك الانحدار اللوجستي (LR)، وغابة عشوائية (RF)، وأشجار القرار (DT)، ونايف بايز (NB)، وXGBoost. أظهر نموذج XGBoost أداءً متفوقًا، محققًا منطقة تحت المنحنى (AUC) تبلغ 0.9689 ودقة 93.29% في تصنيف المرضى إلى مراحل CKD 1-2 أو 3-5. من الجدير بالذكر أن الدراسة دمجت تقنيات الذكاء الاصطناعي القابل للتفسير (XAI)، وتحديدًا SHAP وLIME، لتعزيز قابلية تفسير تنبؤات النموذج، مما يكشف أن مستويات الكرياتينين، والهيموجلوبين السكري A1C (HgbA1C)، والعمر كانت من أهم المتنبئين بمرض الكلى المزمن.

تؤكد النتائج على أهمية دمج القابلية للتفسير في نماذج تعلم الآلة، خاصة في الرعاية الصحية، حيث يعد فهم الأسباب وراء التنبؤات أمرًا حيويًا. من خلال توضيح كيفية تأثير المتغيرات المحددة على مخاطر مرض الكلى المزمن، تقدم الدراسة رؤى قيمة يمكن أن تساعد الأطباء في اتخاذ قرارات مستنيرة بشأن إدارة المرضى. لا تعزز دمج الميزات السريرية الرئيسية دقة التشخيص فحسب، بل تؤكد أيضًا على إمكانية أطر تعلم الآلة لتحسين نتائج المرضى في إدارة مرض الكلى المزمن. قد تشمل اتجاهات البحث المستقبلية توسيع مجموعة البيانات واستكشاف خوارزميات التعلم العميق المتقدمة لتحسين القدرات التنبؤية بشكل أكبر.

Journal: Scientific Reports, Volume: 14, Issue: 1
DOI: https://doi.org/10.1038/s41598-024-54375-4
PMID: https://pubmed.ncbi.nlm.nih.gov/38355876
Publication Date: 2024-02-14
Author(s): Samit Kumar Ghosh et al.
Primary Topic: Artificial Intelligence in Healthcare

Overview

Chronic kidney disease (CKD) poses a significant global health challenge, often progressing asymptomatically in its early stages, which complicates timely diagnosis and treatment. This study explores an explainable artificial intelligence (XAI) approach to predict CKD using clinical characteristics from a dataset of 491 patients, including 56 diagnosed with CKD. Five machine learning (ML) algorithms—logistic regression (LR), random forest (RF), decision tree (DT), Naïve Bayes (NB), and extreme gradient boosting (XGBoost)—were employed to develop predictive models, with the XGBoost model yielding the highest performance, achieving an area under the curve (AUC) of 0.9689 and an accuracy of 93.29%.

The analysis identified creatinine, glycosylated hemoglobin type A1C (HgbA1C), and age as the most significant predictors of CKD within the XGBoost model. The study utilized SHAP (SHapley Additive exPlanations) and LIME (Local Interpretable Model-agnostic Explanations) to enhance model interpretability, allowing clinicians to understand the rationale behind predictions. The findings underscore the potential of a robust ML framework for early CKD diagnosis and highlight the importance of integrating interpretable ML techniques to improve transparency in predictive analytics. Future work may focus on expanding the dataset and incorporating advanced deep learning algorithms to further enhance predictive accuracy.

Methods

The section on “Materials and Methods” outlines the experimental design and procedures employed in the study. It details the specific materials used, including any reagents, equipment, or biological samples, ensuring reproducibility of the experiments. The methodology is described in a step-by-step manner, highlighting the techniques and protocols followed to gather data, such as statistical analyses or computational models.

Additionally, the section may include information on the sample size, controls, and any relevant parameters that were measured. This comprehensive approach ensures that the findings can be validated and compared with other research in the field. Overall, the clarity and rigor of the methods are crucial for the integrity of the study’s conclusions.

Results

The study analyzed a cohort of 491 patients with an estimated glomerular filtration rate (eGFR) ≥ 60 mL/min/1.73 m², comprising 250 males and 241 females, with a mean age of 53.20 years. Among these, 435 patients (88.59%) were classified as non-CKD (stages 1-2), while 56 patients (11.41%) were identified as CKD (stages 3-5). Significant differences were observed between the two groups, with the CKD group exhibiting a higher prevalence of comorbidities such as diabetes (83.93% vs. 38.62%), coronary heart disease (30.36% vs. 6.44%), and hypertension (91.07% vs. 65.29%). Statistical analysis revealed that age, gender, and various health conditions significantly impacted CKD stages, with a p-value < 0.05 indicating statistical significance. Machine learning (ML) models were developed to predict CKD, with the XGBoost algorithm demonstrating the highest accuracy (93.29%) and an area under the curve (AUC) score of 0.9689, outperforming other models like logistic regression (LR) and random forest (RF). Feature importance analysis using SHAP values identified creatinine, HbA1C, and age as the most influential predictors. The study utilized techniques such as SMOTE for data augmentation to address class imbalance and employed cross-validation to ensure robust model evaluation. The findings underscore the potential of ML algorithms, particularly XGBoost, in enhancing CKD prediction and facilitating targeted clinical interventions based on individual patient risk profiles.

Discussion

The study investigates the application of machine learning (ML) algorithms for predicting chronic kidney disease (CKD) stages among patients at Tawam Hospital in Al-Ain City, UAE. Utilizing a dataset of 544 patients collected in 2008, the research employed various ML models, including logistic regression (LR), random forest (RF), decision trees (DT), naive Bayes (NB), and XGBoost. The XGBoost model demonstrated superior performance, achieving an area under the curve (AUC) of 0.9689 and an accuracy of 93.29% in classifying patients into CKD stages 1-2 or 3-5. Notably, the study incorporated explainable artificial intelligence (XAI) techniques, specifically SHAP and LIME, to enhance the interpretability of the model’s predictions, revealing that creatinine levels, glycosylated hemoglobin A1C (HgbA1C), and age were the most significant predictors of CKD.

The findings underscore the importance of integrating explainability into ML models, particularly in healthcare, where understanding the rationale behind predictions is critical. By elucidating how specific variables influence CKD risk, the study provides valuable insights that can aid clinicians in making informed decisions regarding patient management. The incorporation of key clinical features not only enhances diagnostic accuracy but also emphasizes the potential for ML frameworks to improve patient outcomes in CKD management. Future research directions may include expanding the dataset and exploring advanced deep learning algorithms to further refine predictive capabilities.