نظام خبير معزز بالتعلم الآلي للكشف عن تفاقم فشل القلب باستخدام العلامات الحيوية المبلغ عنها من قبل المرضى والسجلات الصحية الإلكترونية Machine learning enhanced expert system for detecting heart failure decompensation using patient reported vitals and electronic health records

المجلة: Scientific Reports، المجلد: 15، العدد: 1
DOI: https://doi.org/10.1038/s41598-025-16376-9
PMID: https://pubmed.ncbi.nlm.nih.gov/40846740
تاريخ النشر: 2025-08-22
المؤلف: Shumit Saha وآخرون
الموضوع الرئيسي: الذكاء الاصطناعي في الرعاية الصحية

نظرة عامة

فشل القلب (HF) يتميز بفترات متناوبة من الاستقرار ونوبات من تفاقم الأعراض، المعروفة باسم عدم التعويض. لتحسين إدارة HF، طورت الأبحاث Medly، وهو برنامج علاجي رقمي يستخدم خوارزمية قائمة على القواعد لتحليل بيانات المرضى – تحديدًا الوزن، ضغط الدم، ومعدل ضربات القلب – لإرسال تنبيهات تلقائية إلى الأطباء والمرضى عند اكتشاف علامات التدهور. ومع ذلك، فإن وضع القواعد المحافظة الذي يهدف إلى تقليل الإيجابيات الكاذبة زاد عن غير قصد من الإيجابيات الكاذبة، مما أدى إلى عبء عمل سريري غير ضروري. لم تتضمن الخوارزمية الأولية سجلات الصحة الإلكترونية (EHR) للمرضى، مما دفع هذه الدراسة لتعزيز قدرات Medly التنبؤية باستخدام تقنيات التعلم الآلي.

في دراسة استعادية تستخدم خوارزمية XGBoost للتصنيف الثنائي، دمج الباحثون مجموعة بيانات أوسع، بما في ذلك معلومات EHR مثل نتائج التحاليل وسجل الأدوية. أظهر النموذج المحسن تحسينات كبيرة، حيث حقق دقة بنسبة 98.08%، وحساسية بنسبة 95.26%، ونوعية بنسبة 98.86%، وقيمة تنبؤية إيجابية (PPV) بنسبة 88.18%، مقارنة بدقة الخوارزمية الأصلية التي كانت 55.8%. ومن الجدير بالذكر أن بيانات EHR، وخاصة مستويات الببتيد الناتريوتي B (BNP) والكوليسترول الكلي، أثبتت أنها ضرورية في تحسين التنبؤات بعدم التعويض وتقليل التنبيهات الإيجابية الكاذبة، مما يعزز الفعالية العامة لاستراتيجيات إدارة HF.

الطرق

توضح قسم “الطرق” في ورقة البحث التصميم التجريبي والتقنيات التحليلية المستخدمة للتحقيق في سؤال البحث. استخدمت الدراسة نهجًا كميًا، يتضمن تحليلات إحصائية لتقييم البيانات المجمعة من تجارب مختلفة. تضمنت المنهجيات المحددة تجارب مختبرية محكومة، حيث تم التلاعب بالمتغيرات بشكل منهجي لمراقبة تأثيراتها على النتائج ذات الصلة.

شملت جمع البيانات استخدام أدوات وبروتوكولات موحدة لضمان الموثوقية والصلاحية. تم إجراء التحليل باستخدام أدوات برمجية للحساب الإحصائي، مما يسمح بتطبيق الاختبارات ذات الصلة مثل ANOVA وتحليل الانحدار لتفسير النتائج. يبرز القسم أهمية القابلية للتكرار والصرامة في الطرق المستخدمة، مما يضمن إمكانية التحقق من النتائج والبناء عليها في الأبحاث المستقبلية.

النتائج

تكشف نتائج الدراسة عن اختلافات كبيرة بين أداء الخوارزمية الأصلية القائمة على القواعد وخوارزمية XGBoost في التنبؤ بنوبات عدم التعويض. حققت الخوارزمية القائمة على القواعد دقة بنسبة 93.6%، مع حساسية بنسبة 94% ونوعية بنسبة 93%. ومع ذلك، أظهرت قيمة تنبؤية إيجابية (PPV) تبلغ 55.8% فقط، مما يعكس تصميمها المحافظ الذي يهدف إلى تقليل الأحداث المفقودة من عدم التعويض.

في المقابل، أظهرت خوارزمية XGBoost أداءً متفوقًا، حيث حققت دقة بنسبة 98.08%، وحساسية بنسبة 95.26%، ونوعية بنسبة 98.86%، إلى جانب قيمة PPV تبلغ 88.18%. أشار مصفوفة الالتباس إلى انخفاض ملحوظ في الإيجابيات الكاذبة، حيث سجلت خوارزمية XGBoost 116 إيجابية كاذبة مقارنة بـ 684 في النهج القائم على القواعد. يُعزى هذا التحسن إلى دمج بيانات السجلات الصحية الإلكترونية (EHR) والقياسات من اليومين السابقين، مما يبرز فعالية خوارزمية XGBoost في تحسين دقة التنبؤ.

المناقشة

ناقشت الدراسة دمج نظام خبير معزز بالتعلم الآلي مع بيانات السجلات الصحية الإلكترونية (EHR) ضمن برنامج Medly لتحسين التنبؤ بنوبات عدم التعويض لفشل القلب (HF). استخدمت الدراسة الاستعادية، التي وافق عليها مجلس أخلاقيات البحث في شبكة الصحة الجامعية، بيانات تاريخية من برنامج Medly، الذي يجمع العلامات الحيوية والأعراض من مرضى HF. أنشأت الخوارزمية القائمة على القواعد، التي تم تطويرها بمشاركة من أطباء HF، تنبيهات بناءً على القياسات اليومية. وجدت الدراسة أن دمج بيانات EHR عزز بشكل كبير دقة التنبؤ للخوارزمية، مما حسن القيمة التنبؤية الإيجابية (PPV) من 0.55 إلى 0.88 دون زيادة الإيجابيات الكاذبة، مما يقلل من استهلاك الموارد الصحية غير الضرورية.

أشارت النتائج الرئيسية إلى أن المتغيرات مثل الببتيد الناتريوتي B (BNP)، مستويات الفوسفات، والقياسات اليومية للوزن، ضغط الدم، ومعدل ضربات القلب كانت حاسمة في التنبؤ بنوبات عدم التعويض. كشفت تحليل SHAP أن الخوارزمية القائمة على القواعد في Medly لعبت دورًا كبيرًا في تنبؤات النموذج، بينما ساهمت بيانات EHR في تصحيح الإيجابيات الكاذبة. سلطت الدراسة الضوء على أهمية دمج المعرفة المتخصصة مع التعلم الآلي لتعزيز اتخاذ القرار السريري في إدارة HF، مشيرة إلى أن هذا النهج يمكن أن يؤدي إلى نتائج أفضل للمرضى وتقليل تكاليف الرعاية الصحية. ومع ذلك، اعترف المؤلفون بالحاجة إلى التحقق المستقبلي من الخوارزمية قبل تنفيذها في الممارسة السريرية.

Journal: Scientific Reports, Volume: 15, Issue: 1
DOI: https://doi.org/10.1038/s41598-025-16376-9
PMID: https://pubmed.ncbi.nlm.nih.gov/40846740
Publication Date: 2025-08-22
Author(s): Shumit Saha et al.
Primary Topic: Artificial Intelligence in Healthcare

Overview

Heart failure (HF) is characterized by alternating periods of stability and episodes of symptom exacerbation, known as decompensation. To improve HF management, the research developed Medly, a digital therapeutic program that utilizes a rules-based algorithm to analyze patient data—specifically weight, blood pressure, and heart rate—to send automated alerts to clinicians and patients upon detecting signs of deterioration. However, the conservative rule-setting aimed at minimizing false negatives inadvertently increased false positives, leading to unnecessary clinical workload. The initial algorithm did not incorporate patients’ electronic health records (EHR), which prompted this study to enhance Medly’s predictive capabilities using machine learning techniques.

In a retrospective study employing the XGBoost algorithm for binary classification, the researchers integrated a broader dataset, including EHR information such as blood work and medication history. The enhanced model demonstrated significant improvements, achieving an accuracy of 98.08%, sensitivity of 95.26%, specificity of 98.86%, and a positive predictive value (PPV) of 88.18%, compared to the original algorithm’s 55.8% accuracy. Notably, EHR data, particularly B-type natriuretic peptide (BNP) levels and total cholesterol, proved essential in refining predictions of decompensation and reducing false-positive alerts, thereby enhancing the overall effectiveness of HF management strategies.

Methods

The “Methods” section of the research paper outlines the experimental design and analytical techniques employed to investigate the research question. The study utilized a quantitative approach, incorporating statistical analyses to evaluate the data collected from various experiments. Specific methodologies included controlled laboratory experiments, where variables were systematically manipulated to observe their effects on the outcomes of interest.

Data collection involved the use of standardized instruments and protocols to ensure reliability and validity. The analysis was conducted using software tools for statistical computation, allowing for the application of relevant tests such as ANOVA and regression analysis to interpret the results. The section emphasizes the importance of replicability and rigor in the methods employed, ensuring that findings can be verified and built upon in future research.

Results

The results of the study reveal significant differences between the performance of the original rule-based algorithm and the XGBoost algorithm in predicting decompensation episodes. The rule-based algorithm achieved an accuracy of 93.6%, with a sensitivity of 94% and specificity of 93%. However, it exhibited a positive predictive value (PPV) of only 55.8%, reflecting its conservative design aimed at minimizing missed decompensation events.

In contrast, the XGBoost algorithm demonstrated superior performance, attaining an accuracy of 98.08%, a sensitivity of 95.26%, and a specificity of 98.86%, alongside a PPV of 88.18%. The confusion matrix indicated a notable reduction in false positives, with the XGBoost algorithm recording 116 false positives compared to 684 in the rule-based approach. This improvement is attributed to the incorporation of electronic health record (EHR) data and measurements from the preceding two days, highlighting the efficacy of the XGBoost algorithm in enhancing predictive accuracy.

Discussion

The study discussed the integration of a machine learning-enhanced expert system with electronic health record (EHR) data within the Medly program to improve the prediction of heart failure (HF) decompensation episodes. The retrospective cohort study, approved by the University Health Network’s Research Ethics Board, utilized historical data from the Medly program, which collects vital signs and symptoms from HF patients. The rules-based algorithm, developed with input from HF clinicians, generated alerts based on daily measurements. The study found that incorporating EHR data significantly enhanced the predictive accuracy of the algorithm, improving the positive predictive value (PPV) from 0.55 to 0.88 without increasing false negatives, thereby reducing unnecessary healthcare resource consumption.

Key findings indicated that variables such as B-type natriuretic peptide (BNP), phosphate levels, and daily measurements of weight, blood pressure, and heart rate were critical in predicting decompensation episodes. The SHAP analysis revealed that the Medly rules-based algorithm played a significant role in the model’s predictions, while EHR data contributed to correcting false positives. The study highlighted the importance of integrating domain knowledge with machine learning to enhance clinical decision-making in HF management, suggesting that this approach could lead to better patient outcomes and reduced healthcare costs. However, the authors acknowledged the need for prospective validation of the algorithm before its implementation in clinical practice.