إطار تعلم آلي محسن لتشخيص أمراض القلب والأوعية الدموية: منظور أخلاقي جديد Optimized machine learning framework for cardiovascular disease diagnosis: a novel ethical perspective

المجلة: BMC Cardiovascular Disorders، المجلد: 25، العدد: 1
DOI: https://doi.org/10.1186/s12872-025-04550-w
PMID: https://pubmed.ncbi.nlm.nih.gov/39979842
تاريخ النشر: 2025-02-20
المؤلف: Ghadah Naif Alwakid وآخرون
الموضوع الرئيسي: الذكاء الاصطناعي في الرعاية الصحية

نظرة عامة

تستكشف هذه الورقة البحثية تطبيق تقنيات الذكاء الاصطناعي المتقدمة (AI) للتشخيص الدقيق وفي الوقت المناسب لأمراض القلب والأوعية الدموية (CVDs). تتناول التحديات المتعلقة بتحقيق دقة تشخيص عالية بسبب تعقيد البيانات السريرية وعملية اختيار الميزات. تستخدم الدراسة منهجيات مختلفة لاختيار الميزات، بما في ذلك اختبار كاي، واكتساب المعلومات، والاختيار الأمامي، والإزالة الخلفية، لتقطير مجموعة مصفاة من ثمانية مؤشرات صحية للقلب والأوعية الدموية. يتم إعطاء الأولوية للاعتبارات الأخلاقية مثل الشفافية، وقابلية التفسير، والتخفيف من التحيز من خلال استخدام مجموعات بيانات غير متحيزة ومقاييس تحقق صارمة. يكشف دمج نماذج التعلم الآلي المتعددة، ولا سيما الغابة العشوائية (RF)، وXGBoost، وأشجار القرار (DT)، والانحدار اللوجستي (LR)، أن XGBoost هو النموذج الأكثر فعالية، حيث حقق دقة ملحوظة تبلغ 99%، واسترجاع 100%، ودقة 99%.

تخلص الدراسة إلى أنه بينما تظل أمراض القلب سببًا رئيسيًا للوفيات، فإن خوارزميات التعلم الآلي التي تم تقييمها، وخاصة الطرق التجميعية مثل XGBoost، تظهر أداءً قويًا في تصنيف CVD. ومع ذلك، تشمل القيود حجم مجموعة البيانات الصغيرة نسبيًا، مما قد يؤثر على قابلية تعميم النتائج، واستبعاد أنماط البيانات المتنوعة مثل التصوير الطبي وعلم الجينوم. يؤكد المؤلفون على الحاجة إلى التحقق السريري في العالم الحقيقي لنماذجهم ويقترحون أن تشمل الأبحاث المستقبلية مجموعات بيانات أكبر ومتنوعة لتعزيز قابلية تطبيق منهجياتهم عبر مختلف الإعدادات السريرية عالميًا.

مقدمة

تؤكد مقدمة هذه الورقة البحثية على الدور الحاسم للقلب في صحة الإنسان وانتشار أمراض القلب والأوعية الدموية (CVD)، التي تودي بحياة حوالي 18 مليون شخص سنويًا. تسلط الضوء على أنواع مختلفة من CVD، بما في ذلك فشل القلب الاحتقاني ومرض الشريان التاجي، وتحدد عوامل الخطر مثل خيارات نمط الحياة غير الصحية والحالات الأيضية. تدعو الورقة إلى دمج الذكاء الاصطناعي (AI) والتعلم الآلي (ML) في الرعاية الصحية لتعزيز كفاءة تشخيص CVD، مقترحة أن الأنظمة الآلية يمكن أن تساعد بشكل كبير في عمليات اتخاذ القرار للمهنيين الصحيين.

يتناول المؤلفون الفجوات الموجودة في تطبيقات ML الحالية لتصنيف CVD، لا سيما فيما يتعلق بمبادئ الذكاء الاصطناعي الأخلاقي، وقابلية تعميم النموذج، واختيار الميزات الشامل. يقترحون منهجية جديدة تتضمن اعتبارات أخلاقية أثناء استخدام خوارزميات ML المختلفة، بما في ذلك الجيران الأقرب (KNN)، والانحدار اللوجستي (LR)، والشبكات العصبية (NN)، لتصنيف CVD بشكل فعال. تؤكد الدراسة على أهمية تحديد الميزات المهمة من بيانات المرضى لتحسين دقة النموذج وقابلية التفسير. من خلال مواءمة تقنيات الذكاء الاصطناعي مع الأطر الأخلاقية، تهدف الأبحاث إلى تعزيز دقة التشخيص وتعزيز التدخلات في الوقت المناسب، مما يحسن في النهاية نتائج المرضى ويقلل من تكاليف الرعاية الصحية. توضح الورقة مساهماتها، بما في ذلك عملية اختيار ميزات قوية وتقييم شامل لأداء النموذج عبر مجموعات بيانات متنوعة، مما يمهد الطريق لمزيد من الاستكشاف في الأقسام اللاحقة.

الطرق

تشمل المنهجية المقترحة لتصنيف أمراض القلب والأوعية الدموية (CVDs) نهجًا منهجيًا موضحًا في مخطط انسيابي. في البداية، يتم الحصول على مجموعة بيانات أمراض القلب من مستودع التعلم الآلي UCI، والتي تتكون من سجلات من أربعة قواعد بيانات: هنغاريا، كليفلاند، سويسرا، وVA لونغ بيتش. يتم استخدام مجموعة بيانات كليفلاند، التي تحتوي على 303 عينة مع 14 سمة رئيسية، بشكل أساسي بسبب اكتمالها، بينما تتطلب مجموعات البيانات الأخرى معالجة مسبقة لمعالجة القيم المفقودة. تشمل السمات الميزات الديموغرافية والسريرية مثل مستويات الكوليسترول، والحد الأقصى لمعدل ضربات القلب، وضغط الدم أثناء الراحة، وحالة الثلاسيميا.

تشمل المنهجية عدة خطوات رئيسية: معالجة البيانات المسبقة، واختيار الميزات، وتحديد الميزات المهمة، وبناء النموذج، وتطبيق مصنفات التعلم الآلي (ML). يتم استخدام تحليل المكونات الرئيسية (PCA) لتقليل الأبعاد قبل إدخال الميزات المستخرجة في نماذج ML. يتم إجراء تحسين المعلمات باستخدام استراتيجية البحث الشبكي لتعزيز أداء النموذج، مع التركيز على معلمات مثل معدل التعلم، والعمق الأقصى، وعدد المقدرين لـ XGBoost، ومعلمات مماثلة لـ Random Forest (RF). يتم تقييم النماذج باستخدام التحقق المتقاطع 5-fold، مع استخدام درجة F1 كمقياس رئيسي لتحقيق التوازن بين الدقة والاسترجاع. يتم تفصيل المعلمات النهائية المحسنة ونتائج النموذج في قسم التقييم.

النتائج

في هذا القسم، تقدم الدراسة تقييمًا شاملاً لمختلف نماذج التعلم الآلي (ML) لتصنيف أمراض القلب والأوعية الدموية (CVD) باستخدام مجموعات بيانات من مستودع UCI. تم إجراء التجارب في بيئة محكومة باستخدام وحدة معالجة الرسوميات Intel Core، وPython 3.8، والمكتبات ذات الصلة مثل Scikit-learn وXGBoost. شملت معالجة البيانات المسبقة قياس Min-Max والترميز الأحادي، مع تقسيم 80/20 للتدريب والاختبار وأخذ عينات مصنفة للحفاظ على نسب الفئات. تم استخدام مقاييس الأداء بما في ذلك الدقة، والدقة، والاسترجاع، ودرجة F1 لتقييم فعالية النموذج، حيث برز XGBoost كأكثر المصنفات فعالية، محققًا دقة 99% واسترجاع 100% على مجموعة فرعية من ثمانية ميزات تم تحديدها من خلال اكتساب المعلومات.

استكشفت الدراسة أيضًا تقنيات اختيار الميزات، بما في ذلك الاختيار الأمامي، والإزالة الخلفية، واختبارات كاي، التي أبرزت باستمرار تفوق XGBoost على نماذج أخرى مثل Random Forest (RF) وأشجار القرار (DT). من الجدير بالذكر أن أداء XGBoost قد تم تحسينه من خلال تقليل الأبعاد عبر تحليل المكونات الرئيسية (PCA)، مع الحفاظ على دقة وموثوقية عالية مع تقليل الإفراط في التخصيص. أظهرت المقارنات مع النماذج الحالية الرائدة أن نموذج XGBoost المقترح مع اكتساب المعلومات (IG_XGB) تفوق على الأعمال السابقة من حيث الدقة، والدقة، والاسترجاع، ودرجة F1، مما يبرز أهمية اختيار الميزات الفعالة في تحسين اتخاذ القرار السريري لتشخيص CVD. من المتوقع أن تستكشف الأبحاث المستقبلية هذه النتائج وآثارها على الممارسة السريرية.

المناقشة

في قسم المناقشة، تستعرض الورقة العديد من الدراسات التي تدمج تقنيات التعلم الآلي (ML) والذكاء الاصطناعي (AI) لتصنيف أمراض القلب والأوعية الدموية (CVD). تشمل النتائج الملحوظة استخدام تحليل المكونات الرئيسية (PCA) واختبار كاي لاختيار الميزات وتقليل الأبعاد، حيث حقق Random Forest (RF) أعلى دقة تبلغ 98% في دراسة واحدة، على الرغم من أنه على مجموعة بيانات صغيرة. استكشفت دراسات أخرى نماذج هجينة وتقنيات اختيار ميزات متنوعة، مثل الخوارزميات الجينية (GA) والإزالة التكرارية للميزات (RFE)، حيث حققت تركيبات SVM وGA دقة تبلغ 88.34%. تسلط الورقة الضوء على فعالية النماذج التجميعية والشبكات العصبية العميقة (DNNs) في تحقيق دقة تنبؤية عالية، مع الإشارة أيضًا إلى القيود المتعلقة بحجم مجموعة البيانات والإفراط المحتمل في التخصيص.

يؤكد المؤلفون على أهمية معالجة البيانات المسبقة، بما في ذلك التنظيف، والتطبيع، والتعامل مع المتغيرات الفئوية، لضمان فعالية النموذج. يوضحون الخطوات المتخذة للحفاظ على مجموعة بيانات متوازنة والتخفيف من التحيزات أثناء تدريب النموذج وتقييمه. تم استخدام تقنيات اختيار الميزات، بما في ذلك طرق الفلترة والتغليف، لتحديد الميزات المهمة، مما أدى في النهاية إلى مجموعة بيانات مصفاة لبناء النموذج. تجمع المنهجية المقترحة بشكل منهجي بين هذه التقنيات المتقدمة لتعزيز أداء النموذج مع معالجة الاعتبارات الأخلاقية وقضايا القابلية للتعميم، بهدف تقديم تنبؤات موثوقة وقوية لتصنيف أمراض القلب.

القيود

تسلط قسم القيود في الدراسة الضوء على عدة جوانب حاسمة تتعلق بنموذج تصنيف أمراض القلب والأوعية الدموية (CVD) المقترح. بينما يظهر النموذج قدرات واعدة في اختيار الميزات والتوافق الأخلاقي، فإن قابليته للتعميم مقيدة بحجم مجموعة البيانات الصغيرة والمحددة ديموغرافيًا المستخدمة، والتي قد لا تمثل بشكل كاف تنوع السكان العالمي. تظهر المنهجيات المستخدمة، بما في ذلك تقنيات اختيار الميزات المتقدمة وخوارزمية XGBoost، إمكانات لأداء قوي؛ ومع ذلك، قد تتجاهل العلاقات المعقدة المتأصلة في مجموعات البيانات الأكبر أو متعددة الأنماط. بالإضافة إلى ذلك، تشكل تعقيدات النموذج وحساسيته لتعديل المعلمات تحديات، لا سيما في البيئات ذات الموارد المحدودة حيث قد تكون البيانات غير مكتملة أو مشوشة.

تُبرز اتجاهات البحث المستقبلية، داعية إلى التحقق من النموذج على مجموعات بيانات أكثر تنوعًا تشمل نطاقًا أوسع من الديموغرافيات والحالات القلبية الوعائية. يُوصى بدمج البيانات متعددة الأنماط، مثل التصوير الطبي والمعلومات الجينومية، لتعزيز دقة التشخيص وقابلية التطبيق. علاوة على ذلك، تقترح الدراسة استكشاف تقنيات متقدمة مثل التعلم الفيدرالي لمعالجة مخاوف خصوصية البيانات مع تعزيز الوصول العادل إلى التشخيصات المدفوعة بالذكاء الاصطناعي. بشكل عام، بينما تعتبر النتائج الحالية واعدة، سيكون من الضروري معالجة هذه القيود من أجل التنفيذ الناجح للنموذج في الإعدادات السريرية ومساهمته في تحسين نتائج الرعاية الصحية.

Journal: BMC Cardiovascular Disorders, Volume: 25, Issue: 1
DOI: https://doi.org/10.1186/s12872-025-04550-w
PMID: https://pubmed.ncbi.nlm.nih.gov/39979842
Publication Date: 2025-02-20
Author(s): Ghadah Naif Alwakid et al.
Primary Topic: Artificial Intelligence in Healthcare

Overview

This research paper investigates the application of advanced Artificial Intelligence (AI) techniques for the accurate and timely diagnosis of cardiovascular diseases (CVDs). It addresses the challenges of achieving high diagnostic accuracy due to the complexity of clinical data and the feature selection process. The study employs various feature selection methodologies, including Chi-square, Information Gain, Forward Selection, and Backward Elimination, to distill a refined subset of eight cardiovascular health indicators. Ethical considerations such as transparency, interpretability, and bias mitigation are prioritized through the use of unbiased datasets and rigorous validation metrics. The integration of multiple Machine Learning (ML) models, notably Random Forest (RF), XGBoost, Decision Trees (DT), and Logistic Regression (LR), reveals XGBoost as the most effective model, achieving a remarkable 99% accuracy, 100% recall, and 99% precision.

The study concludes that while heart disease remains a leading cause of mortality, the evaluated ML algorithms, particularly ensemble methods like XGBoost, demonstrate robust performance in CVD classification. However, limitations include the relatively small dataset size, which may affect the generalizability of the findings, and the exclusion of diverse data modalities such as medical imaging and genomics. The authors emphasize the need for real-world clinical validation of their models and suggest future research should incorporate larger, heterogeneous datasets to enhance the applicability of their methodologies across various clinical settings globally.

Introduction

The introduction of this research paper emphasizes the critical role of the heart in human health and the prevalence of cardiovascular diseases (CVD), which claim approximately 18 million lives annually. It highlights various types of CVD, including congestive heart failure and coronary disease, and identifies risk factors such as unhealthy lifestyle choices and metabolic conditions. The paper advocates for the integration of Artificial Intelligence (AI) and Machine Learning (ML) in healthcare to enhance the efficiency of CVD diagnostics, proposing that automated systems can significantly aid in decision-making processes for healthcare professionals.

The authors address existing gaps in current ML applications for CVD classification, particularly concerning ethical AI principles, model generalizability, and comprehensive feature selection. They propose a novel methodology that incorporates ethical considerations while employing various ML algorithms, including K-Nearest Neighbours (KNN), Logistic Regression (LR), and Neural Networks (NN), to classify CVD effectively. The study emphasizes the importance of identifying significant features from patient data to improve model accuracy and interpretability. By aligning AI technologies with ethical frameworks, the research aims to enhance diagnostic precision and promote timely interventions, ultimately improving patient outcomes and reducing healthcare costs. The paper outlines its contributions, including a robust feature selection process and a comprehensive evaluation of model performance across diverse datasets, setting the stage for further exploration in subsequent sections.

Methods

The proposed methodology for classifying cardiovascular diseases (CVDs) involves a systematic approach illustrated in a flowchart. Initially, the Heart Disease dataset is sourced from the UCI Machine Learning Repository, comprising records from four databases: Hungary, Cleveland, Switzerland, and VA Long Beach. The Cleveland dataset, which contains 303 samples with 14 key attributes, is primarily utilized due to its completeness, while other datasets require preprocessing to address missing values. The attributes include demographic and clinical features such as cholesterol levels, maximum heart rate, resting blood pressure, and thalassemia status.

The methodology encompasses several key steps: data preprocessing, feature selection, identification of significant features, model building, and application of machine learning (ML) classifiers. Principal Component Analysis (PCA) is employed for dimensionality reduction before feeding the extracted features into ML models. Hyperparameter optimization is conducted using a grid search strategy to enhance model performance, focusing on parameters like learning rate, maximum depth, and number of estimators for XGBoost, and similar parameters for Random Forest (RF). The models are evaluated using 5-fold cross-validation, with the F1 score serving as the primary metric to balance precision and recall. The final optimized hyperparameters and model results are detailed in the evaluation section.

Results

In this section, the research presents a comprehensive evaluation of various machine learning (ML) models for classifying cardiovascular disease (CVD) using datasets from the UCI repository. The experiments were conducted in a controlled environment utilizing an Intel Core GPU, Python 3.8, and relevant libraries such as Scikit-learn and XGBoost. Data preprocessing involved Min-Max scaling and one-hot encoding, with an 80/20 train-test split and stratified sampling to maintain class ratios. Performance metrics including accuracy, precision, recall, and F1-score were employed to assess model efficacy, with XGBoost emerging as the most effective classifier, achieving 99% accuracy and 100% recall on a subset of eight features identified through information gain.

The study further explored feature selection techniques, including forward selection, backward elimination, and chi-square tests, which consistently highlighted the superiority of XGBoost over other models like Random Forest (RF) and Decision Trees (DT). Notably, XGBoost’s performance was enhanced through dimensionality reduction via Principal Component Analysis (PCA), maintaining high accuracy and reliability while minimizing overfitting. Comparisons with existing state-of-the-art models demonstrated that the proposed XGBoost model with information gain (IG_XGB) outperformed previous works in accuracy, precision, recall, and F1-score, underscoring the importance of effective feature selection in improving clinical decision-making for CVD diagnosis. Future research is anticipated to further investigate these findings and their implications for clinical practice.

Discussion

In the discussion section, the paper reviews various studies that integrate machine learning (ML) and artificial intelligence (AI) techniques for cardiovascular disease (CVD) classification. Notable findings include the use of Principal Component Analysis (PCA) and Chi-Square for feature selection and dimensionality reduction, with Random Forest (RF) achieving the highest accuracy of 98% in one study, albeit on a small dataset. Other studies explored hybrid models and various feature selection techniques, such as Genetic Algorithms (GA) and Recursive Feature Elimination (RFE), with SVM and GA combinations yielding an accuracy of 88.34%. The paper highlights the effectiveness of ensemble models and deep neural networks (DNNs) in achieving high prediction accuracy, while also noting limitations related to dataset size and potential overfitting.

The authors emphasize the importance of data preprocessing, including cleaning, normalization, and handling categorical variables, to ensure model efficacy. They detail the steps taken to maintain a balanced dataset and mitigate biases during model training and evaluation. Feature selection techniques, including filter and wrapper methods, were employed to identify significant features, ultimately leading to a refined dataset for model building. The proposed methodology systematically combines these advanced techniques to enhance model performance while addressing ethical considerations and generalizability issues, aiming to provide robust and trustworthy predictions for heart disease classification.

Limitations

The limitations section of the study highlights several critical aspects regarding the proposed cardiovascular disease (CVD) classification model. While the model demonstrates promising capabilities in feature selection and ethical alignment, its generalizability is constrained by the relatively small and demographically specific dataset utilized, which may not adequately represent global population diversity. The methodologies employed, including advanced feature selection techniques and the XGBoost algorithm, show potential for robust performance; however, they may overlook complex relationships inherent in larger or multimodal datasets. Additionally, the model’s complexity and sensitivity to hyperparameter tuning pose challenges, particularly in resource-limited settings where data may be incomplete or noisy.

Future research directions are emphasized, advocating for the validation of the model on more diverse datasets that encompass a broader range of demographics and cardiovascular conditions. The integration of multimodal data, such as medical imaging and genomic information, is recommended to enhance diagnostic accuracy and applicability. Furthermore, the study suggests exploring advanced techniques like federated learning to address data privacy concerns while promoting equitable access to AI-driven diagnostics. Overall, while the current findings are promising, addressing these limitations will be essential for the model’s successful implementation in clinical settings and its contribution to improved healthcare outcomes.