تعزيز تصنيف أمراض القلب استنادًا إلى خوارزمية تحسين الإوز الرمادي وذاكرة المدى الطويل والقصير Enhancing heart disease classification based on greylag goose optimization algorithm and long short-term memory

المجلة: Scientific Reports، المجلد: 15، العدد: 1
DOI: https://doi.org/10.1038/s41598-024-83592-0
PMID: https://pubmed.ncbi.nlm.nih.gov/39779779
تاريخ النشر: 2025-01-08
المؤلف: Ahmed M. Elshewey وآخرون
الموضوع الرئيسي: الذكاء الاصطناعي في الرعاية الصحية

نظرة عامة

تقدم هذه الدراسة خوارزمية تحسين الإوز الرمادي (GGO) التي تهدف إلى تعزيز دقة تصنيف أمراض القلب. تشمل أمراض القلب حالات مختلفة تؤثر على هيكل القلب ووظيفته، بما في ذلك مرض الشريان التاجي، وعدم انتظام ضربات القلب، واعتلال عضلة القلب. تستخدم الدراسة نسخة ثنائية من GGO (bGGO) لاختيار الميزات، مع تقييمها مقابل ستة خوارزميات تحسين ثنائية أخرى—BER، DTO، PSO، WAO، GWO، وFOA. تستخدم مرحلة التصنيف عدة مصنفات تعلم آلي، حيث يظهر نموذج الذاكرة طويلة وقصيرة الأمد (LSTM) كالأكثر فعالية، محققًا دقة تبلغ 91.79%. بعد ضبط المعلمات الفائقة باستخدام GGO، تتحسن دقة نموذج LSTM إلى 99.58%. تؤكد التحليلات الإحصائية، بما في ذلك اختبار ويلكوكسون واختبار ANOVA، على أهمية نتائج اختيار الميزات والتصنيف.

تؤكد النتائج فعالية خوارزمية GGO في تحسين اختيار الميزات وضبط المعلمات الفائقة، مما يعزز بشكل كبير الدقة التنبؤية مع التخفيف من الإفراط في التكيف في تحليل أمراض القلب. ستركز الأعمال المستقبلية على تحسين معدلات التنبؤ المبكر من خلال تحليل بيانات المستشعر واستكشاف خوارزميات تحسين إضافية، مثل الخوارزميات الجينية والتطور التفاضلي. تهدف الدراسة إلى تطوير أدوات تشخيصية عملية لتصنيف أمراض القلب من خلال الاستفادة من مجموعات بيانات أكبر وأكثر تنوعًا وتنفيذ تقنيات التعلم الانتقالي.

طرق

تحدد قسم الطرق في الدراسة إطار عمل شامل يهدف إلى تحسين تصنيف أمراض القلب من خلال دمج خوارزمية تحسين الإوز الرمادي (GGO) وشبكات الذاكرة طويلة وقصيرة الأمد (LSTM). يتم تنظيم الهيكل في مراحل رئيسية، تشمل معالجة البيانات بالتفصيل، واستخراج الميزات باستخدام آلية الترميز الثنائي لخوارزمية GGO، وتنفيذ مصنفات متنوعة لتعزيز دقة التصنيف.

في النتائج التجريبية، تقيم الدراسة خوارزمية GGO المقترحة عبر سيناريوهات متعددة، باستخدام وظائف رياضية راسخة كمعايير لتقييم أداء التحسين ضمن مجالات البحث المحددة. تُعرف هذه الوظائف في الأدبيات بفعاليتها في تقييم تقنيات التحسين. توضح تحليل مقارن ضد سبع خوارزميات تحسين بارزة—GGO، تمثيل الترميز الثنائي (BER)، تحسين التعليم التفاضلي (DTO)، تحسين سرب الجسيمات (PSO)، خوارزمية تحسين الحوت (WAO)، تحسين الذئب الرمادي (GWO)، وخوارزمية تحسين اليراعة (FOA)—الأداء المتفوق والفعالية لخوارزمية GGO، مما يثبت أنها قاعدة قوية لتقييم الأداء في سياق تصنيف أمراض القلب.

نتائج

في هذا القسم، تقدم البحث نتائج منهجيات اختيار الميزات والتصنيف المطبقة على توقع أمراض القلب، مع التأكيد على فعالية طريقة تحسين الإوز الرمادي الثنائية المقترحة (bGGO). يتم تفصيل التحليل المقارن لسبع خوارزميات تحسين، بما في ذلك bGGO، في الجدول 3، حيث تظهر bGGO أداءً متفوقًا باستمرار عبر مقاييس مختلفة، مثل متوسط معدلات الخطأ وقيم اللياقة. توضح الأشكال 5 و6 مزيدًا من القوة لتقنية bGGO من خلال تمثيلات رسومية، بما في ذلك مخططات QQ وخرائط الحرارة، التي تؤكد موثوقيتها في اختيار الميزات. تؤكد التحليلات الإحصائية، بما في ذلك ANOVA واختبارات ويلكوكسون، على الأهمية الإحصائية لأداء bGGO، مع قيم p التي تشير إلى مزايا قوية على الطرق المنافسة (p < 0.005). تسلط نتائج التصنيف، الملخصة في الجدول 6، الضوء على الأداء المحسن لمصنفات التعلم الآلي، وخاصة نموذج الذاكرة طويلة وقصيرة الأمد (LSTM)، عند استخدام الميزات المختارة بواسطة إطار bGGO. حقق الجمع بين GGO وLSTM مقاييس ملحوظة، بما في ذلك دقة تبلغ 0.995763 وحساسية تبلغ 0.996086، متفوقًا على استراتيجيات تحسين أخرى. توفر الأشكال من 10 إلى 15 رؤى بصرية حول مقاييس الأداء، بما في ذلك الدقة والحساسية والنوعية، مما يسهل تقييمًا شاملاً لنهج GGO + LSTM مقارنة بالخوارزميات الأخرى. تؤكد هذه النتائج على الدور الحاسم لاختيار الميزات الفعال في تحسين دقة التصنيف لتوقع أمراض القلب، مما يثبت أن الطريقة المقترحة هي تقنية رائدة في هذا المجال.

مناقشة

تسلط قسم المناقشة في ورقة البحث الضوء على التقدم الكبير في تصنيف أمراض القلب من خلال تقنيات التعلم الآلي (ML) والتعلم العميق (DL) المختلفة. يتم مراجعة مجموعة من الدراسات، مما يظهر فعالية خوارزميات وأساليب اختيار الميزات المختلفة. على سبيل المثال، حقق هارشيت جيندال وآخرون دقة تبلغ 87.5% باستخدام مزيج من الغابة العشوائية، والانحدار اللوجستي، وجيران الأقرب (KNN) على مجموعة بيانات تحتوي على 304 عناصر و13 ميزة. أفاد أوتوم وآخرون بدقة تبلغ 88.3% باستخدام آلة الدعم الناقل (SVM) على مجموعة بيانات أمراض القلب في كليفلاند، بينما وجد أبورب راجدان وآخرون أن الغابة العشوائية كانت فعالة بشكل خاص، محققة دقة تبلغ 90.16%. تشير المراجعة أيضًا إلى أن استراتيجيات التحسين الهجينة، مثل تلك التي تجمع بين تحسين سرب الجسيمات (PSO) وتحسين مستعمرة النمل (ACO)، تعزز الأداء التنبؤي بشكل كبير، حيث تصل الدقة إلى 99.65%.

تؤكد هذه القسم على أهمية اختيار الميزات وتحسين المعلمات الفائقة في تحسين نتائج التصنيف. تم تقديم خوارزمية تحسين الإوز الرمادي (GGO) كنهج جديد لاختيار الميزات، مما يظهر قدرتها على تحقيق توازن فعال بين الاستكشاف والاستغلال. يدمج الإطار المقترح GGO لكل من اختيار الميزات وضبط المعلمات الفائقة لشبكات الذاكرة طويلة وقصيرة الأمد (LSTM)، مما يؤدي إلى تحقيق دقة قصوى تبلغ 99.578%. تؤكد النتائج على أن خوارزمية GGO لا تعزز فقط الدقة التنبؤية ولكنها أيضًا تخفف من الإفراط في التكيف، مما يمهد الطريق للأبحاث المستقبلية لاستكشاف تقنيات تحسين إضافية ومجموعات بيانات أكبر لتحسين أدوات تشخيص أمراض القلب بشكل أكبر.

Journal: Scientific Reports, Volume: 15, Issue: 1
DOI: https://doi.org/10.1038/s41598-024-83592-0
PMID: https://pubmed.ncbi.nlm.nih.gov/39779779
Publication Date: 2025-01-08
Author(s): Ahmed M. Elshewey et al.
Primary Topic: Artificial Intelligence in Healthcare

Overview

This research presents the Greylag Goose Optimization (GGO) algorithm aimed at enhancing the accuracy of heart disease classification. Heart disease encompasses various conditions affecting the heart’s structure and function, including coronary artery disease, arrhythmias, and cardiomyopathies. The study employs a binary version of GGO (bGGO) for feature selection, benchmarking it against six other binary optimization algorithms—BER, DTO, PSO, WAO, GWO, and FOA. The classification phase utilizes multiple machine learning classifiers, with Long Short-Term Memory (LSTM) emerging as the most effective, achieving an accuracy of 91.79%. After hyperparameter tuning with GGO, the LSTM model’s accuracy improves to 99.58%. Statistical analyses, including the Wilcoxon signed-rank test and ANOVA, validate the significance of the feature selection and classification results.

The findings underscore the GGO algorithm’s effectiveness in optimizing feature selection and hyperparameter tuning, significantly enhancing predictive accuracy while mitigating overfitting in heart disease analysis. Future work will focus on improving early prediction rates through sensor data analysis and exploring additional optimization algorithms, such as Genetic Algorithms and Differential Evolution. The study aims to develop practical diagnostic tools for heart disease classification by leveraging larger, more diverse datasets and implementing transfer learning techniques.

Methods

The methods section of the study outlines a comprehensive framework aimed at improving heart disease classification through the integration of the Greylag Goose Optimization (GGO) algorithm and Long Short-Term Memory (LSTM) networks. The architecture is organized into key stages, which include detailed data preprocessing, feature extraction using the binary encoding mechanism of the GGO algorithm, and the implementation of various classifiers to enhance classification accuracy.

In the experimental results, the study evaluates the proposed GGO algorithm across multiple scenarios, utilizing well-established mathematical functions as benchmarks to assess optimization performance within defined search domains. These functions are recognized in the literature for their effectiveness in evaluating optimization techniques. A comparative analysis against seven notable optimization algorithms—GGO, Binary Encoding Representation (BER), Differential Teaching Optimization (DTO), Particle Swarm Optimization (PSO), Whale Optimization Algorithm (WAO), Grey Wolf Optimization (GWO), and Firefly Optimization Algorithm (FOA)—demonstrates the superior performance and efficacy of the GGO algorithm, establishing it as a robust baseline for performance evaluation in the context of heart disease classification.

Results

In this section, the research presents the results of feature selection and classification methodologies applied to heart disease prediction, emphasizing the effectiveness of the proposed binary Greylag Goose Optimization (bGGO) method. The comparative analysis of seven optimization algorithms, including bGGO, is detailed in Table 3, where bGGO consistently demonstrates superior performance across various metrics, such as average error rates and fitness values. Figures 5 and 6 further illustrate the robustness of the bGGO technique through graphical representations, including QQ plots and heatmaps, which affirm its reliability in feature selection. Statistical analyses, including ANOVA and Wilcoxon signed-rank tests, confirm the statistical significance of bGGO’s performance, with p-values indicating strong advantages over competing methods (p < 0.005). The classification results, summarized in Table 6, highlight the enhanced performance of machine learning classifiers, particularly the Long Short-Term Memory (LSTM) model, when utilizing features selected by the bGGO framework. The GGO-LSTM combination achieved remarkable metrics, including an accuracy of 0.995763 and sensitivity of 0.996086, outperforming other optimization strategies. Figures 10 through 15 provide visual insights into the performance metrics, including accuracy, sensitivity, and specificity, facilitating a comprehensive evaluation of the GGO + LSTM approach against other algorithms. These findings underscore the critical role of effective feature selection in improving classification accuracy for heart disease prediction, establishing the proposed method as a leading technique in this domain.

Discussion

The discussion section of the research paper highlights significant advancements in heart disease classification through various machine learning (ML) and deep learning (DL) techniques. A range of studies is reviewed, showcasing the effectiveness of different algorithms and feature selection methods. For instance, Harshit Jindal et al. achieved an accuracy of 87.5% using a combination of Random Forest, Logistic Regression, and K-Nearest Neighbors (KNN) on a dataset with 304 items and 13 features. Otoom et al. reported an accuracy of 88.3% with Support Vector Machine (SVM) on the Cleveland Heart Disease dataset, while Apurb Rajdhan et al. found Random Forest to be particularly effective, achieving 90.16% accuracy. The review also notes that hybrid optimization strategies, such as those combining Particle Swarm Optimization (PSO) and Ant Colony Optimization (ACO), significantly enhance predictive performance, with accuracies reaching up to 99.65%.

The section underscores the importance of feature selection and hyperparameter optimization in improving classification outcomes. The Greylag Goose Optimization (GGO) algorithm is introduced as a novel approach for feature selection, demonstrating its capability to balance exploration and exploitation effectively. The proposed framework integrates GGO for both feature selection and hyperparameter tuning of Long Short-Term Memory (LSTM) networks, resulting in a peak accuracy of 99.578%. The findings emphasize that the GGO algorithm not only enhances predictive accuracy but also mitigates overfitting, paving the way for future research to explore additional optimization techniques and larger datasets to further improve heart disease diagnostic tools.