تعزيز توقعات أمراض القلب باستخدام نموذج تحويل قائم على الانتباه الذاتي Enhancing heart disease prediction using a self-attention-based transformer model

المجلة: Scientific Reports، المجلد: 14، العدد: 1
DOI: https://doi.org/10.1038/s41598-024-51184-7
PMID: https://pubmed.ncbi.nlm.nih.gov/38177293
تاريخ النشر: 2024-01-04
المؤلف: Atta Rahman وآخرون
الموضوع الرئيسي: الذكاء الاصطناعي في الرعاية الصحية

نظرة عامة

تتناول الأبحاث القضية الحرجة للأمراض القلبية الوعائية (CVDs)، التي تمثل أكثر من 17 مليون حالة وفاة على مستوى العالم. إن الكشف المبكر والدقيق عن فشل القلب أمر ضروري للتدخل السريري الفعال. تقدم هذه الدراسة نموذجًا جديدًا قائمًا على الانتباه الذاتي مصممًا للتنبؤ بمخاطر الأمراض القلبية الوعائية من خلال تحليل خصائص المرضى مثل ضغط الدم، مستويات الكوليسترول، ومعدل ضربات القلب. يستفيد النموذج من آليات الانتباه الذاتي لالتقاط المعلومات السياقية وتوليد تمثيلات تعكس أنماط البيانات المعقدة بشكل فعال. ومن الجدير بالذكر أنه يعزز من قابلية التفسير من خلال تخصيص أوزان الانتباه لميزات الإدخال، مما يمكّن الأطباء من فهم مساهمات عناصر البيانات المختلفة في تنبؤات النموذج. تم اختبار النموذج على مجموعة بيانات كليفلاند من مستودع التعلم الآلي UCI، محققًا دقة مثيرة للإعجاب تبلغ 96.51%، متجاوزًا العديد من الأساليب الأساسية.

في الختام، يمثل نموذج المحول القائم على الانتباه المقترح تقدمًا كبيرًا في التنبؤ بأمراض القلب، حيث يستخدم آليات الانتباه على مستوى الموضع لتحسين تعلم التمثيل مقارنة بالشبكات العصبية التكرارية التقليدية (RNNs). يسمح تصميمه بمعالجة متوازية فعالة، مما يقلل من أوقات التدريب والاستدلال مع الحفاظ على دقة عالية في التطبيقات الواقعية. تشير مرونة هذا النموذج إلى إمكانية تطبيقه على مهام التنبؤ بمخاطر سريرية متنوعة تتجاوز أمراض القلب. تهدف الأعمال المستقبلية إلى دمج التعلم الانتقالي لتعزيز الأداء بشكل أكبر، خاصة في السيناريوهات التي تحتوي على بيانات مصنفة محدودة، مما يوسع من تأثيره المحتمل في اتخاذ القرارات الصحية.

النتائج

تظهر نتائج الدراسة أن نموذج المحول القائم على الانتباه الذاتي المقترح يتفوق بشكل كبير على التقنيات الأساسية، بما في ذلك RNN وRETAIN، في مهام التنبؤ بأمراض القلب. حقق النموذج دقة مثيرة للإعجاب تبلغ 95.2% على مجموعة بيانات الأمراض القلبية الوعائية التي تحتوي على 70,000 حالة مع 11 ميزة، كما هو موضح في الجدول 4. ومن الجدير بالذكر أن نموذج المحول أظهر أوقات تدريب أسرع مقارنةً بنماذج RNN، التي تواجه صعوبات في معالجة البيانات التسلسلية، مما يؤدي إلى فترات تدريب أطول. يسمح تصميم النموذج بالتوازي الفعال، مما يعزز كل من الأداء وقابلية التفسير، وهو أمر حاسم في التطبيقات الصحية لفهم العوامل المؤثرة في تنبؤات فشل القلب.

بالإضافة إلى ذلك، تسلط الدراسة الضوء على دقة تدريب النموذج واختباره التي بلغت 97.17% و96.51%، على التوالي، بعد 90 دورة، مع خسائر تدريب واختبار ضئيلة تبلغ 0.10 و0.12. لا يتجاوز النموذج المقترح الأساليب الحديثة فحسب، بل يوفر أيضًا رؤى أوضح في عملية التنبؤ من خلال آلية الانتباه الذاتي الخاصة به، التي تخصص أوزان الانتباه لمختلف ميزات الإدخال. بينما تشكل تعقيدات بنية المحول تحديات في فهم عملية اتخاذ القرار، تم استخدام تقنيات تصور الانتباه لتعزيز قابلية التفسير. بشكل عام، تؤكد النتائج على إمكانيات النموذج في تحسين تشخيص أمراض القلب من خلال منهجيات التعلم الآلي المتقدمة.

المناقشة

تسلط قسم المناقشة في ورقة البحث الضوء على الدور الحاسم للذكاء الاصطناعي (AI) في الكشف المبكر والتنبؤ بأمراض القلب، وهي سبب رئيسي للوفيات على مستوى العالم. تم استكشاف تقنيات متعددة من التعلم الآلي (ML) والتعلم العميق (DL)، حيث أظهرت طرق التجميع مثل الغابة العشوائية (RF) نتائج واعدة. ومن الجدير بالذكر أن نموذج الانتباه العكسي (RETAIN) وطرق التصنيف الأخرى قد تم استخدامها لتعزيز دقة التنبؤ من خلال التركيز على الميزات المهمة والتسلسلات الزمنية. ومع ذلك، غالبًا ما تواجه النماذج الحالية صعوبات في استخراج الميزات من مجموعات البيانات المعقدة، مما يستدعي أساليب مبتكرة لتحسين موثوقية التنبؤ.

تقدم الورقة نموذجًا جديدًا قائمًا على الانتباه الذاتي مصممًا لمعالجة قيود الأساليب السابقة. لا يعزز هذا النموذج دقة التنبؤ فحسب، بل يقدم أيضًا تحسينًا في التكيف وقابلية التفسير، مما يجعله مناسبًا لتطبيقات الرعاية الصحية المتنوعة. تشمل المساهمات الرئيسية تحديد عوامل الخطر الحرجة لأمراض القلب وإقامة إطار عمل قوي يستفيد من آليات الانتباه الذاتي لالتقاط الأنماط المعقدة داخل البيانات الطبية. يهدف النموذج المقترح إلى تسهيل اتخاذ قرارات سريرية أفضل من خلال توفير أداة أكثر كفاءة وفعالية لتنبؤ مخاطر الأمراض القلبية الوعائية، مما يسهم في تحسين نتائج المرضى.

القيود

في قسم “القيود”، يناقش المؤلفون التحديات المرتبطة بأساليب التنبؤ الحالية بأمراض القلب، لا سيما في معالجة البيانات التسلسلية وتحسين نتائج المرضى. يحددون النقائص الرئيسية في الأساليب السابقة، بما في ذلك النمذجة غير الكافية لسمات مجموعة البيانات المدخلة، والحساب غير الفعال لعوامل الخطر، ونقص الاعتبار للتكاليف المتغيرة للخطأ في التصنيف، خاصة بالنسبة للفئات الأقل تمثيلًا في الرعاية الصحية. ومن الجدير بالذكر أن المصنفات التقليدية، مثل Naive Bayes (NB)، تعالج الميزات بشكل مستقل، مما يؤدي إلى أنظمة دعم القرار دون المستوى الأمثل.

بالإضافة إلى ذلك، يسلط المؤلفون الضوء على قيود الشبكات العصبية التكرارية (RNNs)، التي تواجه صعوبات مع تسلسلات البيانات الطويلة بسبب مشكلات مثل تلاشي التدرجات وعدم قدرة الشبكات القياسية للذاكرة طويلة وقصيرة المدى (LSTM) على إدارة الفواصل الزمنية غير المنتظمة الشائعة في بيانات الرعاية الصحية. لمعالجة هذه القيود، يقترح المؤلفون نموذجًا يدمج آلية قائمة على الانتباه، تهدف إلى تعزيز التقاط الاعتماد، وتحسين قابلية التفسير، وتسهيل الحساب المتوازي، مما يساهم في تقدم مجال التنبؤ بأمراض القلب.

Journal: Scientific Reports, Volume: 14, Issue: 1
DOI: https://doi.org/10.1038/s41598-024-51184-7
PMID: https://pubmed.ncbi.nlm.nih.gov/38177293
Publication Date: 2024-01-04
Author(s): Atta Rahman et al.
Primary Topic: Artificial Intelligence in Healthcare

Overview

The research addresses the critical issue of cardiovascular diseases (CVDs), which account for over 17 million deaths globally. Early and accurate detection of heart failure is essential for effective clinical intervention. This study introduces a novel self-attention-based transformer model designed to predict CVD risk by analyzing patient characteristics such as blood pressure, cholesterol levels, and heart rate. The model leverages self-attention mechanisms to capture contextual information and generate representations that effectively model complex data patterns. Notably, it enhances interpretability by assigning attention weights to input features, enabling physicians to understand the contributions of various data elements to the model’s predictions. The model was tested on the Cleveland dataset from the UCI machine learning repository, achieving an impressive accuracy of 96.51%, surpassing several baseline approaches.

In conclusion, the proposed attention-based transformer model represents a significant advancement in heart disease prediction, utilizing position-level attention mechanisms to improve representation learning compared to traditional recurrent neural networks (RNNs). Its architecture allows for efficient parallel processing, reducing training and inference times while maintaining high accuracy in real-world applications. The versatility of this model suggests its applicability to various clinical risk prediction tasks beyond heart disease. Future work aims to integrate transfer learning to further enhance performance, particularly in scenarios with limited labeled data, thereby broadening its potential impact in healthcare decision-making.

Results

The results of the study demonstrate that the proposed Self-Attention-based Transformer Model significantly outperforms baseline techniques, including RNN and RETAIN, in heart disease prediction tasks. The model achieved an impressive accuracy of 95.2% on a cardiovascular disease dataset containing 70,000 instances with 11 features, as detailed in Table 4. Notably, the Transformer Model exhibited faster training times compared to RNN models, which struggle with sequential data processing, leading to longer training durations. The model’s architecture allows for efficient parallelization, enhancing both performance and interpretability, which is crucial in healthcare applications for understanding the factors influencing heart failure predictions.

Additionally, the study highlights the model’s training and testing accuracies of 97.17% and 96.51%, respectively, after 90 epochs, with minimal training and testing losses of 0.10 and 0.12. The proposed model not only surpasses state-of-the-art approaches but also provides clearer insights into the prediction process through its self-attention mechanism, which allocates attention weights to various input features. While the complexity of the Transformer architecture poses challenges in understanding its decision-making, attention visualization techniques were employed to enhance interpretability. Overall, the findings underscore the model’s potential in improving heart disease diagnosis through advanced machine learning methodologies.

Discussion

The discussion section of the research paper highlights the critical role of artificial intelligence (AI) in the early detection and prediction of heart disease, a leading cause of mortality globally. Various machine learning (ML) and deep learning (DL) techniques have been explored, with ensemble methods like Random Forest (RF) demonstrating promising results. Notably, the Reverse Time Attention model (RETAIN) and other classification methods have been employed to enhance prediction accuracy by focusing on significant features and temporal sequences. However, existing models often struggle with feature extraction in complex datasets, necessitating innovative approaches to improve prediction reliability.

The paper introduces a novel self-attention-based transformer model designed to address the limitations of previous methods. This model not only enhances prediction accuracy but also offers improved adaptability and interpretability, making it suitable for diverse healthcare applications. Key contributions include the identification of critical risk factors for heart disease and the establishment of a robust framework that leverages self-attention mechanisms to capture intricate patterns within medical data. The proposed model aims to facilitate better clinical decision-making by providing a more efficient and effective tool for predicting cardiovascular disease risk, ultimately contributing to improved patient outcomes.

Limitations

In the “Limitations” section, the authors discuss the challenges associated with existing heart disease prediction methods, particularly in processing sequential data and improving patient outcomes. They identify key shortcomings in prior approaches, including inadequate modeling of input dataset attributes, ineffective computation of risk factors, and a lack of consideration for the varying costs of misclassification, especially for minority classes in healthcare. Notably, traditional classifiers, such as Naive Bayes (NB), treat features independently, leading to suboptimal decision support systems.

Additionally, the authors highlight the limitations of recurrent neural networks (RNNs), which struggle with long data sequences due to issues like vanishing gradients and the inability of standard Long Short-Term Memory (LSTM) networks to manage irregular time intervals common in healthcare data. To address these limitations, the authors propose a model that incorporates an attention-based mechanism, aiming to enhance dependency capture, improve interpretability, and facilitate parallel computation, thereby advancing the field of heart disease prediction.