إطار عمل هجيني قابل للتفسير يعتمد على التحويلات البصرية الفيدرالية لتوقع سرطان الثدي عبر عوامل الخطر A hybrid explainable federated-based vision transformer framework for breast cancer prediction via risk factors

المجلة: Scientific Reports، المجلد: 15، العدد: 1
DOI: https://doi.org/10.1038/s41598-025-96527-0
PMID: https://pubmed.ncbi.nlm.nih.gov/40419634
تاريخ النشر: 2025-05-27
المؤلف: Aymen M. Al-Hejri وآخرون
الموضوع الرئيسي: الذكاء الاصطناعي في اكتشاف السرطان

نظرة عامة

تقدم هذه الورقة البحثية إطار عمل جديد قابل للتفسير للتعلم الفيدرالي يهدف إلى تحسين توقعات سرطان الثدي، وهي قضية حاسمة نظرًا لوضعها كأحد الأسباب الرئيسية للوفاة بين النساء. تقيم الدراسة ثلاث منهجيات تعلم عميق – نماذج الذكاء الاصطناعي الفردية، ونماذج تجميع الفضاءات المميزة عالية المستوى، ونموذج هجين يجمع بين ميزات المحول البصري العالمي (ViT) وميزات الشبكة العصبية التلافيفية المحلية (CNN) – عبر كل من الإعدادات المركزية والفيدرالية. تم اختبار النماذج على مهام التصنيف الثنائي، والتصنيف متعدد الفئات، ونظام تقرير بيانات تصوير الثدي (BI-RADS) باستخدام مجموعة بيانات فريدة تشمل عوامل الخطر من العالم الحقيقي. يظهر نهج التعلم الفيدرالي، الذي يجمع التوقعات من ثلاثة عملاء، أداءً متفوقًا بدقة 98.65%، 97.30%، و95.59% للمهام المعنية، بينما يتضمن أيضًا تقنيات الذكاء الاصطناعي القابل للتفسير (XAI) لتعزيز شفافية النموذج.

في الخاتمة، تبرز الورقة تنفيذ كل من سيناريوهات التعلم المركزي والفيدرالي لتوقع سرطان الثدي بناءً على عوامل الخطر المستمدة من السجلات الطبية للمرضى. تحدد الدراسة عوامل الخطر من خلال السجلات الصحية وتصنفها وفقًا لدرجات BI-RADS. استخدم النهج المركزي نماذج التعلم الآلي والتعلم التجميعي، بينما حقق النموذج المقترح CEET-Fed، الذي يجمع بين التعلم التجميعي مع مشفر المحول، دقة قدرها 97.30%. لا يحافظ إعداد التعلم الفيدرالي على خصوصية العملاء فيما يتعلق ببيانات خطر الثدي الحساسة فحسب، بل يعالج أيضًا عدم توازن البيانات بشكل فعال. بشكل عام، تفوق النموذج الفيدرالي على السيناريو المركزي، محققًا معدلات دقة ملحوظة ويظهر إمكانيات XAI في توضيح نتائج التوقع.

الطرق

في هذا القسم، يقدم المؤلفون النتائج التجريبية والمناقشات المتعلقة بقدرات التوقع لنماذج قائمة على عوامل الخطر لسرطان الثدي في كل من سيناريوهات التعلم المركزي والفيدرالي. تشمل التقييمات ثلاث طرق تصنيف: التصنيف الثنائي، وتصنيف ثلاث فئات، وتصنيف BI-RADS.

تستخدم الدراسة مجموعة من المنهجيات، بما في ذلك نماذج التعلم الآلي (ML)، والتعلم الانتقالي التجميعي، ونماذج المحولات، والنماذج الفيدرالية، لتعزيز توقع سرطان الثدي بناءً على عوامل الخطر والسجلات الصحية. تشير النتائج إلى فعالية هذه التقنيات النمذجة المتنوعة في توقع خطر سرطان الثدي بدقة، مما يبرز إمكانيات التعلم الفيدرالي في الاستفادة من البيانات الموزعة مع الحفاظ على الخصوصية.

النتائج

يقدم قسم النتائج تقييمًا مقارنًا لأساليب توقع خطر سرطان الثدي، مقارنًا الدراسات السابقة بالنموذج المقترح. يبرز استخدام مصادر بيانات متنوعة، مثل السجلات الصحية المرتبطة، والأشعة السينية للثدي، والسجلات الطبية الإلكترونية، وميزات العلامات الحيوية السريرية، عبر مختلف أساليب التعلم الآلي (ML) والتعلم العميق (DL). تم تقييم مقاييس الأداء بشكل أساسي باستخدام المساحة تحت المنحنى (AUC) مع فترة ثقة 95% (CI).

حقق النموذج المقترح، الذي يدمج الشبكات العصبية التلافيفية (CNN) مع المحولات البصرية (ViT) ويستخدم التعلم الفيدرالي، AUC قدره 0.970 (95% CI: 0.929-1) على مجموعة بيانات تضم 734 مشاركًا. تفوق هذا النموذج بشكل كبير على الطرق الحالية من خلال دمج عوامل الخطر من السجلات الصحية الإلكترونية (EHR) والأعراض بشكل فعال. تؤكد النتائج على إمكانيات تقنيات التعلم العميق المتقدمة والتعلم الفيدرالي في تعزيز دقة التوقع في تقييم خطر سرطان الثدي، مما يبرز قيمة استخدام مصادر بيانات متنوعة وخوارزميات متطورة.

المناقشة

تسلط قسم المناقشة في الورقة البحثية الضوء على منهجيات ونتائج مختلفة تتعلق بتوقع سرطان الثدي، مع التركيز على دمج عوامل الخطر والبيانات السريرية وتقنيات التعلم الآلي المتقدمة. تحدد عوامل الخطر غير القابلة للتعديل، مثل الجنس وتاريخ العائلة، إلى جانب العوامل القابلة للتعديل مثل مؤشر كتلة الجسم، والعلاج الهرموني، وخيارات نمط الحياة. تشير الورقة إلى عدة نماذج، بما في ذلك الانحدار اللوجستي (RF-LR) والشبكات العصبية التلافيفية (CNN)، التي تم تقييمها على مجموعات بيانات كبيرة. من الجدير بالذكر أن نموذج التعلم العميق الهجين الذي يجمع بين عوامل الخطر وصور الماموجرام حقق AUC قدره 0.70، بينما وصل نموذج آخر يستخدم تقنيات التعلم الآلي والتعلم العميق إلى AUC قدره 0.91، مما يظهر إمكانيات هذه الأساليب في تعزيز دقة التوقع.

بالإضافة إلى ذلك، يناقش القسم تطبيق التعلم الانتقالي، والتعلم التجميعي، والتعلم الفيدرالي في الكشف عن سرطان الثدي. أظهرت نماذج التعلم الانتقالي، مثل VGG16 وResNet50، نتائج واعدة، مع تحسين طرق التجميع لدقة التصنيف بشكل كبير. يتم تسليط الضوء على التعلم الفيدرالي كنهج متقدم يسمح بتدريب النماذج بشكل تعاوني مع الحفاظ على خصوصية البيانات، محققًا معدلات دقة عالية في الكشف عن سرطان الثدي. بشكل عام، تؤكد النتائج على أهمية دمج مصادر بيانات متنوعة وتقنيات حسابية متقدمة لتحسين توقع وتصنيف خطر سرطان الثدي.

القيود

ت stem القيود في هذا البحث بشكل أساسي من التحديات المرتبطة بجمع البيانات في المجال الطبي، والذي يتطلب موارد كبيرة ويستغرق وقتًا طويلاً. قامت الدراسة بتحليل عوامل خطر سرطان الثدي والسجلات الطبية لـ 734 حالة، لكن فعالية نماذج الذكاء الاصطناعي تعتمد على توفر مجموعات بيانات واسعة. بالإضافة إلى ذلك، تعيق الغموض في الحالات الحدية، مثل التمييز بين تصنيفات Bi2 وBi3، دقة التوقع. أظهر استخدام تفسيرات نموذجية قابلة للتفسير (LIME) هذه التحديات، مما يشير إلى الحاجة إلى مزيد من تحسين النماذج.

يتم اقتراح العمل المستقبلي لتعزيز نتائج البحث، بما في ذلك تطوير نموذج هجين يدمج صور الماموجرام والأشعة فوق الصوتية مع مجموعة بيانات عوامل خطر سرطان الثدي. يقترح المؤلفون أيضًا إنشاء نظام شامل للتجزئة والكشف باستخدام نماذج متقدمة مثل YOLO، جنبًا إلى جنب مع توليد تقارير الذكاء الاصطناعي. علاوة على ذلك، يُوصى باستكشاف تقنيات ضغط النماذج ونشر حلول الحوسبة الطرفية لتحسين الوصول والكفاءة في التطبيقات العملية.

Journal: Scientific Reports, Volume: 15, Issue: 1
DOI: https://doi.org/10.1038/s41598-025-96527-0
PMID: https://pubmed.ncbi.nlm.nih.gov/40419634
Publication Date: 2025-05-27
Author(s): Aymen M. Al-Hejri et al.
Primary Topic: AI in cancer detection

Overview

This research paper presents a novel explainable federated learning framework aimed at improving breast cancer prediction, a critical issue given its status as a leading cause of mortality among women. The study evaluates three deep learning methodologies—individual AI models, high-level feature space ensemble models, and a hybrid model that integrates global Vision Transformer (ViT) and local convolutional neural network (CNN) features—across both centralized and federated settings. The models are tested on binary, multi-class, and Breast Imaging Reporting and Data System (BI-RADS) classification tasks using a unique dataset that includes real-world risk factors. The federated learning approach, which aggregates predictions from three clients, demonstrates superior performance with accuracies of 98.65%, 97.30%, and 95.59% for the respective tasks, while also incorporating Explainable AI (XAI) techniques to enhance model transparency.

In the conclusion, the paper highlights the implementation of both centralized and federated learning scenarios to predict breast cancer based on risk factors derived from patients’ medical records. The study identifies risk factors through health records and classifies them according to BI-RADS scores. The centralized approach utilized machine learning models and ensemble learning, while the proposed CEET-Fed model, which combines ensemble learning with a transformer encoder, achieved an accuracy of 97.30%. The federated learning setting not only preserves client privacy regarding sensitive breast risk data but also effectively addresses data imbalance. Overall, the federated model outperformed the centralized scenario, achieving notable accuracy rates and demonstrating the potential of XAI in elucidating prediction outcomes.

Methods

In this section, the authors present the experimental results and discussions regarding the prediction capabilities of risk factor-based models for breast cancer in both centralized and federated learning scenarios. The evaluation encompasses three classification approaches: binary classification, three-class classification, and BI-RADS classification.

The study employs a range of methodologies, including machine learning (ML) models, ensemble transfer learning, transformer models, and federated models, to enhance the prediction of breast cancer based on risk factors and health records. The findings indicate the effectiveness of these diverse modeling techniques in accurately predicting breast cancer risk, highlighting the potential of federated learning in leveraging distributed data while maintaining privacy.

Results

The results section presents a comparative evaluation of breast cancer risk prediction methodologies, contrasting previous studies with the proposed model. It highlights the use of various data sources, such as linked health records, mammograms, electronic medical records, and biomarker clinical features, across different machine learning (ML) and deep learning (DL) approaches. Performance metrics were primarily assessed using the area under the curve (AUC) with a 95% confidence interval (CI).

The proposed model, which integrates Convolutional Neural Networks (CNN) with Vision Transformers (ViT) and employs federated learning, achieved an AUC of 0.970 (95% CI: 0.929-1) on a dataset comprising 734 participants. This model significantly outperformed existing methods by effectively incorporating risk factors from electronic health records (EHR) and symptoms. The findings underscore the potential of advanced deep learning techniques and federated learning to enhance predictive accuracy in breast cancer risk assessment, emphasizing the value of utilizing diverse data sources and cutting-edge algorithms.

Discussion

The discussion section of the research paper highlights various methodologies and findings related to breast cancer prediction, emphasizing the integration of risk factors, clinical data, and advanced machine learning techniques. It identifies significant non-modifiable risk factors, such as gender and family history, alongside modifiable factors like BMI, hormonal therapy, and lifestyle choices. The paper references several models, including logistic regression (RF-LR) and convolutional neural networks (CNN), which were evaluated on large datasets. Notably, a hybrid deep learning model combining risk factors and mammography images achieved an AUC of 0.70, while another model utilizing machine learning and deep learning techniques reached an AUC of 0.91, demonstrating the potential of these approaches in enhancing predictive accuracy.

Additionally, the section discusses the application of transfer learning, ensemble learning, and federated learning in breast cancer detection. Transfer learning models, such as VGG16 and ResNet50, have shown promising results, with ensemble methods improving classification accuracy significantly. Federated learning is highlighted as a cutting-edge approach that allows for collaborative model training while preserving data privacy, achieving high accuracy rates in breast cancer detection. Overall, the findings underscore the importance of integrating diverse data sources and advanced computational techniques to improve breast cancer risk prediction and classification.

Limitations

The limitations of this research primarily stem from the challenges associated with data collection in the medical field, which is both resource-intensive and time-consuming. The study analyzed breast cancer risk factors and medical records for 734 cases, but the effectiveness of AI models is contingent upon the availability of extensive datasets. Additionally, ambiguities in borderline cases, such as distinguishing between Bi2 and Bi3 classifications, hinder prediction accuracy. The use of Local Interpretable Model-agnostic Explanations (LIME) has revealed these challenges, indicating a need for further refinement of the models.

Future work is suggested to enhance the research outcomes, including the development of a hybrid model that integrates mammogram and ultrasound images with the breast cancer risk factor dataset. The authors also propose creating an end-to-end system for segmentation and detection using advanced models like YOLO, alongside AI report generation. Furthermore, exploring model compression techniques and deploying edge computing solutions are recommended to improve accessibility and efficiency in practical applications.