نموذج شبكة عصبية تلافيفية هجينة وbi-LSTM مع EfficientNet-B0 للكشف عن سرطان الثدي وتصنيفه بدقة عالية Hybrid convolutional neural network and bi-LSTM model with EfficientNet-B0 for high-accuracy breast cancer detection and classification

المجلة: Scientific Reports، المجلد: 15، العدد: 1
DOI: https://doi.org/10.1038/s41598-025-95311-4
PMID: https://pubmed.ncbi.nlm.nih.gov/40204759
تاريخ النشر: 2025-04-09
المؤلف: Umesh Kumar Lilhore وآخرون
الموضوع الرئيسي: الذكاء الاصطناعي في اكتشاف السرطان

نظرة عامة

تقدم البحث نموذج هجين جديد لاكتشاف سرطان الثدي يجمع بين الشبكات العصبية التلافيفية (CNNs) وشبكات الذاكرة طويلة وقصيرة المدى ثنائية الاتجاه (Bi-LSTM) وهيكل EfficientNet-B0 المدرب مسبقًا. من خلال الاستفادة من EfficientNet-B0، الذي تم تدريبه على مجموعة بيانات ImageNet الواسعة، يستفيد النموذج من التعلم الانتقالي، مما يسمح باستخراج ميزات أكثر فعالية من الصور الماموجرافية مقارنةً بالشبكات العصبية التلافيفية التقليدية التي تتطلب التدريب من الصفر. يعزز دمج Bi-LSTM قدرة النموذج على معالجة الاعتماد الزمني في البيانات، وهو أمر حاسم لتحديد الأنماط المعقدة في تصوير سرطان الثدي. حقق النموذج دقة مثيرة للإعجاب بلغت 99.2% في التمييز بين الأورام الحميدة والخبيثة عند تقييمه على مجموعات بيانات مثل CBIS-DDSM وMIAS، متفوقًا على هياكل أخرى مثل VGG-16 وResNet-50.

تخلص الدراسة إلى أن النموذج الهجين المقترح يحسن بشكل كبير من الدقة التنبؤية وقابلية التفسير في اكتشاف سرطان الثدي. تم تحسين أداء النموذج باستخدام مُحسِّن آدم، الذي حقق باستمرار أعلى دقة وأقل خسارة عبر مراحل التدريب والتحقق المختلفة. أظهرت دراسة الإزالة أهمية كل مكون، كاشفة أن النموذج الكامل (CNN + EfficientNet-B0 + Bi-LSTM) تفوق على التكوينات التي تفتقر إما إلى المعالجة الزمنية لـ Bi-LSTM أو قدرات استخراج الميزات لـ EfficientNet-B0. يعزز دمج تقنيات تصور الميزات من شفافية النموذج، مما يوفر رؤى حول عملية اتخاذ القرار الخاصة به، وهو أمر حيوي للتطبيقات السريرية. يضع هذا النهج الهجين معيارًا جديدًا في الرعاية الصحية التنبؤية لاكتشاف سرطان الثدي، مما يبرز إمكانيته في تحويل الممارسات السريرية.

الطرق

في قسم الطرق، يوضح المؤلفون مجموعة البيانات المستخدمة وهيكل النموذج الهجين المقترح لتشخيص سرطان الثدي. يتم تقديم تحليل مقارن في الجدول 18، الذي يقيم نماذج التعلم العميق المختلفة من 13 دراسة نُشرت في عام 2024. يتم تسليط الضوء على مقاييس الأداء الرئيسية مثل الدقة والحساسية والنوعية والدقة والمساحة تحت المنحنى (AUC) ودرجة F1 وكابا كوهين، مما يُظهر فعالية النماذج. يحقق النموذج الهجين المقترح دقة تبلغ 99.00%، وحساسية تبلغ 95.00%، ونوعية تبلغ 99.50%، مما يشير إلى قدراته الفائقة في الكشف والتصنيف مقارنةً بالتقنيات الحالية.

تؤكد النتائج المستخلصة من مجموعات بيانات MIAS وCBIS-DDSM على دقة النموذج المقترح وموثوقيته المحسّنة في تصنيف سرطان الثدي، مما يمثل تقدمًا كبيرًا على المنهجيات الحالية. يبرز هذا البحث إمكانيات تقنيات التعلم العميق المتقدمة لتكون مدمجة بشكل فعال في الممارسات السريرية، مما يحسن في النهاية نتائج المرضى في تشخيص سرطان الثدي.

النتائج

تظهر نتائج الدراسة فعالية النموذج الهجين المقترح، الذي يدمج CNN مع EfficientNet-B0 لاستخراج الميزات وBi-LSTM لنمذجة التسلسل، في تصنيف سرطان الثدي باستخدام مجموعات بيانات CBIS-DDSM وMIAS. في التصنيف الثنائي على مجموعة بيانات CBIS-DDSM، حقق النموذج الهجين دقة تبلغ 99.30%، وحساسية تبلغ 97.85%، ونوعية تبلغ 99.27%، ودقة تبلغ 98.54%، وAUC قدره 0.99، متفوقًا بشكل كبير على النماذج التقليدية مثل DenseNet201 وResNet-50. بالنسبة للتصنيف متعدد الفئات، حافظ النموذج على أداء عالٍ بدقة تبلغ 99.08% وحساسية تبلغ 96.05%. وبالمثل، على مجموعة بيانات MIAS، سجل النموذج الهجين دقة تبلغ 89.20% دون معالجة مسبقة، والتي تحسنت إلى 99.00% بعد المعالجة المسبقة، مما يبرز الدور الحاسم لمعالجة البيانات المسبقة في تحسين أداء النموذج.

كما كشفت التحليلات أن مُحسِّن آدم تفوق باستمرار على المحسنات الأخرى، مما يعزز تقارب النموذج واستقراره، خاصة في مهام تصنيف الصور الطبية. أكدت دراسة الإزالة أن كل مكون من مكونات النموذج الهجين يساهم في فعاليته العامة، محققًا مقاييس ملحوظة مثل 99.2% دقة و99.1% درجة F1. تؤكد هذه النتائج على إمكانيات النموذج الهجين كأداة قوية لاكتشاف سرطان الثدي، قادرة على تحسين التشخيص المبكر وتوفير نتائج موثوقة في البيئات السريرية.

المناقشة

يقدم البحث نموذج CNN-LSTM هجين محسّن باستخدام تحسين آدم يهدف إلى تعزيز توقع صور سرطان الثدي من خلال دمج الميزات المكانية والزمنية بشكل فعال. تشمل المساهمات الرئيسية تحسين هيكل CNN مع طبقات تلافيفية إضافية لتحسين التعرف على الأنماط المكانية، وهيكل Bi-LSTM المحسن لتمثيل بيانات التسلسل بشكل أفضل، وتطبيق التعلم الانتقالي باستخدام نموذج EfficientNet-B0 المدرب مسبقًا على ImageNet. تعالج ضبط المعلمات عبر تحسين آدم مشكلة الإفراط في التكيف ونقص التكيف، مما يؤدي إلى توقعات أسرع وأكثر موثوقية. يظهر النموذج المقترح دقة محسّنة على مجموعات بيانات سرطان الثدي CBIS-DDSM وMIAS، متفوقًا على النماذج الحالية مثل VGG-16 وResNet-50.

تسلط المناقشة الضوء على أهمية منهجيات التعلم العميق في تشخيص سرطان الثدي، مع التأكيد على الحاجة إلى نماذج قوية يمكن أن تعمم عبر مجموعات سكانية وطرق تصوير متنوعة. أظهرت الدراسات السابقة درجات متفاوتة من النجاح في اكتشاف سرطان الثدي باستخدام التعلم العميق، مع التركيز على دمج البيانات متعددة الوسائط وتحسين قابلية تفسير النموذج. يبرز البحث أهمية الذكاء الاصطناعي القابل للتفسير لمساعدة المتخصصين في الرعاية الصحية في اتخاذ القرارات السريرية ويقترح اتجاهات مستقبلية تشمل معالجة عدم توازن البيانات، وتعزيز قابلية تعميم النموذج، ودمج تقنيات معالجة البيانات في الوقت الحقيقي مثل إنترنت الأشياء الطبية (IoMT). بشكل عام، بينما تعتبر التقدمات في التعلم العميق لاكتشاف سرطان الثدي ملحوظة، فإن الجهود المستمرة مطلوبة لتعزيز قوة النموذج وقابلية تفسيره.

Journal: Scientific Reports, Volume: 15, Issue: 1
DOI: https://doi.org/10.1038/s41598-025-95311-4
PMID: https://pubmed.ncbi.nlm.nih.gov/40204759
Publication Date: 2025-04-09
Author(s): Umesh Kumar Lilhore et al.
Primary Topic: AI in cancer detection

Overview

The research presents a novel hybrid model for breast cancer detection that combines Convolutional Neural Networks (CNNs), Bidirectional Long Short-Term Memory (Bi-LSTM) networks, and the pre-trained EfficientNet-B0 architecture. By utilizing EfficientNet-B0, which is trained on the extensive ImageNet dataset, the model benefits from transfer learning, allowing for more effective feature extraction from mammographic images compared to traditional CNNs that require training from scratch. The incorporation of Bi-LSTM enhances the model’s ability to process temporal dependencies in the data, crucial for identifying complex patterns in breast cancer imaging. The model achieved an impressive accuracy of 99.2% in distinguishing between benign and malignant tumors when evaluated on datasets such as CBIS-DDSM and MIAS, outperforming other architectures like VGG-16 and ResNet-50.

The study concludes that the proposed hybrid model significantly improves predictive accuracy and interpretability in breast cancer detection. The model’s performance was optimized using the Adam optimizer, which consistently yielded the highest accuracy and lowest loss across various training and validation phases. An ablation study demonstrated the importance of each component, revealing that the full model (CNN + EfficientNet-B0 + Bi-LSTM) outperformed configurations lacking either the temporal processing of Bi-LSTM or the feature extraction capabilities of EfficientNet-B0. The integration of feature visualization techniques further enhances the model’s transparency, providing insights into its decision-making process, which is vital for clinical applications. This hybrid approach sets a new standard in predictive healthcare for breast cancer detection, highlighting its potential to transform clinical practices.

Methods

In the Methods section, the authors detail the dataset utilized and the architecture of the proposed hybrid model for breast cancer diagnosis. A comparative analysis is presented in Table 18, which evaluates various deep learning models from 13 studies published in 2024. Key performance metrics such as accuracy, sensitivity, specificity, precision, area under the curve (AUC), F1 score, and Cohen’s Kappa are highlighted, demonstrating the efficacy of the models. The proposed hybrid model achieves an accuracy of 99.00%, sensitivity of 95.00%, and specificity of 99.50%, indicating its superior detection and classification capabilities compared to existing techniques.

The findings from the MIAS and CBIS-DDSM datasets further validate the proposed model’s enhanced accuracy and reliability in breast cancer classification, marking a significant advancement over current methodologies. This research underscores the potential of advanced deep learning techniques to be effectively integrated into clinical practices, ultimately improving patient outcomes in breast cancer diagnosis.

Results

The results of the study demonstrate the effectiveness of the proposed Hybrid Model, which integrates CNN with EfficientNet-B0 for feature extraction and Bi-LSTM for sequence modeling, in classifying breast cancer using the CBIS-DDSM and MIAS datasets. In binary classification on the CBIS-DDSM dataset, the Hybrid Model achieved an accuracy of 99.30%, sensitivity of 97.85%, specificity of 99.27%, precision of 98.54%, and an AUC of 0.99, significantly outperforming traditional models such as DenseNet201 and ResNet-50. For multi-class classification, the model maintained high performance with an accuracy of 99.08% and sensitivity of 96.05%. Similarly, on the MIAS dataset, the Hybrid Model recorded an accuracy of 89.20% without preprocessing, which improved to 99.00% post-preprocessing, highlighting the critical role of data preprocessing in enhancing model performance.

The analysis also revealed that the Adam optimizer consistently outperformed other optimizers, enhancing the model’s convergence and stability, particularly in medical image classification tasks. The ablation study confirmed that each component of the Hybrid Model contributes to its overall effectiveness, achieving notable metrics such as 99.2% accuracy and 99.1% F1-score. These findings underscore the Hybrid Model’s potential as a robust tool for breast cancer detection, capable of improving early diagnosis and providing reliable outcomes in clinical settings.

Discussion

The research presents an Improved Adam optimization-optimized hybrid CNN-LSTM model aimed at enhancing breast cancer image prediction by effectively integrating spatial and temporal features. Key contributions include an improved CNN architecture with additional convolutional layers for better spatial pattern recognition, an enhanced Bi-LSTM structure for superior representation of sequential data, and the application of transfer learning using the EfficientNet-B0 model pre-trained on ImageNet. Hyperparameter tuning via Adam optimization addresses overfitting and underfitting, leading to faster and more reliable predictions. The proposed model demonstrates improved accuracy on breast cancer datasets CBIS-DDSM and MIAS, outperforming existing models such as VGG-16 and ResNet-50.

The discussion highlights the significance of deep learning methodologies in breast cancer diagnosis, emphasizing the need for robust models that can generalize across diverse populations and imaging modalities. Previous studies have shown varying degrees of success in breast cancer detection using deep learning, with a focus on integrating multimodal data and improving model interpretability. The research underscores the importance of explainable AI to assist healthcare professionals in clinical decision-making and suggests future directions that include addressing data imbalance, enhancing model generalizability, and incorporating real-time data processing technologies like the Internet of Medical Things (IoMT). Overall, while advancements in deep learning for breast cancer detection are notable, ongoing efforts are required to enhance model robustness and interpretability.