شبكة عصبية عميقة مدفوعة بالذكاء الاصطناعي قابلة للتفسير للكشف الدقيق عن سرطان الثدي من الصور النسيجية والصور بالموجات فوق الصوتية An explainable AI-driven deep neural network for accurate breast cancer detection from histopathological and ultrasound images

المجلة: Scientific Reports، المجلد: 15، العدد: 1
DOI: https://doi.org/10.1038/s41598-025-97718-5
PMID: https://pubmed.ncbi.nlm.nih.gov/40394112
تاريخ النشر: 2025-05-20
المؤلف: Md. Romzan Alom وآخرون
الموضوع الرئيسي: الذكاء الاصطناعي في اكتشاف السرطان

نظرة عامة

تقدم البحث نموذج الكشف عن سرطان الثدي باستخدام الشبكات العصبية العميقة (DNBCD)، وهو إطار مبتكر مصمم لتعزيز الكشف المبكر والدقيق عن سرطان الثدي من خلال التحليل الآلي للصور الهستوباثولوجية والأشعة فوق الصوتية. باستخدام بنية تعلم عميق تعتمد على Densenet121، يتضمن نموذج DNBCD طبقات شبكة عصبية تلافيفية (CNN) مخصصة، بما في ذلك GlobalAveragePooling2D وDense وDropout، إلى جانب تقنيات التعلم الانتقالي. لا يحقق هذا النهج دقة تصنيف عالية فحسب – 93.97% على مجموعة بيانات Breakhis-400x و89.87% على مجموعة بيانات صور الأشعة فوق الصوتية للثدي – بل يبرز أيضًا القابلية للتفسير من خلال دمج Grad-CAM (تخطيط تنشيط الفئة المعتمد على التدرج)، الذي يبرز بصريًا المناطق التي تؤثر على توقعات النموذج.

تشير نتائج الدراسة إلى أن DNBCD يتفوق بشكل كبير على الأساليب الحالية الرائدة، مما يجعله أداة واعدة للتطبيقات السريرية في تشخيص سرطان الثدي. من خلال تقديم تبريرات بصرية لتوقعاته، يعزز النموذج الثقة والشفافية بين مقدمي الرعاية الصحية، مما يعالج قيدًا حاسمًا لنماذج التعلم العميق التقليدية في السياقات الطبية. تشير النتائج إلى أن DNBCD لا يعمل فقط كمصدر موثوق للتشخيص ولكن لديه أيضًا القدرة على تحسين نتائج المرضى من خلال الكشف المبكر والدقيق عن سرطان الثدي. تهدف الأعمال المستقبلية إلى تطوير هذه الأداة بشكل أكبر للاستخدام السريري الواسع، مما يساهم في التقدم في تكنولوجيا الصحة الرقمية.

طرق

تم تصميم المنهجية المقترحة لنظام DNBCD لتحليل وتصنيف صور سرطان الثدي بفعالية. تبدأ العملية بالحصول على صور الإدخال من مجموعة بيانات، تليها معالجة مسبقة لتحسين جودة الصورة. ثم يتم تقسيم مجموعة البيانات إلى مجموعات تدريب واختبار وتحقق باستخدام استراتيجيات تقسيم عشوائية أو طبقية. يتم تدريب نماذج مختلفة، بما في ذلك Densenet121 وMobilenet وResnet50 وVGG19 وغيرها، مع التركيز على Densenet121 وMobilenet وResnet50 وVGG19 لتطوير النموذج النهائي. يتم تعديل هذه النماذج من خلال إضافة طبقات شبكة عصبية تلافيفية (CNN) واستخدام تقنيات التعلم الانتقالي، مما يؤدي إلى اختيار تكوين يعتمد على Densenet121 كنموذج DNBCD لتصنيف سرطان الثدي. يوفر النظام أيضًا تفسيرات قابلة للتفسير لنتائج تصنيفه.

تم إجراء الإعداد التجريبي لهذا البحث على Kaggle، باستخدام نظام مزود بوحدات معالجة الرسوميات P100 (ذاكرة 8 جيجابايت) ومعالج Intel Xeon Platinum 8259CL، مما يضمن قوة حسابية كافية لمهام التعلم العميق. تم تنفيذ العمل باستخدام Python 3.7، مع TensorFlow وKeras كإطارات عمل رئيسية. يتم تلخيص المعلمات التفصيلية المستخدمة خلال التجارب في الجدول 5، مما يوفر رؤى حول إعدادات التكوين التي أبلغت عملية التدريب.

نقاش

في قسم النقاش هذا، يتم مراجعة دراسات مختلفة تستخدم تقنيات التعلم العميق والتعلم الآلي لتشخيص سرطان الثدي، مع التركيز على منهجياتها ونتائجها وقيودها. اقترح أوندكون وآخرون نموذج CNN-ANN هجين حقق دقة 89.47% على مجموعة بيانات BreakHis-400x، على الرغم من أنه واجه تحديات في الاسترجاع والتكيف الزائد. قدم أحمد وآخرون نموذج AlexNet المحسن كميًا (QOA) الذي حسن الدقة إلى 93.67% عند تكبير 400×، مما يبرز إمكانيات الحوسبة الكمومية في التصوير الطبي، ومع ذلك أثار مخاوف بشأن قابلية التوسع والتطبيق السريري. تشمل المساهمات البارزة الأخرى نماذج التعلم العميق الهجينة التي اقترحها غوبتا وآخرون، والتي دمجت Xception مع مصنفات مثل SVM وRandom Forest، محققة دقتين قدرهما 90.17% و88.98% على التوالي، مع معالجة قضايا عدم توازن مجموعة البيانات.

بالإضافة إلى ذلك، أظهرت الدراسات حول صور الأشعة فوق الصوتية للثدي، مثل تلك التي أجراها مونتيانو وآخرون وبكال وآخرون، فعالية نماذج التعلم العميق، حيث حقق نموذج Vision Transformer دقة قدرها 88.6%. ومع ذلك، كانت القيود مثل أحجام مجموعات البيانات الصغيرة والتكاليف الحسابية العالية شائعة عبر العديد من الأساليب. يؤكد النقاش على أهمية جودة مجموعة البيانات وتقنيات المعالجة المسبقة والحاجة إلى مجموعات بيانات أكبر وأكثر تنوعًا لتعزيز قابلية تعميم النموذج وأدائه. بشكل عام، يبرز البحث التقدم في الكشف عن سرطان الثدي من خلال هياكل نماذج مبتكرة وتقنيات تحسين، بينما يشير أيضًا إلى المجالات الحرجة للاستكشاف المستقبلي، بما في ذلك قابلية تفسير النموذج والتكامل السريري العملي.

القيود

تسلط القيود المفروضة على نموذج DNBCD لتشخيص سرطان الثدي الضوء على عدة تحديات حاسمة تؤثر على نشره وقابلية تعميمه. تشمل القضايا الرئيسية تباين جودة الصورة عبر مجموعات البيانات، المتأثرة بعوامل مثل الإضاءة والدقة، مما يمكن أن يعيق أداء النموذج. قد يقتصر تدريب النموذج على مجموعات بيانات محددة، وهي Breakhis-400x وBUSI، على قابليته للتطبيق على نطاق أوسع من سيناريوهات التصوير السريري. لتعزيز القابلية للتعميم، يجب أن تركز الأبحاث المستقبلية على دمج ظروف تصوير متنوعة والتحقق من النموذج على مجموعة أوسع من مجموعات البيانات التي تعكس مختلف الفئات السكانية وطرائق التصوير.

بالإضافة إلى ذلك، على الرغم من الجهود المبذولة للتخفيف من عدم توازن الفئات من خلال الوزن، لا تزال هناك تفاوتات كبيرة، خاصة بالنسبة للفئات الممثلة تمثيلاً ناقصًا، مما يؤثر على مقاييس الأداء مثل الاسترجاع والدقة. على سبيل المثال، بينما يحقق نموذج T_Resnet استرجاعًا عاليًا على مجموعة بيانات Breakhis-400x، فإن دقته منخفضة نسبيًا، مما يشير إلى أداء غير متساوٍ عبر الفئات. يجب استكشاف استراتيجيات مستقبلية، مثل زيادة العينة، وتوليد بيانات اصطناعية، أو طرق التجميع، لمعالجة هذا التوازن بشكل أفضل. أخيرًا، تؤدي تعقيدات النموذج إلى زيادة الوقت الحسابي، مما يطرح تحديات للتطبيق السريري في الوقت الحقيقي. سيكون من الضروري تحقيق توازن بين الدقة والكفاءة الحسابية في النسخ المستقبلية من نظام DNBCD، مما يحسن في النهاية من متانته وقابليته للتطبيق في الإعدادات السريرية.

Journal: Scientific Reports, Volume: 15, Issue: 1
DOI: https://doi.org/10.1038/s41598-025-97718-5
PMID: https://pubmed.ncbi.nlm.nih.gov/40394112
Publication Date: 2025-05-20
Author(s): Md. Romzan Alom et al.
Primary Topic: AI in cancer detection

Overview

The research introduces the Deep Neural Breast Cancer Detection (DNBCD) model, an innovative framework designed to enhance the early and accurate detection of breast cancer through automated analysis of histopathological and ultrasound images. Utilizing a deep learning architecture based on Densenet121, the DNBCD model incorporates customized Convolutional Neural Network (CNN) layers, including GlobalAveragePooling2D, Dense, and Dropout layers, alongside transfer learning techniques. This approach not only achieves high classification accuracy—93.97% on the Breakhis-400x dataset and 89.87% on the Breast Ultrasound Images Dataset—but also emphasizes interpretability through the integration of Grad-CAM (Gradient-weighted Class Activation Mapping), which visually highlights the regions influencing the model’s predictions.

The study’s findings indicate that DNBCD significantly outperforms existing state-of-the-art methods, making it a promising tool for clinical applications in breast cancer diagnostics. By providing visual justifications for its predictions, the model enhances trust and transparency among healthcare providers, addressing a critical limitation of traditional deep learning models in medical contexts. The results suggest that DNBCD not only serves as a reliable diagnostic resource but also has the potential to improve patient outcomes through early and precise detection of breast cancer. Future work aims to further develop this tool for widespread clinical use, ultimately contributing to advancements in digital health technology.

Methods

The proposed methodology for the DNBCD system is designed to analyze and classify breast cancer images effectively. The process begins with the acquisition of input images from a dataset, followed by preprocessing to enhance image quality. The dataset is then divided into training, testing, and validation sets using either random or stratified splitting strategies. Various models, including Densenet121, Mobilenet, Resnet50, VGG19, and others, are trained, with a focus on Densenet121, Mobilenet, Resnet50, and VGG19 for the final model development. These models are modified by adding convolutional neural network (CNN) layers and utilizing transfer learning techniques, culminating in the selection of a Densenet121-based configuration as the DNBCD model for breast cancer classification. The system also provides interpretable explanations for its classification results.

The experimental setup for this research was conducted on Kaggle, utilizing a system with P100 GPUs (8 GB RAM) and an Intel Xeon Platinum 8259CL processor, ensuring sufficient computational power for deep learning tasks. The implementation was carried out using Python 3.7, with TensorFlow and Keras as the primary frameworks. Detailed hyperparameters used during the experiments are summarized in Table 5, offering insights into the configuration settings that informed the training process.

Discussion

In this discussion section, various studies utilizing deep learning and machine learning techniques for breast cancer diagnosis are reviewed, focusing on their methodologies, results, and limitations. Ogundokun et al. proposed a hybrid CNN-ANN model that achieved an accuracy of 89.47% on the BreakHis-400x dataset, although it faced challenges with recall and overfitting. Ahmed et al. introduced a Quantum-Optimized AlexNet (QOA) model that improved accuracy to 93.67% at 400× magnification, highlighting the potential of quantum computing in medical imaging, yet it raised concerns regarding scalability and clinical applicability. Other notable contributions include Gupta et al.’s hybrid deep transfer learning models, which combined Xception with classifiers like SVM and Random Forest, achieving accuracies of 90.17% and 88.98%, respectively, while also addressing dataset imbalance issues.

Additionally, studies on breast ultrasound images, such as those by Munteanu et al. and Pacal et al., demonstrated the effectiveness of deep learning models, with the Vision Transformer achieving an accuracy of 88.6%. However, limitations such as small dataset sizes and high computational costs were common across many approaches. The discussion emphasizes the importance of dataset quality, preprocessing techniques, and the need for larger, more diverse datasets to enhance model generalizability and performance. Overall, the research highlights the advancements in breast cancer detection through innovative model architectures and optimization techniques, while also pointing out the critical areas for future exploration, including model interpretability and practical clinical integration.

Limitations

The limitations of the DNBCD model for breast cancer diagnostics highlight several critical challenges that affect its deployment and generalizability. Key issues include variability in image quality across datasets, influenced by factors such as lighting and resolution, which can hinder model performance. The model’s training on specific datasets, namely Breakhis-400x and BUSI, may restrict its applicability to a broader range of clinical imaging scenarios. To enhance generalizability, future research should focus on incorporating diverse imaging conditions and validating the model on a wider array of datasets that reflect various demographics and imaging modalities.

Additionally, despite efforts to mitigate class imbalance through weighting, significant disparities remain, particularly for underrepresented classes, impacting performance metrics like recall and precision. For example, while the T_Resnet model achieves high recall on the Breakhis-400x dataset, its precision is comparatively low, indicating uneven performance across classes. Future strategies, such as oversampling, synthetic data generation, or ensemble methods, should be explored to better address this imbalance. Lastly, the model’s complexity leads to increased computational time, posing challenges for real-time clinical application. Balancing accuracy with computational efficiency will be essential for the future iterations of the DNBCD system, ultimately improving its robustness and applicability in clinical settings.