إطار التعلم الهجين ونظام الانتباه الذاتي لتصنيف أورام الدماغ المستندة إلى MRI بشكل قوي Hybrid transfer learning and self-attention framework for robust MRI-based brain tumor classification

المجلة: Scientific Reports، المجلد: 15، العدد: 1
DOI: https://doi.org/10.1038/s41598-025-09311-5
PMID: https://pubmed.ncbi.nlm.nih.gov/40595383
تاريخ النشر: 2025-07-01
المؤلف: Soumyarashmi Panigrahi وآخرون
الموضوع الرئيسي: كشف وتصنيف أورام الدماغ

نظرة عامة

تقدم ورقة البحث نموذجًا هجينًا جديدًا، يسمى DenseTransformer، يهدف إلى تعزيز دقة تصنيف أورام الدماغ باستخدام التصوير بالرنين المغناطيسي (MRI). يدمج النموذج ميزات من شبكة DenseNet201 العصبية التلافيفية المدربة مسبقًا مع بنية Transformer، مع استخدام خاص لآليات الانتباه الذاتي متعدد الرؤوس (MHSA) وكتل الانتباه للضغط والإثارة (SEA). تتناول هذه الطريقة التحديات المتعلقة بكثافة الحساب، واكتشاف التفاصيل، وحساسية الضوضاء، محققة دقة تصنيف ملحوظة تبلغ 99.41% على مجموعة بيانات Br35H المكونة من 3,000 صورة MRI. يتم دعم موثوقية النموذج من خلال تحليلات إحصائية شاملة، بما في ذلك درجة كابا لكوهين، ودرجة AUC، وتقييمات F1-score.

في الختام، يظهر نموذج DenseTransformer أداءً متفوقًا مقارنةً بنماذج CNN المدربة مسبقًا الأخرى، مثل VGG19 وInceptionV3، في مهام التصنيف الثنائي. بينما يظهر النموذج وعدًا للتطبيقات السريرية، قد تؤثر القيود مثل حجم مجموعة البيانات وتنوع جودة MRI على قابليته للتعميم. ستركز الأعمال المستقبلية على دمج البيانات السريرية مع صور MRI، واستكشاف تقنيات الذكاء الاصطناعي القابلة للتفسير (XAI) الإضافية، وتكييف النموذج لمهام التصنيف المتعدد والفصل. تؤكد الدراسة على إمكانيات نماذج التعلم العميق الهجينة في تحسين تشخيص أورام الدماغ ونتائج المرضى، مع الاعتراف أيضًا بالحاجة إلى تعليقات موثقة من الخبراء للتحقق من نتائج النموذج بشكل أكبر.

الطرق

في هذا القسم، يقدم المؤلفون بنية هجينة جديدة، تسمى DenseTransformer، والتي تدمج نقاط القوة في الشبكات العصبية التلافيفية (CNN) وTransformers بهدف التنبؤ بدقة بالتشخيصات المستندة إلى MRI لأورام الدماغ والأدمغة السليمة. يتم تمثيل المنهجية المقترحة بصريًا في مخطط (الشكل 4)، يوضح تصميم البنية.

تتم مناقشة تفاصيل تنفيذ نموذج DenseTransformer بشكل شامل، بما في ذلك النتائج التجريبية والمعايير الإحصائية التي تقيم فعاليته. يقوم المؤلفون بمقارنة نموذجهم مع أحدث المنهجيات والنماذج المدربة مسبقًا لوضع أدائه في سياق المشهد الحالي. بالإضافة إلى ذلك، تم إجراء دراسات إلغاء لتقييم مساهمات المكونات الفردية للنموذج الهجين، مما يوفر رؤى حول أدائه العام وقوته.

النتائج

تتناول قسم النتائج تقييم أداء نموذج DenseTransformer الهجين لتصنيف صور MRI لأورام الدماغ مقابل الأدمغة الطبيعية. تم التحقق من فعالية النموذج من خلال تكوينات تجريبية متنوعة، مع اختيار تكوين إعادة تشكيل الميزات (16،32) لتحقيق توازن مثالي بين طول التسلسل وكفاءة الحساب. حقق هذا التكوين أعلى دقة بلغت 99.41% عند تدريبه على مجموعة بيانات BT-800، متفوقًا على جميع التكوينات والنماذج الأخرى التي تم اختبارها، كما هو ملخص في الجداول 7 و8.

تم الإبلاغ عن مقاييس تقييم إضافية، بما في ذلك الدقة، والمساحة تحت المنحنى (AUC)، والقيمة التنبؤية الإيجابية (PPV)، ومعدل الإيجابيات الحقيقية (TPR)، ودرجة F1، في الجدول 9، مما يظهر قوة النموذج. ومن الجدير بالذكر أن نموذج DenseTransformer أظهر أداءً استثنائيًا عبر جميع المقاييس، بما في ذلك درجة Brier المنخفضة التي تشير إلى توقعات جيدة المعايرة. كما شملت التحليلات تمثيلات بصرية لمقاييس متنوعة مثل درجة كابا لكوهين، والحساسية، والنوعية، ومنحنى خصائص التشغيل المستقبلية (ROC)، والتي تؤكد مجتمعة قدرات النموذج الفائقة في التصنيف. يشير منحنى المعايرة الموضح في الشكل 20 إلى أن احتمالات النموذج المتوقعة موثوقة، مما يدعم فعاليته في مهام تصنيف الصور الطبية.

المناقشة

في قسم المناقشة من ورقة البحث، يقوم المؤلفون بتقييم نقدي لتطور منهجيات اكتشاف الأورام في صور MRI، مع تسليط الضوء على قيود تقنيات معالجة الصور الطبية التقليدية مثل تحديد العتبات واكتشاف الحواف. كانت هذه الطرق تكافح لتحديد الحدود المعقدة لأورام الدماغ بدقة بسبب الطبيعة المعقدة لتشريح الدماغ وشكل الورم. لقد عزز ظهور التعلم العميق (DL)، وخاصة الشبكات العصبية التلافيفية (CNNs)، بشكل كبير من دقة وكفاءة تحليل الصور الطبية. حققت الهياكل البارزة مثل VGG وResNet50 وInceptionV3 دقة تتجاوز 95%، مع وصول بعض النماذج الهجينة والأساليب المعززة بـ Transformer إلى حوالي 97%. لقد أظهر دمج Transformers وعدًا في نمذجة الاعتماديات المكانية المعقدة، على الرغم من أن CNNs لا تزال تنافسية، خاصة عندما يتم تحسينها باستخدام آليات الانتباه.

يقدم المؤلفون بنية DenseTransformer جديدة تتكامل بشكل متآزر بين CNNs وTransformers لمعالجة قيود الأساليب الحالية. يهدف هذا الإطار الهجين إلى الاستفادة من نقاط القوة في كلا النموذجين: CNNs لاستخراج الميزات المحلية مكانيًا وTransformers لالتقاط الاعتماديات بعيدة المدى والعلاقات السياقية. من خلال القيام بذلك، يهدف النموذج المقترح إلى تحسين دقة تصنيف أورام الدماغ والمساهمة في التقدم في التشخيصات الطبية. يؤكد المؤلفون على أهمية عملهم في تحسين النتائج الواقعية للمهنيين الصحيين والمرضى، مما يعالج في النهاية التحديات المتطورة في تشخيص أورام الدماغ.

Journal: Scientific Reports, Volume: 15, Issue: 1
DOI: https://doi.org/10.1038/s41598-025-09311-5
PMID: https://pubmed.ncbi.nlm.nih.gov/40595383
Publication Date: 2025-07-01
Author(s): Soumyarashmi Panigrahi et al.
Primary Topic: Brain Tumor Detection and Classification

Overview

The research paper presents a novel hybrid model, termed DenseTransformer, aimed at enhancing the classification accuracy of brain tumors using Magnetic Resonance Imaging (MRI). The model integrates features from the pre-trained DenseNet201 Convolutional Neural Network (CNN) with a Transformer architecture, specifically employing Multi-Head Self-Attention (MHSA) and Squeeze-and-Excitation Attention (SEA) blocks. This approach addresses challenges related to computational intensity, detail detection, and noise sensitivity, achieving a remarkable classification accuracy of 99.41% on the Br35H dataset of 3,000 MRI images. The model’s reliability is further supported by comprehensive statistical analyses, including Cohen’s Kappa Score, AUC Score, and F1-score assessments.

In conclusion, the DenseTransformer model demonstrates superior performance compared to other pre-trained CNN models, such as VGG19 and InceptionV3, in binary classification tasks. While the model shows promise for clinical applications, limitations such as dataset size and variability in MRI quality may affect its generalizability. Future work will focus on integrating clinical data with MRI images, exploring additional Explainable AI (XAI) techniques, and adapting the model for multi-class classification and segmentation tasks. The study emphasizes the potential of hybrid Deep Learning models in improving brain tumor diagnosis and patient outcomes, while also acknowledging the need for expert-verified annotations to validate the model’s findings further.

Methods

In this section, the authors introduce a novel hybrid architecture, termed DenseTransformer, which integrates the strengths of Convolutional Neural Networks (CNN) and Transformers for the purpose of accurately forecasting MRI-based diagnoses of brain tumors and healthy brains. The proposed methodology is visually represented in a diagram (Fig. 4), illustrating the architecture’s design.

The implementation details of the DenseTransformer model are thoroughly discussed, including empirical results and statistical metrics that evaluate its efficacy. The authors benchmark their model against state-of-the-art methodologies and pre-trained models to contextualize its performance within the existing landscape. Additionally, ablation studies are conducted to assess the contributions of individual components of the hybrid model, offering insights into its overall performance and robustness.

Results

The results section details the performance evaluation of the DenseTransformer hybrid model for classifying brain MRI images of tumors versus normal brains. The model’s effectiveness was validated through various experimental configurations, with the feature reshaping configuration of (16,32) selected for its optimal balance of sequence length and computational efficiency. This configuration achieved the highest accuracy of 99.41% when trained on the BT-800 dataset, outperforming all other tested configurations and models, as summarized in Tables 7 and 8.

Further evaluation metrics, including accuracy, area under the curve (AUC), positive predictive value (PPV), true positive rate (TPR), and F1 score, were reported in Table 9, demonstrating the model’s robustness. Notably, the DenseTransformer model exhibited exceptional performance across all metrics, including a low Brier score indicating well-calibrated predictions. The analysis also included visual representations of various metrics such as Cohen’s Kappa Score, sensitivity, specificity, and the Receiver Operating Characteristic (ROC) curve, which collectively affirm the model’s superior classification capabilities. The calibration curve illustrated in Figure 20 indicates that the model’s predicted probabilities are reliable, further supporting its efficacy in medical image classification tasks.

Discussion

In the discussion section of the research paper, the authors critically evaluate the evolution of tumor detection methodologies in MRI images, highlighting the limitations of traditional medical image processing techniques such as thresholding and edge detection. These methods struggled to accurately delineate the complex boundaries of brain tumors due to the intricate nature of brain anatomy and tumor morphology. The advent of deep learning (DL), particularly convolutional neural networks (CNNs), has significantly enhanced the accuracy and efficiency of medical image analysis. Notable architectures like VGG, ResNet50, and InceptionV3 have achieved accuracies exceeding 95%, with some hybrid models and Transformer-enhanced approaches reaching around 97%. The integration of Transformers has shown promise in modeling complex spatial dependencies, although CNNs remain competitive, especially when optimized with attention mechanisms.

The authors introduce a novel DenseTransformer architecture that synergistically combines CNNs and Transformers to address the limitations of existing methods. This hybrid framework aims to leverage the strengths of both paradigms: CNNs for extracting spatially localized features and Transformers for capturing long-range dependencies and contextual relationships. By doing so, the proposed model aims to improve brain tumor classification accuracy and contribute to advancements in medical diagnostics. The authors emphasize the importance of their work in enhancing real-world outcomes for healthcare professionals and patients, ultimately addressing the evolving challenges in brain tumor diagnostics.