تقسيم أورام الدماغ باستخدام شبكة U-Net متعددة المقاييس مع مشفر EfficientNetB4 لتحليل الرنين المغناطيسي المحسن Brain tumor segmentation using multi-scale attention U-Net with EfficientNetB4 encoder for enhanced MRI analysis

المجلة: Scientific Reports، المجلد: 15، العدد: 1
DOI: https://doi.org/10.1038/s41598-025-94267-9
PMID: https://pubmed.ncbi.nlm.nih.gov/40121246
تاريخ النشر: 2025-03-22
المؤلف: R. Preetha وآخرون
الموضوع الرئيسي: كشف وتصنيف أورام الدماغ

نظرة عامة

تقدم هذه الدراسة إطار عمل جديد لتقسيم أورام الدماغ يدمج بين شبكة U-Net متعددة المقاييس مع مشفر EfficientNetB4، بهدف تحسين دقة وكفاءة التقسيم. يستخدم النموذج توسيع EfficientNetB4 المركب لتحسين استخراج الميزات عبر دقات متعددة مع تقليل المتطلبات الحسابية. يعزز آلية الانتباه متعددة المقاييس، التي تستخدم نوى بأحجام 1 × 1، 3 × 3، و5 × 5، تمثيل الميزات من خلال التقاط حدود الورم بفعالية على مقاييس مختلفة. تتضمن البنية اتصالات تخطي معززة بالانتباه وكتل انتباه متبقية لقمع المناطق غير ذات الصلة وتحسين تحديد موقع الورم.

تظهر التجارب الواسعة التي أجريت على مجموعة بيانات أورام الدماغ من Figshare أن النموذج المقترح يتفوق على متغيرات EfficientNet الأخرى وطرق التقسيم الحديثة، محققًا دقة تصل إلى 99.79%، ومعدل تصنيف خاطئ (MCR) يبلغ 0.21%، ومعامل Dice يبلغ 0.9339، وتداخل على اتحاد (IoU) يبلغ 0.8795. أظهر عملية التدريب تقاربًا مستقرًا عبر مقاييس رئيسية، بما في ذلك الدقة، والاسترجاع، والخصوصية، ومتوسط IoU. تؤكد هذه النتائج على قوة النموذج وفعاليته في تحديد مناطق الورم بدقة، مما يجعله أداة قيمة للتطبيقات السريرية في تشخيص أورام الدماغ وتخطيط العلاج، بينما يضع أيضًا الأساس للبحث المستقبلي في تصنيف الأورام وتدرجها.

الطرق

في قسم الطرق، توضح الدراسة إعدادها التجريبي، الذي يتضمن مجموعة بيانات مختارة بعناية من صور MRI للدماغ عالية الجودة، الضرورية للتدريب والتقييم الفعال. تحدد تفاصيل التنفيذ بيئة البرمجيات، والمكتبات، والأجهزة المستخدمة، بينما يتم تعريف مقاييس تقييم الأداء مثل معامل Dice، وتداخل على اتحاد (IoU)، والاسترجاع، والدقة لتقييم فعالية النموذج. يتم توضيح هذه المكونات في الأقسام التالية لتوضيح سير العمل التجريبي.

تم تقييم النموذج المقترح، EfficientNetB4 مع شبكة U-Net متعددة المقاييس، مقارنةً بالطرق الحديثة باستخدام مجموعة بيانات أورام الدماغ من Figshare. تشير النتائج إلى أن الطريقة المقترحة تفوقت على الأساليب الحالية، محققة دقة تصل إلى 99.79% ومعامل Dice يبلغ 0.9339، متجاوزة أفضل طريقة أداء من Akshya Kumar Sahoo وآخرين. بالإضافة إلى ذلك، أظهرت IoU أعلى يبلغ 0.8795 مقارنةً بالطرق الأخرى، مما يبرز فعالية آلية الانتباه متعددة المقاييس في تحديد حدود الورم بدقة. تؤكد دقة النموذج البالغة 0.9657 واسترجاعه البالغ 0.9103 قدرته في تقسيم الأورام، مع متوسط IoU يبلغ 0.9121 مما يشير إلى تعميم قوي عبر أشكال الأورام المختلفة. تشير النتائج إلى أن الطريقة المقترحة لا تعزز فقط جودة التقسيم ولكنها تحمل أيضًا وعدًا للتطبيقات السريرية، مع تحقيق توازن بين دقة عالية ووقت استدلال متوسط يبلغ 24 مللي ثانية لكل صورة، مما يجعلها مناسبة للاستخدام في الوقت الحقيقي. يمكن أن تستكشف الأبحاث المستقبلية قابليتها للتكيف مع أوضاع التصوير الأخرى وتحقق من أدائها على مجموعات بيانات متعددة المؤسسات.

النتائج

في هذا القسم، يتم تقديم نتائج التجربة التي تقيم أداء EfficientNetB4 المقترح مع شبكة U-Net متعددة المقاييس لتقسيم أورام الدماغ. تسلط مقارنة شاملة بين متغيرات EfficientNet المختلفة الضوء على مساهماتها في أداء التقسيم. يتم فحص مقاييس التدريب للنموذج المقترح لتأكيد استقراره وفعاليته. بالإضافة إلى ذلك، تظهر النتائج أن الطريقة المقترحة تتفوق على الأساليب الحديثة من حيث دقة التقسيم ومقاييس حيوية أخرى.

تؤكد الدراسة على أهمية التقسيم على التصنيف في تحليل أورام الدماغ لعدة أسباب. يعد التقسيم الدقيق ضروريًا لتحديد موقع الورم بدقة، مما يؤثر بشكل كبير على تخطيط العلاج والإرشاد الجراحي. علاوة على ذلك، تعالج الطريقة المقترحة بفعالية التحديات المرتبطة بالأورام التي لها حدود غير منتظمة أو كثافات متداخلة، والتي غالبًا ما يتم تصنيفها بشكل خاطئ في طرق التصنيف المباشر. كما تضع نتائج التقسيم الأساس للبحث المستقبلي في تصنيف الأورام أو تدرجها، مما قد يعزز دقة ووضوح النتائج.

المناقشة

تسلط قسم المناقشة في ورقة البحث الضوء على التقدم الكبير في تقسيم أورام الدماغ، مع التركيز على تطور هياكل التعلم العميق (DL) من النماذج المبكرة مثل الشبكات الالتفافية الكاملة (FCNs) إلى أطر أكثر تعقيدًا مثل U-Net ومشتقاتها. تشمل المساهمات الرئيسية تقديم U-Net من قبل Ronneberger وآخرين، الذي أسس معيارًا لتقسيم الصور الطبية من خلال هيكله المشفر-المفكك مع اتصالات تخطي. استكشفت دراسات مختلفة الابتكارات المعمارية، مثل دمج آليات الانتباه والتعلم المتبقي، لتعزيز دقة التقسيم ومعالجة التحديات مثل الكثافات المتداخلة وحدود الورم غير الواضحة. تشمل النماذج البارزة RR-U-Net، الذي يدمج وحدات متبقية، والنهج الهجينة التي تجمع بين التقسيم ومهام التصنيف، مما يظهر أداءً محسنًا عبر مجموعات بيانات متعددة.

على الرغم من هذه التقدمات، لا تزال هناك تحديات، بما في ذلك الحاجة إلى هياكل خفيفة الوزن لتحقيق توازن بين الأداء والكفاءة الحسابية، بالإضافة إلى دمج بيانات التصوير متعددة الأوضاع. تؤكد الورقة على إمكانيات التعلم المنقول والهياكل المخصصة لتحسين دقة التقسيم، مع الإشارة أيضًا إلى القيود التي تفرضها مجموعات البيانات المعلّمة غير الكافية والمتطلبات الحسابية العالية. يهدف النموذج المقترح في هذه الدراسة، الذي يجمع بين مشفر EfficientNetB4 ومفكك شبكة U-Net متعددة المقاييس، إلى معالجة هذه التحديات من خلال الاستفادة من استخراج الميزات الفعال وآليات الانتباه المتقدمة لتحقيق تقسيم قوي لأورام الدماغ من صور MRI. يتم وضع هذا النهج المبتكر كخطوة كبيرة إلى الأمام في هذا المجال، مما يعزز دقة وكفاءة أدوات التشخيص الآلي في البيئات السريرية.

القيود

تقدم الدراسة عدة قيود قد تؤثر على قابلية تطبيق النموذج المقترح وموثوقيته. أولاً، تقتصر مقارنات وقت الاستدلال مع النماذج الحديثة الأخرى على نقص المعايير القياسية، مما يجعل من الصعب وضع النموذج مقارنةً بالهياكل المعاصرة بخلاف اتساقه مع نماذج EfficientNet في تقسيم الصور الطبية. ثانيًا، تم إجراء تقييم أداء النموذج فقط على مجموعة بيانات أورام الدماغ من Figshare، والتي قد لا تشمل التنوع الكامل للسيناريوهات السريرية، مما يثير القلق بشأن تعميمه على مجموعات بيانات أو أوضاع تصوير أخرى.

بالإضافة إلى ذلك، تم تصميم النموذج حاليًا لمهام التقسيم الثنائي، ولا تزال فعاليته في تقسيم الفئات المتعددة – مثل التمييز بين أنواع الأورام المختلفة – غير مختبرة. أخيرًا، على الرغم من أن النموذج حقق دقة عالية، إلا أن جوانب التفسير والتحقق السريري لم يتم فحصها بدقة، مما قد يعيق دمجه في سير العمل السريري حيث يكون فهم قرارات النموذج أمرًا ضروريًا للممارسين.

Journal: Scientific Reports, Volume: 15, Issue: 1
DOI: https://doi.org/10.1038/s41598-025-94267-9
PMID: https://pubmed.ncbi.nlm.nih.gov/40121246
Publication Date: 2025-03-22
Author(s): R. Preetha et al.
Primary Topic: Brain Tumor Detection and Classification

Overview

This study presents a novel brain tumor segmentation framework that integrates Multiscale Attention U-Net with the EfficientNetB4 encoder, aimed at improving segmentation accuracy and efficiency. The model utilizes EfficientNetB4’s compound scaling to optimize feature extraction across multiple resolutions while minimizing computational demands. A Multi-Scale Attention Mechanism, employing kernels of sizes 1 × 1, 3 × 3, and 5 × 5, enhances feature representation by effectively capturing tumor boundaries at various scales. The architecture incorporates attention-enhanced skip connections and residual attention blocks to suppress irrelevant regions and improve tumor localization.

Extensive experiments conducted on the Figshare brain tumor dataset reveal that the proposed model outperforms other EfficientNet variants and state-of-the-art segmentation methods, achieving an accuracy of 99.79%, a misclassification rate (MCR) of 0.21%, a Dice Coefficient of 0.9339, and an Intersection over Union (IoU) of 0.8795. The training process demonstrated stable convergence across key metrics, including precision, recall, specificity, and mean IoU. These results underscore the model’s robustness and effectiveness in accurately delineating tumor regions, positioning it as a valuable tool for clinical applications in brain tumor diagnosis and treatment planning, while also laying the groundwork for future research in tumor classification and grading.

Methods

In the Methods section, the study outlines its experimental setup, which includes a carefully selected dataset of high-quality brain MRI images, essential for effective training and validation. The implementation details specify the software environment, libraries, and hardware used, while performance evaluation metrics such as the Dice Coefficient, Intersection over Union (IoU), recall, and precision are defined to assess the model’s efficacy. These components are elaborated upon in subsequent sections to clarify the experimental workflow.

The proposed model, EfficientNetB4 with Multiscale Attention U-Net, was benchmarked against state-of-the-art methods using the Figshare brain tumor dataset. Results indicate that the proposed method outperformed existing approaches, achieving an accuracy of 99.79% and a Dice Coefficient of 0.9339, surpassing the best-performing method by Akshya Kumar Sahoo et al. Additionally, it demonstrated a higher IoU of 0.8795 compared to other methods, underscoring the effectiveness of the multiscale attention mechanism in accurately delineating tumor boundaries. The model’s precision of 0.9657 and recall of 0.9103 further emphasize its capability in tumor segmentation, with a Mean IoU of 0.9121 indicating robust generalization across various tumor morphologies. The findings suggest that the proposed method not only enhances segmentation quality but also holds promise for clinical applications, balancing high accuracy with an average inference time of 24 ms per image, suitable for real-time use. Future research could explore its adaptability to other imaging modalities and validate its performance on multi-institutional datasets.

Results

In this section, the results of the experiment evaluating the performance of the proposed EfficientNetB4 combined with Multiscale Attention U-Net for brain tumor segmentation are presented. A comprehensive comparison among various EfficientNet variants highlights their respective contributions to segmentation performance. The training metrics of the proposed model are scrutinized to confirm its stability and effectiveness. Additionally, the results demonstrate that the proposed method outperforms state-of-the-art approaches in terms of segmentation accuracy and other critical metrics.

The study emphasizes the importance of segmentation over classification in brain tumor analysis for several reasons. Accurate segmentation is essential for precise tumor localization, which significantly influences treatment planning and surgical guidance. Furthermore, the proposed method effectively addresses challenges associated with tumors that have irregular boundaries or overlapping intensities, which are often misclassified in direct classification methods. The segmentation outcomes also lay the groundwork for future research in tumor classification or grading, potentially enhancing the accuracy and interpretability of results.

Discussion

The discussion section of the research paper highlights the significant advancements in brain tumor segmentation, emphasizing the evolution of deep learning (DL) architectures from early models like Fully Convolutional Networks (FCNs) to more sophisticated frameworks such as U-Net and its derivatives. Key contributions include the introduction of U-Net by Ronneberger et al., which established a standard for biomedical image segmentation through its encoder-decoder architecture with skip connections. Various studies have explored architectural innovations, such as the integration of attention mechanisms and residual learning, to enhance segmentation accuracy and address challenges like overlapping intensities and unclear tumor boundaries. Notable models include the RR-U-Net, which incorporates residual modules, and hybrid approaches that combine segmentation with classification tasks, demonstrating improved performance across multiple datasets.

Despite these advancements, challenges remain, including the need for lightweight architectures to balance performance and computational efficiency, as well as the integration of multimodal imaging data. The paper emphasizes the potential of transfer learning and customized architectures to improve segmentation accuracy, while also noting the limitations posed by insufficient annotated datasets and high computational demands. The proposed model in this study, which combines an EfficientNetB4 encoder with a Multi-Scale Attention U-Net decoder, aims to address these challenges by leveraging efficient feature extraction and advanced attention mechanisms to achieve robust segmentation of brain tumors from MRI images. This innovative approach is positioned as a significant advancement in the field, enhancing the accuracy and efficiency of automated diagnostic tools in clinical settings.

Limitations

The study presents several limitations that may affect the applicability and robustness of the proposed model. Firstly, the inference time comparisons with other state-of-the-art models are constrained by the lack of standardized benchmarks, making it difficult to position the model against contemporary architectures beyond its consistency with EfficientNet-based models in medical image segmentation. Secondly, the validation of the model’s performance was conducted solely on the Figshare brain tumor dataset, which may not encompass the full diversity of clinical scenarios, thereby raising concerns about its generalization to other datasets or imaging modalities.

Additionally, the model is currently tailored for binary segmentation tasks, and its efficacy in multiclass segmentation—such as differentiating between various tumor types—remains untested. Lastly, while the model achieved high accuracy, the aspects of interpretability and clinical validation were not thoroughly examined, which could hinder its integration into clinical workflows where understanding model decisions is essential for practitioners.