MUNet: إطار عمل جديد لتقسيم دقيق لأورام الدماغ يجمع بين شبكات UNet و mamba MUNet: a novel framework for accurate brain tumor segmentation combining UNet and mamba networks

المجلة: Frontiers in Computational Neuroscience، المجلد: 19
DOI: https://doi.org/10.3389/fncom.2025.1513059
PMID: https://pubmed.ncbi.nlm.nih.gov/39944950
تاريخ النشر: 2025-01-29
المؤلف: Lijuan Yang وآخرون
الموضوع الرئيسي: كشف وتصنيف أورام الدماغ

نظرة عامة

تقدم ورقة البحث MUNet، وهو إطار شبكة جديد مصمم لت segmentation الأورام الدماغية بكفاءة ودقة من خلال دمج نقاط القوة في UNet و Mamba. يتناول MUNet قيود النماذج الحالية، مثل Transformers والشبكات العصبية التلافيفية (CNNs)، من خلال دمج وحدة SD-SSM، التي تستخدم المسح الانتقائي ونمذجة الفضاء الحالى لالتقاط ميزات الصورة العالمية والمحلية بفعالية. تعزز هذه القدرة المزدوجة بشكل كبير دقة التقسيم. علاوة على ذلك، تم تقديم هيكل SD-Conv لتقليل تكرار الميزات دون زيادة معلمات النموذج، مما يحسن الكفاءة الحسابية.

تظهر النتائج التجريبية أن MUNet يحقق أداءً متفوقًا على مجموعات بيانات BraTS2020 و BraTS2018، مع تحسينات ملحوظة في قيم معامل تشابه Dice (DSC) ودرجات Hausdorff لتعزيز تقسيم الورم (ET) والورم الكلي (WT) ونواة الورم (TC). بالإضافة إلى ذلك، يظهر MUNet قدرات تعميم قوية عند اختباره على مجموعة بيانات LGG المستقلة، مما يؤكد فعاليته عبر سياقات التصوير الطبي المختلفة. تقترح الورقة أن البحث المستقبلي يمكن أن يستكشف تطبيق MUNet على أنماط تصوير أخرى واستراتيجيات تعلم متقدمة لتعزيز فائدته السريرية. الشيفرة الخاصة بـ MUNet متاحة للجمهور على https://github.com/Dalin/MUNet.

مقدمة

تناقش مقدمة الورقة التهديد الصحي الحرج الذي تمثله الأورام الدماغية، والتي يمكن أن تكون خبيثة أو حميدة وغالبًا ما تؤدي إلى إعاقات عصبية شديدة. يؤكد المؤلفون على أهمية التشخيص والعلاج المبكر والدقيق بسبب تباين الأورام ومواقعها التشريحية المعقدة. لقد مكنت التقدمات الأخيرة في التعلم العميق، وخاصة في تحليل الصور الطبية، من التقسيم التلقائي واكتشاف الأورام الدماغية باستخدام مجموعات بيانات MRI الكبيرة. من الجدير بالذكر أن الهياكل مثل UNet قد أظهرت نجاحًا كبيرًا في هذه المهام، متفوقة على الطرق اليدوية التقليدية من خلال التقاط خصائص الورم بدقة أكبر.

تسلط الورقة الضوء على التقدمات في نماذج التعلم العميق، وخاصة تلك المعتمدة على Transformers والشبكات العصبية التلافيفية (CNNs)، في تقسيم الصور. بينما تتفوق Transformers في التقاط الميزات العالمية، تواجه تحديات تتعلق بالتعقيد الحسابي ومتطلبات البيانات، بينما تكافح CNNs مع الارتباطات المكانية بعيدة المدى. يقدم المؤلفون شبكة Mamba كبديل واعد بسبب استخراج الميزات الفعال وتصميمها المعياري، الذي يعزز دمج الميزات متعددة المقاييس مع تقليل المتطلبات الحسابية. ومع ذلك، فإن تعقيد صور أورام الدماغ MRI يمثل تحديات للتقسيم الدقيق. لمعالجة هذه القضايا، يقترح المؤلفون إطار شبكة جديد، MUNet، الذي يدمج نقاط القوة في UNet و Mamba. يستخدم MUNet كتلة SD-SSM جديدة لالتقاط الميزات العالمية والمحلية، ويقدم هيكل SD-Conv فعال لتقليل التكرار، ويستخدم اتصالات تخطي للاحتفاظ بمعلومات متعددة المقاييس. بالإضافة إلى ذلك، يتم اقتراح دالة خسارة جديدة تجمع بين خسائر المIoU وDice وحدود لت优化 دقة التقسيم.

طرق

في هذا القسم، يحدد المؤلفون المنهجيات المستخدمة في دراستهم، مع التركيز على نماذج الفضاء الحالى (SSM) لنمذجة البيانات التسلسلية، وخاصة في المهام البصرية التي تتضمن تسلسلات الصور. يتميز إطار SSM بقدرته على التقاط الاعتماديات بعيدة المدى من خلال تحويل تسلسلات الإدخال إلى فضاء حالة مخفية ونمذجتها بشكل تكراري. يتضمن الإعداد التجريبي ثلاث مجموعات بيانات رئيسية: BraTS2020 و BraTS2018 ومجموعة بيانات تقسيم LGG المستمدة من أرشيف تصوير السرطان (TCIA)، والتي تشمل مجتمعة مجموعة متنوعة من صور MRI المستخدمة في مهام تقسيم الأورام الدماغية. تم إجراء التجارب على خادم عالي الأداء مزود بأجهزة وأدوات برمجية متقدمة، بما في ذلك PyTorch ومكتبات الحوسبة العلمية المختلفة، مما يضمن إمكانية إعادة الإنتاج والكفاءة.

تشمل مقاييس التقييم المستخدمة في الدراسة Kappa ومعامل تشابه Dice (DSC) والتقاطع على الاتحاد (IoU) والحساسية والدقة والخصوصية والدقة العامة والدقة المتوازنة، والمسافة Hausdorff95. توفر هذه المقاييس تقييمًا شاملاً لأداء النموذج في مهام التقسيم، حيث تركز Hausdorff95 بشكل خاص على المسافة المكانية بين الحدود المتوقعة والحدود الحقيقية، مما يوفر إطار تقييم قوي لتقسيم الصور الطبية. تشير النتائج إلى أن نموذج MUNet يتفوق بشكل كبير على النماذج الحالية على كل من مجموعات بيانات BraTS، محققًا درجات DSC أعلى وتقليل المسافات Hausdorff95، مما يدل على دقة متفوقة في تقسيم الورم وتحديد الحدود. تؤكد النتائج فعالية MUNet في التقاط أشكال الأورام المعقدة، مما يبرز إمكانياته للتحليل الدقيق في تطبيقات التصوير الطبي.

مناقشة

تقدم ورقة البحث MUNet، وهو إطار مبتكر لتقسيم الأورام الدماغية يدمج نقاط القوة في هياكل U-Net و Mamba. تشمل المساهمات الرئيسية تقديم كتلة SD-SSM، التي تستخدم المسح الانتقائي واستخراج الميزات ثنائية القناة لالتقاط الميزات متعددة المقاييس، مما يعزز دقة التقسيم. بالإضافة إلى ذلك، تم تصميم هيكل SD-Conv لضغط المعلومات الزائدة بين الميزات دون زيادة معلمات النموذج، مما يحسن الكفاءة الحسابية. كما تم اقتراح دالة خسارة جديدة تجمع بين خسارة المIoU المتوسطة، وخسارة Dice، وخسارة الحدود، مما يحسن أداء التقسيم من عدة جوانب.

تناقش الورقة تطور U-Net وتكيفاته، مثل TransUNet و SwinUNet، التي حسنت استخراج الميزات ونمذجة السياق العالمي. يميز MUNet نفسه من خلال تحقيق توازن فعال بين تمثيل الميزات المحلية والعالمية مع الحفاظ على تعقيد حسابي منخفض. تظهر النتائج التجريبية أن MUNet يتفوق على النماذج الحالية على مجموعات بيانات BraTS2020 و BraTS2018، محققًا تحسينات كبيرة في درجات Dice وتقليل المسافات Hausdorff، خاصة في دقة الحدود. تؤكد النتائج فعالية المكونات المقترحة ودالة الخسارة في معالجة تحديات تقسيم الأورام الدماغية، مما يمثل تقدمًا كبيرًا في هذا المجال.

القيود

تسلط قسم القيود في MUNet الضوء على عدة مجالات للتحسين في مهام تقسيم الأورام الدماغية. أولاً، بينما تدمج كتلة SD-SSM بشكل فعال الميزات العالمية والمحلية، فإن دقة النموذج تتناقص في وجود مناطق ورم معقدة أو غير متجانسة، حيث يمكن أن تعيق الحدود غير الواضحة والأشكال غير المنتظمة دقة التقسيم. يجب أن يهدف البحث المستقبلي إلى تعزيز آليات الكشف عن الحدود لمعالجة هذه التحديات بشكل أفضل.

ثانيًا، تحد طبيعة MUNet كصندوق أسود من قابليته للتفسير، وهو أمر حاسم للتطبيقات السريرية حيث يكون فهم عملية اتخاذ القرار للنموذج أمرًا أساسيًا. يمكن أن يركز العمل المستقبلي على تطوير عمليات تفكير شفافة أو وحدات تفسيرية لتوضيح خطوات القرار وتوزيعات الأوزان، مما يعزز الثقة بين المهنيين الطبيين. بالإضافة إلى ذلك، يواجه النموذج مشاكل الإفراط في التكيف، خاصة مع مجموعات البيانات المحدودة أو غير المتوازنة، مما قد يؤدي إلى تباينات بين الأداء في التدريب والأداء في العالم الحقيقي. للتخفيف من ذلك، يجب أن يأخذ البحث المستقبلي في الاعتبار زيادة تنوع مجموعة البيانات من خلال بيانات مشروحة إضافية أو تقنيات تعزيز البيانات، بالإضافة إلى استخدام طرق التنظيم والتعلم الانتقالي لتعزيز قدرات تعميم النموذج.

Journal: Frontiers in Computational Neuroscience, Volume: 19
DOI: https://doi.org/10.3389/fncom.2025.1513059
PMID: https://pubmed.ncbi.nlm.nih.gov/39944950
Publication Date: 2025-01-29
Author(s): Lijuan Yang et al.
Primary Topic: Brain Tumor Detection and Classification

Overview

The research paper introduces MUNet, a novel network framework designed for efficient and accurate brain tumor segmentation by integrating the strengths of UNet and Mamba. MUNet addresses the limitations of existing models, such as Transformers and Convolutional Neural Networks (CNNs), by incorporating the SD-SSM module, which employs selective scanning and state-space modeling to effectively capture both global and local image features. This dual capability significantly enhances segmentation accuracy. Furthermore, the SD-Conv structure is introduced to minimize feature redundancy without increasing model parameters, thereby improving computational efficiency.

Experimental results demonstrate that MUNet achieves superior performance on the BraTS2020 and BraTS2018 datasets, with notable improvements in Dice Similarity Coefficient (DSC) values and Hausdorff scores for enhancing tumor (ET), whole tumor (WT), and tumor core (TC) segmentation. Additionally, MUNet exhibits strong generalization capabilities when tested on the independent LGG dataset, affirming its effectiveness across various medical imaging contexts. The paper suggests that future research could explore the application of MUNet to other imaging modalities and advanced learning strategies to further enhance its clinical utility. The code for MUNet is publicly available at https://github.com/Dalin/MUNet.

Introduction

The introduction of the paper discusses the critical health threat posed by brain tumors, which can be either malignant or benign and often lead to severe neurological impairments. The authors emphasize the importance of early and accurate diagnosis and treatment due to the tumors’ heterogeneity and complex anatomical locations. Recent advancements in deep learning, particularly in medical image analysis, have enabled automatic segmentation and detection of brain tumors using large MRI datasets. Notably, architectures like UNet have shown significant success in these tasks, outperforming traditional manual methods by more accurately capturing tumor characteristics.

The paper highlights the advancements in deep learning models, particularly those based on Transformers and Convolutional Neural Networks (CNNs), in image segmentation. While Transformers excel at capturing global features, they face challenges related to computational complexity and data requirements, whereas CNNs struggle with long-range spatial correlations. The authors introduce the Mamba network as a promising alternative due to its efficient feature extraction and modular design, which enhances multi-scale feature fusion while reducing computational demands. However, the complexity of MRI brain tumor images presents challenges for accurate segmentation. To address these issues, the authors propose a novel network framework, MUNet, which integrates the strengths of UNet and Mamba. MUNet employs a new SD-SSM block for capturing both global and local features, introduces an efficient SD-Conv structure to minimize redundancy, and utilizes skip connections for multi-scale information retention. Additionally, a new loss function combining mIoU, Dice, and Boundary losses is proposed to optimize segmentation accuracy.

Methods

In this section, the authors outline the methodologies employed in their study, focusing on State Space Models (SSM) for modeling sequential data, particularly in visual tasks involving image sequences. The SSM framework is adept at capturing long-range dependencies by mapping input sequences into a hidden state space and modeling them recursively. The experimental setup involves three primary datasets: BraTS2020, BraTS2018, and an LGG segmentation dataset sourced from The Cancer Imaging Archive (TCIA), which collectively encompass a variety of MRI images used for brain tumor segmentation tasks. The experiments were conducted on a high-performance server equipped with advanced hardware and software tools, including PyTorch and various scientific computing libraries, ensuring reproducibility and efficiency.

The evaluation metrics employed in the study include Kappa, Dice Similarity Coefficient (DSC), Intersection over Union (IoU), Sensitivity, Precision, Specificity, Accuracy, Balanced Accuracy, and Hausdorff95 distance. These metrics provide a comprehensive assessment of the model’s performance in segmentation tasks, with Hausdorff95 specifically focusing on the spatial distance between predicted and ground truth boundaries, thus offering a robust evaluation framework for medical image segmentation. The results indicate that the MUNet model significantly outperforms existing models on both BraTS datasets, achieving higher DSC scores and reduced Hausdorff95 distances, thereby demonstrating superior accuracy in tumor segmentation and boundary delineation. The findings underscore MUNet’s effectiveness in capturing complex tumor morphologies, highlighting its potential for precise analysis in medical imaging applications.

Discussion

The research paper presents MUNet, an innovative framework for brain tumor segmentation that integrates the strengths of U-Net and Mamba architectures. Key contributions include the introduction of the SD-SSM Block, which employs selective scanning and dual-channel feature extraction to capture multi-scale features, thereby enhancing segmentation accuracy. Additionally, the SD-Conv structure is designed to compress redundant information between features without increasing model parameters, improving computational efficiency. A novel loss function combining mean Intersection over Union (mIoU) loss, Dice loss, and Boundary loss is also proposed, optimizing segmentation performance from multiple perspectives.

The paper discusses the evolution of U-Net and its adaptations, such as TransUNet and SwinUNet, which have improved feature extraction and global context modeling. MUNet distinguishes itself by effectively balancing local and global feature representation while maintaining low computational complexity. Experimental results demonstrate that MUNet outperforms existing models on the BraTS2020 and BraTS2018 datasets, achieving significant improvements in Dice scores and reductions in Hausdorff distances, particularly in boundary accuracy. The findings underscore the effectiveness of the proposed components and loss function in addressing the challenges of brain tumor segmentation, marking a substantial advancement in the field.

Limitations

The section on limitations of MUNet highlights several areas for improvement in brain tumor segmentation tasks. Firstly, while the SD-SSM block effectively integrates global and local features, the model’s accuracy diminishes in the presence of complex or heterogeneous tumor regions, where unclear boundaries and irregular shapes can hinder segmentation precision. Future research should aim to enhance boundary detection mechanisms to better address these challenges.

Secondly, MUNet’s black-box nature limits its interpretability, which is crucial for clinical applications where understanding the model’s decision-making process is essential. Future work could focus on developing transparent reasoning processes or explanation modules to clarify decision steps and weight distributions, thereby fostering trust among medical professionals. Additionally, the model faces overfitting issues, particularly with limited or imbalanced datasets, which can lead to discrepancies between training and real-world performance. To mitigate this, future research should consider increasing dataset diversity through additional annotated data or data augmentation techniques, as well as employing regularization methods and transfer learning to enhance the model’s generalization capabilities.