إطار عمل متتابع لتفسير أورام الدماغ باستخدام متغيرات Yolo A successive framework for brain tumor interpretation using Yolo variants

المجلة: Scientific Reports، المجلد: 15، العدد: 1
DOI: https://doi.org/10.1038/s41598-025-13155-4
PMID: https://pubmed.ncbi.nlm.nih.gov/40745471
تاريخ النشر: 2025-07-31
المؤلف: S. Priyadharshini وآخرون
الموضوع الرئيسي: كشف وتصنيف أورام الدماغ

نظرة عامة

تقدم ورقة البحث إطارًا جديدًا لتحديد وتقسيم أورام الدماغ في التصوير بالرنين المغناطيسي (MRI) باستخدام تقنيات التعلم العميق المتقدمة. مع الاعتراف بالتحديات التي يواجهها المتخصصون الطبيون في تقييم صور الرنين المغناطيسي يدويًا بسبب قيود الوقت والتباين، تقيم الدراسة أداء YOLOv9 و YOLOv10 و YOLOv11 على مجموعة بيانات أورام الدماغ من Figshare ومجموعة بيانات BraTS2020. تشمل خطوات المعالجة المسبقة المستخدمة تحويل اللوغاريتم لتطبيع الكثافة، وتساوي الهيستوغرام لتعزيز التباين، واستخراج منطقة الاهتمام (ROI) المعتمدة على الحواف.

من بين النماذج التي تم تقييمها، برز YOLOv11 كالأكثر فعالية، حيث حقق دقة تصنيف تبلغ 96.22% على BraTS2020 و 96.41% على Figshare، بالإضافة إلى درجة F1 تبلغ 0.990، واسترجاع قدره 0.984، ومتوسط دقة (mAP) عند 0.5 يبلغ 0.993، وmAP عبر [0.5:0.95] يبلغ 0.801. كما أظهر النموذج وقت استدلال سريع قدره 5.3 مللي ثانية، مما يدل على إمكانيته للتطبيقات السريرية في الوقت الحقيقي. تؤكد النتائج على قدرة الإطار على تعزيز دقة وسرعة تحديد الأورام، مما يجعله أداة واعدة لتحسين عمليات التشخيص في البيئات السريرية.

الطرق

تنطوي منهجية البحث على نهج منظم لتحديد الأورام المعتمدة على الرنين المغناطيسي باستخدام نموذج YOLO، كما هو موضح في الأشكال المرفقة. تبدأ العملية بمراجعة شاملة للأدبيات وتنتقل عبر جمع البيانات، والاختيار، والإعداد، والمعالجة المسبقة. تشمل تقنيات المعالجة المسبقة الرئيسية تحويل اللوغاريتم، وتساوي الهيستوغرام، وكشف حواف كاني، والتي تعتبر أساسية لتعزيز منطقة الاهتمام (ROI) في صور الرنين المغناطيسي. تشمل المخرجات النهائية للنموذج تحديد موقع الصندوق المحيط وتقسيم منطقة الورم، مما يظهر تأثير المعالجة المسبقة على جودة الصورة.

استخدمت الإعدادات التجريبية Google Colab Pro، مستفيدة من بنية تحتية سحابية قوية مع معالج Intel Xeon بسرعة 2.2 جيجاهرتز، وذاكرة وصول عشوائي سعة 64 جيجابايت، وGPU من نوع V100. تم تحسين نماذج YOLO (v9 و v10 و v11) لهذا البيئة، مع معلمات ومتطلبات حسابية متغيرة. تم تصميم YOLOv11، الذي يحتوي على 2.83 مليون معلمة و113 طبقة، لفحص الأورام في الوقت الحقيقي، بينما يوازن YOLOv10 (8.04 مليون معلمة) بين السرعة والدقة للتطبيقات السريرية. بالمقابل، يتفوق YOLOv9، الذي يحتوي على 27.63 مليون معلمة، في الدقة ولكنه يتطلب موارد حسابية كبيرة. استخدم التدريب مُحسّن SGD مع معلمات فرعية محددة، بما في ذلك معدل تعلم قدره 0.0001 وحجم دفعة قدره 32، محققًا تعميمًا مثاليًا خلال 50 دورة. يعتمد اختيار النموذج على الاحتياجات المحددة للدقة أو الأداء أو القابلية للتكيف في سيناريوهات الفحص.

النتائج

تسلط النتائج المقدمة الضوء على فعالية نماذج YOLO في الكشف عن أورام الدماغ، مع التأكيد على التقدم المحرز من خلال التقنيات الهجينة، والتوليف بدون علامات، ونماذج التصنيف القوية. يمثل إطار YOLOv11 المقترح هذه التقدمات من خلال تقديم أداء في الوقت الحقيقي دون المساس بالدقة.

بالإضافة إلى ذلك، تؤكد النتائج على أهمية طرق الكشف المبتكرة ونُهج التقسيم متعددة الأنماط، التي تدعو إلى دمج أوضاع التصوير المختلفة. هذا الدمج ضروري لتحسين النتائج التشخيصية في الكشف عن أورام الدماغ، مما يتماشى مع الاتجاهات المعاصرة في هذا المجال.

المناقشة

تسلط المناقشة الضوء على التقدم في خوارزميات التعلم العميق (DL) لتشخيص أورام الدماغ في الوقت الحقيقي، مع التركيز بشكل خاص على فعالية الشبكات العصبية التلافيفية (CNNs) ومتغيراتها. أظهرت دراسات مختلفة أن النماذج المدربة على صور الرنين المغناطيسي والأشعة المقطعية يمكن أن تحقق معدلات دقة عالية، مع نتائج ملحوظة مثل نموذج حقق دقة 96% على مجموعة بيانات BrainMRNet ونموذج CNN ثلاثي الأبعاد حقق دقة تصل إلى 98% عبر مجموعات بيانات متعددة. يؤكد البحث على أهمية النماذج الهجينة والهياكل المتقدمة، مثل YOLOv9 و YOLOv10 و YOLOv11، التي تدمج ميزات مثل الكشف بدون مرساة وآليات الانتباه متعددة المقاييس لتعزيز قدرات الكشف عن الأورام وتقسيمها.

تناقش الورقة أيضًا التحديات التي تطرحها عدم توازن الفئات في مجموعات بيانات التدريب، داعية إلى تقنيات مثل زيادة البيانات ودوال خسارة مع وزن الفئة لتحسين أداء النموذج عبر جميع أنواع الأورام. تُبرز خطوات المعالجة المسبقة، بما في ذلك تساوي الهيستوغرام وتحويل اللوغاريتم، كعوامل حاسمة لتعزيز وضوح مناطق الأورام في صور الرنين المغناطيسي. تشير النتائج إلى أن YOLOv11 يتفوق على أسلافه من حيث الدقة والاسترجاع ومتوسط الدقة (mAP)، محققًا mAP@0.5 قدره 0.993 ويظهر قدرات تعميم متفوقة. بشكل عام، تشير النتائج إلى أن النظام القائم على YOLOv11 المقترح مناسب تمامًا للتطبيقات السريرية في الوقت الحقيقي، حيث يوازن بشكل فعال بين السرعة والدقة وجودة التقسيم في تحليل أورام الدماغ.

القيود

بينما يظهر الإطار أداءً واعدًا، إلا أن لديه عدة قيود ملحوظة. أولاً، اعتمد تقييمه فقط على مجموعتين من البيانات المتاحة للجمهور، مما قد لا يعكس بشكل كافٍ التباين الموجود في البيانات السريرية الواقعية عبر مختلف السكان وظروف التصوير. ثانيًا، يقتصر النموذج على تحليل شرائح الرنين المغناطيسي ثنائية الأبعاد، مما يفتقر إلى القدرة على استخدام بيانات الحجم ثلاثية الأبعاد، والتي يمكن أن تعزز فهمه السياقي المكاني. علاوة على ذلك، لم يتم اختبار النموذج على أوضاع تصوير أخرى مثل الأشعة المقطعية أو PET، والتي تعتبر أساسية في البيئات التشخيصية متعددة التخصصات.

بالإضافة إلى ذلك، قد تتطلب تقنيات المعالجة المسبقة المستخدمة، على الرغم من فعاليتها في الرنين المغناطيسي للدماغ، تعديلات كبيرة عند تطبيقها على مناطق تشريحية أو أنواع أورام أخرى. أخيرًا، فإن قابلية تفسير النموذج وشرح قراراته محدودة، مما قد يعيق اعتماده في الممارسة السريرية حيث يكون فهم قرارات النموذج أمرًا حاسمًا.

Journal: Scientific Reports, Volume: 15, Issue: 1
DOI: https://doi.org/10.1038/s41598-025-13155-4
PMID: https://pubmed.ncbi.nlm.nih.gov/40745471
Publication Date: 2025-07-31
Author(s): S. Priyadharshini et al.
Primary Topic: Brain Tumor Detection and Classification

Overview

The research paper presents a novel framework for the identification and segmentation of brain tumors in Magnetic Resonance Imaging (MRI) using advanced deep learning techniques. Recognizing the challenges faced by medical professionals in manually evaluating MRI images due to time constraints and variability, the study evaluates the performance of YOLOv9, YOLOv10, and YOLOv11 on the Figshare Brain Tumor dataset and the BraTS2020 dataset. The preprocessing steps employed include log transformation for intensity normalization, histogram equalization for contrast enhancement, and edge-based region of interest (ROI) extraction.

Among the models assessed, YOLOv11 emerged as the most effective, achieving a classification accuracy of 96.22% on BraTS2020 and 96.41% on Figshare, alongside an F1-score of 0.990, recall of 0.984, mean Average Precision (mAP) at 0.5 of 0.993, and mAP across [0.5:0.95] of 0.801. The model also demonstrated a rapid inference time of 5.3 ms, indicating its potential for real-time clinical applications. The findings underscore the framework’s capability to enhance tumor localization accuracy and speed, making it a promising tool for improving diagnostic processes in clinical settings.

Methods

The methodology of the research involves a structured approach to MRI-based tumor identification using a YOLO model, as illustrated in the accompanying figures. The process begins with a thorough literature review and progresses through data collection, selection, preparation, and preprocessing. Key preprocessing techniques include log transformation, histogram equalization, and Canny edge detection, which are essential for enhancing the region of interest (ROI) in MRI images. The final model output includes bounding box localization and tumor area segmentation, demonstrating the impact of preprocessing on image quality.

The experimental setup utilized Google Colab Pro, leveraging a robust cloud infrastructure with a 2.2 GHz Intel Xeon CPU, 64 GB RAM, and a V100 GPU. The YOLO models (v9, v10, and v11) were optimized for this environment, with varying parameters and computational demands. YOLOv11, featuring 2.83 million parameters and 113 layers, is tailored for real-time tumor screening, while YOLOv10 (8.04 million parameters) balances speed and accuracy for clinical applications. In contrast, YOLOv9, with 27.63 million parameters, excels in accuracy but requires significant computational resources. The training utilized the SGD optimizer with specific hyperparameters, including a learning rate of 0.0001 and a batch size of 32, achieving optimal generalization within 50 epochs. The choice of model depends on the specific needs for precision, performance, or adaptability in screening scenarios.

Results

The results presented highlight the effectiveness of the YOLO models in brain tumor detection, emphasizing the advancements achieved through hybrid techniques, label-free synthesis, and robust classification models. The proposed YOLOv11 framework exemplifies these advancements by delivering real-time performance without compromising accuracy.

Additionally, the findings underscore the significance of innovative detection methods and multimodal segmentation approaches, which advocate for the integration of various imaging modalities. This integration is crucial for enhancing diagnostic outcomes in brain tumor detection, aligning with contemporary trends in the field.

Discussion

The discussion highlights the advancements in deep learning (DL) algorithms for real-time brain tumor diagnosis, particularly emphasizing the efficacy of convolutional neural networks (CNNs) and their variants. Various studies have demonstrated that models trained on MRI and CT scans can achieve high accuracy rates, with notable results such as a model reaching 96% accuracy on the BrainMRNet dataset and a 3D CNN achieving up to 98% accuracy across multiple datasets. The research underscores the importance of hybrid models and advanced architectures, such as YOLOv9, YOLOv10, and YOLOv11, which integrate features like anchor-free detection and multi-scale attention mechanisms to enhance tumor detection and segmentation capabilities.

The paper also addresses the challenges posed by class imbalance in training datasets, advocating for techniques such as data augmentation and class-weighted loss functions to improve model performance across all tumor types. The preprocessing steps, including histogram equalization and log transformation, are highlighted as crucial for enhancing the visibility of tumor regions in MRI scans. The results indicate that YOLOv11 outperforms its predecessors in terms of precision, recall, and mean Average Precision (mAP), achieving a mAP@0.5 of 0.993 and demonstrating superior generalization capabilities. Overall, the findings suggest that the proposed YOLOv11-based system is well-suited for real-time clinical applications, effectively balancing speed, accuracy, and segmentation quality in brain tumor analysis.

Limitations

The framework, while demonstrating promising performance, has several notable limitations. Firstly, its evaluation relied solely on two publicly available datasets, which may not adequately capture the variability present in real-world clinical data across different populations and imaging conditions. Secondly, the model is restricted to analyzing 2D MRI slices, thereby lacking the ability to utilize 3D volumetric data, which could enhance its spatial contextual understanding. Furthermore, the model has not been tested on other imaging modalities such as CT or PET, which are essential in multi-disciplinary diagnostic environments.

Additionally, the preprocessing techniques employed, though effective for brain MRI, may necessitate considerable adjustments when applied to other anatomical regions or tumor types. Lastly, the model’s clinical interpretability and explainability are limited, which could hinder its adoption in clinical practice where understanding model decisions is crucial.