نموذج ذكي لاكتشاف أورام الدماغ باستخدام شبكة هجين خفيفة الوزن من نوع التوأم الانتباهية الهرمية An intelligent brain tumor detection model using lightweight hybrid twin attentive pyramid convolutional network

المجلة: Scientific Reports، المجلد: 15، العدد: 1
DOI: https://doi.org/10.1038/s41598-025-23813-2
PMID: https://pubmed.ncbi.nlm.nih.gov/41249242
تاريخ النشر: 2025-11-17
المؤلف: S. Lincy Jemina وآخرون
الموضوع الرئيسي: كشف وتصنيف أورام الدماغ

نظرة عامة

تقدم هذه البحث نظامًا تلقائيًا لاكتشاف أورام الدماغ (BT) يظهر فعالية وموثوقية في تصنيف صور الرنين المغناطيسي إلى فئات الأورام وغير الأورام. يستخدم النموذج بنية هجينة خفيفة الوزن تعرف باسم LHTA-PCNet، والتي تعزز دقة التصنيف ضمن إطار HybLwDL. إن دمج نهج تصفية قائم على شبكة الخصومة التوليدية (GANF) يحسن جودة الصورة، مما يعزز بشكل كبير أداء الاكتشاف.

يتم تقييم فعالية المصنف باستخدام مجموعة بيانات اكتشاف BT 2020، حيث يحقق إطار HybLwDL دقة تصنيف مثيرة للإعجاب تبلغ 99.5%. علاوة على ذلك، يظهر حساسية بنسبة 100%، ودرجة F1 تبلغ 99.6%، ومعدلات أقصى خصوصية ودقة تبلغ 99%. ستركز الأعمال المستقبلية على توسيع إطار HybLwDL لتصنيف الأورام متعددة الفئات لتعزيز قابليته للتطبيق في التشخيصات الطبية. بالإضافة إلى ذلك، تم التخطيط لاختبارات التحقق والتجارب السريرية الشاملة لتأكيد كفاءة الإطار عبر مجموعات مرضى متنوعة.

طرق

تحدد المنهجية المقترحة نموذج HybLwDL المصمم لتصنيف أورام الدماغ (BT) باستخدام تقنيات التصوير المتقدمة. في البداية، يتم معالجة صور الرنين المغناطيسي مسبقًا باستخدام مرشح شبكة ثنائية الاتجاه غاوسي (GANF) لتحسين جودة الصورة. بعد ذلك، يتم تحليل هذه الصور باستخدام شبكة الالتفاف الهرمية المزدوجة الانتباه الخفيفة الوزن (LHTA-PCNet)، والتي تعتمد على بنية ResNet-50. يستبدل هذا النموذج الاختناقات القياسية بكتل شبكة الالتفاف عالية الأداء (HPCNet)، مما يمكّن من استخراج الميزات متعددة المقاييس من خلال الالتفافات المجمعة المتوسعة. يتم دمج وحدات الانتباه على مستوى مزدوج (TwinL-A) للتركيز على الميزات الحرجة عبر الأبعاد المكانية والقناة.

لتحسين القابلية للتفسير، يدمج أسلوب HybLwDL Grad-CAM، مما يسمح بتصور المناطق المهمة داخل صور الرنين المغناطيسي. يتم تقييم فعالية نموذج HybLwDL مقابل المنهجيات الموجودة، مما يظهر إمكانيته في تصنيف أورام الدماغ بدقة والمساعدة في اتخاذ قرارات علاجية مستقبلية. يوضح الإطار العام، كما هو موضح في الشكل 1، العملية الشاملة من معالجة الصور المسبقة إلى التصنيف والتصور، مع التأكيد على قدرات النموذج في تقليل الضوضاء، وتنقيح الميزات، واكتشاف الأورام بدقة.

نتائج

في هذا القسم، يتم تقديم نتائج التحليلات التجريبية التي تقارن نموذج HybLwDL مع النماذج الموجودة لاكتشاف أورام الدماغ. تستخدم الدراسة مجموعة بيانات اكتشاف أورام الدماغ 2020 وتحسب مؤشرات أداء متنوعة لتقييم فعالية تقنية HybLwDL مقابل نماذج مثل ResNet50 و VGG19 و EfficientNetB0 و CNN المعتمد على الانتباه (ACNN) و Vision Transformer (ViT). يتم تعزيز قابلية تفسير نموذج HybLwDL من خلال تطبيق Grad-CAM.

تم تحسين بنية HPCNet المقترحة لتحقيق توازن بين استخراج الميزات وكفاءة الحساب، باستخدام حجم نواة 3 × 3 لاكتشاف المعلومات المكانية المحلية بشكل فعال في صور الرنين المغناطيسي. يتضمن النموذج التواءات مجمعة لتقليل تعقيد الحساب مع تعزيز تنوع الميزات، ويستخدم التواءات موسعة (DilC) بمعدلات توسع متغيرة لالتقاط الميزات متعددة المقاييس. لتخفيف الإفراط في التكيف، خاصة في سياق مجموعات بيانات الصور الطبية المحدودة، يتم تطبيق تقنيات زيادة البيانات مثل التدوير، والانعكاس، وتحولات الشدة أثناء التدريب. يتضمن القسم أيضًا تفاصيل حول بنية النظام وقيم المعلمات الفائقة، إلى جانب وصف لمرحلة المعالجة المسبقة، التي تستخدم GBNF لتحسين جودة صور الرنين المغناطيسي قبل التصنيف.

مناقشة

في هذا القسم، يناقش البحث تطبيق مرشح شبكة ثنائية الاتجاه غاوسي (GBNF) لتحسين صور الرنين المغناطيسي، خاصة في سياق تصنيف أورام الدماغ (BT). يقلل GBNF بشكل فعال من الضوضاء مع الحفاظ على المعلومات الحرجة للحواف والبنية، متفوقًا على المرشحات التقليدية التي إما تضعف الحواف أو تعالج بشكل غير كافٍ العيوب الخاصة بالمقياس. من خلال استخدام تقنيات التنظيم المحلي والتحسين العالمي، يعزز GBNF تباين الصورة ونعومتها دون المساس بموثوقية حدود الورم، مما يوفر مدخلات أنظف لنماذج التصنيف ويحسن دقة التشخيص. يتم تفصيل الصيغ الرياضية لنواة الفضاء الغاوسي وعملية التصفية، مع التأكيد على أهمية المعلمات مثل الانحرافات المعيارية المكانية والنطاق لتحقيق نتائج مثالية.

يتناول القسم أيضًا تطوير شبكة الالتفاف الهرمية المزدوجة الانتباه الخفيفة الوزن (LHTA-PCNet) لتصنيف BT، والتي تدمج هيكل ResNet مع آليات الانتباه على مستوى مزدوج وشبكة الالتفاف الهرمية الهجينة. تم تصميم هذه البنية لاستخراج الميزات متعددة المقاييس بكفاءة وتحسين حساسية النموذج تجاه التغيرات العالمية والمحلية في مناطق الورم. يسمح استخدام آليات الانتباه بتركيز أكثر دقة على المناطق والميزات المهمة داخل صور الرنين المغناطيسي، مما يعزز قدرة النموذج على التمييز بين الأنسجة الورمية وغير الورمية. يتم تقييم أداء LHTA-PCNet مقابل النماذج الموجودة، مما يظهر دقة وقدرات تشخيصية متفوقة، مدعومة بتصورات مثل Grad-CAM التي توضح عملية اتخاذ القرار للنموذج.

Journal: Scientific Reports, Volume: 15, Issue: 1
DOI: https://doi.org/10.1038/s41598-025-23813-2
PMID: https://pubmed.ncbi.nlm.nih.gov/41249242
Publication Date: 2025-11-17
Author(s): S. Lincy Jemina et al.
Primary Topic: Brain Tumor Detection and Classification

Overview

This research presents an automatic brain tumor (BT) detection system that demonstrates both effectiveness and reliability in classifying MRI images into tumor and non-tumor categories. The model employs a lightweight hybrid architecture known as LHTA-PCNet, which enhances classification accuracy within the HybLwDL framework. The integration of a Generative Adversarial Network-based Filtering (GANF) approach improves image quality, thereby significantly boosting detection performance.

The evaluation of the classifier’s effectiveness is conducted using the BT Detection 2020 dataset, where the HybLwDL framework achieves an impressive classification accuracy of 99.5%. Furthermore, it demonstrates a sensitivity of 100%, an F1-score of 99.6%, and maximum specificity and precision rates of 99%. Future work will focus on extending the HybLwDL framework to multi-class tumor classification to enhance its applicability in medical diagnostics. Additionally, validation tests and comprehensive clinical trials are planned to confirm the framework’s efficiency across diverse patient populations.

Methods

The proposed methodology outlines the HybLwDL model designed for brain tumor (BT) classification using advanced imaging techniques. Initially, MRI images are pre-processed with a Gaussian Bilateral Network Filter (GANF) to enhance image quality. Subsequently, these images are analyzed using the lightweight hybrid twin-attentive pyramid convolutional network (LHTA-PCNet), which is based on ResNet-50 architecture. This model replaces standard bottlenecks with high-performance convolutional network (HPCNet) blocks, enabling multi-scale feature extraction through dilated grouped convolutions. Twin-level attention modules (TwinL-A) are integrated to focus on critical features across both spatial and channel dimensions.

To enhance interpretability, the HybLwDL method incorporates Grad-CAM, allowing for visualization of significant regions within the MRI images. The effectiveness of the HybLwDL model is evaluated against existing methodologies, demonstrating its potential in accurately classifying brain tumors and aiding in future therapeutic decisions. The overall framework, as depicted in Figure 1, illustrates the comprehensive process from image pre-processing to classification and visualization, emphasizing the model’s capabilities in noise reduction, feature refinement, and precise tumor detection.

Results

In this section, the results of the experimental analyses comparing the HybLwDL model with existing models for brain tumor detection are presented. The study utilizes the Brain Tumor Detection 2020 dataset and computes various performance indicators to evaluate the effectiveness of the HybLwDL technique against models such as ResNet50, VGG19, EfficientNetB0, Attention-based CNN (ACNN), and Vision Transformer (ViT). The interpretability of the HybLwDL model is enhanced through the application of Grad-CAM.

The architecture of the proposed HPCNet is optimized to balance feature extraction and computational efficiency, employing a 3 × 3 kernel size for effective local spatial information detection in MRI images. The model incorporates grouped convolutions to reduce computational complexity while promoting feature diversity, and utilizes dilated convolutions (DilC) with varying dilation rates to capture multi-scale features. To mitigate overfitting, particularly in the context of limited medical image datasets, data augmentation techniques such as rotations, flipping, and intensity transformations are applied during training. The section also includes details on the system architecture and hyperparameter values, alongside a description of the pre-processing stage, which employs GBNF to enhance the quality of MRI images before classification.

Discussion

In this section, the research discusses the application of the Gaussian Bilateral Network Filter (GBNF) for enhancing MRI images, particularly in the context of brain tumor (BT) classification. The GBNF effectively reduces noise while preserving critical edge and structural information, outperforming traditional filters that either blur edges or inadequately address scale-specific artifacts. By employing local regularization and global optimization techniques, GBNF enhances image contrast and smoothness without compromising tumor boundary fidelity, thus providing cleaner inputs for classification models and improving diagnostic accuracy. The mathematical formulations for the Gaussian spatial kernel and the filtering process are detailed, emphasizing the importance of parameters such as spatial and range standard deviations in achieving optimal results.

The section further elaborates on the development of a lightweight hybrid twin-attentive pyramid convolutional network (LHTA-PCNet) for BT classification, which integrates a ResNet backbone with twin-level attention mechanisms and a hybrid pyramid convolution network. This architecture is designed to efficiently extract multi-scale features and improve the model’s sensitivity to both global and local changes in tumor areas. The use of attention mechanisms allows for a more refined focus on significant regions and features within the MRI images, enhancing the model’s ability to distinguish between tumor and non-tumor tissues. The performance of the LHTA-PCNet is evaluated against existing models, demonstrating superior accuracy and diagnostic capabilities, supported by visualizations such as Grad-CAM that elucidate the model’s decision-making process.