M-C&M-BL: نموذج تصنيف جديد لتصنيف أورام الدماغ: متعدد CNN ومتعدد BiLSTM M-C&M-BL: a novel classification model for brain tumor classification: multi-CNN and multi-BiLSTM

المجلة: The Journal of Supercomputing، المجلد: 81، العدد: 3
DOI: https://doi.org/10.1007/s11227-025-06964-x
تاريخ النشر: 2025-02-14
المؤلف: Muhammet Sinan Başarslan
الموضوع الرئيسي: كشف وتصنيف أورام الدماغ

نظرة عامة

تقدم الدراسة نهجًا جديدًا للكشف عن أورام الدماغ باستخدام نموذج تعلم عميق، M-C&M-BL، الذي يجمع بين شبكة عصبية تلافيفية (CNN) لاستخراج ميزات الصورة مع شبكة ذاكرة طويلة وقصيرة المدى ثنائية الاتجاه (BiLSTM) لمعالجة البيانات المتسلسلة. تم تقييم النموذج على مجموعة بيانات تصوير الرنين المغناطيسي للدماغ Br35H وحقق مقاييس أداء مثيرة للإعجاب، بما في ذلك دقة 99.33% ودرجة F1 تبلغ 99.35%. تتجاوز هذه الأداءات تلك الخاصة بالنماذج الحالية المعتمدة على CNN، مثل BMRI-Net وAlexNet، وتتنافس مع MobileNetv2، الذي سجل دقة أعلى قليلاً تبلغ 99.67%.

تشير النتائج إلى أن النموذج المقترح لديه إمكانات كبيرة للتكامل في أنظمة دعم القرار السريري ومنصات التشخيص، مما يعزز التشخيص المبكر ويحسن دقة التشخيص مع تقليل وقت التقييم. ومع ذلك، تعترف الدراسة أيضًا بالتحديات المتعلقة بالخصوصية، والعمومية عبر مجموعات بيانات متنوعة، وقيود البنية التحتية التي يجب معالجتها من أجل نشر فعال. بشكل عام، تؤكد هذه الأبحاث على وعد دمج الهياكل المتقدمة للتعلم العميق لتعزيز أدوات الرعاية الصحية المدفوعة بالذكاء الاصطناعي، بهدف تحسين سير العمل السريري ونتائج المرضى.

مقدمة

تتناول مقدمة هذه الورقة البحثية القضية الحرجة لأورام الدماغ، التي تعتبر عدوانية وشائعة بين الأطفال والبالغين، وتشكل جزءًا كبيرًا من أورام الجهاز العصبي المركزي الأولية. مع حوالي 11,700 تشخيص جديد سنويًا، يتم التأكيد على أهمية الكشف المبكر والدقيق، خاصة من خلال التصوير بالرنين المغناطيسي، الذي، على الرغم من فعاليته، ينتج كميات هائلة من البيانات التي تتحدى عمليات المراجعة اليدوية. يمكن أن تؤدي هذه التعقيدات إلى أخطاء تشخيصية، مما يبرز ضرورة وجود أدوات تحليل آلي تستفيد من التقدم في الذكاء الاصطناعي (AI) لتعزيز دقة التشخيص وتخطيط العلاج.

تقدم الدراسة بنية هجينة جديدة للتعلم العميق، تُسمى M-C&M-BL (Multi-CNN وMulti-BiLSTM)، مصممة لتصنيف أورام الدماغ من صور الرنين المغناطيسي. تدمج هذه البنية بين عدة شبكات عصبية تلافيفية (CNNs) لاستخراج الميزات المكانية مع عدة شبكات ذاكرة طويلة وقصيرة المدى ثنائية الاتجاه (BiLSTM) لنمذجة الاعتماديات التسلسلية. من خلال دمج هذه الأساليب، يلتقط نموذج M-C&M-BL بشكل فعال كل من الميزات المكانية المحلية والاعتماديات بعيدة المدى، مما يؤدي إلى تحسين أداء التصنيف مقارنة بالنماذج التقليدية ذات الطبقة الواحدة. توضح الورقة المنهجية، بما في ذلك تحسين المعلمات الفائقة وتقييم مقاييس الأداء المختلفة، بينما تتناول أيضًا تحديات الكفاءة الحسابية والتكيف الزائد في تطبيقات التعلم العميق. ستفصل الأقسام اللاحقة من الورقة مراجعة الأدبيات، ومصادر البيانات، والمنهجيات، والنتائج التجريبية، وتقييمًا شاملاً لأداء النموذج المقترح.

طرق

في هذه الدراسة، استخدم المؤلفون اثنين من هياكل التعلم العميق، الشبكات العصبية التلافيفية (CNN) وذاكرة طويلة وقصيرة المدى ثنائية الاتجاه (BiLSTM)، لتطوير نماذج ذات طبقة واحدة وطبقات ثلاثية لمهام التصنيف. تم تقديم نموذج مركب جديد، يُشار إليه باسم M-C&M-BL، يجمع بين CNN وBiLSTM مع ثلاث طبقات. تم إجراء التجارب على Google Colab، باستخدام KerasTuner لتحسين المعلمات الفائقة، والتي شملت ضبط أحجام الدفعات، وأحجام نواة CNN، والوحدات. تم فصل مجموعات بيانات التدريب والاختبار باستخدام استراتيجية الاحتفاظ بنسبة 80%-20%، مع تنفيذ إنهاء مبكر واستدعاءات تقليل معدل التعلم للتخفيف من التكيف الزائد.

أشارت النتائج إلى أن زيادة عدد الطبقات في كل من هياكل CNN وBiLSTM أثرت بشكل إيجابي على أداء النموذج، حيث حقق نموذج M-C&M-BL أعلى دقة عبر جميع المقاييس. على وجه التحديد، أدى إدخال الطبقات الثلاثية إلى تحسين بنسبة 2% في الدقة لـ CNN وتحسين بنسبة 1% لـ BiLSTM. تم توثيق المقاييس، بما في ذلك الدقة (Acc)، ودرجة F1، والدقة (Pre)، والاسترجاع (Rec)، والخصوصية (Spe)، ومعامل ارتباط ماثيو (MCC)، بشكل منهجي، مع تفوق نموذج M-C&M-BL على جميع التكوينات. كما أبرزت الدراسة أن النماذج لم تظهر أي علامات على التكيف الزائد، كما يتضح من استقرار مقاييس الأداء وخصائص منحنى ROC المواتية، مما يشير إلى حساسية عالية ومعدلات إيجابية كاذبة منخفضة.

مناقشة

في هذا القسم، تركز المناقشة على التقدم في تصنيف أورام الدماغ من خلال دمج الذكاء الاصطناعي (AI) وتقنيات التعلم العميق، مع التركيز بشكل خاص على فعالية الشبكات العصبية التلافيفية (CNN) وهياكل الذاكرة طويلة وقصيرة المدى ثنائية الاتجاه (BiLSTM). تكشف الأدبيات عن اتجاه كبير نحو استخدام طرق التعلم العميق للتشخيص الدقيق والسريع لأورام الدماغ، حيث أفادت العديد من الدراسات بمعدلات دقة عالية، غالبًا ما تتجاوز 98%. على سبيل المثال، حققت نماذج مثل BMRI-Net ومختلف هياكل CNN دقة تصل إلى 99.51% على مجموعة بيانات Br35H. تسلط التحليلات المقارنة لهذه الدراسات الضوء على مزايا الأساليب متعددة الطبقات والهجينة على النماذج ذات الطبقة الواحدة، التي قد تعاني من قيود في استخراج الميزات والصلابة ضد الضوضاء.

يهدف نموذج M-C&M-BL المقترح، الذي يجمع بين CNN وBiLSTM، إلى الاستفادة من نقاط القوة في كلا الهيكلين لتعزيز أداء التصنيف. أظهر هذا النموذج نتائج تنافسية، حيث حقق دقة تزيد عن 99% على مجموعة بيانات Br35H، مما يبرز إمكانيته في الاستخدام في البيئات السريرية للكشف المبكر عن الأورام. تضمن الطبيعة المتوازنة لمجموعة البيانات المستخدمة في هذه الدراسة، التي تتكون من 3,000 صورة MRI، أن أداء النموذج ليس متحيزًا، مما يوفر أساسًا موثوقًا لمزيد من البحث والتطبيق في أدوات التشخيص المدفوعة بالذكاء الاصطناعي. تشير النتائج إلى أن دمج تقنيات التعلم العميق المتقدمة يمكن أن يحسن بشكل كبير من دقة وكفاءة تصنيف أورام الدماغ، مما يساعد في النهاية المهنيين الصحيين في اتخاذ قرارات مستنيرة.

Journal: The Journal of Supercomputing, Volume: 81, Issue: 3
DOI: https://doi.org/10.1007/s11227-025-06964-x
Publication Date: 2025-02-14
Author(s): Muhammet Sinan Başarslan
Primary Topic: Brain Tumor Detection and Classification

Overview

The study presents a novel approach for detecting brain tumors using a deep learning model, M-C&M-BL, which combines a Convolutional Neural Network (CNN) for image feature extraction with a Bidirectional Long Short-Term Memory (BiLSTM) Network for processing sequential data. The model was evaluated on the Br35H brain MRI dataset and achieved impressive performance metrics, including an accuracy of 99.33% and an F1 score of 99.35%. This performance surpasses that of existing CNN-based models, such as BMRI-Net and AlexNet, and is competitive with MobileNetv2, which recorded a slightly higher accuracy of 99.67%.

The findings suggest that the proposed model has significant potential for integration into clinical decision support systems and diagnostic platforms, enhancing early diagnosis and improving diagnostic accuracy while reducing evaluation time. However, the study also acknowledges challenges related to privacy, generalizability across diverse datasets, and infrastructure constraints that must be addressed for effective deployment. Overall, this research underscores the promise of combining advanced deep learning architectures to enhance AI-driven healthcare tools, ultimately aiming to improve clinical workflows and patient outcomes.

Introduction

The introduction of this research paper addresses the critical issue of brain tumors, which are aggressive and prevalent in both children and adults, constituting a significant portion of primary central nervous system tumors. With approximately 11,700 new diagnoses annually, the importance of early and accurate detection is emphasized, particularly through MRI, which, despite its effectiveness, generates vast amounts of data that challenge manual review processes. This complexity can lead to diagnostic errors, underscoring the necessity for automated analysis tools that leverage advancements in artificial intelligence (AI) to enhance diagnostic accuracy and treatment planning.

The study introduces a novel hybrid deep learning architecture, termed M-C&M-BL (Multi-CNN and Multi-BiLSTM), designed to classify brain tumors from MRI images. This architecture integrates multiple Convolutional Neural Networks (CNNs) for spatial feature extraction with multiple Bidirectional Long Short-Term Memory (BiLSTM) networks for modeling sequential dependencies. By combining these approaches, the M-C&M-BL model effectively captures both local spatial features and long-range dependencies, resulting in improved classification performance compared to traditional single-layer models. The paper outlines the methodology, including hyperparameter optimization and the evaluation of various performance metrics, while also addressing the challenges of computational efficiency and overfitting in deep learning applications. The subsequent sections of the paper will detail the literature review, data sources, methodologies, experimental results, and a comprehensive evaluation of the proposed model’s performance.

Methods

In this study, the authors employed two deep learning architectures, Convolutional Neural Networks (CNN) and Bidirectional Long Short-Term Memory (BiLSTM), to develop single-layer and triple-layer models for classification tasks. A novel combined model, referred to as M-C&M-BL, was also introduced, integrating both CNN and BiLSTM with triple layers. The experiments were conducted on Google Colab, utilizing KerasTuner for hyperparameter optimization, which included tuning batch sizes, CNN kernel sizes, and units. The training and testing datasets were separated using an 80%-20% holdout strategy, with early termination and learning rate reduction callbacks implemented to mitigate overfitting.

The results indicated that increasing the number of layers in both CNN and BiLSTM architectures positively impacted model performance, with the M-C&M-BL model achieving the highest accuracy across all metrics. Specifically, the introduction of triple layers resulted in a 2% improvement in accuracy for CNN and a 1% improvement for BiLSTM. The metrics, including accuracy (Acc), F1 score, precision (Pre), recall (Rec), specificity (Spe), and Matthews correlation coefficient (MCC), were systematically documented, with the M-C&M-BL model outperforming all configurations. The study also highlighted that the models exhibited no signs of overfitting, as evidenced by stable performance metrics and favorable ROC curve characteristics, indicating high sensitivity and low false positive rates.

Discussion

In this section, the discussion focuses on the advancements in brain tumor classification through the integration of artificial intelligence (AI) and deep learning techniques, particularly emphasizing the effectiveness of convolutional neural networks (CNN) and bi-directional long short-term memory (BiLSTM) architectures. The literature reveals a significant trend towards using deep learning methods for accurate and rapid diagnosis of brain tumors, with numerous studies reporting high accuracy rates, often exceeding 98%. For instance, models such as BMRI-Net and various CNN architectures have achieved accuracies of up to 99.51% on the Br35H dataset. The comparative analysis of these studies highlights the advantages of multi-layer and hybrid approaches over single-layer models, which may suffer from limitations in feature extraction and robustness against noise.

The proposed M-C&M-BL model, which combines CNN and BiLSTM, aims to leverage the strengths of both architectures to enhance classification performance. This model demonstrated competitive results, achieving over 99% accuracy on the Br35H dataset, underscoring its potential utility in clinical settings for early tumor detection. The balanced nature of the dataset used in this study, consisting of 3,000 MRI images, ensures that the model’s performance is not biased, thus providing a reliable foundation for further research and application in AI-driven diagnostic tools. The findings suggest that integrating advanced deep learning techniques can significantly improve the accuracy and efficiency of brain tumor classification, ultimately aiding healthcare professionals in making informed decisions.