كشف أورام الدماغ من الصور والمقارنة مع طرق التعلم بالنقل وشبكة الأعصاب التلافيفية ذات الثلاث طبقات Brain tumor detection from images and comparison with transfer learning methods and 3-layer CNN

المجلة: Scientific Reports، المجلد: 14، العدد: 1
DOI: https://doi.org/10.1038/s41598-024-52823-9
PMID: https://pubmed.ncbi.nlm.nih.gov/38302604
تاريخ النشر: 2024-02-01
المؤلف: Mohammad Zafer Khaliki وآخرون
الموضوع الرئيسي: كشف وتصنيف أورام الدماغ

نظرة عامة

تتناول ورقة البحث الدور الحاسم لصحة الدماغ واستخدام التصوير بالرنين المغناطيسي (MRI) لتشخيص أورام الدماغ، والتي تشمل الورم الدبقي، والورم السحائي، وأورام الغدة النخامية. تسلط الضوء على أهمية الذكاء الاصطناعي (AI) في معالجة مجموعات البيانات الكبيرة الناتجة عن MRI، لا سيما من خلال تقنيات تصنيف الصور. تستخدم الدراسة نماذج مختلفة من الشبكات العصبية التلافيفية (CNN)، بما في ذلك VGG16 وInception-V3 وEfficientNetB4 وVGG19، لتصنيف أورام الدماغ. حقق نموذج VGG16 أعلى دقة بنسبة 98%، مع درجة F تبلغ 97%، ومنطقة تحت المنحنى (AUC) تبلغ 99%، واسترجاع بنسبة 98%، ودقة بنسبة 98%.

تؤكد الورقة على التأثير التحويلي للذكاء الاصطناعي في قطاع الرعاية الصحية، لا سيما في تشخيص وعلاج أورام الدماغ. تعزز تقنيات الذكاء الاصطناعي، من خلال تحليلات البيانات الكبيرة والتعلم العميق، دقة اكتشاف وتصنيف الأورام، مما يساعد الأطباء في تطوير خطط علاج فعالة. تستكشف الدراسة أيضًا أداء CNN وطرق التعلم الانتقالي على صور الدماغ، بما في ذلك التحديات التي تطرحها مجموعات البيانات المنحرفة ومنخفضة الجودة. تؤكد النتائج على إمكانية الذكاء الاصطناعي في تحسين التشخيص المبكر واستراتيجيات العلاج الشخصية لمرضى أورام الدماغ.

طرق

في هذه الدراسة، يبحث المؤلفون في تصنيف الصور في مجال الصور الصحية، مع التركيز بشكل خاص على تصنيف أورام الدماغ باستخدام تقنيات التعلم العميق. تم استخدام خمسة نماذج متميزة لهذه المهمة: InceptionV3 وEfficientNetB4 وVGG16 وVGG19، وشبكة عصبية تلافيفية متعددة الطبقات (CNN). تم تقسيم مجموعة البيانات إلى ثلاثة مجموعات فرعية: 75% للتدريب، و15% للتحقق، و10% للاختبار، مما يضمن تقييمًا قويًا لأداء النموذج.

تم إجراء التجارب باستخدام Google Colab، مما يوفر منصة قائمة على السحابة لتنفيذ النماذج. يهدف التحليل المقارن لهذه النماذج إلى تحديد الهيكل الأكثر فعالية لتصنيف أورام الدماغ بدقة، مما يساهم في تقديم رؤى قيمة في مجال تحليل الصور الطبية.

نتائج

في هذه الدراسة، تم تدريب نماذج مختلفة من الشبكات العصبية التلافيفية (CNN)، بما في ذلك هياكل التعلم الانتقالي مثل InceptionV3 وEfficientNetB4 وVGG19، على مجموعات بيانات مفتوحة المصدر لمرضى أورام الدماغ. أشارت النتائج إلى أن EfficientNetB4 حقق أعلى دقة بنسبة 95%. تم تقديم تحليل مقارن لهذه النتائج مع الأدبيات الحالية في الجدول 4، مما يظهر أن نماذج التعلم الانتقالي المعتمدة على CNN تفوقت على الدراسات السابقة.

تؤكد الأبحاث على الدور المهم للذكاء الاصطناعي (AI) في الرعاية الصحية، لا سيما في تشخيص وإدارة الحالات المعقدة مثل أورام الدماغ. بينما يعزز الذكاء الاصطناعي سرعة ودقة العمليات التشخيصية، يتم التأكيد على أنه يجب أن يكمل، بدلاً من استبدال، خبرة المتخصصين في الرعاية الصحية. تعتبر الاعتبارات الأخلاقية، وقضايا الأمان، والخصوصية أمورًا حاسمة لنجاح دمج الذكاء الاصطناعي في التطبيقات الطبية. تدعو الدراسة إلى اعتماد طرق التعلم الانتقالي في مهام معالجة الصور لمساعدة صانعي القرار في الرعاية الصحية، مقترحة أن بيانات MRI وCT يمكن أن تعمل كنظم إنذار مبكر لاتخاذ قرارات سريرية دقيقة وفي الوقت المناسب. ستركز التحقيقات المستقبلية على تشخيص الأمراض باستخدام تقنيات التصوير في الوقت الحقيقي.

مناقشة

في هذه الدراسة، تم استخدام مجموعة بيانات تتكون من 2870 صورة MRI لدماغ الإنسان لتصنيف الأورام إلى أربع فئات: الورم الدبقي، الورم السحائي، عدم وجود ورم، وورم الغدة النخامية. تم إجراء التصنيف باستخدام الشبكات العصبية التلافيفية (CNNs) وهياكل التعلم الانتقالي، وبشكل خاص VGG16 وVGG19 وInceptionV3 وEfficientNetB4. قامت بنية CNN باستخراج الميزات الهرمية من الصور بشكل فعال، مما عزز دقة التصنيف دون الحاجة إلى هندسة ميزات يدوية. تم تدريب النماذج على صور تم تغيير حجمها إلى 64 × 64 بكسل، مع تقييم مقاييس الأداء بما في ذلك الدقة، والدقة، والاسترجاع، ودرجة F من خلال مصفوفات الارتباك ومنحنيات ROC.

أشارت النتائج إلى أن نموذج VGG16 حقق أعلى دقة بنسبة 97%، مع درجة F تبلغ 97%، ومنطقة تحت المنحنى (AUC) تبلغ 99%، وقيم استرجاع ودقة تبلغ 98%. أظهرت الدراسة فعالية التعلم الانتقالي في تحسين أداء النموذج عبر هياكل مختلفة، حيث أظهرت جميع النماذج قدرات تصنيف قوية لأنواع الأورام. أكدت مصفوفة الارتباك أيضًا الأداء العالي للتصنيف عبر النماذج، مما يبرز إمكانية تقنيات التعلم العميق في تحليل الصور الطبية.

القيود

تتمثل القيود الرئيسية لهذه الدراسة في معالجة مجموعة البيانات أثناء عملية التصنيف. على وجه التحديد، ركزت التحقيقات على الشبكات العصبية التلافيفية (CNNs) الفردية ونماذج التعلم الانتقالي المعتمدة على CNN متعددة الطبقات دون تنفيذ تقنيات المعالجة المسبقة الأساسية مثل التدوير والقص. قد تؤثر هذه الإغفالات على قوة وعمومية أداء النموذج، حيث تعتبر هذه العمليات حاسمة لتعزيز تنوع مجموعة البيانات وتحسين دقة التصنيف. وبالتالي، قد لا تمثل النتائج بشكل كامل الفعالية المحتملة للنماذج المقترحة في ظل ظروف أكثر تنوعًا.

Journal: Scientific Reports, Volume: 14, Issue: 1
DOI: https://doi.org/10.1038/s41598-024-52823-9
PMID: https://pubmed.ncbi.nlm.nih.gov/38302604
Publication Date: 2024-02-01
Author(s): Mohammad Zafer Khaliki et al.
Primary Topic: Brain Tumor Detection and Classification

Overview

The research paper focuses on the critical role of brain health and the utilization of magnetic resonance imaging (MRI) for diagnosing brain tumors, which include glioma, meningioma, and pituitary tumors. It highlights the significance of artificial intelligence (AI) in processing the large datasets generated by MRI, particularly through image classification techniques. The study employs various convolutional neural network (CNN) architectures, including VGG16, Inception-V3, EfficientNetB4, and VGG19, to classify brain tumors. The VGG16 model achieved the highest accuracy of 98%, with an F-score of 97%, an Area Under the Curve (AUC) of 99%, a recall of 98%, and a precision of 98%.

The paper emphasizes the transformative impact of AI in the healthcare sector, particularly in the diagnosis and treatment of brain tumors. AI technologies, through big data analytics and deep learning, enhance the accuracy of tumor detection and classification, aiding physicians in developing effective treatment plans. The study also explores the performance of CNN and transfer learning methods on brain images, including the challenges posed by skewed and low-quality datasets. The findings underscore the potential of AI in improving early diagnosis and personalized treatment strategies for brain tumor patients.

Methods

In this study, the authors investigate image classification in the domain of health images, specifically focusing on brain tumor classification using deep learning techniques. Five distinct models were employed for this task: InceptionV3, EfficientNetB4, VGG16, VGG19, and a Multi-Layer Convolutional Neural Network (CNN). The dataset was partitioned into three subsets: 75% for training, 15% for validation, and 10% for testing, ensuring a robust evaluation of model performance.

The experiments were conducted using Google Colab, providing a cloud-based platform for the implementation of the models. The comparative analysis of these models aims to identify the most effective architecture for accurately classifying brain tumors, contributing valuable insights to the field of medical image analysis.

Results

In this study, various convolutional neural network (CNN) models, including transfer learning architectures such as InceptionV3, EfficientNetB4, and VGG19, were trained on open-source datasets of brain tumor patients. The results indicated that EfficientNetB4 achieved the highest accuracy of 95%. A comparative analysis of these findings with existing literature is presented in Table 4, demonstrating that the CNN-based transfer learning models outperformed previous studies.

The research underscores the significant role of artificial intelligence (AI) in healthcare, particularly in the diagnosis and management of complex conditions like brain tumors. While AI enhances the speed and accuracy of diagnostic processes, it is emphasized that it should complement, rather than replace, the expertise of healthcare professionals. Ethical considerations, security, and privacy issues are crucial for the successful integration of AI in medical applications. The study advocates for the adoption of transfer learning methods in image processing tasks to aid healthcare decision-makers, suggesting that MRI and CT data could serve as early warning systems for timely and precise clinical decisions. Future investigations will focus on the diagnosis of diseases using real-time imaging technologies.

Discussion

In this study, a dataset of 2870 human brain MRI images was utilized to classify tumors into four categories: glioma, meningioma, no tumor, and pituitary tumor. The classification was performed using convolutional neural networks (CNNs) and transfer learning architectures, specifically VGG16, VGG19, InceptionV3, and EfficientNetB4. The CNN architecture effectively extracted hierarchical features from the images, enhancing classification accuracy without the need for manual feature engineering. The models were trained on images resized to 64 × 64 pixels, with performance metrics including accuracy, precision, recall, and F-score evaluated through confusion matrices and ROC curves.

The results indicated that the VGG16 model achieved the highest accuracy of 97%, along with an F-score of 97%, an AUC of 99%, and recall and precision values of 98%. The study demonstrated the effectiveness of transfer learning in improving model performance across various architectures, with all models exhibiting strong classification capabilities for the tumor types. The confusion matrix further confirmed the high classification performance across the models, underscoring the potential of deep learning techniques in medical image analysis.

Limitations

The primary limitation of this study lies in the dataset’s treatment during the classification process. Specifically, the investigation focused on single Convolutional Neural Networks (CNNs) and multilayer CNN-based transfer learning models without implementing essential preprocessing techniques such as rotation and cropping. This omission may affect the robustness and generalizability of the model’s performance, as these operations are critical for enhancing the dataset’s variability and improving classification accuracy. Consequently, the findings may not fully represent the potential effectiveness of the proposed models under more diverse conditions.