تحسين تشخيص الملاريا من خلال هياكل الشبكات العصبية المخصصة القابلة للتفسير Improving Malaria diagnosis through interpretable customized CNNs architectures

المجلة: Scientific Reports، المجلد: 15، العدد: 1
DOI: https://doi.org/10.1038/s41598-025-90851-1
PMID: https://pubmed.ncbi.nlm.nih.gov/39987229
تاريخ النشر: 2025-02-22
المؤلف: Md. Faysal Ahamed وآخرون
الموضوع الرئيسي: التصوير الرقمي لأمراض الدم

نظرة عامة

تبحث الدراسة في فعالية ثلاث هياكل لشبكات الأعصاب التلافيفية (CNN) – الشبكة التلافيفية المتوازية (PCNN)، والشبكات التلافيفية المتوازية مع الانتباه الناعم (SPCNN)، والانتباه الناعم بعد الكتلة الوظيفية للشبكات التلافيفية المتوازية (SFPCNN) – لتشخيص طفيليات الملاريا في مسحات خلايا الدم الحمراء (RBC). تفوقت نموذج SPCNN على الآخرين، محققة دقة قدرها 99.38 ± 0.21%، واسترجاع قدره 99.37 ± 0.21%، ودرجة F1 قدرها 99.37 ± 0.21%، ودقة قدرها 99.37 ± 0.30%، ومنطقة تحت منحنى التشغيل (AUC) قدرها 99.95 ± 0.01%. بالإضافة إلى ذلك، أظهرت SPCNN كفاءة حسابية متفوقة مع أسرع وقت اختبار قدره 0.00252 ثانية، على الرغم من كونها أكثر تعقيدًا من PCNN وأبسط من SFPCNN.

تؤكد الدراسة على إمكانيات SPCNN كأداة تشخيصية قوية للملاريا، خاصة في البيئات ذات الموارد المحدودة. ومع ذلك، تعترف بالقيود، بما في ذلك الاعتماد على مجموعة بيانات محدودة قد لا تمثل تنوع عينات مسحات الدم عبر مختلف السكان والمناطق الجغرافية. علاوة على ذلك، قد تشكل المتطلبات الحاسوبية للنموذج تحديات للنشر على الأجهزة ذات الموارد المنخفضة المستخدمة عادة في البيئات الميدانية. بشكل عام، تسلط النتائج الضوء على وعد الهياكل المتقدمة لشبكات CNN في تعزيز تشخيص الملاريا، بينما تشير أيضًا إلى الحاجة لمزيد من البحث لمعالجة القيود المحددة.

الطرق

تحدد المنهجية المقترحة إطارًا شاملاً لتصنيف طفيليات الملاريا من مسحات خلايا الدم الحمراء (RBC)، مع تضمين صور من الأفراد المصابين والأصحاء. تتضمن الخطوة الأولية تجميع مجموعة بيانات كبيرة من المكتبة الوطنية للطب، تليها مرحلة معالجة مسبقة تهدف إلى تحسين جودة الصورة. تشمل تقنيات المعالجة المسبقة الرئيسية تغيير حجم الصورة لتحقيق التوحيد، وتعديل تباين الهيستوجرام التكيفي المحدود (CLAHE) لتحسين التباين المحلي، والتطبيع لتوسيع قيم البكسل بين 0 و 1. هذه الخطوات ضرورية لضمان الاتساق عبر مجموعة البيانات وتحسين أداء هياكل الشبكات العصبية التلافيفية (CNN).

يستخدم الإطار عدة نماذج CNN، بما في ذلك SFPCNN وSPCNN وPCNN، جنبًا إلى جنب مع نماذج التعلم الانتقالي (TL) مثل VGG16 وResNet152. يتم تطبيق توحيد الميزات باستخدام الصيغة \( z = \frac{x – \mu}{\sigma} \)، مما يضمن أن الميزات لها متوسط قدره 0 وانحراف معياري قدره 1، مما يعزز استقرار التدريب وأداء النموذج. توضح النتائج التجريبية أداء النماذج المقترحة، مع تسليط الضوء على فعالية PCNN، التي تعالج بيانات الإدخال بشكل متوازي لاستخراج ميزات معقدة، وSPCNN، التي تركز على مناطق معينة من الصورة. بالإضافة إلى ذلك، يتم دمج آلية الانتباه الناعم لإعطاء الأولوية للميزات المهمة، مما يحسن دقة تصنيف النموذج بشكل أكبر.

النتائج

تشير نتائج نموذج PCNN إلى أداء قوي في تصنيف طفيليات الملاريا، كما هو موضح في الجدول 7، الذي يقدم مقاييس الأداء حسب الفئة على مجموعة بيانات الاختبار. خضع النموذج لعملية تحقق صارمة من خلال التحقق المتقاطع بخمس طيات لضمان موثوقية دقته. في مجموعة التحقق، كما هو ملخص في الجدول 8، حقق PCNN دقة مثيرة للإعجاب، وحساسية، ودرجة F1 قدرها 99.31% ± 0.18، إلى جانب معدل تعرف قدره 99.31%. تم تسجيل معدلات الدقة خلال الاختبار والتحقق عند 99.31 ± 0.18 و99.56 ± 0.13، على التوالي، مما يظهر قدرة النموذج على تصنيف طفيليات الملاريا بدقة.

توضح الأشكال 8 و9 منحنيات الدقة والخسارة على مدى 100 دورة، كاشفة عن أداء متسق طوال مراحل التدريب والتحقق. تسلط منحنى الخسارة الضوء على التقلبات في الخسارة، بينما توفر الأشكال 10 و11 رؤى حول نتائج التصنيف من خلال مصفوفات الالتباس (CMs)، التي توضح الدقة والحساسية ودرجات F1 لكل فئة. بالإضافة إلى ذلك، يتم إثبات قدرة النموذج على التمييز بين الفئات من خلال منحنى ROC المقدم في الشكل 12، محققًا درجة منطقة تحت المنحنى (AUC) قدرها 99.92% ± 0.05، مما يبرز الأداء الاستثنائي للنموذج في التصنيف.

المناقشة

تسلط قسم المناقشة في ورقة البحث الضوء على التقدم في تقنيات التعلم العميق (DL) والتعلم الآلي (ML) لتشخيص الملاريا، مع التأكيد على فعالية الشبكات العصبية التلافيفية (CNNs) في اكتشاف وتصنيف طفيليات الملاريا. تتم مقارنة دراسات مختلفة، مما يظهر نماذج مثل هيكل VGG، الذي حقق دقة مثيرة للإعجاب قدرها 98.57% في تحديد الخلايا المصابة بالملاريا من خلال طريقة التحقق المتقاطع بخمس طيات. تشمل الأساليب البارزة الأخرى استخدام CNNs المدربة مسبقًا مثل ResNet50 وMobileNetV2، التي حسنت الأداء على مجموعات بيانات أصغر من خلال التعلم الانتقالي. بالإضافة إلى ذلك، أظهرت النماذج الهجينة التي تجمع بين CNNs وآلات الدعم الناقل (SVMs) والشبكات العميقة (DBNs) دقة وحساسية متفوقة مقارنة بالطرق التقليدية.

على الرغم من هذه التقدمات، تحدد القسم قيودًا كبيرة في الأبحاث الحالية، بما في ذلك الاعتماد على مجموعات بيانات مصنفة يدويًا، مما يمكن أن يعيق القابلية للتوسع والعمومية في التطبيقات الواقعية. العديد من النماذج تتطلب حسابات مكثفة، مما يشكل تحديات للتنفيذ في البيئات ذات الموارد المحدودة. علاوة على ذلك، تشير المناقشة إلى نقص القابلية للتفسير في العديد من النماذج، مع قلة من الدراسات التي تقيم منهجيات الذكاء الاصطناعي القابلة للتفسير (XAI) مثل SHAP وGrad-CAM بشكل منهجي. تثير هذه الفجوة مخاوف بشأن القابلية العملية لهذه النماذج في البيئات السريرية، حيث يعد فهم قرارات النموذج أمرًا حيويًا. يدعو المؤلفون إلى مزيد من الاستكشاف لكفاءة التدريب، والصلابة ضد البيانات الضوضائية، والتعلم التكيفي لتعزيز قابلية الاستخدام وفعالية طرق تشخيص الملاريا المعتمدة على DL وML.

Journal: Scientific Reports, Volume: 15, Issue: 1
DOI: https://doi.org/10.1038/s41598-025-90851-1
PMID: https://pubmed.ncbi.nlm.nih.gov/39987229
Publication Date: 2025-02-22
Author(s): Md. Faysal Ahamed et al.
Primary Topic: Digital Imaging for Blood Diseases

Overview

The research investigates the effectiveness of three convolutional neural network (CNN) architectures—Parallel Convolutional Neural Network (PCNN), Soft Attention Parallel Convolutional Neural Networks (SPCNN), and Soft Attention after Functional Block Parallel Convolutional Neural Networks (SFPCNN)—for diagnosing malaria parasites in red blood cell (RBC) smears. The SPCNN model outperformed the others, achieving a precision of 99.38 ± 0.21%, recall of 99.37 ± 0.21%, F1 score of 99.37 ± 0.21%, accuracy of 99.37 ± 0.30%, and an area under the receiver operating characteristic curve (AUC) of 99.95 ± 0.01%. Additionally, SPCNN demonstrated superior computational efficiency with the fastest testing time of 0.00252 seconds, despite being more complex than PCNN and simpler than SFPCNN.

The study emphasizes the potential of SPCNN as a robust diagnostic tool for malaria, particularly in resource-limited settings. However, it acknowledges limitations, including the reliance on a limited dataset that may not represent the diversity of blood smear samples across different populations and geographic regions. Furthermore, the computational demands of the model may pose challenges for deployment on low-resource devices commonly used in field settings. Overall, the findings highlight the promise of advanced CNN architectures in enhancing malaria diagnosis, while also indicating the need for further research to address the identified limitations.

Methods

The proposed methodology outlines a comprehensive framework for classifying malaria parasites from red blood cell (RBC) smears, incorporating images from both infected and healthy individuals. The initial step involves assembling a substantial dataset from the National Library of Medicine, followed by a preprocessing phase aimed at enhancing image quality. Key preprocessing techniques include image resizing for uniformity, Contrast Limited Adaptive Histogram Equalization (CLAHE) for improving local contrast, and normalization to scale pixel values between 0 and 1. These steps are crucial for ensuring consistency across the dataset and optimizing the performance of convolutional neural network (CNN) architectures.

The framework employs several CNN models, including SFPCNN, SPCNN, and PCNN, alongside transfer learning (TL) models such as VGG16 and ResNet152. Feature standardization is applied using the formula \( z = \frac{x – \mu}{\sigma} \), ensuring that features have a mean of 0 and a standard deviation of 1, which enhances training stability and model performance. The experimental results detail the performance of the proposed models, highlighting the effectiveness of PCNN, which processes input data in parallel to extract complex features, and SPCNN, which focuses on specific image regions. Additionally, a soft attention mechanism is integrated to prioritize significant features, further improving the model’s classification accuracy.

Results

The results of the PCNN model indicate a robust performance in classifying malaria parasites, as detailed in Table 7, which presents class-wise performance metrics on the test dataset. The model underwent a rigorous five-fold cross-validation to ensure the reliability of its accuracy. On the validation set, as summarized in Table 8, the PCNN achieved an impressive precision, sensitivity, and F1-score of 99.31% ± 0.18, alongside a recognition rate of 99.31%. The accuracy rates during testing and validation were recorded at 99.31 ± 0.18 and 99.56 ± 0.13, respectively, demonstrating the model’s capability to accurately categorize malaria parasites.

Figures 8 and 9 illustrate the accuracy and loss curves over 100 epochs, revealing consistent performance throughout the training and validation phases. The loss curve highlights fluctuations in loss, while Figures 10 and 11 provide insights into classification outcomes through Confusion Matrices (CMs), which detail precision, sensitivity, and F1-scores for each class. Additionally, the model’s ability to distinguish between classes is further evidenced by the ROC curve presented in Figure 12, achieving an Area Under the Curve (AUC) score of 99.92% ± 0.05, underscoring the model’s exceptional classification performance.

Discussion

The discussion section of the research paper highlights the advancements in deep learning (DL) and machine learning (ML) techniques for malaria diagnosis, emphasizing the effectiveness of convolutional neural networks (CNNs) in detecting and classifying malaria parasites. Various studies are compared, showcasing models like the VGG architecture, which achieved an impressive accuracy of 98.57% in identifying malaria-infected cells through a five-fold cross-validation method. Other notable approaches include the use of pre-trained CNNs such as ResNet50 and MobileNetV2, which improved performance on smaller datasets through transfer learning. Additionally, hybrid models combining CNNs with support vector machines (SVMs) and deep belief networks (DBNs) demonstrated superior accuracy and sensitivity compared to traditional methods.

Despite these advancements, the section identifies significant limitations in existing research, including the reliance on manually labeled datasets, which can hinder scalability and generalizability in real-world applications. Many models are computationally intensive, posing challenges for implementation in resource-constrained environments. Furthermore, the discussion points out the lack of interpretability in many models, with few studies systematically evaluating explainable AI (XAI) methods like SHAP and Grad-CAM. This gap raises concerns about the practical applicability of these models in clinical settings, where understanding model decisions is crucial. The authors call for further exploration of training efficiency, robustness against noisy data, and adaptive learning to enhance the usability and effectiveness of DL and ML-based malaria diagnostic methods.