دمج تقنيات التعلم العميق المتقدمة لتحسين الكشف وتصنيف أمراض أوراق وفواكه الحمضيات Integrating advanced deep learning techniques for enhanced detection and classification of citrus leaf and fruit diseases

المجلة: Scientific Reports، المجلد: 15، العدد: 1
DOI: https://doi.org/10.1038/s41598-025-97159-0
PMID: https://pubmed.ncbi.nlm.nih.gov/40221550
تاريخ النشر: 2025-04-12
المؤلف: Archna Goyal وآخرون
الموضوع الرئيسي: الزراعة الذكية والذكاء الاصطناعي

نظرة عامة

في هذه الدراسة، تم تقييم أربعة نماذج متقدمة من التعلم العميق—EfficientNetB0 وResNet50 وDenseNet121 وInceptionV3—لفعاليتها في تصنيف أمراض الحمضيات مع تحسين المعلمات الفائقة. كشفت النتائج أن كل من InceptionV3 وDenseNet121 حققتا أعلى دقة تصنيف، حوالي 99.12%، مع تسجيل InceptionV3 لمتوسط F1-score ماكرو يبلغ حوالي 0.986 ومتوسط F1-score مرجح بالقرب من 0.991، مما يدل على دقة واسترجاع استثنائي لكل من فئات الفاكهة والأوراق. أظهر DenseNet121 أيضًا مقاييس أداء قوية مماثلة، مما يبرز قدراته القوية في استخراج الميزات.

كما أظهرت ResNet50 وEfficientNetB0 أداءً محسّنًا، حيث حققت دقة تبلغ 84.58% و80.18%، على التوالي، على الرغم من أنها كانت أقل اتساقًا عبر فئات الأمراض مقارنة بالنماذج الرائدة. أكدت الدراسة على أهمية ضبط معدلات التعلم والمُحسِّنات، بالإضافة إلى فعالية تعزيز Mixup في تحسين أداء التصنيف. تؤكد هذه النتائج على إمكانيات الشبكات العصبية التلافيفية في الكشف الدقيق عن أمراض الحمضيات، وهو أمر حاسم لتقليل خسائر المحاصيل وضمان إنتاجية عالية الجودة. علاوة على ذلك، تدعو الأبحاث إلى ضرورة معالجة قيود البيانات وتعزيز الكفاءة الحاسوبية للتطبيقات العملية في الزراعة، مما يساهم في الفهم الأوسع للحلول المدفوعة بالذكاء الاصطناعي في تصنيف أمراض النباتات.

الطرق

في هذه الدراسة، تم إجراء استخراج الميزات باستخدام الشبكات العصبية التلافيفية (CNNs) لاشتقاق تمثيلات ميزات هرمية تلقائيًا من بيانات الصور الخام. قام الباحثون بتقييم عدة هياكل CNN مدربة مسبقًا، بما في ذلك VGG16 وResNet50 وInceptionV3 وEfficientNetB0، باستخدام التعلم بالنقل للاستفادة من المعرفة من النماذج المدربة سابقًا على مجموعة بيانات ImageNet.

خضعت النماذج المدربة مسبقًا لتعديلات لتكييفها مع مهمة تصنيف أمراض الحمضيات. شملت التغييرات المعمارية الرئيسية إدخال طبقة تجميع متوسط عالمي لتقليل الأبعاد المكانية مع الحفاظ على المعلومات الأساسية، تليها طبقة أو أكثر من الطبقات الكثيفة المتصلة بالكامل مع دوال تنشيط ReLU لالتقاط التفاعلات غير الخطية للميزات. لتخفيف الإفراط في التكيف، تم دمج طبقات إسقاط بمعدل 0.5، واستخدمت الطبقة النهائية دالة تنشيط softmax لتوليد توزيعات احتمالية عبر الفئات المحددة.

النتائج

قيمت نتائج الدراسة أداء أربعة نماذج من التعلم العميق—EfficientNetB0 وResNet50 وInceptionV3 وDenseNet121—في تصنيف أمراض الحمضيات. تم تحليل مقاييس الأداء بشكل منهجي لتقييم فعالية كل نموذج، مع تسليط الضوء على نقاط القوة والضعف في سياق تصنيف الأمراض. كشفت التحليلات المقارنة عن اختلافات كبيرة في الدقة والموثوقية بين النماذج، مما يوفر رؤى حول قابليتها للتطبيق في إدارة أمراض الحمضيات.

تستكشف المناقشة أيضًا تداعيات هذه النتائج لتحسين استراتيجيات الكشف عن الأمراض وإدارتها في زراعة الحمضيات. بالإضافة إلى ذلك، تتناول القسم القيود التي تم مواجهتها خلال الدراسة وتقترح طرقًا محتملة للبحث المستقبلي، مع التأكيد على الحاجة إلى الاستمرار في استكشاف تقنيات التعلم العميق في التطبيقات الزراعية.

المناقشة

تحدد قسم المناقشة من ورقة البحث المنهجية والنتائج المتعلقة بالكشف وتصنيف أمراض الحمضيات باستخدام نماذج التعلم العميق. تتكون مجموعة البيانات، المستمدة من Rauf وآخرين، من 759 صورة لكل من الفواكه والأوراق الصحية والمريضة، مصنفة إلى فئات مرضية مختلفة مثل البقعة السوداء، والقرحة، والتخضير. تم التقاط الصور تحت ظروف متنوعة لتعزيز قوة النموذج، وتم استخدام تعليقات الخبراء لضمان جودة عالية للتسميات. لمعالجة عدم توازن الفئات، وخاصة لصور الأوراق الصحية، تم تنفيذ استراتيجيات مثل زيادة العينة ووزن الفئات. تم تقسيم مجموعة البيانات إلى مجموعات التدريب (70%)، والتحقق (15%)، والاختبار (15%) باستخدام أخذ عينات طبقية للحفاظ على تمثيل الفئات.

تم استخدام تقنيات تعزيز البيانات والمعالجة المسبقة لتحسين تعميم النموذج، بما في ذلك التحولات الهندسية، وتعديلات الألوان، وإدخال الضوضاء. تم تطبيق طرق تحسين الصور، مثل تحسين التباين المحدود التكيفي (CLAHE) والترشيح الوسيط، لتحسين رؤية أعراض الأمراض. تم تدريب وتحسين عدة نماذج من التعلم العميق، بما في ذلك EfficientNetB0 وResNet50 وInceptionV3 وDenseNet121، من خلال ضبط المعلمات الفائقة وتقنيات التنظيم لمنع الإفراط في التكيف. أشارت النتائج إلى أن InceptionV3 وDenseNet121 حققتا أعلى دقة اختبار تبلغ 99.12%، مما يدل على فعالية الهياكل المتقدمة واستراتيجيات التحسين في تصنيف أمراض الحمضيات بدقة. زادت دمج أدوات الشرح مثل رسم خرائط تنشيط الفئة المعتمدة على التدرج (Grad-CAM) من قابلية تفسير النموذج، مما يسمح للمستخدمين بتصور عملية اتخاذ القرار وبناء الثقة في نظام الذكاء الاصطناعي. بشكل عام، تؤكد النتائج على إمكانيات التعلم العميق في التطبيقات الزراعية، لا سيما في مراقبة وإدارة الأمراض.

القيود

تسلط قيود الدراسة الحالية الضوء على عدة مجالات لتحسين تطوير النماذج للكشف عن أمراض الحمضيات. أولاً، قد تعيق حجم مجموعة البيانات الصغير وعدم التوازن في بعض فئات الأمراض قدرات تعميم النماذج. يجب أن تهدف الأبحاث المستقبلية إلى توسيع مجموعة البيانات، خاصة من خلال تضمين المزيد من الصور من الفئات الممثلة تمثيلاً ناقصًا، لتعزيز قوة النموذج وتقليل التحيز. بالإضافة إلى ذلك، فإن الاعتماد على الميزات البصرية وحدها يتجاهل العوامل السياقية مثل درجة الحرارة والرطوبة، والتي يمكن دمجها لتحسين دقة التصنيف من خلال أساليب البيانات متعددة الوسائط.

علاوة على ذلك، يتطلب النشر العملي في البيئات الزراعية التركيز على الكفاءة الحاسوبية، حيث إن العديد من الأجهزة المستخدمة محدودة الموارد. سيكون من الضروري التحقيق في تقنيات التحسين مثل تقليم النموذج والتكميم لتمكين هذه النماذج من العمل بفعالية على الأجهزة الطرفية دون المساس بالأداء. يجب أن تتحقق الدراسات المستقبلية أيضًا من صحة النماذج عبر أنواع الحمضيات المتنوعة والمواقع الجغرافية، لضمان القابلية للتكيف والفائدة العملية في التطبيقات الواقعية. سيكون من الضروري إجراء تحليل شامل لتعقيد الحوسبة لتقييم جدوى نشر هذه النماذج في البيئات المحدودة الموارد، مع خطط لاستكشاف الهياكل الخفيفة والنماذج المعتمدة على المحولات لتحسين الأداء في تصنيف أمراض النباتات.

Journal: Scientific Reports, Volume: 15, Issue: 1
DOI: https://doi.org/10.1038/s41598-025-97159-0
PMID: https://pubmed.ncbi.nlm.nih.gov/40221550
Publication Date: 2025-04-12
Author(s): Archna Goyal et al.
Primary Topic: Smart Agriculture and AI

Overview

In this study, four advanced deep learning models—EfficientNetB0, ResNet50, DenseNet121, and InceptionV3—were assessed for their effectiveness in classifying citrus diseases with optimized hyperparameters. The results revealed that both InceptionV3 and DenseNet121 achieved the highest classification accuracy, approximately 99.12%, with InceptionV3 recording a macro average F1-score of about 0.986 and a weighted average F1-score near 0.991, indicating exceptional precision and recall for both fruit and leaf classes. DenseNet121 demonstrated similarly strong performance metrics, highlighting its robust feature extraction capabilities.

ResNet50 and EfficientNetB0 also showed improved performance, achieving accuracies of 84.58% and 80.18%, respectively, though they were less consistent across disease categories compared to the top models. The study emphasized the significance of fine-tuning learning rates and optimizers, as well as the effectiveness of Mixup augmentation in enhancing classification performance. These findings underscore the potential of convolutional neural networks for accurate citrus disease detection, which is crucial for reducing crop losses and ensuring high-quality yields. Furthermore, the research calls attention to the need for addressing data limitations and enhancing computational efficiency for practical applications in agriculture, thereby contributing to the broader understanding of AI-driven solutions in plant disease classification.

Methods

In this study, feature extraction was conducted using Convolutional Neural Networks (CNNs) to automatically derive hierarchical feature representations from raw image data. The researchers evaluated several pre-trained CNN architectures, including VGG16, ResNet50, InceptionV3, and EfficientNetB0, utilizing transfer learning to capitalize on the knowledge from models previously trained on the ImageNet dataset.

The pre-trained models underwent modifications to adapt them for the citrus disease classification task. Key architectural changes included the introduction of a global average pooling layer to minimize spatial dimensions while preserving essential information, followed by one or more fully connected dense layers with ReLU activation functions to capture non-linear feature interactions. To mitigate overfitting, dropout layers with a rate of 0.5 were incorporated, and the final output layer employed a softmax activation function to generate probability distributions across the defined classes.

Results

The results of the study evaluated the performance of four deep learning models—EfficientNetB0, ResNet50, InceptionV3, and DenseNet121—in classifying citrus diseases. Performance metrics were systematically analyzed to assess the efficacy of each model, highlighting their strengths and weaknesses in the context of disease classification. A comparative analysis revealed significant differences in accuracy and reliability among the models, providing insights into their applicability for citrus disease management.

The discussion further explores the implications of these findings for improving disease detection and management strategies in citrus agriculture. Additionally, the section addresses the limitations encountered during the study and suggests potential avenues for future research, emphasizing the need for continued exploration of deep learning techniques in agricultural applications.

Discussion

The discussion section of the research paper outlines the methodology and findings related to the detection and classification of citrus diseases using deep learning models. The dataset, sourced from Rauf et al., comprises 759 images of both healthy and diseased citrus fruits and leaves, categorized into various disease classes such as Black spot, Canker, and Greening. The images were captured under diverse conditions to enhance model robustness, and expert annotations were employed to ensure high-quality labels. To address class imbalances, particularly for healthy leaf images, strategies like oversampling and class weighting were implemented. The dataset was divided into training (70%), validation (15%), and testing (15%) sets using stratified sampling to maintain class representation.

Data augmentation and preprocessing techniques were employed to improve model generalization, including geometric transformations, color adjustments, and noise injection. Image enhancement methods, such as Contrast Limited Adaptive Histogram Equalization (CLAHE) and median filtering, were applied to improve the visibility of disease symptoms. Several deep learning models, including EfficientNetB0, ResNet50, InceptionV3, and DenseNet121, were trained and optimized through hyperparameter tuning and regularization techniques to prevent overfitting. The results indicated that InceptionV3 and DenseNet121 achieved the highest test accuracy of 99.12%, demonstrating the effectiveness of advanced architectures and optimization strategies in accurately classifying citrus diseases. The integration of explainability tools like Gradient-weighted Class Activation Mapping (Grad-CAM) further enhanced model interpretability, allowing users to visualize the decision-making process and build trust in the AI system. Overall, the findings underscore the potential of deep learning in agricultural applications, particularly for disease monitoring and management.

Limitations

The limitations of the current study highlight several areas for improvement in the development of models for citrus disease detection. Firstly, the dataset’s small size and imbalance in certain disease categories may hinder the models’ generalization capabilities. Future research should aim to expand the dataset, particularly by including more images from underrepresented classes, to enhance model robustness and mitigate bias. Additionally, the reliance on visual features alone neglects contextual factors such as temperature and humidity, which could be integrated to improve classification accuracy through multimodal data approaches.

Moreover, practical deployment in agricultural settings necessitates a focus on computational efficiency, as many devices used are resource-constrained. Investigating optimization techniques like model pruning and quantization will be crucial for enabling these models to function effectively on edge hardware without compromising performance. Future studies should also validate the models across diverse citrus varieties and geographic locations, ensuring adaptability and practical utility in real-world applications. A thorough computational complexity analysis will be essential to assess the feasibility of deploying these models in resource-limited environments, with plans to explore lightweight architectures and transformer-based models for enhanced performance in plant disease classification.