تشخيص مرض أوراق الذرة: تعزيز الدقة باستخدام resnet152 و grad-cam للذكاء الاصطناعي القابل للتفسير Corn leaf disease diagnosis: enhancing accuracy with resnet152 and grad-cam for explainable AI

المجلة: BMC Plant Biology، المجلد: 25، العدد: 1
DOI: https://doi.org/10.1186/s12870-025-06386-0
تاريخ النشر: 2025-04-07
المؤلف: Kirubasri Gopalan وآخرون
الموضوع الرئيسي: الزراعة الذكية والذكاء الاصطناعي

نظرة عامة

تركز هذه الدراسة على تطوير نموذج تعلم عميق، تحديداً ResNet152، للكشف الدقيق وتصنيف أمراض أوراق الذرة، والتي تعتبر حيوية للحفاظ على إمدادات الغذاء وإنتاجية الزراعة. باستخدام مجموعة بيانات تحتوي على 4,188 صورة مصنفة إلى أربع فئات—الذبول، الصدأ الشائع، البقع الرمادية، والأوراق الصحية—حقق النموذج دقة تدريب مثيرة للإعجاب بلغت 99.95% ودقة اختبار بلغت 98.34%. يعزز دمج Grad-CAM (تخطيط تنشيط الفئة المعتمد على التدرج) من قابلية تفسير النموذج من خلال إنشاء خرائط حرارية تبرز المناطق الأكثر أهمية في صور الأوراق التي تؤثر على التنبؤات، مما يساعد المزارعين في فهم تشخيصات الأمراض.

تشير النتائج إلى أن نموذج ResNet152، المدعوم بـ Grad-CAM، لا يتجاوز فقط النماذج السابقة من حيث الدقة والخصوصية ولكن أيضاً يدعم المزارعين في إدارة الأمراض بشكل فعال، مما يعزز في النهاية الممارسات الزراعية المستدامة. تؤكد الدراسة على أهمية الذكاء الاصطناعي القابل للتفسير (X-AI) في الزراعة، مقترحة أن الأبحاث المستقبلية يمكن أن توسع من تطبيقات النموذج لتشمل محاصيل أخرى وتدمج تقنيات إنترنت الأشياء للكشف عن الأمراض في الوقت الحقيقي. تسلط التحديات مثل الحاجة إلى موارد حوسبة محددة للموقع وقيود مجموعات البيانات الحالية الضوء على ضرورة إجراء دراسات إضافية لتعزيز موثوقية النموذج وسهولة وصول المستخدمين في سياقات زراعية متنوعة.

مقدمة

تسلط مقدمة هذه الورقة البحثية الضوء على الدور الحاسم للذرة كمحصول غذائي أساسي ضروري للأمن الغذائي العالمي والاقتصادات الزراعية. على الرغم من قدرتها على مقاومة الأمراض المختلفة، تتأثر الذرة بشكل كبير بالعوامل الممرضة التي يمكن أن تقلل من كل من الغلة والجودة، مما يؤدي إلى عواقب اقتصادية على المزارعين وزيادة تكاليف الغذاء. تؤكد الورقة على أهمية التعرف المبكر والدقيق على الأمراض، خاصةً لأمراض الأوراق مثل ذبول أوراق الذرة الشمالية، والبقع الرمادية، والصدأ الشائع، التي تنتشر عبر مناطق جغرافية مختلفة.

لقد حولت التقدمات في التكنولوجيا، وخاصة من خلال التعلم العميق (DL) والذكاء الاصطناعي (AI)، الكشف عن الأمراض من الفحص اليدوي إلى الأنظمة الآلية. تقترح الدراسة نموذج DL فعال وقابل للتفسير باستخدام بنية ResNet152، التي تتضمن اتصالات متبقية لتعزيز التدريب وتحسين دقة الكشف عن الأمراض. يحقق النموذج دقة تدريب تبلغ 99.95% ودقة اختبار تبلغ 98.34%، مع استخدام Grad-CAM للتفسير، مما يسمح للمزارعين بفهم أي أجزاء من أوراق الذرة تتأثر أكثر. تهدف هذه الدراسة إلى تقليل فقد المحاصيل وتعزيز ممارسات الزراعة المستدامة، وبالتالي المساهمة في الأمن الغذائي العالمي.

طرق

في هذه الدراسة، يتم اقتراح نموذج قائم على التعلم العميق للتصنيف الآلي لأمراض أوراق النباتات إلى أربع فئات متميزة: H، LB، CR، وGLS. تبدأ المنهجية بتعزيز البيانات، تليها حذف التكرارات، تغيير حجم الصور، وتعديل التنسيق، جميعها تهدف إلى تحسين جودة البيانات. يتم استخراج الميزات وتصنيفها باستخدام نموذج ResNet152 المدرب مسبقاً، المحسن باستخدام خوارزمية آدم، مع معدل تعلم يبلغ 0.0001، وحجم دفعة يبلغ 32، و50 دورة.

يتم تقييم أداء النموذج باستخدام مقاييس قياسية على بيانات الاختبار لضمان الموثوقية والاتساق. بالإضافة إلى ذلك، تتضمن الدراسة تقنيات الذكاء الاصطناعي القابل للتفسير، وتحديداً تخطيط تنشيط الفئة المعتمد على التدرج (Grad-CAM)، لتصور الميزات الحيوية التي تؤثر على أداء النموذج ونتائج التصنيف. يهدف النموذج المقترح إلى توفير كشف واضح وفعال عن أمراض النباتات، مما يساهم في تحسين الممارسات الزراعية. توضح الشكل 1 الهيكل العام للنموذج المقترح.

نتائج

تكشف نتائج الدراسة حول نموذج ResNet152 للكشف عن أمراض أوراق الذرة عن تقييم شامل لأدائه من خلال مقاييس مختلفة. تشمل مؤشرات الأداء الرئيسية الحساسية (Se)، الخصوصية (Sp)، الدقة (Acc)، الدقة (Pr)، معدل الإيجابيات الحقيقية (TPR)، معدل الإيجابيات الكاذبة (FPR)، ومعدل التصنيف الخاطئ (MR). نجح النموذج في تصنيف الصور إلى أربع فئات مرضية: الذبول البكتيري (B)، الصدأ الشائع (Cr)، البقع الرمادية (GLS)، والصحي (H). أشارت مصفوفة الالتباس إلى أن النموذج حقق دقة اختبار بلغت 93.69%، مع تصنيفات صحيحة ملحوظة وبعض التصنيفات الخاطئة، خاصة بين B وGLS.

لزيادة قوة النموذج، تم استخدام طريقة التحقق المتقاطع بخمس طيات، مما يضمن أداءً مستقراً ومتسقاً عبر مجموعات تدريب مختلفة. قدم دمج تخطيط تنشيط الفئة المعتمد على التدرج (Grad-CAM) رؤى قيمة حول عملية اتخاذ القرار للنموذج، مبرزاً المناطق المحددة للأمراض في الصور التي أثرت على التنبؤات. بينما أظهر النموذج استخراج ميزات فعالة لمعظم الحالات، كشفت التصنيفات الخاطئة عن تحديات في تمييز الفروق الدقيقة بين الأمراض المتشابهة. تشمل التوصيات للتحسينات المستقبلية تحسين وضوح الميزات من خلال تقنيات مثل تقليل الضوضاء وزيادة مجموعة البيانات لتعزيز دقة النموذج وقابليته للتفسير في تشخيصات الزراعة.

مناقشة

تسلط المناقشة الضوء على القيود الكبيرة في النماذج الحالية للتعلم العميق لكشف أمراض النباتات، وخاصة نقص قابلية التفسير، مما يعيق اعتمادها في الممارسات الزراعية. بينما تتفوق نماذج مثل DenseViT وYOLOv8 في الدقة، إلا أنها تفشل في تقديم رؤى حول عمليات اتخاذ القرار الخاصة بها، مما يترك المزارعين وعلماء الزراعة غير قادرين على الثقة أو الاستفادة بشكل فعال من هذه التقنيات. بالإضافة إلى ذلك، تكافح النماذج الخفيفة مثل MobileNetV3 وEfficientNet، على الرغم من فعاليتها في البيئات المسيطر عليها، مع تباين الظروف الزراعية في العالم الحقيقي. يجادل المؤلفون بضرورة وجود نموذج أكثر قابلية للتفسير يحافظ على دقة عالية، مما يؤدي إلى اختيار ResNet152، الذي يقدم قدرات استخراج ميزات متفوقة بفضل هيكله الأعمق واتصالاته المتبقية.

يعالج دمج Grad-CAM مع ResNet152 مشكلة قابلية التفسير من خلال إنشاء خرائط حرارية تمثل بصرياً المناطق في الصور المدخلة التي تؤثر على تنبؤات النموذج. هذه الميزة حيوية للمستخدمين النهائيين، حيث تعزز فهمهم لتركيز النموذج على المناطق المريضة مقابل الخلفية غير ذات الصلة، مما يعزز الثقة ويسهل اتخاذ القرارات المستنيرة في إدارة الأمراض. يظهر النموذج المقترح، المدرب على مجموعة بيانات تحتوي على 4,188 صورة لأوراق الذرة، نهجاً قوياً لتصنيف الأمراض مع ضمان أن النتائج قابلة للتفسير، مما يلبي في النهاية متطلبات الزراعة الدقيقة. يخلص المؤلفون إلى أن هذا الجمع بين ResNet152 وGrad-CAM لا يحسن الدقة فحسب، بل يجسر أيضاً الفجوة بين تكنولوجيا الذكاء الاصطناعي والتطبيقات الزراعية العملية.

Journal: BMC Plant Biology, Volume: 25, Issue: 1
DOI: https://doi.org/10.1186/s12870-025-06386-0
Publication Date: 2025-04-07
Author(s): Kirubasri Gopalan et al.
Primary Topic: Smart Agriculture and AI

Overview

This study focuses on the development of a deep learning model, specifically ResNet152, for the accurate detection and classification of corn leaf diseases, which are critical for maintaining food supply and agricultural productivity. Utilizing a dataset of 4,188 images categorized into four classes—blight, common rust, gray spot, and healthy leaves—the model achieved an impressive training accuracy of 99.95% and a testing accuracy of 98.34%. The integration of Grad-CAM (Gradient-weighted Class Activation Mapping) enhances the model’s interpretability by generating heatmaps that highlight the most significant areas of the leaf images influencing predictions, thereby aiding farmers in understanding disease diagnoses.

The findings indicate that the ResNet152 model, augmented with Grad-CAM, not only surpasses previous models in accuracy and specificity but also supports farmers in effective disease management, ultimately promoting sustainable agricultural practices. The study emphasizes the importance of Explainable AI (X-AI) in agriculture, suggesting that future research could expand the model’s applicability to other crops and integrate IoT technologies for real-time disease detection. Challenges such as the need for site-specific computing resources and the limitations of current datasets highlight the necessity for further studies to enhance model reliability and user accessibility in diverse agricultural contexts.

Introduction

The introduction of this research paper highlights the critical role of corn as a staple food crop essential for global food security and agricultural economies. Despite its resilience to various diseases, corn is significantly impacted by pathogens that can diminish both yield and quality, leading to economic repercussions for farmers and increased food costs. The paper emphasizes the importance of early and accurate disease identification, particularly for leaf diseases such as Northern Corn Leaf Blight, Grey Leaf Spot, and Common Rust, which are prevalent across different geographical regions.

Advancements in technology, particularly through deep learning (DL) and artificial intelligence (AI), have transformed disease detection from manual inspection to automated systems. The study proposes an efficient and interpretable DL model utilizing ResNet152 architecture, which incorporates residual connections to enhance training and improve disease detection accuracy. The model achieves a training accuracy of 99.95% and a testing accuracy of 98.34%, with the use of Grad-CAM for interpretability, allowing farmers to understand which parts of the corn leaves are most affected. This research aims to mitigate crop loss and promote sustainable farming practices, thereby contributing to global food security.

Methods

In this study, a deep learning-based model is proposed for the automated categorization of plant leaf diseases into four distinct classes: H, LB, CR, and GLS. The methodology begins with data augmentation, followed by duplicate deletion, image resizing, and format modification, all aimed at enhancing data quality. The feature extraction and classification are performed using a pre-trained ResNet152 model, optimized with the Adam algorithm, employing a learning rate of 0.0001, a batch size of 32, and 50 epochs.

Model performance is evaluated using standard metrics on test data to ensure reliability and consistency. Additionally, the study incorporates explainable artificial intelligence techniques, specifically Gradient-weighted Class Activation Mapping (Grad-CAM), to visualize the critical features influencing model performance and classification outcomes. The proposed model aims to provide clear and effective detection of plant diseases, thereby contributing to improved agricultural practices. Figure 1 illustrates the overall architecture of the proposed model.

Results

The results of the study on the ResNet152-based model for detecting corn leaf diseases reveal a comprehensive evaluation of its performance through various metrics. Key performance indicators include sensitivity (Se), specificity (Sp), accuracy (Acc), precision (Pr), true positive rate (TPR), false positive rate (FPR), and misclassification rate (MR). The model successfully classified images into four disease categories: Bacterial Blight (B), Common Rust (Cr), Gray Leaf Spot (GLS), and Healthy (H). The confusion matrix indicated that the model achieved a testing accuracy of 93.69%, with notable correct classifications and some misclassifications, particularly between B and GLS.

To enhance the model’s robustness, a fivefold cross-validation method was employed, ensuring a stable and consistent performance across different training subsets. The incorporation of Gradient-weighted Class Activation Mapping (Grad-CAM) provided valuable insights into the model’s decision-making process, highlighting disease-specific regions in the images that influenced predictions. While the model demonstrated effective feature extraction for most cases, misclassifications revealed challenges in distinguishing subtle differences between similar diseases. Recommendations for future improvements include refining feature clarity through techniques such as noise reduction and dataset augmentation to enhance the model’s accuracy and interpretability in agricultural diagnostics.

Discussion

The discussion highlights significant limitations in current deep learning models for plant disease detection, particularly their lack of interpretability, which impedes their adoption in agricultural practices. While models like DenseViT and YOLOv8 excel in accuracy, they fail to provide insights into their decision-making processes, leaving farmers and agronomists unable to trust or effectively utilize these technologies. Additionally, lightweight models such as MobileNetV3 and EfficientNet, although efficient in controlled environments, struggle with the variability of real-world agricultural settings. The authors argue for the necessity of a more interpretable model that maintains high accuracy, leading to the selection of ResNet152, which offers superior feature extraction capabilities due to its deeper architecture and residual connections.

The integration of Grad-CAM with ResNet152 addresses the interpretability issue by generating heatmaps that visually represent the areas of the input images that influence the model’s predictions. This feature is crucial for end-users, as it enhances their understanding of the model’s focus on diseased areas versus irrelevant background, thereby fostering trust and facilitating informed decision-making in disease management. The proposed model, trained on a dataset of 4,188 corn leaf images, demonstrates a robust approach to disease classification while ensuring that the results are interpretable, ultimately meeting the demands of precision agriculture. The authors conclude that this combination of ResNet152 and Grad-CAM not only improves accuracy but also bridges the gap between AI technology and practical agricultural applications.