LeafSightX: نموذج دمج CNN معزز بالانتباه القابل للتفسير لتحديد أمراض أوراق التفاح LeafSightX: an explainable attention-enhanced CNN fusion model for apple leaf disease identification

المجلة: Frontiers in Artificial Intelligence، المجلد: 8
DOI: https://doi.org/10.3389/frai.2025.1689865
PMID: https://pubmed.ncbi.nlm.nih.gov/41693760
تاريخ النشر: 2026-01-30
المؤلف: Md. Ehsanul Haque وآخرون
الموضوع الرئيسي: الزراعة الذكية والذكاء الاصطناعي

نظرة عامة

تقدم البحث LeafSightX، وهو هيكل معماري جديد للتعلم العميق ذو عمودين مصمم للتعرف السريع والدقيق على أمراض أوراق التفاح، مما يعالج التحديات الكبيرة في الزراعة الدقيقة مثل محدودية القابلية للتعميم، وقابلية التفسير، والحساسية تجاه التحولات في المجال. من خلال دمج الميزات من DenseNet201 و InceptionV3 مع تقنيات الانتباه الذاتي متعدد الرؤوس (MHSA)، يعزز LeafSightX كل من القدرة التمثيلية والتفكير في السياق المكاني. يستخدم النموذج استراتيجية معالجة مسبقة متخصصة وتكبير بيانات محدود، مما يساهم في قوته ومرونته عبر سيناريوهات مختلفة. من المهم أنه يتضمن طرق الذكاء الاصطناعي القابلة للتفسير، باستخدام تصورات Grad-CAM لتعزيز الشفافية.

تظهر التقييمات التجريبية على مجموعة بيانات لأمراض أوراق التفاح من خمس فئات أن LeafSightX يحقق مقاييس أداء ملحوظة، بما في ذلك دقة اختبار تبلغ 99.64%، ودرجة F1 تبلغ 0.9962، ودرجات AUC و PR-AUC المثالية 1.000، متفوقًا بشكل كبير على الشبكات العصبية التلافيفية (CNNs) الأساسية. تشير نتائج التحقق المتقاطع إلى مستوى عالٍ من الاتساق التنبؤي، مع متوسط كابا كوهين يبلغ 0.9917. كما يظهر الإطار أيضًا قدرات استدلال في الوقت الحقيقي، مما يجعله مناسبًا للنشر على الأجهزة الطرفية، خاصة في المناطق الريفية ذات الموارد المحدودة. بينما يظهر النموذج قوة استثنائية وقابلية للتعميم، حيث حقق دقة اختبار تبلغ 99.69% على مجموعة بيانات مستقلة، قد يحد تدريبه على بيانات محددة بالمنطقة من قابليته للتطبيق في سياقات جغرافية أخرى. ستركز الأعمال المستقبلية على تعزيز قدرات LeafSightX من خلال مجموعات بيانات أكبر وتكييفه مع الديناميات الزمنية للعدوى الورقية، بهدف تحسين إدارة أمراض أوراق التفاح في الزراعة الدقيقة.

مقدمة

تؤكد مقدمة ورقة البحث على الدور الحاسم للزراعة في الأمن الغذائي العالمي، مع تسليط الضوء على الأهمية التجارية لإنتاج التفاح (Malus domestica)، الذي يهدده العديد من الأمراض الورقية مثل بقعة أوراق Alternaria والصدأ. لا تؤدي هذه الأمراض فقط إلى خسائر كبيرة في المحاصيل ولكنها تؤثر أيضًا سلبًا على جودة الفاكهة، مما يستلزم تشخيصًا مبكرًا ودقيقًا لإدارة فعالة للأمراض. تعتبر طرق الفحص اليدوي التقليدية، على الرغم من انتشارها، كثيفة العمالة وذات طابع شخصي، مما يؤدي غالبًا إلى عدم الكفاءة في البساتين الكبيرة حيث تكون توفر الخبراء محدودًا.

لمعالجة هذه التحديات، يقترح المؤلفون LeafSightX، وهو إطار متقدم للتعلم العميق مصمم للكشف التلقائي عن أمراض أوراق التفاح. يدمج LeafSightX هياكل DenseNet201 و InceptionV3 مع آلية الانتباه الذاتي متعدد الرؤوس، مما يعزز قدرته على التقاط كل من الميزات المحلية والسياقية. يستخدم النموذج خط أنابيب معالجة مسبقة شامل لتحسين جودة الصورة والتخفيف من التغيرات البيئية، إلى جانب تصور Grad-CAM من أجل القابلية للتفسير. تدعي الدراسة أن LeafSightX يتفوق على الطرق الحالية من حيث الدقة والموثوقية والكفاءة الحاسوبية، مما يجعله أداة قيمة للزراعة الدقيقة. توضح الورقة هيكله، مشيرة إلى أن الأقسام التالية ستتناول مراجعة الأدبيات، ومنهجية مفصلة، ونتائج تجريبية، واستنتاجات.

طرق

تم هيكلة المنهجية المقترحة للكشف عن أمراض أوراق التفاح كخط أنابيب منهجي مصمم لتحديد وتصنيف حالات مختلفة من أوراق التفاح بدقة. يتم تمثيل سير العمل لهذا الإطار بصريًا في الشكل 2. تشمل المراحل الرئيسية للمنهجية جمع البيانات، والمعالجة المسبقة والتكبير، وتشخيص الأمراض، وتطوير النماذج الأساسية والمقترحة، والتدريب، وتقييم الأداء. تلعب كل مرحلة دورًا حاسمًا في تعزيز الفعالية العامة لعملية الكشف عن الأمراض.

نتائج

في هذا القسم، يتم تحليل أداء نموذج LeafSightX بشكل منهجي مقارنة بالمعايير المعمول بها، مع التركيز على دقته، وقوته، وقابلية تفسيره في سياق تشخيص الأمراض النباتية التلقائي. تشير النتائج إلى أن LeafSightX يتفوق على الهياكل الأخرى للتعلم العميق، حيث حقق دقة اختبار تبلغ 0.9969 ودرجة F1 تبلغ 0.9970، كما هو موضح في الجدول 17. يتم تأكيد هذا الأداء من خلال قيم AUC العالية (0.9999) و PR AUC (1.0000)، مما يدل على قدرات النموذج القوية في التعميم عبر مجموعات التدريب والتحقق والاختبار.

بينما أظهرت DenseNet201 و InceptionV3 أيضًا أداءً مشرفًا مع دقة اختبار تتجاوز 0.99، إلا أنها أظهرت دقة واسترجاع أقل قليلاً، مما يشير إلى عدم توازن طفيف في توقعات الفئات. تؤكد التفوق المستمر لـ LeafSightX عبر جميع المقاييس التي تم تقييمها قوته وقدرته على التكيف مع مجموعات البيانات الجديدة، مما يثبت قابليته للتوسع وفعاليته في تطبيقات الذكاء الاصطناعي الزراعية في العالم الحقيقي.

مناقشة

تسلط المناقشة الضوء على التحديات المستمرة في تحديد أمراض أوراق التفاح بدقة ضمن الزراعة الدقيقة، مع التأكيد على الدور الكبير للتقدم في التعلم العميق. لقد أظهرت نماذج مختلفة، مثل A-Net و ELM-YOLOv8n، معدلات دقة متوسطة عالية (mAP)، حيث حقق ELM-YOLOv8n 96.7% mAP@0.5. ومع ذلك، تواجه العديد من هذه النماذج قيودًا في التكيف مع الظروف البيئية المتنوعة، والصلابة عبر مجموعات البيانات غير المتجانسة، وقابلية التفسير. من الجدير بالذكر أنه بينما حققت بعض النماذج دقة مثيرة للإعجاب، إلا أنها غالبًا ما تفتقر إلى طرق التحقق الصارمة، مثل التحقق المتقاطع، وتستخدم تقنيات معالجة مسبقة بسيطة تعيق فعاليتها في عزل الآفات المعقدة.

تحدد الورقة الفجوات الحرجة في الأدبيات الحالية، بما في ذلك الحاجة إلى تحسين القابلية للتعميم، والقدرة على التفسير، والكفاءة الحاسوبية في نماذج الكشف عن الأمراض. وتجادل بضرورة وجود طرق معالجة مسبقة متقدمة وطرق تحقق إحصائية لتعزيز موثوقية هذه الأنظمة وقابليتها للتطبيق في البيئات الزراعية الحقيقية. لمعالجة هذه القضايا، يقدم المؤلفون LeafSightX، وهو إطار جديد للتعلم العميق ذو عمودين يدمج عمليات الانتباه الذاتي متعدد الرؤوس (MHSA) من أجل تعزيز تمثيل الميزات ودقة التصنيف. تم التحقق من صحة هذا النموذج من خلال التحقق المتقاطع الشامل والاختبار الإحصائي، مما يضمن تحسينات كبيرة في الأداء مع التركيز أيضًا على القابلية للتفسير والنشر العملي في ظروف بيئية متنوعة.

Journal: Frontiers in Artificial Intelligence, Volume: 8
DOI: https://doi.org/10.3389/frai.2025.1689865
PMID: https://pubmed.ncbi.nlm.nih.gov/41693760
Publication Date: 2026-01-30
Author(s): Md. Ehsanul Haque et al.
Primary Topic: Smart Agriculture and AI

Overview

The research presents LeafSightX, a novel dual-backbone deep learning architecture designed for the rapid and accurate identification of apple leaf diseases, addressing significant challenges in precision agriculture such as limited generalizability, interpretability, and sensitivity to domain shifts. By integrating features from DenseNet201 and InceptionV3 with Multi-Head Self-Attention (MHSA) techniques, LeafSightX enhances both representational capability and spatial context reasoning. The model employs a specialized preprocessing and limited data augmentation strategy, which contributes to its robustness and resilience across various scenarios. Importantly, it incorporates explainable AI methods, utilizing Grad-CAM visualizations to enhance transparency.

Experimental evaluations on a five-class apple leaf disease dataset reveal that LeafSightX achieves remarkable performance metrics, including a test accuracy of 99.64%, an F1-score of 0.9962, and perfect AUC and PR-AUC scores of 1.000, significantly outperforming baseline convolutional neural networks (CNNs). Cross-validation results indicate a high level of predictive consistency, with a mean Cohen’s Kappa of 0.9917. The framework also demonstrates real-time inference capabilities, making it suitable for deployment on edge devices, particularly in resource-limited rural areas. While the model shows exceptional robustness and generalization, having achieved a test accuracy of 99.69% on an independent dataset, its training on region-specific data may limit applicability to other geographic contexts. Future work will focus on enhancing LeafSightX’s capabilities through larger datasets and adapting it to the temporal dynamics of leaf infections, ultimately aiming to improve the management of apple leaf diseases in precision agriculture.

Introduction

The introduction of the research paper emphasizes the critical role of agriculture in global food security, highlighting the commercial significance of apple (Malus domestica) production, which is threatened by various foliar diseases such as Alternaria leaf spot and rust. These diseases not only lead to substantial yield losses but also adversely affect fruit quality, necessitating early and accurate diagnosis for effective disease management. Traditional manual inspection methods, while prevalent, are labor-intensive and subjective, often leading to inefficiencies in large-scale orchards where expert availability is limited.

To address these challenges, the authors propose LeafSightX, an advanced deep learning framework designed for the automated detection of apple leaf diseases. LeafSightX integrates the architectures of DenseNet201 and InceptionV3 with a Multi-Head Self-Attention mechanism, enhancing its ability to capture both local and contextual features. The model employs a comprehensive preprocessing pipeline to improve image quality and mitigate environmental variations, alongside Grad-CAM visualization for interpretability. The study claims that LeafSightX surpasses existing methods in terms of accuracy, reliability, and computational efficiency, making it a valuable tool for precision agriculture. The paper outlines its structure, indicating that subsequent sections will cover a literature review, detailed methodology, experimental results, and conclusions.

Methods

The proposed methodology for detecting apple leaf disease is structured as a systematic pipeline designed to accurately identify and classify various states of apple leaves. The workflow of this framework is visually represented in Figure 2. Key phases of the methodology include data gathering, preprocessing and augmentation, disease diagnosis, the development of baseline and suggested models, training, and performance evaluation. Each phase plays a critical role in enhancing the overall effectiveness of the disease detection process.

Results

In this section, the performance of the LeafSightX model is systematically analyzed against established baselines, focusing on its accuracy, robustness, and interpretability in the context of automated plant disease diagnosis. The results indicate that LeafSightX outperforms other deep learning architectures, achieving a test accuracy of 0.9969 and an F1-score of 0.9970, as detailed in Table 17. This performance is corroborated by high AUC (0.9999) and PR AUC (1.0000) values, demonstrating the model’s strong generalization capabilities across training, validation, and test sets.

While DenseNet201 and InceptionV3 also exhibited commendable performance with test accuracies exceeding 0.99, they showed slightly lower precision and recall, indicating a minor imbalance in class predictions. The consistent superiority of LeafSightX across all evaluated metrics underscores its robustness and adaptability to new datasets, validating its scalability and effectiveness in real-world agricultural AI applications.

Discussion

The discussion highlights the ongoing challenges in accurately identifying apple leaf diseases within precision agriculture, emphasizing the significant role of deep learning advancements. Various models, such as A-Net and ELM-YOLOv8n, have demonstrated high mean average precision (mAP) rates, with ELM-YOLOv8n achieving 96.7% mAP@0.5. However, many of these models face limitations in adaptability to diverse environmental conditions, robustness across heterogeneous datasets, and interpretability. Notably, while some models achieved impressive accuracy, they often lacked rigorous validation methods, such as cross-validation, and employed simplistic preprocessing techniques that hindered their effectiveness in isolating complex lesions.

The paper identifies critical gaps in the current literature, including the need for improved generalizability, explainability, and computational efficiency in disease detection models. It argues for the necessity of advanced preprocessing and statistical validation methods to enhance the reliability and applicability of these systems in real-world agricultural settings. To address these issues, the authors introduce LeafSightX, a novel dual-backbone transfer learning framework that integrates Multi-Head Self-Attention (MHSA) operations for enhanced feature representation and classification accuracy. This model was validated through comprehensive cross-validation and statistical testing, ensuring significant improvements in performance while also focusing on interpretability and practical deployment in varied environmental conditions.