الكشف عن أمراض أوراق النباتات باستخدام محولات الرؤية للزراعة الدقيقة Plant leaf disease detection using vision transformers for precision agriculture

المجلة: Scientific Reports، المجلد: 15، العدد: 1
DOI: https://doi.org/10.1038/s41598-025-05102-0
PMID: https://pubmed.ncbi.nlm.nih.gov/40594191
تاريخ النشر: 2025-07-01
المؤلف: S. Murugavalli وآخرون
الموضوع الرئيسي: الزراعة الذكية والذكاء الاصطناعي

نظرة عامة

تقدم البحث نظام تحليل أوراق الدقة مع المحولات البصرية (PLA-ViT)، الذي يستفيد من المحولات البصرية (ViTs) للكشف المبكر وتصنيف أمراض الأوراق، مما يعالج قيود الطرق التقليدية التي تتطلب مجموعات بيانات كبيرة وعرضة للتكيف الزائد. من خلال استخدام آليات الانتباه الذاتي، تعزز ViTs قدرة النموذج على التقاط المعلومات السياقية العالمية، مما يؤدي إلى تحسين الدقة في الكشف عن الأمراض وتحديد مواقعها. يتضمن نهج PLA-ViT تحسين البيانات، والتطبيع، والترشيح الثنائي لتحسين جودة الصورة والتعميم. بالإضافة إلى ذلك، يعمل التعلم بالنقل مع ViTs المدربة مسبقًا على تحسين الكفاءة الحسابية واستخراج الميزات، مما يؤدي إلى أداء متفوق مقارنةً بنماذج الشبكات العصبية الأخرى.

يخلص الدراسة إلى أن PLA-ViT يعزز الزراعة الدقيقة بشكل كبير من خلال تمكين المراقبة في الوقت الحقيقي من خلال تكامل إنترنت الأشياء، مما يسمح للمزارعين بالاستجابة بسرعة لتهديدات الآفات والضغوط غير الحيوية. بينما يظهر النموذج قدرات قوية في تحديد الأمراض ومراقبة صحة المحاصيل، فإنه يحدد أيضًا الحاجة إلى تحسينات إضافية، لا سيما في دمج كتل الانتباه مع كتل الالتفاف لتعزيز سرعة ودقة التنبؤ. تشمل اتجاهات البحث المستقبلية تطوير شبكات تحديد المواقع للأجسام لتحديد أوراق النباتات في الصور الكبيرة ومعالجة تحديات التصنيف متعدد التسميات، إلى جانب إنشاء مجموعات بيانات موثقة جيدًا لدعم هذه التطورات.

الطرق

في هذا القسم، يناقش المؤلفون تقنيات متقدمة وطرق تجميع للكشف عن أمراض النباتات، مع التركيز على نماذج التعلم العميق المختلفة. أظهر خوارزمية الطائر الطنان المعزز المتقاطع القائم على AX-RetinaNet (CAHA-AXRNet) دقة مثيرة للإعجاب تبلغ 98.1% في الكشف عن أمراض نبات الأرز، متفوقًا على الطرق الحالية. بالإضافة إلى ذلك، حقق نموذج Gan لأوراق المحاصيل (CLGan) مع الشبكات العصبية الالتفافية العميقة (CNNs) دقة تدريب تبلغ 99.97% ودقة اختبار تبلغ 99.04% لتحديد أمراض أوراق الذرة، مع معالجة التحديات مثل عدم استقرار التدريب ومشكلة التدرج المتلاشي من خلال وظيفة خسارة محسنة.

يسلط القسم الضوء أيضًا على شبكة دمج الميزات متعددة المستويات (MFFN) لتصنيف أمراض نبات الطماطم، والتي حققت دقة تدريب وتحقق تبلغ 99.88% ودقة اختبار خارجية تبلغ 99.83%. يتضمن هذا النموذج آلية انتباه تكيفية لإدارة عدم التوازن في البيانات والأعراض المعقدة بشكل فعال. استخدم الإعداد التجريبي منصة سحابية للتعلم العميق بمواصفات قوية، بما في ذلك وحدة معالجة الرسوميات NVIDIA GeForce RTX 3090. يؤكد المؤلفون على مزايا المحولات البصرية (ViTs) مقارنةً بالشبكات العصبية الالتفافية التقليدية، مشيرين إلى قدرتها على التقاط الاعتمادات بعيدة المدى وتوفير تفسير أفضل ومرونة وقابلية للتوسع لتطبيقات الزراعة الدقيقة.

النتائج

يقدم الدراسة نظام تحليل أوراق الدقة مع المحولات البصرية (PLA-ViT)، المصمم للتنبؤ الذاتي بأمراض الأوراق والمراقبة في الزراعة الدقيقة. تم تقييم أداء نموذج PLA-ViT مقابل عدة نماذج موجودة، بما في ذلك HRF-MCSVM، DLMC-Net، CycleGAN، وESDNN. كانت المقاييس الرئيسية للمقارنة تشمل الدقة، تحديد مواقع المرض، وقت الاستدلال، وتعقيد الحسابات، مما يبرز فعالية وكفاءة نظام PLA-ViT في التطبيقات الزراعية.

المناقشة

تستعرض قسم المناقشة من ورقة البحث المنهجيات الحالية للكشف عن أمراض أوراق النباتات، مع تسليط الضوء على نقاط قوتها وضعفها. تشير إلى اتجاه كبير نحو استخدام تقنيات التعلم الآلي ومعالجة الصور لتحديد الأمراض بشكل آلي. أظهرت نماذج هجينة مختلفة، مثل آلة الدعم متعددة الفئات (HRF-MCSVM) وكيس الكلمات مع ميزات قوية معززة (SURF)، دقة تصنيف عالية عبر مجموعات بيانات متعددة، حيث حقق بعضها أكثر من 97% دقة. كما أظهرت أساليب التعلم العميق، بما في ذلك نموذج التصنيف متعدد الفئات خفيف الوزن (DLMC-Net) وCycleGAN، وعدًا، مع دقة تتراوح من 83.12% إلى 99.50% لمحاصيل مختلفة.

يؤكد القسم على قيود الطرق التقليدية، التي غالبًا ما تكون كثيفة العمالة وبطيئة، ويبرز الحاجة إلى أنظمة قابلة للتوسع وآلية يمكن أن تعمل بفعالية على أجهزة ذات موارد منخفضة. يعالج إطار عمل تحليل أوراق الدقة مع المحولات البصرية (PLA-ViT) هذه التحديات من خلال الاستفادة من آليات الانتباه الذاتي للمحولات البصرية لتعزيز استخراج الميزات وتحديد مواقع الأمراض. تشير النتائج التجريبية إلى أن PLA-ViT يتفوق على النماذج الحالية من حيث الكفاءة الحسابية والدقة، مما يجعله حلاً قابلاً للتطبيق للنشر في الوقت الحقيقي في الزراعة الدقيقة. كما تعزز دمج تقنيات متقدمة مثل تحسين البيانات والتعلم التكيفي من قوة النموذج ضد مجموعات البيانات المتنوعة وغير المتوازنة.

القيود

يسلط قسم القيود في ورقة البحث الضوء على عدة تحديات مرتبطة بأساليب التعلم الآلي (ML) والتعلم العميق (DL) الحالية للكشف عن أمراض أوراق النباتات. بينما تستعرض الدراسة 160 ورقة بحثية تشمل نماذج مختلفة، بما في ذلك التقنيات الهجينة وطرق الكشف في الوقت الحقيقي، تحدد قضايا كبيرة مثل التكاليف الحسابية العالية، الاعتماد على مجموعات بيانات كبيرة، والصعوبات في التعميم على أمراض جديدة. تكافح العديد من النماذج الحالية مع حساسية الضوضاء وليست مناسبة جيدًا للتنفيذ على نطاق واسع في الزراعة الذكية المعتمدة على إنترنت الأشياء بسبب عدم قدرتها على التكيف مع الظروف الواقعية.

لمعالجة هذه القيود، تقدم الدراسة إطار عمل تحليل أوراق الدقة مع المحولات البصرية (PLA-ViT)، الذي يستفيد من المحولات البصرية (ViT) لتحسين استخراج الميزات وتصنيف الأمراض. يوفر PLA-ViT دقة ومرونة محسنتين مقارنةً بالنماذج التقليدية القائمة على CNN من خلال التقاط العلاقات المكانية المعقدة والاعتمادات بعيدة المدى في صور الأوراق بشكل فعال. علاوة على ذلك، تعزز البنية المعتمدة على المحولات من قابلية التفسير وتقلل من حساسية الضوضاء، مما يمكّن من مراقبة الأمراض في الوقت الحقيقي مع عبء حسابي أقل عند دمجه في أنظمة الزراعة المعتمدة على إنترنت الأشياء.

Journal: Scientific Reports, Volume: 15, Issue: 1
DOI: https://doi.org/10.1038/s41598-025-05102-0
PMID: https://pubmed.ncbi.nlm.nih.gov/40594191
Publication Date: 2025-07-01
Author(s): S. Murugavalli et al.
Primary Topic: Smart Agriculture and AI

Overview

The research presents the Precision Leaf Analysis with Vision Transformers (PLA-ViT) system, which leverages Vision Transformers (ViTs) for the early detection and classification of leaf diseases, addressing the limitations of traditional methods that require extensive datasets and are prone to overfitting. By utilizing self-attention mechanisms, ViTs enhance the model’s ability to capture global contextual information, leading to improved accuracy in disease detection and localization. The PLA-ViT approach incorporates data augmentation, normalization, and bilateral filtering to enhance image quality and generalization. Additionally, transfer learning with pre-trained ViTs optimizes computational efficiency and feature extraction, resulting in superior performance compared to other neural network models.

The study concludes that PLA-ViT significantly enhances precision agriculture by enabling real-time monitoring through IoT integration, allowing farmers to respond promptly to pest threats and abiotic stresses. While the model demonstrates strong capabilities in disease identification and crop health monitoring, it also identifies the need for further improvements, particularly in integrating attention blocks with convolutional blocks to enhance predictive speed and accuracy. Future research directions include developing object localization networks for identifying plant leaves in large images and addressing multi-label classification challenges, alongside the creation of well-annotated datasets to support these advancements.

Methods

In this section, the authors discuss advanced techniques and ensemble methods for plant disease detection, focusing on various deep learning models. The Crossover Boosted Artificial Hummingbird Algorithm-based AX-RetinaNet (CAHA-AXRNet) demonstrated an impressive accuracy of 98.1% in detecting rice plant diseases, outperforming existing methods. Additionally, the Crop Leaf Gan (CLGan) combined with deep convolutional neural networks (CNNs) achieved a training accuracy of 99.97% and a testing accuracy of 99.04% for maize leaf disease identification, addressing challenges such as training instability and the vanishing gradient problem through an enhanced loss function.

The section also highlights the Multilevel Feature Fusion Network (MFFN) for tomato plant disease classification, which achieved training and validation accuracies of 99.88% and an external testing accuracy of 99.83%. This model incorporates an Adaptive Attention Mechanism to effectively manage data imbalances and complex symptoms. The experimental setup utilized a deep learning cloud platform with robust specifications, including an NVIDIA GeForce RTX 3090 GPU. The authors emphasize the advantages of Vision Transformers (ViTs) over traditional CNNs, noting their ability to capture long-range dependencies and provide better interpretability, flexibility, and scalability for precision agriculture applications.

Results

The study presents the Precision Leaf Analysis with Vision Transformers (PLA-ViT) system, designed for autonomous prediction of leaf diseases and monitoring in precision agriculture. The performance of the PLA-ViT model was evaluated against several existing models, including HRF-MCSVM, DLMC-Net, CycleGAN, and ESDNN. Key metrics for comparison included accuracy, illness localization, inference time, and computational complexity, highlighting the effectiveness and efficiency of the PLA-ViT system in agricultural applications.

Discussion

The discussion section of the research paper reviews existing methodologies for plant leaf disease detection, highlighting both their strengths and weaknesses. It notes a significant trend towards the use of machine learning and image processing techniques for automated disease identification. Various hybrid models, such as the Hybrid Random Forest Multi-class Support Vector Machine (HRF-MCSVM) and Bag-of-Words with Speeded Up Robust Features (SURF), have demonstrated high classification accuracies across multiple datasets, with some achieving over 97% accuracy. Deep learning approaches, including the Deeper Lightweight Multi-class Classification Model (DLMC-Net) and CycleGAN, have also shown promise, with accuracies ranging from 83.12% to 99.50% for different crops.

The section emphasizes the limitations of traditional methods, which are often labor-intensive and slow, and highlights the need for scalable, automated systems that can operate effectively on low-resource devices. The proposed Precision Leaf Analysis with Vision Transformers (PLA-ViT) framework addresses these challenges by leveraging the self-attention mechanisms of Vision Transformers to enhance feature extraction and disease localization. Experimental results indicate that PLA-ViT outperforms existing models in terms of computational efficiency and accuracy, making it a viable solution for real-time deployment in precision agriculture. The integration of advanced techniques such as data augmentation and adaptive learning further enhances the model’s robustness against diverse and unbalanced datasets.

Limitations

The limitations section of the research paper highlights several challenges associated with existing machine learning (ML) and deep learning (DL) approaches for plant leaf disease detection. While the study reviews 160 research papers encompassing various models, including hybrid techniques and real-time detection methods, it identifies significant issues such as high computational costs, reliance on large datasets, and difficulties in generalizing to new diseases. Many existing models struggle with noise susceptibility and are not well-suited for large-scale implementation in IoT-based smart agriculture due to their inability to adapt to real-world conditions.

To address these limitations, the study introduces the Precision Leaf Analysis with Vision Transformers (PLA-ViT) framework, which leverages Vision Transformers (ViT) for enhanced feature extraction and disease classification. PLA-ViT offers improved accuracy and flexibility compared to traditional CNN-based models by effectively capturing complex spatial relationships and long-range dependencies in leaf images. Furthermore, the transformer-based architecture enhances interpretability and reduces noise sensitivity, enabling real-time disease monitoring with lower computational overhead when integrated into IoT-based agricultural systems.