إطار عمل التعلم العميق القائم على YOLO لمراقبة صحة النباتات متعددة الفئات في الوقت الحقيقي في الزراعة الدقيقة YOLO-based deep learning framework for real-time multi-class plant health monitoring in precision agriculture

المجلة: Scientific Reports، المجلد: 16، العدد: 1
DOI: https://doi.org/10.1038/s41598-025-29132-w
PMID: https://pubmed.ncbi.nlm.nih.gov/41484288
تاريخ النشر: 2026-01-03
المؤلف: Anurag Rana وآخرون
الموضوع الرئيسي: الزراعة الذكية والذكاء الاصطناعي

نظرة عامة

تقدم البحث نهجًا قويًا للكشف عن صحة أوراق النباتات باستخدام أطر كشف الكائنات YOLOv8 و YOLOv11. من خلال استخدام نماذج مسبقة التدريب تم تعزيزها من خلال التعلم الانتقالي والتخصيص الدقيق الخاص بالمجال، حققت الدراسة متوسط دقة (mAP) يتجاوز 92% على مجموعة بيانات التحقق. أدى تنفيذ تقنيات تعزيز البيانات المتقدمة إلى تعزيز مرونة النماذج بشكل كبير تجاه التغيرات في حجم الورقة، والاتجاه، والإضاءة، مما أدى إلى دقة واسترجاع قويين عبر ثلاث فئات صحية: أوراق صحية، وأوراق متوترة، وأوراق تالفة. أظهرت النماذج سرعات استدلال واعدة على وحدات معالجة الرسوميات الحديثة، مما يشير إلى إمكانيتها للنشر في الوقت الحقيقي في السياقات الزراعية.

كما أجرت الدراسة تحليلًا شاملًا للأخطاء، كاشفة عن تحديات تتعلق بالاحتجاب والأعراض البصرية الغامضة، مما يشير إلى ضرورة دمج بيانات إضافية أو معلومات زمنية في الأبحاث المستقبلية. تشمل المساهمات الرئيسية لهذا العمل شفافية مجموعة البيانات مع إصدار مجموعة فرعية عامة، وتقييم مقارن ضد المعايير التقليدية والحديثة، وتحليل صارم للأخطاء وعدم اليقين. على الرغم من القيود مثل تنوع المحاصيل المحدود وعدم توازن الفئات، تؤكد النتائج على جدوى وقيمة الأنظمة المعتمدة على التعلم العميق للزراعة الدقيقة. ستركز الاتجاهات المستقبلية على توسيع مجموعات البيانات، ودمج الصور متعددة الطيف، والتحقيق في هياكل المحولات الخفيفة لتعزيز تعميم النموذج.

طرق

تحدد ورقة البحث منهجية شاملة لكشف الكائنات، مع التركيز على تطور تقنيات الكشف من الطرق التقليدية المعتمدة على الميزات المصنوعة يدويًا إلى الأساليب الحديثة المعتمدة على التعلم العميق. واجهت التقنيات المبكرة، مثل cascades Haar و Histogram of Oriented Gradients (HOG)، تحديات في الدقة والموثوقية ضد التغيرات في الظروف البيئية. أدت مقدمة CNNs المعتمدة على المناطق (R-CNNs) إلى تقدم كبير، مما أدى إلى تحسين الدقة من خلال اقتراحات مناطق المرشحين. عززت النماذج اللاحقة مثل Faster R-CNN و YOLO قدرات الكشف، حيث قامت إصدارات YOLO المختلفة (من YOLOv2 إلى YOLOv8) بتحسين الأداء للتطبيقات في الوقت الحقيقي. ومن الجدير بالذكر أن محولات الرؤية (ViTs) ظهرت كبدائل واعدة للتطبيقات المحمولة، كما يتضح من إطار عمل ViT-SmartAgri.

تتفصل قسم المنهجية عن سير عمل منظم لتنفيذ نموذج YOLOv8، بدءًا من إعداد مجموعة البيانات التي تشمل جمع الصور، والتنظيف، والتعزيز، والتقسيم إلى مجموعات تدريب، والتحقق، واختبار. يتم استخدام شبكة YOLO مسبقة التدريب، مع إجراء تعديلات على الطبقات النهائية لاستيعاب الفئات المستهدفة المحددة. يتم تعريف المعلمات الفائقة الرئيسية لتوجيه عملية التدريب، والتي يتم مراقبتها لمنع الإفراط في التكيف من خلال التوقف المبكر بناءً على فقدان التحقق. يتم تقييم أداء النموذج باستخدام مقاييس مثل الدقة، والاسترجاع، ومتوسط الدقة (mAP)، مدعومة بمصفوفات الالتباس ومنحنيات الدقة والاسترجاع لتحليل شامل لدقة الكشف. تم استخدام إعداد تجريبي يعتمد على Python و PyTorch ووحدة معالجة الرسوميات NVIDIA RTX A6000، مما يضمن بيئة قوية لتدريب النموذج وتقييمه.

النتائج

تشير النتائج إلى أن YOLOv11 يتفوق على YOLOv8 في الدقة، والاسترجاع، ومتوسط الدقة (mAP) عبر جميع المقاييس التي تم تقييمها. تظهر كلا النموذجين تحسينات كبيرة بسبب تقنيات التعلم الانتقالي وتعزيز البيانات. بينما يوفر YOLOv8 أوقات استدلال أسرع، يحقق YOLOv11 دقة أكبر، على الرغم من زيادة طفيفة في الطلب الحاسوبي.

بالإضافة إلى ذلك، يكشف التحليل أن الأخطاء في التصنيف تحدث بشكل رئيسي بين الفئات المتشابهة بصريًا من النباتات ‘المتوترة’ و ‘التالفة’. على الرغم من هذا التحدي، تظهر كل من YOLOv8 و YOLOv11 إمكانات كبيرة للتطبيق في أنظمة مراقبة صحة النباتات في الوقت الحقيقي، مما يوفر قدرات كشف موثوقة وفعالة.

المناقشة

تسلط قسم المناقشة في ورقة البحث الضوء على عدة تقدمات هامة في مراقبة صحة النباتات من خلال تقديم مجموعة بيانات متعددة الفئات شاملة وتطبيق نماذج YOLO. تتضمن مجموعة البيانات تعليقات توضيحية لصناديق الحدود عالية الدقة لصور النباتات الصحية، والمتوترة، والتالفة، مما يعزز القدرة على مراقبة صحة المحاصيل بشكل فعال. كما توضح البحث تفاصيل تخصيص هياكل YOLO، وخاصة YOLOv8 و YOLOv11، مع التركيز على كفاءتها في مهام كشف الكائنات في الوقت الحقيقي المناسبة للتطبيقات الزراعية. يوفر الدراسة خط أنابيب معياري لمعالجة مجموعة البيانات، والتدريب، والتقييم، مما يسهل الأبحاث المستقبلية والتكرار.

علاوة على ذلك، تتناول الورقة قيود الطرق السابقة في كشف أمراض النباتات، التي غالبًا ما اعتمدت على تصنيف فئة واحدة أو افتقرت إلى مجموعات بيانات قوية. من خلال استخدام تقنيات تعزيز البيانات المتقدمة والتعلم الانتقالي، تعزز الدراسة قدرات تعميم النموذج عبر ظروف زراعية متنوعة. تظهر النتائج أن YOLOv11 يتفوق على أسلافه في كل من دقة الكشف وكفاءة الاستدلال، محققًا تحسينًا ذا دلالة إحصائية في متوسط الدقة (mAP). لا يساهم هذا البحث فقط في مجال الزراعة الدقيقة ولكن أيضًا يضع أساسًا لاستكشافات مستقبلية في الكشف متعدد الفئات عن حالات صحة النباتات.

Journal: Scientific Reports, Volume: 16, Issue: 1
DOI: https://doi.org/10.1038/s41598-025-29132-w
PMID: https://pubmed.ncbi.nlm.nih.gov/41484288
Publication Date: 2026-01-03
Author(s): Anurag Rana et al.
Primary Topic: Smart Agriculture and AI

Overview

The research presents a robust approach to plant leaf health detection utilizing the YOLOv8 and YOLOv11 object detection frameworks. By employing pretrained models enhanced through transfer learning and domain-specific fine-tuning, the study achieved a mean Average Precision (mAP) exceeding 92% on the validation dataset. The implementation of advanced data augmentation techniques significantly bolstered the models’ resilience to variations in leaf size, orientation, and lighting, resulting in strong precision and recall across three health categories: healthy, stressed, and damaged leaves. The models demonstrated promising inference speeds on modern GPUs, indicating their potential for real-time deployment in agricultural contexts.

The study also conducted a thorough error analysis, revealing challenges related to occlusion and ambiguous visual symptoms, which suggests the necessity for integrating additional data modalities or temporal information in future research. Key contributions of this work include the transparency of the dataset with a public subset release, a comparative evaluation against classical and modern baselines, and a rigorous analysis of errors and uncertainties. Despite limitations such as restricted crop diversity and class imbalance, the findings underscore the feasibility and practical value of deep learning-based systems for precision agriculture. Future directions will focus on expanding datasets, incorporating multispectral imagery, and investigating lightweight Transformer architectures to enhance model generalization.

Methods

The research paper outlines a comprehensive methodology for object detection, emphasizing the evolution of detection techniques from traditional methods reliant on handcrafted features to modern deep learning approaches. Early techniques, such as Haar cascades and Histogram of Oriented Gradients (HOG), faced challenges in accuracy and robustness against variations in environmental conditions. The introduction of Region-based CNNs (R-CNNs) marked a significant advancement, leading to improved accuracy through candidate region proposals. Subsequent models like Faster R-CNN and YOLO have further enhanced detection capabilities, with YOLO’s various iterations (YOLOv2 to YOLOv8) optimizing performance for real-time applications. Notably, Vision Transformers (ViTs) have emerged as promising alternatives for mobile applications, exemplified by the ViT-SmartAgri framework.

The methodology section details a structured workflow for implementing the YOLOv8 model, beginning with dataset preparation that includes image collection, cleaning, augmentation, and partitioning into training, validation, and testing subsets. A pretrained YOLO network is utilized, with adjustments made to its final layers to accommodate specific target classes. Key hyperparameters are defined to guide the training process, which is monitored to prevent overfitting through early stopping based on validation loss. The evaluation of model performance is conducted using metrics such as precision, recall, and mean average precision (mAP), supplemented by confusion matrices and precision-recall curves for a thorough analysis of detection accuracy. The experimental setup employed Python, PyTorch, and an NVIDIA RTX A6000 GPU, ensuring a robust environment for model training and evaluation.

Results

The results indicate that YOLOv11 surpasses YOLOv8 in precision, recall, and mean Average Precision (mAP) across all evaluated metrics. Both models exhibit substantial improvements due to transfer learning and data augmentation techniques. While YOLOv8 offers faster inference times, YOLOv11 achieves greater accuracy, albeit with a minor increase in computational demand.

Additionally, the analysis reveals that misclassifications predominantly occur between the visually similar classes of ‘stressed’ and ‘damaged’ plants. Despite this challenge, both YOLOv8 and YOLOv11 show considerable potential for application in real-time plant health monitoring systems, providing reliable and efficient detection capabilities.

Discussion

The discussion section of the research paper highlights several significant advancements in plant health monitoring through the introduction of a comprehensive multi-class dataset and the application of YOLO models. The dataset includes high-precision bounding box annotations for images of healthy, stressed, and damaged plants, which enhances the ability to monitor crop health effectively. The research also details the fine-tuning of YOLO architectures, particularly YOLOv8 and YOLOv11, emphasizing their efficiency in real-time object detection tasks suitable for agricultural applications. The study provides a modular pipeline for dataset preprocessing, training, and evaluation, facilitating future research and replication.

Furthermore, the paper addresses the limitations of previous methods in plant disease detection, which often relied on single-class classification or lacked robust datasets. By employing advanced data augmentation and transfer learning techniques, the study enhances the model’s generalization capabilities across diverse agricultural conditions. The results demonstrate that YOLOv11 outperforms its predecessors in both detection accuracy and inference efficiency, achieving a statistically significant improvement in mean Average Precision (mAP). This research not only contributes to the field of precision agriculture but also sets a foundation for future explorations in multi-class detection of plant health conditions.