الكشف عن أمراض النباتات باستخدام محولات الرؤية Plant disease detection using vision transformers

المجلة: International Journal of Power Electronics and Drive Systems/International Journal of Electrical and Computer Engineering، المجلد: 15، العدد: 2
DOI: https://doi.org/10.11591/ijece.v15i2.pp2334-2344
تاريخ النشر: 2025-01-26
المؤلف: Ali Mhaned وآخرون
الموضوع الرئيسي: الزراعة الذكية والذكاء الاصطناعي

نظرة عامة

تتناول ورقة البحث التحدي الحاسم في الكشف عن أمراض النباتات، والذي يشكل مخاطر كبيرة على الأمن الغذائي العالمي واستدامة الزراعة. غالبًا ما تكون طرق الكشف التقليدية، مثل الفحص البصري والتقنيات المخبرية، غير فعالة وتفتقر إلى القابلية للتوسع. للتغلب على هذه القيود، تستكشف الورقة تطبيق المحولات البصرية (ViTs)، وهي نماذج متقدمة من التعلم الآلي تستخدم آليات الانتباه الذاتي لالتقاط الأنماط المعقدة في صور النباتات بشكل فعال. تسلط مراجعة الأدبيات الضوء على الاهتمام المتزايد بتقنيات التعلم العميق، وخاصة ViTs، لأتمتة تصنيف أمراض النباتات.

تظهر النتائج أن ViTs يمكن أن تحدد بدقة مجموعة متنوعة من أمراض النباتات عبر مجموعة بيانات متوازنة من 55 فئة، مما يبرز إمكاناتها في تعزيز الزراعة الدقيقة وتعزيز ممارسات الزراعة المستدامة. تؤكد الورقة على أهمية معالجة مجموعة البيانات في تحسين دقة النموذج وتقترح أن تركز الأبحاث المستقبلية على تحسين هياكل ViT، وتعزيز القابلية للتفسير، واستكشاف التطبيقات في العالم الحقيقي. بشكل عام، يمثل اعتماد ViTs تقدمًا كبيرًا في تكنولوجيا الزراعة، حيث يقدم حلولًا واعدة للتخفيف من تأثير أمراض النباتات ودعم الأمن الغذائي العالمي.

مقدمة

تسلط المقدمة الضوء على الدور الحاسم للزراعة في بقاء الإنسان، مشددة على مساهماتها في الغذاء، والعلف، والألياف، والوقود. وتؤكد على ضرورة ممارسات الزراعة المستدامة لضمان الأمن الغذائي، وتعزيز جودة البيئة، والحفاظ على الجدوى الاقتصادية للمزارع. تُقدم الزراعة المستدامة كنهج شامل يوازن بين العوامل الاقتصادية والاجتماعية والبيئية، مما يدعم الإنتاجية على المدى الطويل والحفاظ على الموارد.

تناقش هذه الفقرة أيضًا أهمية الكشف عن أمراض النباتات في الزراعة، موصلة إياه بعائد المحاصيل، والأمن الغذائي، والاستدامة الاقتصادية. وتشير إلى قيود الطرق التقليدية، التي تعتمد على الفحوصات البصرية من قبل الخبراء والتقنيات المخبرية، وتدعو إلى اعتماد تقنيات متقدمة مثل التعلم الآلي، والتعلم العميق، ورؤية الكمبيوتر. تعد هذه الأساليب الحديثة بتحسين الكفاءة والدقة في الكشف عن أمراض النباتات، مما يعزز النتائج الزراعية.

النتائج

تظهر نتائج الدراسة فعالية نموذج المحول البصري (ViT) في تصنيف أمراض النباتات من خلال بنية جديدة تجمع بين الترميز القائم على القطع مع طبقات المحولات. يقوم النموذج بمعالجة الصور المدخلة عن طريق تقسيمها إلى قطع بحجم 16×16 بكسل، والتي يتم بعد ذلك إسقاطها خطيًا وتعزيزها مع تضمينات موضعية للحفاظ على المعلومات المكانية. تستخدم طبقات TransformerEncoder اللاحقة آليات الانتباه الذاتي متعدد الرؤوس والشبكات الأمامية لالتقاط الاعتماديات المحلية والعالمية، مع معلمات فرعية تشمل ثمانية رؤوس انتباه، وحجم مخفي يبلغ 512، وأربع طبقات محولات. تم تدريب النموذج باستخدام مُحسِّن آدم بمعدل تعلم يبلغ 0.0001 على مدى 20 دورة، محققًا دقة تدريب تحسنت من حوالي 24% إلى ما يقرب من 94.5%، بينما تراوحت دقة التحقق من الصحة من 44.7% إلى 91.6%.

تشير مقاييس التقييم إلى أن نموذج ViT حقق دقة إجمالية تبلغ 90%، مع متوسطات ماكرو ومرجحة للدقة، والاسترجاع، ودرجة F1 أيضًا حوالي 90%. من الجدير بالذكر أن النموذج تفوق في الكشف عن الأمراض في محاصيل متنوعة، مثل أمراض التفاح (درجات F1 من 0.88 إلى 0.93) ومرض فسيفساء الكسافا (كشف مثالي). ومع ذلك، واجه تحديات مع بعض الحالات، مثل انخفاض الاسترجاع للرقعة الخضراء في الكسافا (درجة F1 تبلغ 0.88) والصدأ الشائع في الذرة (درجة F1 تبلغ 0.81). بشكل عام، تؤكد النتائج على قوة النموذج وقدرته على التكيف في تحديد مجموعة متنوعة من أمراض النباتات بدقة، مما يبرز إمكاناته للتطبيقات العملية في إدارة أمراض الزراعة.

المناقشة

تؤكد فقرة المناقشة في ورقة البحث على قيود طرق الكشف التقليدية عن أمراض النباتات، مثل الفحص البصري والتقنيات المخبرية، التي غالبًا ما تكون ذات طابع ذاتي، وتستغرق وقتًا طويلاً، ومكلفة. قد تفشل هذه الطرق في تحديد العدوى غير العرضية، مما يبرز الحاجة إلى أساليب متقدمة وآلية. يُقترح دمج تقنيات مثل التسلسل عالي الإنتاجية، والذكاء الاصطناعي (AI)، وإنترنت الأشياء (IoT) كحل لتعزيز سرعة ودقة وقابلية التوسع في الكشف عن الأمراض. بشكل خاص، ظهرت المحولات البصرية (ViTs) كبديل واعد للشبكات العصبية التلافيفية (CNNs) لأتمتة الكشف عن أمراض النباتات، حيث أظهرت أداءً متفوقًا في مهام رؤية الآلة المختلفة مع الحاجة إلى موارد حسابية أقل.

تستعرض الورقة التقدمات الحديثة في تطبيق ViTs على الكشف عن أمراض النباتات، مما يبرز فعاليتها في تحديد الأمراض من خلال تحليل الصور. تشير دراسات متنوعة إلى أن ViTs تتفوق على الطرق التقليدية، مع ابتكارات مثل نموذج GreenViT ووحدات توجيه الميزات الحافة التي تعزز قدرات استخراج الميزات. تتضمن المنهجية المقترحة تدريب نماذج ViT على مجموعة بيانات متنوعة من صور النباتات، والتي تم معالجتها بشكل منهجي لضمان تمثيل متوازن عبر فئات الأمراض. تشير النتائج إلى أن ViTs يمكن أن تحسن بشكل كبير من دقة وكفاءة تصنيف أمراض النباتات، مما يسهم في ممارسات الزراعة المستدامة والزراعة الدقيقة. يُشجع على أن تركز الأبحاث المستقبلية على تحسين هياكل ViT واستكشاف تطبيقاتها العملية في البيئات الزراعية الحقيقية.

Journal: International Journal of Power Electronics and Drive Systems/International Journal of Electrical and Computer Engineering, Volume: 15, Issue: 2
DOI: https://doi.org/10.11591/ijece.v15i2.pp2334-2344
Publication Date: 2025-01-26
Author(s): Ali Mhaned et al.
Primary Topic: Smart Agriculture and AI

Overview

The research paper addresses the critical challenge of plant disease detection, which poses significant risks to global food security and agricultural sustainability. Traditional detection methods, such as visual inspection and laboratory techniques, are often inefficient and lack scalability. To overcome these limitations, the paper explores the application of vision transformers (ViTs), advanced machine learning models that utilize self-attention mechanisms to effectively capture complex patterns in plant images. The literature review highlights the increasing interest in deep learning techniques, particularly ViTs, for automating plant disease classification.

The findings demonstrate that ViTs can accurately identify a diverse range of plant diseases across a balanced dataset of 55 classes, showcasing their potential to enhance precision agriculture and promote sustainable farming practices. The paper emphasizes the importance of dataset preprocessing in improving model accuracy and suggests that future research should focus on optimizing ViT architectures, enhancing interpretability, and exploring real-world applications. Overall, the adoption of ViTs represents a significant advancement in agricultural technology, offering promising solutions to mitigate the impact of plant diseases and support global food security.

Introduction

The introduction highlights the critical role of agriculture in human survival, emphasizing its contributions to food, feed, fiber, and fuel. It underscores the necessity of sustainable agricultural practices to ensure food security, enhance environmental quality, and maintain the economic viability of farms. Sustainable agriculture is presented as a holistic approach that balances economic, social, and environmental factors, thereby supporting long-term productivity and resource conservation.

The section also addresses the significance of plant disease detection in agriculture, linking it to crop yield, food security, and economic sustainability. It notes the limitations of traditional methods, which rely on expert visual inspections and laboratory techniques, and advocates for the adoption of advanced technologies such as machine learning, deep learning, and computer vision. These modern approaches promise improved efficiency and accuracy in detecting plant diseases, thereby enhancing agricultural outcomes.

Results

The results of the study demonstrate the effectiveness of the Vision Transformer (ViT) model in classifying plant diseases through a novel architecture that combines patch-based encoding with transformer layers. The model processes input images by segmenting them into 16×16 pixel patches, which are then linearly projected and enhanced with positional embeddings to maintain spatial information. The subsequent TransformerEncoder layers utilize multi-head self-attention and feedforward networks to capture both local and global dependencies, with hyperparameters including eight attention heads, a hidden size of 512, and four transformer layers. The model was trained using the Adam optimizer with a learning rate of 0.0001 over 20 epochs, achieving a training accuracy that improved from approximately 24% to nearly 94.5%, while validation accuracy ranged from 44.7% to 91.6%.

The evaluation metrics indicate that the ViT model achieved an overall accuracy of 90%, with macro and weighted averages for precision, recall, and F1-score also around 90%. Notably, the model excelled in detecting diseases in various crops, such as apple diseases (F1-scores of 0.88 to 0.93) and cassava mosaic disease (perfect detection). However, it faced challenges with certain conditions, such as lower recall for green mottle in cassava (F1-score of 0.88) and common rust in corn (F1-score of 0.81). Overall, the results underscore the model’s robustness and adaptability in accurately identifying a diverse range of plant diseases, showcasing its potential for practical applications in agricultural disease management.

Discussion

The discussion section of the research paper emphasizes the limitations of traditional plant disease detection methods, such as visual inspection and laboratory techniques, which are often subjective, time-consuming, and expensive. These methods may fail to identify asymptomatic infections, highlighting the need for advanced, automated approaches. The integration of technologies like high-throughput sequencing, artificial intelligence (AI), and the Internet of Things (IoT) is proposed as a solution to enhance the speed, accuracy, and scalability of disease detection. Specifically, vision transformers (ViTs) have emerged as a promising alternative to convolutional neural networks (CNNs) for automating plant disease detection, demonstrating superior performance in various machine vision tasks while requiring fewer computational resources.

The paper reviews recent advancements in applying ViTs to plant disease detection, showcasing their effectiveness in identifying diseases through image analysis. Various studies indicate that ViTs outperform traditional methods, with innovations such as the GreenViT model and edge-feature guidance modules enhancing feature extraction capabilities. The proposed methodology involves training ViT models on a diverse dataset of plant images, which has been systematically preprocessed to ensure balanced representation across disease categories. The findings suggest that ViTs can significantly improve the accuracy and efficiency of plant disease classification, thereby contributing to sustainable agricultural practices and precision farming. Future research is encouraged to focus on optimizing ViT architectures and exploring their practical applications in real-world agricultural settings.