PND-Net: تصنيف نقص التغذية النباتية والأمراض باستخدام الشبكة العصبية التلافيفية PND-Net: plant nutrition deficiency and disease classification using graph convolutional network

المجلة: Scientific Reports، المجلد: 14، العدد: 1
DOI: https://doi.org/10.1038/s41598-024-66543-7
PMID: https://pubmed.ncbi.nlm.nih.gov/38969738
تاريخ النشر: 2024-07-05
المؤلف: Asish Bera وآخرون
الموضوع الرئيسي: الزراعة الذكية والذكاء الاصطناعي

نظرة عامة

تقدم البحث إطار عمل جديد للتعلم العميق، يسمى شبكة نقص التغذية والأمراض النباتية (PND-Net)، مصمم لتعزيز الكشف عن نقص التغذية والأمراض في النباتات من خلال دمج الشبكات التلافيفية البيانية (GCNs) والشبكات التلافيفية العصبية (CNNs). يؤكد الدراسة على أهمية التعرف المبكر على مشاكل صحة النباتات لتحسين إنتاجية المحاصيل والنمو الزراعي. من خلال استخدام تجميع هرمى مكاني لتعلم ميزات إقليمية متعددة المقاييس، تلتقط شبكة PND-Net تفاصيل حاسمة قد يتم تجاهلها من قبل موصوفات الميزات العالمية التقليدية، مما يحسن دقة التصنيف.

تم تقييم الطريقة المقترحة بدقة على أربعة مجموعات بيانات عامة، محققة نتائج مثيرة للإعجاب: 90.00% و90.54% دقة لنقص تغذية الموز والقهوة، على التوالي، و96.18% و84.30% لأمراض البطاطس ومجموعة بيانات PlantDoc، باستخدام هيكل Xception. بالإضافة إلى ذلك، أظهرت شبكة PND-Net أداءً متفوقًا على مجموعات بيانات صحة الإنسان، بما في ذلك BreakHis وSIPaKMeD، بدقة 95.50% و99.18%، على التوالي. وهذا يشير إلى تعددية استخدام النموذج خارج التطبيقات الزراعية. يدعو المؤلفون إلى مزيد من البحث لتحسين تقنيات الكشف في المراحل المبكرة لمراقبة صحة النباتات، مما يبرز إمكانية شبكة PND-Net في المساهمة بشكل كبير في الممارسات الزراعية المستدامة وتشخيص صحة الإنسان.

مقدمة

في المقدمة، يقدم المؤلفون نظرة شاملة على الشبكات التلافيفية البيانية (GCNs) وتطبيقاتها عبر مجالات مختلفة، بما في ذلك تصنيف العقد، ونمذجة سمات الحواف، وشبكات الاقتباس، والرسوم البيانية المعرفية. تعزز GCNs الشبكات التلافيفية التقليدية (CNNs) من خلال تكديس طبقات تلافيفية بيانية متعددة مع غير الخطية، مما يحسن قدرة النموذج على التعلم. يبرز المؤلفون فعالية GCNs في التخفيف من الإفراط في التكيف ومعالجة مشكلة تلاشي التدرج من خلال تقنيات التطبيع.

تشير القسم إلى خوارزمية GCN متعددة الطبقات المعتمدة على نطاق واسع التي اقترحها كيف وويلينغ، والتي تستخدم طريقة انتشار طبقية تعتمد على التقريب من الدرجة الأولى للتلافيف الطيفية. هذه الطريقة مناسبة بشكل خاص لمهام تصنيف العقد شبه المشرف عليها. يقدم المؤلفون أيضًا صياغة خطية مبسطة لـ GCNs، مما يسمح بتحسين المعلمات في كل طبقة باستخدام التلافيف مع معلمات المرشح $\theta$. تتضمن الصياغة مصفوفة لابلاس البيانية المعنونة، الممثلة بمصفوفة الجوار الثنائية $\tilde{A} = A + I_P$، حيث $I_P$ هي مصفوفة الهوية. يتم تمثيل البيانات المدخلة بـ $X \in \mathbb{R}^{P \times C}$، حيث يمثل $P$ عدد العقد، و$C$ قنوات الإدخال، و$F$ خرائط الميزات، مما يؤدي إلى مصفوفة إشارة تلافيفية مركزية في تحليلهم.

طرق

في هذا القسم، يحدد المؤلفون منهجيات مختلفة تم استخدامها لتقييم نقص التغذية في النباتات عبر أنواع مختلفة، لا سيما الموز، والأرز، والطماطم. قاموا بتطوير مجموعة بيانات صور توضح أعراض النقص لثمانية مغذيات أساسية واختبروها جنبًا إلى جنب مع مجموعة بيانات CoLeaf لنباتات القهوة. تم استخدام تقنيات التعلم الآلي، وخاصة الشبكات التلافيفية العصبية (CNNs)، لتقييم الحالة الغذائية لأوراق زيت النخيل ولتحديد نقص المغذيات الكبرى في نباتات الأرز، محققين معدلات دقة عالية (تصل إلى 97.0%). بالإضافة إلى ذلك، أظهرت CNNs فعالية في التعرف على نقص المغذيات في محاصيل أخرى، مثل الطماطم ونباتات الفلفل الحار، مع دقة تتراوح من 82.61% إلى 98.65%. استكشف المؤلفون أيضًا مخططات تجميع الميزات وطرق التجميع، مما زاد من دقة الكشف.

علاوة على ذلك، يناقش القسم تطبيق الشبكات التلافيفية البيانية (GCNs) في تحليل صحة النباتات، مشيرًا إلى التقدم المحدود في استخدام GCNs للتعرف البصري على أمراض النباتات مقارنةً بـ CNNs. يشير المؤلفون إلى نموذج SR-GNN، الذي يدمج تمثيل الميزات المعتمد على العلاقات مع GCNs، ويبرز إمكانيته في معالجة التحديات في كل من التعرف على أمراض النباتات وتصنيف صور السرطان البشري. تهدف هذه الطريقة المبتكرة إلى التقاط الأهمية الإقليمية للميزات السياقية المحلية، مما يعزز دقة وفعالية أنظمة مراقبة صحة النباتات.

نتائج

في قسم النتائج، يوضح المؤلفون تنفيذ نموذج التصنيف الخاص بهم ويختصرون مجموعات البيانات المستخدمة للتجربة. يتم تقييم أداء النموذج من خلال تقنيات التصنيف التقليدية وطرق التحقق المتبادل، باستخدام مقاييس قياسية مثل الدقة، والدقة، والاسترجاع، ودرجة F1، كما هو محدد في المعادلات المقدمة. على وجه التحديد، يتم حساب درجة F1 على أنها $ F1 = \frac{2 \times \text{Precision} \times \text{Recall}}{\text{Precision} + \text{Recall}} $، حيث يتم تعريف الإيجابيات الحقيقية (TP)، والسلبيات الحقيقية (TN)، والإيجابيات الكاذبة (FP)، والسلبيات الكاذبة (FN).

يشير المؤلفون إلى أن الدقة وحدها غير كافية لتقييم أداء النموذج، خاصة في حالات توزيع الفئات غير المتوازن. لمعالجة هذه القيود، يؤكدون على أهمية الدقة والاسترجاع، اللذان يساهمان في الحصول على درجة F1 أكثر موثوقية. بالإضافة إلى ذلك، يتم تحليل الأداء بشكل أكبر باستخدام مصفوفة الالتباس، مما يوفر تقييمًا شاملاً لقدرات النموذج التنبؤية. تتم مقارنة النتائج بعد ذلك مع المنهجيات الحالية، مما يبرز فعالية نهجهم.

مناقشة

في قسم المناقشة من ورقة البحث، يبرز المؤلفون التقدم في منهجيات الكشف عن أمراض النباتات، مع التركيز بشكل خاص على قيود الدراسات الحالية التي تستخدم بشكل أساسي مجموعات بيانات محكومة. يشيرون إلى أن عددًا قليلاً فقط من الأساليب قد عالجت بفعالية التحديات التي تطرحها الظروف الواقعية. تستعرض الورقة تقنيات مختلفة، بما في ذلك الشبكات التلافيفية العصبية (CNNs) وطرق التعلم الانتقالي، التي أظهرت معدلات دقة عالية في تصنيف صور النباتات المريضة. على سبيل المثال، حققت طريقة تعتمد على CNN دقة 98.0% لصور أوراق الحمضيات، بينما وصلت طريقة التعلم الانتقالي المعدلة للكشف عن أمراض أوراق البطاطس إلى دقة 99.0%. يذكر المؤلفون أيضًا تطوير نماذج هجينة وتطبيقات الهواتف الذكية التي زادت من قدرات الكشف عن الأمراض.

تدمج الطريقة المقترحة، المسماة PND-Net، الشبكات التلافيفية العصبية (CNNs) والشبكات التلافيفية البيانية (GCNs) في إطار عمل شامل لتحسين استخراج الميزات وأداء التصنيف. تستخدم البنية الأساسية CNN لاستخراج الميزات الأولية، تليها GCN التي تصقل هذه الميزات من خلال استراتيجيات التجميع الإقليمي والهرمي. تهدف هذه الطريقة إلى التقاط تفاصيل أدق والعلاقات المكانية بين مناطق مختلفة من الصور المدخلة، مما يعزز قدرة النموذج على تصنيف أمراض النباتات بدقة. يؤكد المؤلفون أن طريقتهم تتفوق على التقنيات الحالية، كما يتضح من النتائج التجريبية عبر مجموعات بيانات متعددة، مما يشير إلى قوتها وفعاليتها في التطبيقات الواقعية.

Journal: Scientific Reports, Volume: 14, Issue: 1
DOI: https://doi.org/10.1038/s41598-024-66543-7
PMID: https://pubmed.ncbi.nlm.nih.gov/38969738
Publication Date: 2024-07-05
Author(s): Asish Bera et al.
Primary Topic: Smart Agriculture and AI

Overview

The research presents a novel deep learning framework, termed Plant Nutrition Deficiency and Disease Network (PND-Net), designed to enhance the detection of plant nutrition deficiencies and diseases through a combination of graph convolutional networks (GCNs) and convolutional neural networks (CNNs). The study emphasizes the importance of early identification of plant health issues to improve crop yield and agricultural growth. By employing spatial pyramidal pooling for multi-scale regional feature learning, the PND-Net effectively captures critical details that may be overlooked by traditional global feature descriptors, thereby improving classification accuracy.

The proposed method was rigorously evaluated on four public datasets, achieving impressive results: 90.00% and 90.54% accuracy for banana and coffee nutrition deficiencies, respectively, and 96.18% and 84.30% for potato diseases and the PlantDoc dataset, using the Xception backbone. Additionally, PND-Net demonstrated state-of-the-art performance on human health datasets, including BreakHis and SIPaKMeD, with accuracies of 95.50% and 99.18%, respectively. This indicates the versatility of the model beyond agricultural applications. The authors advocate for further research to refine early-stage detection techniques for plant health monitoring, underscoring the potential of PND-Net to contribute significantly to sustainable agricultural practices and human health diagnostics.

Introduction

In the introduction, the authors provide a comprehensive overview of Graph Convolutional Networks (GCNs) and their applications across various domains, including node classification, edge attribute modeling, citation networks, and knowledge graphs. GCNs enhance traditional convolutional networks (CNNs) by stacking multiple graph convolutional layers with non-linearities, which improves the model’s learning capacity. The authors highlight the effectiveness of GCNs in mitigating overfitting and addressing the vanishing gradient problem through normalization techniques.

The section references a widely adopted multi-layer GCN algorithm proposed by Kiff and Welling, which utilizes a layer-wise propagation method based on the first-order approximation of spectral convolutions. This approach is particularly suitable for semi-supervised node classification tasks. The authors also present a simplified linear formulation of GCNs, which allows for parameter optimization at each layer using convolution with filter parameters $\theta$. The formulation incorporates the normalized graph Laplacian, represented by the binary adjacency matrix $\tilde{A} = A + I_P$, where $I_P$ is the identity matrix. The input data is denoted as $X \in \mathbb{R}^{P \times C}$, with $P$ representing the number of nodes, $C$ the input channels, and $F$ the feature maps, culminating in a convoluted signal matrix that is central to their analysis.

Methods

In this section, the authors outline various methodologies employed to assess plant nutrition deficiencies across different species, notably bananas, rice, and tomatoes. They developed an image dataset illustrating deficiency symptoms for eight essential nutrients and tested it alongside the CoLeaf dataset for coffee plants. Machine learning techniques, particularly Convolutional Neural Networks (CNNs), were utilized to evaluate the nutritional status of oil palm leaves and to identify macro-nutrient deficiencies in rice plants, achieving high accuracy rates (up to 97.0%). Additionally, CNNs demonstrated effectiveness in recognizing nutrient deficiencies in other crops, such as tomatoes and chili plants, with reported accuracies ranging from 82.61% to 98.65%. The authors also explored feature aggregation schemes and ensemble methods, which further enhanced detection accuracy.

Furthermore, the section discusses the application of Graph Convolutional Networks (GCNs) in plant health analysis, noting the limited progress in utilizing GCNs for visual recognition of plant diseases compared to CNNs. The authors reference the SR-GNN model, which integrates relation-aware feature representation with GCNs, and highlight its potential in addressing challenges in both plant disease recognition and human cancer image classification. This innovative approach aims to capture the regional importance of local contextual features, thereby enhancing the accuracy and effectiveness of plant health monitoring systems.

Results

In the Results section, the authors detail the implementation of their classification model and summarize the datasets utilized for experimentation. The evaluation of model performance is conducted through conventional classification techniques and cross-validation methods, employing standard metrics such as accuracy, precision, recall, and F1-score, as defined by the equations provided. Specifically, the F1-score is calculated as $ F1 = \frac{2 \times \text{Precision} \times \text{Recall}}{\text{Precision} + \text{Recall}} $, where true positives (TP), true negatives (TN), false positives (FP), and false negatives (FN) are defined.

The authors note that accuracy alone is insufficient for assessing model performance, particularly in cases of imbalanced class distributions. To address this limitation, they emphasize the importance of precision and recall, which contribute to a more reliable F1-score. Additionally, the performance is further analyzed using a confusion matrix, providing a comprehensive evaluation of the model’s predictive capabilities. The results are subsequently compared with existing methodologies, highlighting the effectiveness of their approach.

Discussion

In the discussion section of the research paper, the authors highlight the advancements in plant disease detection methodologies, particularly emphasizing the limitations of existing studies that primarily utilize controlled datasets. They note that only a few approaches have effectively addressed the challenges posed by real-world conditions. The paper reviews various techniques, including convolutional neural networks (CNNs) and transfer learning methods, which have demonstrated high accuracy rates in classifying diseased plant images. For instance, a CNN-based method achieved 98.0% accuracy for citrus leaf images, while a modified transfer learning approach for potato leaf disease detection reached 99.0% accuracy. The authors also mention the development of hybrid models and smartphone applications that have further enhanced disease detection capabilities.

The proposed method, termed PND-Net, integrates CNNs and graph convolutional networks (GCNs) in an end-to-end framework to improve feature extraction and classification performance. The architecture employs a backbone CNN for initial feature extraction, followed by a GCN that refines these features through region-based and pyramid pooling strategies. This approach aims to capture finer details and spatial relationships among different regions of the input images, thereby enhancing the model’s ability to classify plant diseases accurately. The authors assert that their method outperforms existing techniques, as evidenced by experimental results across multiple datasets, indicating its robustness and effectiveness in real-world applications.