إطار عمل قائم على CNN مزدوج لتصنيف أمراض أوراق الأرز المحسن مع دمج الميزات A twin CNN-based framework for optimized rice leaf disease classification with feature fusion

المجلة: Journal Of Big Data، المجلد: 12، العدد: 1
DOI: https://doi.org/10.1186/s40537-025-01148-z
تاريخ النشر: 2025-04-11
المؤلف: Prameetha Pai وآخرون
الموضوع الرئيسي: الزراعة الذكية والذكاء الاصطناعي

نظرة عامة

تقدم هذه الورقة البحثية إطار عمل جديد قائم على الشبكات العصبية التلافيفية المزدوجة (CNN) يهدف إلى تصنيف أمراض أوراق الأرز بدقة محسّنة. يستخدم الإطار خوارزمية دمج ميزات محسّنة تدمج نماذج CNN المدربة مسبقًا، مما يمكّن من تصنيف صور أوراق الأرز إلى فئات صحية أو مصابة بشكل أكثر فعالية من الطرق التقليدية. تشير النتائج التجريبية على مجموعات البيانات المتاحة للجمهور إلى أن بنية Twin CNN المقترحة، جنبًا إلى جنب مع آلية دمج الميزات القوية، تتجاوز بشكل كبير تقنيات التصنيف الحالية من حيث الدقة والكفاءة الحاسوبية، مما يبرز إمكاناتها للتطبيقات الواقعية في الزراعة الدقيقة.

في الختام، تفيد الدراسة بأن إطار عمل Twin CNN يحقق دقة مثيرة للإعجاب تبلغ 96.8%، مستفيدًا من دمج ميزات متعددة المقاييس من نماذج مثل VGG16 وResNet50 وInceptionV3، بالإضافة إلى تقليل الأبعاد عبر تحليل المكونات الرئيسية (PCA). تعزز إضافة خرائط Grad-CAM الحرارية من قابلية تفسير النموذج، مما يجعله مناسبًا للتطبيقات الزراعية العملية. على الرغم من التحديات المتعلقة بحجم مجموعة البيانات والمتطلبات الحاسوبية، يعالج الإطار قضايا مثل الإفراط في التكيف وتنوع مجموعة البيانات، مما يجعله أداة قيمة للكشف المبكر عن الأمراض في محاصيل الأرز. ستركز الأبحاث المستقبلية على توسيع مجموعة البيانات وتحسين النموذج لتطبيقات أوسع، مما قد يوسع استخدامه ليشمل محاصيل أخرى، وبالتالي دعم ممارسات الزراعة المستدامة من خلال تحسين تقييمات صحة المحاصيل وتقليل الأخطاء في التصنيف.

مقدمة

تسلط مقدمة الورقة البحثية الضوء على الدور الحاسم للأرز كغذاء أساسي لنحو نصف سكان العالم والتحديات الكبيرة التي تطرحها الأمراض مثل البقعة البنية، وذبول الأوراق البكتيري، والانفجار الورقي، والتي تهدد غلة المحاصيل والجدوى الاقتصادية. تعتبر طرق تحديد الأمراض التقليدية غير فعالة وعرضة للأخطاء، مما يستدعي تطوير حلول آلية. أظهرت التطورات الأخيرة في التعلم العميق، وخاصة الشبكات العصبية التلافيفية (CNN)، وعدًا في الكشف عن أمراض النباتات المعتمدة على الصور، ومع ذلك، غالبًا ما تكافح نماذج CNN ذات المسار الواحد مع الإفراط في التكيف، والتعميم، وقابلية التفسير.

لمعالجة هذه القضايا، يقترح الدراسة إطار عمل قائم على Twin CNN يستخدم دمج ميزات متعددة المقاييس وتقليل الأبعاد لتعزيز دقة وكفاءة تصنيف أمراض أوراق الأرز. يستخدم هذا الإطار ثلاثة نماذج مدربة مسبقًا—VGG16 وResNet50 وInceptionV3—لاستخراج ميزات متنوعة من صور أوراق الأرز، والتي يتم دمجها بعد ذلك في مجموعة ميزات شاملة. يتم تطبيق تحليل المكونات الرئيسية (PCA) لتقليل التكرار، وتقوم بنية Twin CNN بمعالجة هذه الميزات من خلال مسارين متوازيين، مما يمكّن من تصنيف أوراق الأرز إلى أربع فئات: صحية، بقعة بنية، ذبول أوراق بكتيري، وانفجار ورقي. يتضمن الإطار أيضًا تصورات Grad-CAM لتحسين قابلية التفسير ويهدف إلى ضمان التعميم عبر مجموعات بيانات متنوعة، بما في ذلك تلك الخاصة بمحاصيل أخرى مثل القمح والطماطم. يركز التصميم على الكفاءة الحاسوبية، مما يجعله مناسبًا للنشر في البيئات الزراعية ذات الموارد المحدودة.

طرق

تقدم المنهجية المقترحة إطار عمل قائم على Twin CNN مدمج مع خوارزمية دمج الميزات تهدف إلى تعزيز تصنيف أمراض أوراق الأرز. يعالج هذا الإطار أوجه القصور في نماذج CNN التقليدية من خلال تحسين دقة التصنيف والصلابة، خاصة في تمييز الفروق الدقيقة بين الأوراق الصحية والمصابة. تستخدم البنية نماذج CNN المدربة مسبقًا—VGG16 وResNet50 وInceptionV3—لاستخراج الميزات، تليها عملية دمج الميزات التي تجمع خرائط الميزات المتميزة في متجه واحد. يتم بعد ذلك تطبيق تقليل الأبعاد للاحتفاظ بأكثر المكونات معلوماتية، والتي تتم معالجتها من خلال بنية Twin CNN تتكون من فرعين متوازيين. يتم دمج المخرجات من هذه الفروع وتمريرها عبر طبقة متصلة بالكامل ومصنف softmax لتحديد نتيجة التصنيف النهائية.

تظهر التحليلات المقارنة أن إطار عمل Twin CNN يتفوق على النماذج الأساسية، بما في ذلك VGG16 وResNet50 وInceptionV3، من حيث الدقة ودرجات F1. تسهم قدرة البنية على الاستفادة من تمثيلات الميزات المتعددة وموثوقية خوارزمية دمج الميزات في هذه التحسينات. على وجه التحديد، حقق نموذج Twin CNN أعلى مقاييس الأداء، مما يميز بفعالية بين الأمراض المتشابهة مثل البقعة البنية وذبول الأوراق البكتيري، بينما أظهر النموذج المخصص Simple CNN دقة أقل بكثير. بشكل عام، تؤكد النتائج فعالية إطار عمل Twin CNN المقترح في تصنيف أمراض أوراق الأرز.

نتائج

تم تنفيذ إطار عمل Twin CNN المقترح في بيئة تعلم الآلة باستخدام Python 3.8 ومكتبات مثل Keras وNumPy وPandas لتطوير النموذج وتحليل البيانات. تم تدريب النموذج بحجم دفعة قدره 32 على مدى 50 حقبة، مستخدمًا مُحسِّن Adam بمعدل تعلم قدره 0.001. تم استخدام خسارة الانتروبيا المتقاطعة كدالة خسارة لتقييم أداء النموذج. أظهر الإطار مقاييس أداء عالية، محققًا دقة تبلغ 96.8%، ودقة 96.5%، واسترجاع 96.7%، ودرجة F1 تبلغ 96.7%، متجاوزًا النماذج الأساسية VGG16 وResNet50 وInceptionV3.

لإثبات هذه التحسينات في الأداء إحصائيًا، تم إجراء اختبار تحليل التباين أحادي الاتجاه (ANOVA)، مقارنةً بمقاييس أداء Twin CNN مع النماذج الأخرى. أشارت النتائج إلى وجود فرق كبير في الأداء، مع إحصائية F تبلغ 18.65 وقيمة p تبلغ 0.0012، وكلاهما يؤكد أن الفروق الملحوظة ليست بسبب التباين العشوائي. تؤكد هذه الأدلة الإحصائية فعالية إطار عمل Twin CNN، مما يبرز تقدمها المعماري وقدراتها التصنيفية المتفوقة في مهمة التصنيف متعدد الفئات.

مناقشة

تقدم البحث بنية Twin CNN جديدة مصممة لتصنيف أمراض أوراق الأرز، مدمجة مع آلية دمج الميزات التي تعزز قدرة النموذج على التقاط الميزات التكميلية من عدة نماذج CNN مدربة مسبقًا، وتحديدًا VGG16 وResNet50 وInceptionV3. تعالج هذه البنية ذات المسارين خرائط الميزات المدمجة، مما يحسن دقة التصنيف بشكل كبير بنسبة 96.8%، مما يتجاوز الطرق الحالية. تشمل المساهمات الرئيسية تنفيذ تحليل المكونات الرئيسية (PCA) لتقليل الأبعاد، مما يقلل من التكرار مع الاحتفاظ بالمعلومات الأساسية، وبالتالي تحسين الكفاءة الحاسوبية. بالإضافة إلى ذلك، يعزز استخدام Grad-CAM لتصور عمليات اتخاذ القرار من قابلية تفسير النموذج، مما يعزز الثقة في توقعاته.

تناقش الورقة أيضًا تطور طرق الكشف عن أمراض النباتات، مسلطة الضوء على الانتقال من الفحوصات البصرية التقليدية إلى الأساليب الآلية التي تستخدم التعلم العميق، وخاصة CNNs. بينما حققت الدراسات السابقة نتائج جديرة بالثناء، غالبًا ما كانت تفتقر إلى التخصص لأمراض أوراق الأرز أو كانت تتطلب حسابات مكثفة. يعالج إطار عمل Twin CNN المقترح هذه القيود من خلال الاستفادة من بنية ذات مسارين تلتقط الميزات الشائعة والمميزة، مما يسمح بالكشف الأكثر دقة عن أعراض الأمراض. تؤكد الدراسة على أهمية تحسين المعلمات الفائقة القوي، الذي ساهم في الدقة العالية للتصنيف، وتحدد الاتجاهات المستقبلية لتضمين نماذج خفيفة الوزن وهياكل متقدمة مثل Transformers البصرية لتعزيز الأداء بشكل أكبر في التطبيقات الزراعية الواقعية.

Journal: Journal Of Big Data, Volume: 12, Issue: 1
DOI: https://doi.org/10.1186/s40537-025-01148-z
Publication Date: 2025-04-11
Author(s): Prameetha Pai et al.
Primary Topic: Smart Agriculture and AI

Overview

This research paper introduces a novel Twin Convolutional Neural Network (CNN)-based framework aimed at classifying rice leaf diseases with enhanced accuracy. The framework employs an optimized feature fusion algorithm that integrates pre-trained CNN models, enabling the classification of rice leaf images into healthy or diseased categories more effectively than traditional methods. Experimental results on publicly available datasets indicate that the proposed Twin CNN architecture, paired with a robust feature fusion mechanism, significantly surpasses existing classification techniques in both accuracy and computational efficiency, highlighting its potential for real-world applications in precision agriculture.

In the conclusion, the study reports that the Twin CNN framework achieves an impressive accuracy of 96.8%, leveraging multi-scale feature fusion from models such as VGG16, ResNet50, and InceptionV3, along with dimensionality reduction via Principal Component Analysis (PCA). The incorporation of Grad-CAM heatmaps enhances the model’s explainability, making it well-suited for practical agricultural applications. Despite challenges related to dataset size and computational requirements, the framework addresses issues like overfitting and dataset diversity, positioning itself as a valuable tool for early disease detection in rice crops. Future research will focus on expanding the dataset and optimizing the model for broader applicability, potentially extending its use to other crops, thereby supporting sustainable farming practices through improved crop health assessments and reduced misclassifications.

Introduction

The introduction of the research paper highlights the critical role of rice as a staple food for nearly half of the global population and the significant challenges posed by diseases such as Brown Spot, Bacterial Leaf Blight, and Leaf Blast, which threaten crop yields and economic viability. Traditional disease identification methods are inefficient and prone to errors, necessitating the development of automated solutions. Recent advancements in deep learning, particularly Convolutional Neural Networks (CNNs), have shown promise in image-based plant disease detection, yet single-stream CNN models often struggle with overfitting, generalization, and explainability.

To address these issues, the study proposes a Twin CNN-based framework that employs multi-scale feature fusion and dimensionality reduction to enhance the accuracy and efficiency of rice leaf disease classification. This framework utilizes three pre-trained models—VGG16, ResNet50, and InceptionV3—to extract diverse features from rice leaf images, which are then fused into a comprehensive feature set. Principal Component Analysis (PCA) is applied to reduce redundancy, and the Twin CNN architecture processes these features through two parallel streams, enabling the classification of rice leaves into four categories: Healthy, Brown Spot, Bacterial Leaf Blight, and Leaf Blast. The framework also incorporates Grad-CAM visualizations for improved interpretability and aims to ensure generalizability across various datasets, including those of other crops like wheat and tomatoes. The design prioritizes computational efficiency, making it suitable for deployment in resource-limited agricultural environments.

Methods

The proposed methodology introduces a Twin CNN-based framework integrated with a feature fusion algorithm aimed at enhancing the classification of rice leaf diseases. This framework addresses the shortcomings of traditional CNN models by improving classification accuracy and robustness, particularly in distinguishing subtle differences between healthy and diseased leaves. The architecture employs pre-trained CNNs—VGG16, ResNet50, and InceptionV3—for feature extraction, followed by a feature fusion process that combines distinct feature maps into a single vector. Dimensionality reduction is then applied to retain the most informative components, which are processed through a Twin CNN architecture consisting of two parallel branches. The outputs from these branches are merged and passed through a fully connected layer and a softmax classifier to determine the final classification result.

Comparative analysis demonstrates that the Twin CNN framework outperforms baseline models, including VGG16, ResNet50, and InceptionV3, in terms of accuracy and F1-scores. The architecture’s ability to leverage multiple feature representations and the robustness of the feature fusion algorithm contribute to these improvements. Specifically, the Twin CNN model achieved the highest performance metrics, effectively differentiating between similar diseases such as brown spot and bacterial leaf blight, while the custom Simple CNN model exhibited significantly lower accuracy. Overall, the findings underscore the effectiveness of the proposed Twin CNN framework in classifying rice leaf diseases.

Results

The proposed Twin CNN framework was implemented in a machine learning environment utilizing Python 3.8 and libraries such as Keras, NumPy, and Pandas for model development and data analysis. The model was trained with a batch size of 32 over 50 epochs, employing the Adam optimizer with a learning rate of 0.001. Categorical cross-entropy was used as the loss function to assess model performance. The framework demonstrated high performance metrics, achieving an accuracy of 96.8%, precision of 96.5%, recall of 96.7%, and an F1-score of 96.7%, surpassing baseline models VGG16, ResNet50, and InceptionV3.

To statistically validate these performance improvements, a one-way Analysis of Variance (ANOVA) test was conducted, comparing the performance metrics of the Twin CNN against the other models. The results indicated a significant difference in performance, with an F-statistic of 18.65 and a p-value of 0.0012, both of which confirm that the observed differences are not due to random variation. This statistical evidence underscores the effectiveness of the Twin CNN framework, highlighting its architectural advancements and superior classification capabilities in the multi-class classification task.

Discussion

The research presents a novel Twin CNN architecture designed for the classification of rice leaf diseases, integrating a feature fusion mechanism that enhances the model’s ability to capture complementary features from multiple pre-trained CNNs, specifically VGG16, ResNet50, and InceptionV3. This dual-stream architecture processes fused feature maps, significantly improving classification accuracy by 96.8%, which surpasses existing methods. Key contributions include the implementation of Principal Component Analysis (PCA) for dimensionality reduction, which minimizes redundancy while retaining essential information, thus optimizing computational efficiency. Additionally, the use of Grad-CAM for visualizing decision-making processes enhances model explainability, fostering trust in its predictions.

The paper also discusses the evolution of plant disease detection methods, highlighting the transition from traditional visual inspections to automated approaches utilizing deep learning, particularly CNNs. While previous studies have achieved commendable results, they often lacked specificity for rice leaf diseases or were computationally intensive. The proposed Twin CNN framework addresses these limitations by leveraging a dual-stream architecture that captures both common and distinguishing features, allowing for more nuanced detection of disease symptoms. The study emphasizes the importance of robust hyperparameter optimization, which contributed to the high classification accuracy, and outlines future directions for incorporating lightweight models and advanced architectures like Vision Transformers to further enhance performance in real-world agricultural applications.