تحديد الجاموس في بيئات الأنواع المختلطة: نهج تعلم عميق مقارن باستخدام ResNet50 و EfficientNetB3 Buffalo Identification in Mixed-Species Environments: A Comparative Deep Learning Approach Using ResNet50 and EfficientNetB3

المجلة: Journal of Buffalo Science، المجلد: 14
DOI: https://doi.org/10.6000/1927-520x.2025.14.08
تاريخ النشر: 2025-05-15
المؤلف: Nagaraj Naik وآخرون
الموضوع الرئيسي: دراسات سلوك الحيوان ورفاهيته

نظرة عامة

تستكشف هذه الدراسة فعالية اثنين من هياكل الشبكات العصبية التلافيفية، ResNet50 و EfficientNetB3، لتصنيف صور الجاموس ضمن سياق مختلط الأنواع يشمل الفيلة، ووحيد القرن، والحمار الوحشي. تم إنشاء مجموعة بيانات متوازنة، حيث تم تخصيص 80% للتدريب و20% للتحقق. تم تحسين النماذج باستخدام التعلم الانتقالي، حيث استخدم EfficientNetB3 مدخلات عالية الدقة وتوسيعاً واسعاً. كانت مقاييس الأداء، بما في ذلك دقة التحقق، والدقة، والاسترجاع، ودرجة F1، تركز بشكل أساسي على تصنيف الجاموس. أظهرت النتائج أن ResNet50 حققت دقة تحقق تبلغ 47% لكنها واجهت صعوبة في تحديد الجاموس، مما أسفر عن استرجاع قدره 0.07 ودرجة F1 قدرها 0.11. في المقابل، أظهر EfficientNetB3 أداءً متفوقًا، حيث صنف بشكل صحيح 72 من أصل 200 صورة جاموس، مما أدى إلى استرجاع قدره 0.36 ودرجة F1 قدرها 0.32.

تؤكد النتائج على أن EfficientNetB3 هو هيكل أكثر فعالية لاكتشاف الجاموس، مما يبرز قدراته المحسنة في استخراج الميزات والتمثيل. تؤكد الدراسة على أهمية اختيار النماذج المناسبة المصممة لأهداف التصنيف المحددة، خاصة عندما تكون بعض الأنواع، مثل الجاموس، ذات أولوية. تشمل اتجاهات البحث المستقبلية دمج خسارة التركيز لمعالجة عدم توازن الفئات واستكشاف تقنيات تعزيز محددة للفئات، مثل آليات الانتباه والتدريب الموجه بواسطة Grad-CAM، لتحسين التعرف على الجاموس مع ضمان أداء متوازن عبر جميع الفئات. تمهد هذه العمل الطريق لتحقيق تقدم في أدوات مراقبة الأنواع المدفوعة بالذكاء الاصطناعي، والتي تعتبر ضرورية للدراسات البيئية والممارسات الزراعية الذكية.

مقدمة

تسلط مقدمة ورقة البحث الضوء على الدور الحاسم للجاموس (Bubalus bubalis) في سبل عيش المجتمعات الريفية في البلدان النامية، وخاصة في آسيا، حيث يساهمون بشكل كبير في إنتاج الحليب ويكونون أكثر مقاومة للمناخات الاستوائية مقارنة بالماشية. على الرغم من أهميتهم، فإن التقدم في التكنولوجيا لمراقبة واكتشاف الأمراض في الجاموس نادر، خاصة في سياق تطبيقات الذكاء الاصطناعي (AI) والتعلم العميق (DL). تشير الورقة إلى أن الجاموس غالبًا ما يكون ممثلاً تمثيلًا ناقصًا في مجموعات بيانات التدريب لرؤية الكمبيوتر، مما يؤدي إلى تصنيف خاطئ في البيئات المختلطة الأنواع، مما يعقد تحديدهم بسبب التشابهات البصرية مع الثدييات الكبيرة الأخرى.

لمعالجة هذه التحديات، تقيم الدراسة أداء اثنين من هياكل الشبكات العصبية التلافيفية (CNN)، ResNet50 و EfficientNetB3، لتصنيف الجاموس في سياقات متعددة الأنواع. يُعترف بـ ResNet50 لهيكله العميق الذي يعزز تدفق التدرج، بينما يستخدم EfficientNetB3 استراتيجية توسيع مركب لتحقيق دقة عالية مع عدد أقل من المعلمات. تم تحسين كلا النموذجين باستخدام مجموعة بيانات مخصصة تشمل الجاموس، والفيلة، ووحيد القرن، والحمار الوحشي. تهدف الدراسة إلى تحسين أدوات الذكاء الاصطناعي الخاصة بالجاموس، مما يساهم في مراقبة الماشية بدقة وأنظمة إدارة المزارع الذكية. تشمل المساهمات الرئيسية تنفيذ وتحسين هذه الهياكل الشبكية لتعزيز أداء تصنيف الجاموس.

الطرق

تستخدم المنهجية المقترحة لتصنيف أربع فئات حيوانية—الجاموس، الفيل، وحيد القرن، والحمار الوحشي—التعلم العميق الانتقالي مع هيكلين متقدمين: ResNet50 و EfficientNetB3. يتم هيكلة عملية التصنيف كخط أنابيب شامل يشمل عدة مراحل، بما في ذلك معالجة البيانات، والتوسيع، واستخراج الميزات، وتصميم المصنف، والتدريب، والتقييم. مجموعة البيانات متوازنة جيدًا من حيث توزيع الفئات، مما يضمن تمثيلًا عادلًا لكل نوع.

لزيادة قوة النموذج وتقليل الإفراط في التكيف، تم استخدام تقنيات توسيع بيانات متنوعة، مثل الانعكاس الأفقي، والتكبير ضمن نطاق 0.2، والقص أيضًا ضمن نطاق 0.2. علاوة على ذلك، تم تطبيع جميع الصور إلى نطاق [0، 1] لضمان إدخال متسق عبر مجموعة البيانات. تهدف هذه الطريقة الشاملة إلى تحسين دقة تصنيف الفئات الحيوانية من خلال استخراج الميزات الفعالة وتدريب النموذج.

النتائج

في هذه الدراسة، قمنا بتقييم أداء هيكلين من التعلم العميق، ResNet50 و EfficientNetB3، لتصنيف متعدد الفئات لصور الحيوانات، مع التركيز بشكل خاص على تحديد أربع فئات: الجاموس، الفيل، وحيد القرن، والحمار الوحشي. أظهرت كلا النموذجين أداءً متنوعًا في التصنيف، مع إيلاء اهتمام خاص لفئة الجاموس، حيث كان تحسين تحديدها هو الهدف الرئيسي للبحث. لضمان تقييم قوي لأداء النموذج، استخدمنا التحقق المتقاطع 10-fold، مما قلل من التباين وقدم تقديرات موثوقة لقدرات تعميم كل نموذج.

تم تحليل النتائج باستخدام مصفوفات الارتباك وتقارير التصنيف، مما يبرز مقاييس التقييم الرئيسية مثل الدقة، والاسترجاع، ودرجة F1، والدعم. هذه المقاييس ضرورية لفهم فعالية النماذج في تصنيف الأنواع الحيوانية المستهدفة، وخاصة في تعزيز دقة تحديد الجاموس.

المناقشة

في هذه الدراسة، تم تقييم أداء هيكلين من التعلم العميق، ResNet50 و EfficientNetB3، لتصنيف متعدد الفئات لصور الحيوانات، مع التركيز بشكل خاص على تحديد الجاموس بدقة بين الفيلة، ووحيد القرن، والحمار الوحشي. أظهرت النتائج أن EfficientNetB3 تفوق على ResNet50 في تمييز الجاموس، حيث حقق دقة قدرها 0.30، واسترجاع قدره 0.36، ودرجة F1 قدرها 0.32، مقارنة بدقة ResNet50 التي بلغت 0.29، واسترجاع قدره 0.07، ودرجة F1 قدرها 0.11. كشفت مصفوفات الارتباك عن تصنيف خاطئ كبير للجاموس كفيلة، مما يبرز الحاجة إلى تحسين استخراج الميزات وتوسيع مجموعة البيانات المصممة خصيصًا لخصائص الجاموس.

استخدمت الدراسة تقسيم 80-20 للتدريب والتحقق على مجموعة بيانات تتكون من 4000 صورة، مع تطبيق توسيعات بيانات في الوقت الحقيقي لتعزيز تعميم النموذج. استخدمت كلا النموذجين خسارة الانتروبيا المتقاطعة الفئوية ومحسن آدم، حيث أظهر EfficientNetB3 قدرات استخراج ميزات متفوقة، خاصة في التعرف على ميزات الجاموس. ستركز الأعمال المستقبلية على معالجة عدم توازن الفئات من خلال خسارة التركيز واستكشاف تقنيات متقدمة مثل آليات الانتباه لتعزيز التعرف على الجاموس بشكل أكبر مع الحفاظ على توازن التصنيف العام. تمهد هذه الأبحاث الطريق لتحسين تحديد الجاموس في التطبيقات الواقعية، مما يبرز أهمية اختيار الهياكل المناسبة لأهداف التصنيف المحددة.

Journal: Journal of Buffalo Science, Volume: 14
DOI: https://doi.org/10.6000/1927-520x.2025.14.08
Publication Date: 2025-05-15
Author(s): Nagaraj Naik et al.
Primary Topic: Animal Behavior and Welfare Studies

Overview

This study investigates the effectiveness of two convolutional neural network architectures, ResNet50 and EfficientNetB3, for the classification of buffalo images within a mixed-species context that includes elephants, rhinos, and zebras. A balanced dataset was created, with 80% allocated for training and 20% for validation. The models were fine-tuned using transfer learning, with EfficientNetB3 employing higher-resolution inputs and extensive augmentation. Performance metrics, including validation accuracy, precision, recall, and F1-score, were primarily focused on buffalo classification. Results indicated that ResNet50 achieved a validation accuracy of 47% but struggled with buffalo identification, yielding a recall of only 0.07 and an F1-score of 0.11. In contrast, EfficientNetB3 demonstrated superior performance, correctly classifying 72 out of 200 buffalo images, resulting in a recall of 0.36 and an F1-score of 0.32.

The findings establish EfficientNetB3 as a more effective architecture for buffalo detection, highlighting its enhanced feature extraction and representation capabilities. The study emphasizes the importance of selecting appropriate models tailored to specific classification goals, particularly when certain species, like buffalo, are prioritized. Future research directions include the integration of focal loss to address class imbalance and the exploration of class-specific enhancement techniques, such as attention mechanisms and Grad-CAM-guided training, to improve buffalo recognition while ensuring balanced performance across all classes. This work lays the groundwork for advancements in AI-driven species monitoring tools, which are essential for ecological studies and smart agricultural practices.

Introduction

The introduction of the research paper highlights the critical role of buffaloes (Bubalus bubalis) in the livelihoods of rural communities in developing countries, particularly in Asia, where they contribute significantly to milk production and are more resilient to tropical climates compared to cattle. Despite their importance, advancements in technology for monitoring and disease detection in buffaloes are scarce, particularly in the context of artificial intelligence (AI) and deep learning (DL) applications. The paper notes that buffaloes are often underrepresented in training datasets for computer vision, leading to misclassification in mixed-species environments, which complicates their identification due to visual similarities with other large mammals.

To address these challenges, the study evaluates the performance of two convolutional neural network (CNN) architectures, ResNet50 and EfficientNetB3, for buffalo classification in multi-species contexts. ResNet50 is recognized for its deep architecture that enhances gradient flow, while EfficientNetB3 employs a compound scaling strategy to achieve high accuracy with fewer parameters. Both models were fine-tuned using a custom dataset that includes buffaloes, elephants, rhinos, and zebras. The research aims to improve buffalo-specific AI tools, contributing to precision livestock monitoring and intelligent farm management systems. Key contributions include the implementation and optimization of these CNN architectures to enhance buffalo classification performance.

Methods

The proposed methodology for classifying four animal categories—buffalo, elephant, rhino, and zebra—utilizes deep transfer learning with two advanced architectures: ResNet50 and EfficientNetB3. The classification process is structured as an end-to-end pipeline that encompasses several stages, including data preprocessing, augmentation, feature extraction, classifier design, training, and evaluation. The dataset is well-balanced in terms of class distribution, ensuring equitable representation of each species.

To enhance the robustness of the model and mitigate overfitting, various data augmentation techniques were employed, such as horizontal flipping, zooming within a range of 0.2, and shearing also within a range of 0.2. Furthermore, all images were normalized to a range of [0, 1] to ensure consistent input across the dataset. This comprehensive approach aims to improve the classification accuracy of the animal categories through effective feature extraction and model training.

Results

In this study, we assessed the performance of two deep learning architectures, ResNet50 and EfficientNetB3, for the multi-class classification of animal images, specifically targeting the identification of four species: Buffalo, Elephant, Rhino, and Zebra. Both models demonstrated varying classification performance, with particular attention given to the Buffalo class, as optimizing its identification was the primary goal of the research. To ensure a robust evaluation of model performance, we employed 10-fold cross-validation, which mitigated variability and provided reliable estimates of each model’s generalization capabilities.

The results were analyzed using confusion matrices and classification reports, highlighting key evaluation metrics such as Precision, Recall, F1-score, and Support. These metrics are crucial for understanding the models’ effectiveness in classifying the targeted animal species, particularly in enhancing the accuracy of Buffalo identification.

Discussion

In this study, the performance of two deep learning architectures, ResNet50 and EfficientNetB3, was evaluated for multi-class classification of animal images, specifically focusing on accurately identifying buffalo among elephants, rhinos, and zebras. The results indicated that EfficientNetB3 outperformed ResNet50 in distinguishing buffalo, achieving a precision of 0.30, recall of 0.36, and an F1-score of 0.32, compared to ResNet50’s precision of 0.29, recall of 0.07, and F1-score of 0.11. The confusion matrices revealed significant misclassification of buffalo as elephants, highlighting the need for improved feature extraction and dataset augmentation tailored to buffalo characteristics.

The study employed an 80-20 training-validation split on a dataset of 4,000 images, applying real-time data augmentations to enhance model generalization. Both models utilized categorical cross-entropy loss and the Adam optimizer, with EfficientNetB3 demonstrating superior feature extraction capabilities, particularly in recognizing buffalo features. Future work will focus on addressing class imbalance through focal loss and exploring advanced techniques such as attention mechanisms to further enhance buffalo recognition while maintaining overall classification balance. This research lays the groundwork for optimizing buffalo identification in real-world applications, emphasizing the importance of selecting appropriate architectures for specific classification goals.