نحو تصنيف دقيق وفعال لصور النفايات: نهج هجين يجمع بين التعلم العميق والتعلم الآلي Towards accurate and efficient waste image classification: A hybrid deep learning and machine learning approach

المجلة: Ain Shams Engineering Journal، المجلد: 17، العدد: 4
DOI: https://doi.org/10.1016/j.asej.2026.104062
تاريخ النشر: 2026-03-07
المؤلف: Ngoc-Bao-Quang Nguyen وآخرون
الموضوع الرئيسي: إدارة النفايات الصلبة البلدية

نظرة عامة

تقدم الورقة البحثية تقييمًا منهجيًا لتصنيف النفايات المعتمد على الصور بشكل آلي، مع التأكيد على الحاجة إلى معايير قوية تشمل التعلم الآلي (ML) والتعلم العميق (DL) والمنهجيات الهجينة. تقارن الدراسة بين ثلاثة نماذج: (1) خوارزميات التعلم الآلي التقليدية التي تستخدم ميزات مصنوعة يدويًا، (2) هياكل التعلم العميق المختلفة، بما في ذلك متغيرات ResNet و EfficientNetV2S، و (3) نهج هجيني يجمع بين التعلم العميق لاستخراج الميزات مع المصنفات التقليدية مثل آلة الدعم الناقل والانحدار اللوجستي. تكشف التجارب التي أجريت على ثلاثة مجموعات بيانات عامة – TrashNet، وتصنيف النفايات، ومجموعة بيانات النفايات المنزلية المنقحة – أن الطريقة الهجينة تتفوق باستمرار على الأساليب الأخرى، محققة دقة تصل إلى 100% على TrashNet ومجموعة البيانات المنزلية المنقحة، و99.87% على تصنيف النفايات، مما يتجاوز المعايير الحالية.

تؤكد النتائج فعالية الإطار الهجين، خاصة قدرته على الحفاظ على دقة عالية مع تقليل أبعاد الميزات بشكل كبير بأكثر من 95%، مما يعزز سرعة التدريب والاستدلال. لا تؤسس هذه الدراسة معايير أكثر موثوقية لتصنيف النفايات فحسب، بل تسلط الضوء أيضًا على القابلية العملية للإطار المقترح في البيئات ذات الموارد المحدودة. تشمل اتجاهات البحث المستقبلية دمج تقنيات الذكاء الاصطناعي القابلة للتفسير لتعزيز قابلية تفسير توقعات النموذج بشكل أكبر.

مقدمة

تسلط المقدمة الضوء على القضية الملحة المتعلقة بتزايد النفايات المنزلية والصناعية، والتي تضغط على أنظمة إدارة النفايات وتستنفد سعة مدافن النفايات. يعد فرز النفايات بشكل فعال أمرًا ضروريًا للإدارة المستدامة، ومع ذلك لا يزال الفرز اليدوي سائدًا، مما يؤدي إلى عدم الكفاءة. لتعزيز تصنيف النفايات، ركزت الأبحاث بشكل متزايد على الذكاء الاصطناعي (AI)، وخاصة من خلال تقنيات التعلم الآلي (ML) والتعلم العميق (DL). بينما أظهرت الدراسات السابقة وعدًا مع الميزات المصنوعة يدويًا والمصنفات التقليدية، بالإضافة إلى الشبكات العصبية التلافيفية (CNNs)، هناك نقص ملحوظ في المقارنات المنهجية عبر هذه المنهجيات، خاصة فيما يتعلق بالنهج الهجينة التي تجمع بين التعلم العميق والمصنفات التقليدية.

تهدف هذه الدراسة إلى سد هذه الفجوات من خلال إجراء تقييم شامل للإطارات التقليدية للتعلم الآلي، والتعلم العميق، والهجينة لتصنيف النفايات المعتمد على الصور. يستخدم الإطار الهجين المقترح هيكل ResNet50 مخصص لاستخراج الميزات العميقة، والتي يتم دمجها بعد ذلك مع مصنفات التعلم الآلي مثل آلة الدعم الناقل (SVM) والانحدار اللوجستي (LR). بالإضافة إلى ذلك، تعالج الدراسة قضايا جودة البيانات من خلال إعادة تصنيف الصور التي تم تصنيفها بشكل خاطئ يدويًا في مجموعة بيانات النفايات المنزلية، مما يعزز موثوقية المعايير. تشمل المساهمات الرئيسية إجراء تقييم شامل عبر ثلاث مجموعات بيانات عامة، وتحسين سلامة البيانات من خلال تصحيح مجموعة البيانات، وإطار هجيني يحقق تقليلًا كبيرًا في الأبعاد مع الحفاظ على دقة عالية.

الطرق

ت outlines قسم المنهجية خط أنابيب تعلم آلي هجيني مصمم لتصنيف النفايات، يجمع بين الميزات المصنوعة يدويًا وميزات التعلم العميق لتعزيز أداء التصنيف. يستخدم خط الأنابيب مصنفات متنوعة، بما في ذلك الانحدار اللوجستي (LoR)، وأقرب الجيران (KNN)، وآلة الدعم الناقل (SVM)، وأشجار القرار (DT)، والغابات العشوائية (RF)، وتعزيز التدرج المتطرف (XGBoost)، وآلة تعزيز التدرج الخفيف (LightGBM)، لتقييم فعاليتها على بيانات الصور عالية الأبعاد. تشير النتائج إلى أنه بينما تحقق طرق التعلم الآلي التقليدية مع الميزات المصنوعة يدويًا أداءً معتدلاً، فإن الطرق التجميعية مثل XGBoost و LightGBM تحقق دقة تتراوح بين 78-85%، إلا أنها تتفوق عليها نماذج التعلم العميق، التي تتعلم تلقائيًا تمثيلات ميزات أغنى.

تظهر النتائج التجريبية أن نماذج التعلم العميق، وخاصة ResNet50 المخصص، تتجاوز بشكل كبير الطرق التقليدية عبر ثلاث مجموعات بيانات. على سبيل المثال، في مجموعة البيانات A، حقق ResNet50 المخصص دقة قدرها 97.37%، بينما وصل أفضل نموذج تقليدي (XGBoost) إلى 80.79% فقط. في مجموعة البيانات B، اقترب ResNet50 المخصص من الأداء المثالي بدقة 99.74%، متفوقًا على LightGBM بفارق 14.38%. حقق النهج الهجين، الذي يستخدم الميزات العميقة من ResNet50 المخصص كمدخلات للمصنفات التقليدية، أداءً أعلى، حيث حقق مصنفون مثل LoR دقة 100% في مجموعة البيانات C. تجمع هذه الاستراتيجية الهجينة بشكل فعال بين نقاط القوة في استخراج ميزات التعلم العميق مع تصنيف التعلم الآلي التقليدي، مما يؤدي إلى نتائج متفوقة ومستقرة عبر مجموعات بيانات متنوعة.

النتائج

يوفر قسم النتائج تقييمًا شاملاً للإطارات التصنيفية المقترحة المطبقة على ثلاث مجموعات بيانات متميزة. تم استخدام خط أنابيب معالجة مسبقة موحد لجميع الصور، والذي شمل تغيير الحجم إلى دقة $400 \times 400$ بكسل وتنفيذ تقنيات تعزيز البيانات لتحسين قوة النموذج وتقليل الإفراط في التكيف.

تركز التحليل على تقييم مقارن لثلاثة نماذج: التعلم الآلي التقليدي (ML)، والتعلم العميق (DL)، والنهج الهجين المقترح. يهدف هذا المقارنة إلى تحديد الإطار الأكثر فعالية لمهام التصنيف المعنية. يتم مناقشة مقاييس الأداء التفصيلية لكل نهج في الأقسام الفرعية التالية، مع تسليط الضوء على نقاط القوة والضعف لكل منهجية.

المناقشة

تسلط قسم المناقشة في الورقة البحثية الضوء على كفاءة وقابلية توسيع النموذج الهجين المقترح لتصنيف النفايات، الذي يحقق دقة متقدمة مع تقليل تكاليف الاستدلال بشكل كبير. يجعل هذا النموذج مناسبًا بشكل خاص للنشر في البيئات ذات الموارد المحدودة أو الأنظمة الذكية في الوقت الحقيقي. تم هيكلة الورقة لمراجعة الأعمال ذات الصلة حول التعلم العميق والإطارات الهجينة، تليها وصف تفصيلي للنهج المقترح، وإعداد التجارب، والتقييمات المقارنة. يتم تحليل النتائج من حيث القوة والكفاءة، مما يؤدي إلى ملخص للنتائج والاتجاهات المستقبلية.

كما يضع القسم تطور منهجيات تصنيف النفايات في سياقها، مقارنًا بين طرق التعلم الآلي التقليدية، التي تعتمد غالبًا على الميزات المصنوعة يدويًا وتظهر قابلية توسيع محدودة، وتقنيات التعلم العميق الحديثة التي تستفيد من الشبكات العصبية التلافيفية (CNNs). بينما أظهرت الشبكات العصبية التلافيفية دقة متفوقة، لا تزال التحديات قائمة بشأن متطلباتها الحاسوبية وكفاءتها للتطبيقات في الوقت الحقيقي. أظهرت التقدمات الأخيرة في النماذج الهجينة، التي تجمع بين استخراج ميزات التعلم العميق مع مصنفات التعلم الآلي التقليدية، وعدًا في تحقيق توازن بين الدقة والكفاءة. تؤكد النتائج على أهمية المعايير المنهجية التي تقيم الدقة والكفاءة وقابلية التوسع، والتي تهدف الدراسة الحالية إلى معالجتها من خلال تقييماتها.

Journal: Ain Shams Engineering Journal, Volume: 17, Issue: 4
DOI: https://doi.org/10.1016/j.asej.2026.104062
Publication Date: 2026-03-07
Author(s): Ngoc-Bao-Quang Nguyen et al.
Primary Topic: Municipal Solid Waste Management

Overview

The research paper presents a systematic evaluation of automated image-based garbage classification, emphasizing the need for robust benchmarks that encompass Machine Learning (ML), Deep Learning (DL), and hybrid methodologies. The study compares three paradigms: (1) traditional ML algorithms utilizing handcrafted features, (2) various deep learning architectures, including ResNet variants and EfficientNetV2S, and (3) a hybrid approach that combines deep learning for feature extraction with classical classifiers like Support Vector Machine and Logistic Regression. Experiments conducted on three public datasets—TrashNet, Garbage Classification, and a refined Household Garbage Dataset—reveal that the hybrid method consistently outperforms the other approaches, achieving up to 100% accuracy on TrashNet and the refined Household dataset, and 99.87% on Garbage Classification, thus exceeding existing benchmarks.

The findings underscore the hybrid framework’s effectiveness, particularly its ability to maintain high accuracy while significantly reducing feature dimensionality by over 95%, which enhances training and inference speed. This study not only establishes more reliable benchmarks for waste classification but also highlights the practical applicability of the proposed framework in resource-constrained environments. Future research directions include the integration of explainable AI techniques to further enhance the interpretability of the model’s predictions.

Introduction

The introduction highlights the urgent issue of escalating household and industrial waste, which strains waste management systems and depletes landfill capacity. Effective waste sorting is essential for sustainable management, yet manual sorting remains prevalent, leading to inefficiencies. To enhance waste classification, research has increasingly focused on Artificial Intelligence (AI), particularly through machine learning (ML) and deep learning (DL) techniques. While previous studies have shown promise with handcrafted features and classical classifiers, as well as with Convolutional Neural Networks (CNNs), there is a notable lack of systematic comparisons across these methodologies, especially regarding hybrid approaches that combine deep learning with traditional classifiers.

This study aims to fill these gaps by conducting a comprehensive evaluation of classical ML, DL, and hybrid frameworks for image-based waste classification. The proposed hybrid framework utilizes a customized ResNet50 backbone to extract deep features, which are then integrated with ML classifiers like Support Vector Machine (SVM) and Logistic Regression (LR). Additionally, the study addresses data quality issues by manually relabeling misclassified images in the Household Garbage dataset, thereby enhancing benchmark reliability. The main contributions include a thorough benchmarking across three public datasets, improved data integrity through dataset correction, and a hybrid framework that achieves significant dimensionality reduction while maintaining high accuracy.

Methods

The methodology section outlines a hybrid machine learning pipeline designed for waste classification, integrating both handcrafted and deep learning features to enhance classification performance. The pipeline employs various classifiers, including Logistic Regression (LoR), K-Nearest Neighbors (KNN), Support Vector Machine (SVM), Decision Trees (DT), Random Forest (RF), Extreme Gradient Boosting (XGBoost), and Light Gradient Boosting Machine (LightGBM), to evaluate their effectiveness on high-dimensional image data. The results indicate that while classical machine learning methods with handcrafted features yield moderate performance, particularly ensemble methods like XGBoost and LightGBM achieving accuracies between 78-85%, they are outperformed by deep learning models, which automatically learn richer feature representations.

Experimental results demonstrate that deep learning models, particularly a customized ResNet50, significantly surpass traditional methods across three datasets. For instance, in dataset A, the customized ResNet50 achieved an accuracy of 97.37%, while the best classical model (XGBoost) only reached 80.79%. In dataset B, the customized ResNet50 nearly saturated performance with 99.74% accuracy, outperforming LightGBM by 14.38%. The hybrid approach, utilizing deep features from the customized ResNet50 as input for traditional classifiers, yielded even higher performance, with classifiers like LoR achieving 100% accuracy in dataset C. This hybrid strategy effectively combines the strengths of deep learning feature extraction with classical machine learning classification, leading to superior and stable results across diverse datasets.

Results

The results section provides a thorough assessment of the proposed classification frameworks applied to three distinct datasets. A standardized preprocessing pipeline was utilized for all images, which included resizing to a resolution of $400 \times 400$ pixels and implementing data augmentation techniques to improve model robustness and mitigate overfitting.

The analysis focuses on a comparative evaluation of three modeling paradigms: classical machine learning (ML), deep learning (DL), and the proposed hybrid approach. This comparison aims to identify the most effective framework for the classification tasks at hand. Detailed performance metrics for each approach are discussed in subsequent subsections, highlighting the strengths and weaknesses of each methodology.

Discussion

The discussion section of the research paper highlights the efficiency and scalability of the proposed hybrid model for waste classification, which achieves state-of-the-art accuracy while significantly reducing inference costs. This makes it particularly suitable for deployment in resource-constrained environments or real-time intelligent systems. The paper is structured to first review related work on deep learning and hybrid frameworks, followed by a detailed description of the proposed approach, experimental setup, and comparative evaluations. The results are analyzed for robustness and efficiency, culminating in a summary of findings and future directions.

The section also contextualizes the evolution of waste classification methodologies, contrasting traditional machine learning approaches, which often rely on handcrafted features and exhibit limited scalability, with modern deep learning techniques that leverage Convolutional Neural Networks (CNNs). While CNNs have demonstrated superior accuracy, challenges remain regarding their computational demands and efficiency for real-time applications. Recent advancements in hybrid models, which combine deep learning feature extraction with classical machine learning classifiers, have shown promise in balancing accuracy and efficiency. The findings underscore the importance of systematic benchmarks that assess accuracy, efficiency, and scalability, which the current study aims to address through its evaluations.