إطار عمل التعلم العميق لتوقعات سلسلة التوريد القابلة للتفسير باستخدام SOM ANN وSHAP Deep learning framework for interpretable supply chain forecasting using SOM ANN and SHAP

المجلة: Scientific Reports، المجلد: 15، العدد: 1
DOI: https://doi.org/10.1038/s41598-025-11510-z
PMID: https://pubmed.ncbi.nlm.nih.gov/40685434
تاريخ النشر: 2025-07-20
المؤلف: Khandakar Rabbi Ahmed وآخرون
الموضوع الرئيسي: تقنيات التنبؤ وتطبيقاتها

نظرة عامة

تقدم البحث إطار عمل جديد للتحليلات التنبؤية لإدارة سلسلة التوريد (SCM) يدمج الخرائط الذاتية التنظيم (SOM) وتحليل المكونات الرئيسية (PCA) والشبكات العصبية الاصطناعية (ANN) لتعزيز دقة التنبؤ واتخاذ القرار في سياق الصناعة 4.0. تعاني ممارسات SCM التقليدية من صعوبة التعامل مع عدم اليقين في الوقت الحقيقي بسبب التعقيد المتزايد وتقلب شبكات سلسلة التوريد. نموذج SOM+ANN المقترح، الذي تم التحقق من صحته باستخدام مجموعة بيانات DataCo Smart Supply Chain، تفوق بشكل كبير على تقنيات التعلم الآلي التقليدية مثل Random Forest وXGBoost وDecision Trees، محققًا $R^2$ قدره 0.92، وخطأ الجذر التربيعي المتوسط (RMSE) قدره 0.936، وخطأ مطلق متوسط (MAE) قدره 0.8459 لتنبؤات مدة الشحن. بالإضافة إلى ذلك، أظهر النموذج دقة تصنيف قدرها 96% ودرجة F1 قدرها 96.22%.

يؤكد البحث على أهمية القابلية للتفسير في النماذج المدفوعة بالذكاء الاصطناعي، باستخدام SHAP (تفسيرات شابلي الإضافية) لتوضيح كيفية إجراء التنبؤات، مما يعالج مشكلة “الصندوق الأسود” المرتبطة غالبًا بالتعلم العميق. بينما يظهر النموذج نتائج واعدة، يُلاحظ أن اعتماده على البيانات التاريخية المنظمة قد يحد من استجابته للتغيرات في الوقت الحقيقي ومصادر البيانات غير المنظمة. علاوة على ذلك، قد تشكل المتطلبات الحسابية للتعلم العميق تحديات للمؤسسات الصغيرة إلى المتوسطة. تشمل اتجاهات البحث المستقبلية تعديل الإطار لدمج تدفقات البيانات في الوقت الحقيقي واختبارها عبر صناعات متنوعة لتعزيز قابليتها للتطبيق وقابليتها للتوسع، مما يسهم في الأدبيات حول الذكاء الاصطناعي في إدارة سلسلة التوريد.

الطرق

تقدم المنهجية المقترحة إطار عمل منهجي مصمم لتعزيز معالجة البيانات، واستخراج الميزات، وتدريب النماذج للتحليلات التنبؤية. تبدأ العملية بمعالجة البيانات، والتطبيع، ومعالجة القيم المفقودة، تليها تطبيق تحليل المكونات الرئيسية (PCA) لتقليل الأبعاد مع الحفاظ على الأنماط الأساسية. ثم يتم استخدام الخرائط الذاتية التنظيم (SOM) لتجميع الأنماط المتشابهة، مع تصور U-Matrix لهيكل البيانات لتسهيل فهم توزيعات التجمعات. يتم إدخال الميزات المستمدة من هذه التجمعات بعد ذلك في شبكة عصبية اصطناعية (ANN) للتصنيف والتنبؤ، باستخدام مخطط تعلم التراجع المحسن من خلال ضبط المعلمات الفائقة.

تقدم هذه الدراسة إطار عمل هجين جديد، يسمى PCA-SOM-GA-BPNN، الذي يدمج PCA وSOM وANN بطريقة متماسكة، محققًا عدة ابتكارات رئيسية. من الجدير بالذكر أن الجمع بين PCA وSOM يوفر تمثيلات قابلة للتفسير ومنخفضة الأبعاد لبيانات سلسلة التوريد المعقدة، وهي استراتيجية نادرًا ما يتم تطبيقها في هذا السياق. تعمل U-Matrix ليس فقط كأداة تصور ولكن أيضًا تساعد في اكتشاف الهيكل الاستكشافي، معززة بـ SHAP لقابلية تفسير النموذج. علاوة على ذلك، يتم استخدام الخوارزميات الجينية (GA) لتحسين الهيكل والأوزان الخاصة بـ ANN، مما يحسن التعميم عبر البيانات غير المرئية. يتم التحقق من صحة المنهجية تجريبيًا من خلال تحليل إحصائي شامل، مما يظهر تحسينات كبيرة مقارنة بالطرق التقليدية، ويتم وضعها كإطار عمل قابل لإعادة الاستخدام لتحديات النمذجة التنبؤية المنظمة حيث تكون القابلية للتفسير والتحسين أمرًا بالغ الأهمية.

المناقشة

تسلط قسم المناقشة في ورقة البحث الضوء على التأثير التحويلي للتحليلات التنبؤية والذكاء الاصطناعي (AI) على إدارة سلسلة التوريد (SCM). أصبحت الطرق التقليدية للتحسين غير كافية بشكل متزايد في مواجهة التعقيدات وعدم اليقين المتأصل في سلاسل التوريد. تؤكد الدراسات الحديثة على دور التعلم الآلي (ML) والتعلم العميق (DL) في تعزيز دقة التنبؤ وكفاءة العمليات. تشير النتائج الرئيسية إلى أنه بينما يمكن أن تحسن تحليلات البيانات الضخمة (BDA) النتائج المستدامة بشكل كبير في SCM، لا تزال التحديات مثل الاعتماد على البيانات المبلغ عنها ذاتيًا، والعمومية المحدودة عبر الصناعات، وتعقيد الحسابات لنماذج DL قائمة. علاوة على ذلك، لا يزال دمج الذكاء الاصطناعي في أنظمة سلسلة التوريد الحالية عقبة كبيرة بسبب البنى التحتية القديمة.

تتوسع هذه القسم في التقدم في تطبيقات DL لتنبؤ سلسلة التوريد، مشيرًا إلى أن النماذج الهجينة، مثل تلك التي تجمع بين الشبكات العصبية الذاكرة الطويلة القصيرة (LSTM) والشبكات العصبية التلافيفية (CNNs)، قد تفوقت على الطرق التقليدية مثل ARIMA. ومع ذلك، تحدد الأدبيات عقبات حاسمة، بما في ذلك نقص القابلية للتفسير في نماذج الذكاء الاصطناعي والحاجة إلى موارد حسابية كبيرة، مما يمكن أن يعيق التبني، خاصة بين المؤسسات الصغيرة والمتوسطة (SMEs). تؤكد المناقشة على ضرورة إجراء المزيد من الأبحاث لسد الفجوة بين التحليلات التنبؤية ومرونة سلسلة التوريد، فضلاً عن تعزيز دمج تقنيات الذكاء الاصطناعي ضمن الأطر الحالية لتحقيق إمكاناتها بالكامل في تحسين عمليات سلسلة التوريد.

القيود

تعتبر قيود الإطار المقترح الذي يجمع بين الخرائط الذاتية التنظيم (SOM) والشبكات العصبية الاصطناعية (ANN) كبيرة، مما يؤثر على فعاليته وقابليته للتعميم عبر سياقات سلسلة التوريد المتنوعة. يحد اعتماد الدراسة على مجموعتين فقط من البيانات من القدرة على استنتاج النتائج بشكل واسع، مما يتطلب مزيدًا من الاختبارات على مجموعة واسعة من مجموعات البيانات ذات الخصائص المتنوعة لتقييم قوة النموذج. أثر عدم توازن الفئات في مجموعة البيانات الثانية، حيث كانت حالات التسليم المتأخر أقل بكثير من التسليم في الوقت المحدد، سلبًا على دقة وموثوقية تنبؤات النموذج، حيث لم يتم تنفيذ أي استراتيجيات لموازنة البيانات. لم يؤثر هذا التوازن فقط على أداء النموذج ولكن أيضًا أدخل تحيزًا نحو الفئة الغالبة، مما يهدد عدالته وحساسيته في تحديد حالات الفئة الأقل.

بالإضافة إلى ذلك، قد يواجه مكون SOM تحديات في قابلية التوسع عند تطبيقه على مجموعات بيانات كبيرة أو عالية الأبعاد، مما يؤدي إلى زيادة وقت التدريب ومتطلبات الموارد. تفتقر بنية ANN الثابتة المستخدمة في الدراسة إلى تحسين ديناميكي أو ضبط معلمات فائقة مصممة خصيصًا لمجموعات بيانات معينة، مما قد يحد من الأداء التنبؤي العام. بينما يتم استخدام SHAP لتعزيز شفافية النموذج، قد لا تلتقط بشكل كافٍ التفاعلات غير الخطية المعقدة داخل إطار SOM-ANN، مما يشير إلى الحاجة لاستكشاف طرق تفسير بديلة. علاوة على ذلك، قد تعيق غياب العوامل الزمنية أو السياقية الخارجية—مثل الاتجاهات السوقية، أو الطقس، أو الأحداث الجيوسياسية—دقة النموذج التنبؤية في سيناريوهات سلسلة التوريد في العالم الحقيقي. تظل فعالية الإطار في البيئات التشغيلية غير مختبرة، مما يبرز ضرورة إجراء أبحاث مستقبلية لتقييم النشر في الوقت الحقيقي ومقارنة نهج SOM-ANN مع هياكل التعلم العميق المتقدمة الأخرى المصممة لبيانات سلسلة التوريد العلائقية أو التسلسلية.

Journal: Scientific Reports, Volume: 15, Issue: 1
DOI: https://doi.org/10.1038/s41598-025-11510-z
PMID: https://pubmed.ncbi.nlm.nih.gov/40685434
Publication Date: 2025-07-20
Author(s): Khandakar Rabbi Ahmed et al.
Primary Topic: Forecasting Techniques and Applications

Overview

The research presents a novel predictive analytics framework for supply chain management (SCM) that integrates Self-Organizing Maps (SOM), Principal Component Analysis (PCA), and Artificial Neural Networks (ANN) to enhance forecasting accuracy and decision-making in the context of Industry 4.0. Traditional SCM practices struggle to address real-time uncertainties due to the increasing complexity and volatility of supply chain networks. The proposed SOM+ANN model, validated using the DataCo Smart Supply Chain dataset, significantly outperformed conventional machine learning techniques such as Random Forest, XGBoost, and Decision Trees, achieving an $R^2$ of 0.92, Root Mean Squared Error (RMSE) of 0.936, and Mean Absolute Error (MAE) of 0.8459 for shipping duration predictions. Additionally, the model demonstrated a classification accuracy of 96% and an F1-score of 96.22%.

The research emphasizes the importance of interpretability in AI-driven models, utilizing SHAP (Shapley Additive exPlanations) to clarify how predictions are made, thereby addressing the “black-box” issue often associated with deep learning. While the model shows promising results, it is noted that its reliance on structured historical data may limit its responsiveness to real-time changes and unstructured data sources. Furthermore, the computational demands of deep learning may pose challenges for small to medium-sized enterprises. Future research directions include adapting the framework to incorporate real-time data streams and testing across diverse industries to enhance its applicability and scalability, ultimately contributing to the literature on AI in supply chain management.

Methods

The proposed methodology presents a systematic framework designed to enhance data treatment, feature extraction, and model training for predictive analytics. The process begins with data preprocessing, normalization, and addressing missing values, followed by the application of Principal Component Analysis (PCA) for dimensionality reduction while preserving essential patterns. Self-Organizing Maps (SOM) are then employed to cluster similar patterns, with the U-Matrix visualizing the data structure to facilitate understanding of cluster distributions. The features derived from these clusters are subsequently input into an Artificial Neural Network (ANN) for classification and forecasting, utilizing a backpropagation learning scheme optimized through hyperparameter tuning.

This study introduces a novel hybrid framework, termed PCA-SOM-GA-BPNN, which integrates PCA, SOM, and ANN in a cohesive manner, achieving several key innovations. Notably, the combination of PCA and SOM provides interpretable, low-dimensional representations of complex supply chain data, a strategy rarely applied in this context. The U-Matrix serves not only as a visualization tool but also aids in exploratory structure discovery, enhanced by SHAP for model interpretability. Furthermore, Genetic Algorithms (GA) are utilized for meta-optimization of the ANN’s architecture and weights, improving generalization across unseen data. The methodology is empirically validated through comprehensive statistical analysis, demonstrating significant improvements over traditional methods, and is positioned as a reusable framework for structured predictive modeling challenges where interpretability and optimization are paramount.

Discussion

The discussion section of the research paper highlights the transformative impact of predictive analytics and artificial intelligence (AI) on supply chain management (SCM). Traditional optimization methods are increasingly inadequate in the face of the complexities and uncertainties inherent in supply chains. Recent studies emphasize the role of machine learning (ML) and deep learning (DL) in enhancing forecasting accuracy and operational efficiency. Key findings indicate that while Big Data Analytics (BDA) can significantly improve sustainable outcomes in SCM, challenges such as reliance on self-reported data, limited generalizability across industries, and the computational complexity of DL models persist. Moreover, the integration of AI into existing supply chain systems remains a significant hurdle due to legacy infrastructures.

The section further elaborates on the advancements in DL applications for supply chain forecasting, noting that hybrid models, such as those combining Long Short-Term Memory (LSTM) networks with Convolutional Neural Networks (CNNs), have outperformed traditional methods like ARIMA. However, the literature identifies critical obstacles, including the lack of explainability in AI models and the need for substantial computational resources, which can hinder adoption, particularly among small and medium-sized enterprises (SMEs). The discussion underscores the necessity for further research to bridge the gap between predictive analytics and supply chain resilience, as well as to enhance the integration of AI technologies within existing frameworks to fully realize their potential in optimizing supply chain operations.

Limitations

The limitations of the proposed framework combining Self-Organizing Maps (SOM) and Artificial Neural Networks (ANN) are significant, impacting its effectiveness and generalizability across diverse supply chain contexts. The study’s reliance on only two datasets restricts the ability to extrapolate findings broadly, necessitating further testing on a wider array of datasets with varying characteristics to assess the model’s robustness. The second dataset’s class imbalance, where instances of late deliveries are notably fewer than on-time deliveries, adversely affected the model’s prediction accuracy and reliability, as no data balancing strategies were implemented. This imbalance not only diminished the model’s performance but also introduced bias towards the majority class, thereby compromising its fairness and sensitivity in identifying minority-class cases.

Additionally, the SOM component may face scalability challenges when applied to large or high-dimensional datasets, leading to increased training time and resource demands. The fixed ANN architecture used in the study lacks dynamic optimization or hyperparameter tuning tailored to specific datasets, potentially limiting overall predictive performance. While SHAP is employed to enhance model transparency, it may not adequately capture the complex nonlinear interactions within the SOM-ANN framework, suggesting a need for exploring alternative interpretability methods. Furthermore, the absence of external temporal or contextual factors—such as market trends, weather, or geopolitical events—could hinder the model’s predictive accuracy in real-world supply chain scenarios. The framework’s efficacy in operational environments remains untested, highlighting the necessity for future research to evaluate real-time deployment and compare the SOM-ANN approach with other advanced deep learning architectures designed for relational or sequential supply chain data.