نحو عمليات فولاذية أكثر أمانًا مع إطار متعدد النماذج لتنبؤ الحوادث ومحاكاة تقييم المخاطر Towards safer steel operations with a multi model framework for accident prediction and risk assessment simulation

المجلة: Scientific Reports، المجلد: 15، العدد: 1
DOI: https://doi.org/10.1038/s41598-025-96028-0
PMID: https://pubmed.ncbi.nlm.nih.gov/40246914
تاريخ النشر: 2025-04-17
المؤلف: Shatrudhan Pandey وآخرون
الموضوع الرئيسي: تحليل المخاطر والسلامة

نظرة عامة

تقدم هذه الدراسة إطار عمل متعدد النماذج يدمج بين منهجيات الذكاء الاصطناعي (AI) المختلفة، بما في ذلك الشبكات البايزية (BN)، ونماذج التعلم الآلي (ML)، ومعالجة اللغة الطبيعية (NLP) مع تحليل المشاعر، والنمذجة المعتمدة على الوكلاء (ABM)، وتحليل البقاء، لتعزيز النمذجة التنبؤية لأسباب الحوادث في صناعات الصلب عالية المخاطر. توضح الدراسة أن الاستخدام المشترك لهذه الأساليب الذكاء الاصطناعي يحسن بشكل كبير من دقة التنبؤ مقارنة بالطرق التقليدية. على وجه الخصوص، تحدد الشبكات البايزية ظروف الآلات والأخطاء البشرية التي تسهم في الحوادث، بينما تركز آلات تعزيز التدرج (GBM) على الحوادث المتعلقة بالمعدات. يكشف تحليل NLP عن المشاعر السلبية المرتبطة بعدم الامتثال لبروتوكولات السلامة، وتؤكد محاكاة ABM على أهمية الالتزام بمعدات الحماية الشخصية (PPE) وصيانة الآلات. يبرز تحليل البقاء الدور الحاسم للتدخلات في الوقت المناسب في التخفيف من الحوادث الخطيرة.

تؤكد النتائج على نقاط القوة التكميلية لكل نموذج ضمن إطار العمل متعدد النماذج، مما يثبت الفرضية بأن هذا النهج يتفوق على التحليلات التقليدية ذات الطريقة الواحدة. تلتقط الشبكات البايزية الاعتماديات الاحتمالية، وتتفوق نماذج التعلم الآلي في التعرف على الأنماط، وتثري NLP الرؤى النوعية من تقارير الحوادث غير المنظمة. يسهل دمج ABM وتحليل البقاء تحديد فترات التدخلات الأمنية الحرجة. كما تحدد الدراسة القيود، مثل الحاجة إلى التحقق الخارجي من النتائج وإمكانية الدراسات المستقبلية لدمج البيانات في الوقت الحقيقي من إنترنت الأشياء (IoT) لتحسين إدارة المخاطر. بشكل عام، يوفر الإطار نموذجًا قويًا وديناميكيًا لتنبؤ الحوادث وتطوير استراتيجيات سلامة استباقية، مع إعطاء الأولوية لعوامل الخطر الرئيسية مثل “السبب الرئيسي”، “حالة الآلة”، و”الامتثال لمعدات الحماية الشخصية”.

الطرق

يقدم قسم المنهجية في ورقة البحث نظرة عامة مفصلة عن الأساليب المستخدمة في الدراسة. يوضح التقنيات المحددة للذكاء الاصطناعي (AI) المستخدمة، مع التأكيد على صلتها بأهداف البحث. بالإضافة إلى ذلك، يتضمن القسم صيغًا رياضية تدعم هذه الأساليب الذكاء الاصطناعي، مما يوفر إطار عمل صارم لفهم تنفيذها وفعاليتها في معالجة أسئلة البحث. يضمن هذا النقاش الشامل وضوح الخيارات المنهجية وأسسها النظرية.

النتائج

تكشف نتائج الدراسة التي تستخدم نماذج التعلم الآلي (ML)، وتحديدًا آلات تعزيز التدرج (GBM/XGBoost)، عن رؤى مهمة حول أهمية الميزات وأداء النموذج. تشمل المتنبئات الرئيسية المحددة “السبب الرئيسي”، “نوع الملاحظة”، و”الموقع”، والتي ترتبط بقوة بفئات الحوادث العالية، بينما يكون لـ “PPE” و”حالة الآلة” تأثير أقل. تشير مصفوفة الالتباس إلى أن النموذج يتفوق في التنبؤ بالفئات الشائعة مثل “تلف ممتلكات المعدات” و”الانحراف”، ولكنه يواجه صعوبة مع الأحداث النادرة مثل “الوفيات” و”الجسم الغريب”، التي سجلت تصنيفات صحيحة صفرية بسبب تمثيل البيانات المحدود.

تظهر منحنيات ROC مناطق مختلفة تحت منحنى (AUC) مع قوة تمييز قوية للحوادث المتكررة مثل “الضخ” (AUC = 0.97) و”الإفراج السام” (AUC = 0.91)، بينما تظهر أداءً ضعيفًا للفئات النادرة مثل “المكابس” (AUC = 0.23) و”12 خياطة” (AUC = 0.06). بشكل عام، حقق النموذج دقة اختبار تبلغ 53.55% ومتوسط F1-score ماكرو قدره 0.30، مما يشير إلى أداء معتدل بشكل أساسي للفئات الأكثر تكرارًا. تشير الفجوة بين دقة التدريب (90.57%) ودقة الاختبار إلى احتمال حدوث فرط التخصيص، مما يبرز الحاجة إلى مزيد من التحسين والاستراتيجيات لمعالجة عدم توازن الفئات، مثل إعادة أخذ العينات أو تحسين الميزات، لتحسين الدقة التنبؤية عبر جميع فئات الحوادث.

المناقشة

تسلط قسم المناقشة في ورقة البحث الضوء على أهمية دمج أساليب النمذجة المختلفة لتعزيز التنبؤ بالحوادث وإدارة المخاطر في صناعة الصلب. تُعترف الشبكات البايزية (BN) بقدرتها على التقاط الاعتماديات وعدم اليقين، ومع ذلك قد تتجاهل العلاقات غير الخطية المعقدة. بالمقابل، تُظهر نماذج التعلم الآلي (ML)، بما في ذلك الغابات العشوائية (RF) وآلات تعزيز التدرج (GBM/XGBoost)، قدرات تنبؤية متفوقة من خلال التعرف التلقائي على الأنماط في مجموعات البيانات الكبيرة. ومع ذلك، غالبًا ما تعمل نماذج ML كـ “صناديق سوداء”، مما يحد من قابلية التفسير وفهم أسباب الحوادث. يهدف دمج النماذج الاحتمالية مع ML إلى تحسين الشفافية والدقة في التنبؤات.

كما يتم التأكيد على معالجة اللغة الطبيعية (NLP) لدورها في تحليل البيانات غير المنظمة، مثل تقارير الحوادث، لاستخراج رؤى قيمة. تدعو الورقة إلى إطار عمل متعدد النماذج للذكاء الاصطناعي يجمع بين BN وGBM/XGBoost وRF وNLP مع تحليل المشاعر، والنمذجة المعتمدة على الوكلاء (ABM)، وتحليل البقاء. يهدف هذا النهج الشامل إلى معالجة تعقيدات أسباب الحوادث وإدارة المخاطر، مما يوفر منظورًا شاملًا يفتقر حاليًا في صناعة الصلب. من خلال الاستفادة من نقاط القوة لكل نموذج، يعزز الإطار المقترح الدقة التنبؤية ويدعم تطوير استراتيجيات التخفيف من المخاطر الاستباقية، بهدف تقليل الحوادث والوفيات في البيئات الخطرة.

Journal: Scientific Reports, Volume: 15, Issue: 1
DOI: https://doi.org/10.1038/s41598-025-96028-0
PMID: https://pubmed.ncbi.nlm.nih.gov/40246914
Publication Date: 2025-04-17
Author(s): Shatrudhan Pandey et al.
Primary Topic: Risk and Safety Analysis

Overview

This research presents a multi-model framework that integrates various artificial intelligence (AI) methodologies, including Bayesian Networks (BN), Machine Learning (ML) models, Natural Language Processing (NLP) with Sentiment Analysis, Agent-Based Modeling (ABM), and Survival Analysis, to enhance predictive modeling of accident causation in high-risk steel industries. The study demonstrates that the combined use of these AI approaches significantly improves prediction accuracy compared to traditional methods. Specifically, BN identifies machine conditions and human errors contributing to accidents, while Gradient Boosting Machines (GBM) focus on equipment-related incidents. NLP analysis reveals negative sentiments associated with safety protocol non-compliance, and ABM simulations emphasize the importance of personal protective equipment (PPE) adherence and machine maintenance. Survival analysis highlights the critical role of timely interventions in mitigating severe accidents.

The findings underscore the complementary strengths of each model within the multi-model framework, validating the hypothesis that this approach outperforms conventional single-method analyses. BN captures probabilistic dependencies, ML models excel in pattern recognition, and NLP enriches qualitative insights from unstructured incident reports. The integration of ABM and survival analysis facilitates the identification of critical safety intervention intervals. The research also identifies limitations, such as the need for external validation of findings and the potential for future studies to incorporate real-time data from IoT for enhanced risk management. Overall, the framework provides a robust and dynamic model for predicting accidents and developing proactive safety strategies, prioritizing key risk factors like “Primary Cause,” “Machine Condition,” and “PPE Compliance.”

Methods

The methodology section of the research paper presents a detailed overview of the approaches employed in the study. It outlines the specific artificial intelligence (AI) techniques utilized, emphasizing their relevance to the research objectives. Additionally, the section includes mathematical formulations that underpin these AI methods, providing a rigorous framework for understanding their implementation and effectiveness in addressing the research questions. This comprehensive discussion ensures clarity in the methodological choices and their theoretical foundations.

Results

The results of the study employing machine learning (ML) models, specifically Gradient Boosting Machines (GBM/XGBoost), reveal significant insights into feature importance and model performance. Key predictors identified include “Primary Cause,” “Observation Type,” and “Location,” which are strongly associated with high-incident categories, while “PPE” and “Machine Condition” have lesser influence. The confusion matrix indicates that the model excels in predicting common classes like “Equipment Property Damage” and “Derailment,” but struggles with rare events such as “Fatality” and “Foreign Body,” which recorded zero correct classifications due to limited data representation.

The ROC curves demonstrate varying areas under the curve (AUC) scores, with strong discriminatory power for frequent incidents like “Pumping” (AUC = 0.97) and “Toxic Release” (AUC = 0.91), while showing poor performance for rare categories like “Winders” (AUC = 0.23) and “12 Seam” (AUC = 0.06). Overall, the model achieved a test accuracy of 53.55% and a macro average F1-score of 0.30, indicating moderate performance primarily for more frequent classes. The disparity between training (90.57%) and test accuracy suggests potential overfitting, highlighting the need for further optimization and strategies to address class imbalance, such as resampling or enhanced feature engineering, to improve predictive accuracy across all incident categories.

Discussion

The discussion section of the research paper highlights the significance of integrating various modeling approaches to enhance accident prediction and risk management in the steel industry. Bayesian Networks (BN) are recognized for their ability to capture dependencies and uncertainties, yet they may overlook complex nonlinear relationships. In contrast, Machine Learning (ML) models, including Random Forest (RF) and Gradient Boosting Machines (GBM/XGBoost), demonstrate superior predictive capabilities by automatically identifying patterns in extensive datasets. However, ML models often function as “black boxes,” limiting interpretability and understanding of accident causation. The integration of probabilistic models with ML aims to improve transparency and accuracy in predictions.

Natural Language Processing (NLP) is also emphasized for its role in analyzing unstructured data, such as incident reports, to extract valuable insights. The paper advocates for a multi-model AI framework that combines BN, GBM/XGBoost, RF, NLP with sentiment analysis, Agent-Based Modeling (ABM), and Survival Analysis. This comprehensive approach aims to address the complexities of accident causation and risk management, providing a holistic perspective that is currently lacking in the steel industry. By leveraging the strengths of each model, the proposed framework enhances predictive accuracy and supports the development of proactive risk mitigation strategies, ultimately aiming to reduce accidents and fatalities in hazardous environments.