التقييم المقارن لنماذج التعلم الآلي لرسم خرائط قابلية الانزلاق: التركيز على التحقق والدقة Comparative assessment of machine learning models for landslide susceptibility mapping: a focus on validation and accuracy

المجلة: Natural Hazards، المجلد: 121، العدد: 9
DOI: https://doi.org/10.1007/s11069-025-07197-0
تاريخ النشر: 2025-03-13
المؤلف: Mohamed M. Abdelkader وآخرون
الموضوع الرئيسي: الانهيارات الأرضية والمخاطر المرتبطة بها

نظرة عامة

تستكشف هذه الدراسة فعالية ثلاثة نماذج تعلم آلي—الانحدار اللوجستي (LR)، الغابة العشوائية (RF)، وآلة الدعم المتجه (SVM)—في إنشاء خرائط عرضة الانزلاق الأرضي الدقيقة (LSM) لمنطقة جبلية تقع جنوب شرق القاهرة، مصر. باستخدام مجموعة بيانات تتكون من 183 موقع انزلاق أرضي و183 موقع غير انزلاق أرضي، استخدمت الدراسة أربعة عشر عاملاً شرطياً عبر فئات متنوعة، بما في ذلك المتغيرات الطبوغرافية والهيدرولوجية. بينما حققت جميع النماذج درجات عالية من منطقة تحت منحنى خاصية التشغيل المستقبلية (ROC-AUC) (RF: 0.95، SVM: 0.90، LR: 0.88)، أظهرت التقييمات الإضافية باستخدام مقاييس أداء أخرى أن نموذج RF تفوق على الآخرين من حيث الدقة، والاسترجاع، والدقة، ودرجة F1، مما يدل على موثوقيته في تصنيف المناطق المعرضة للانزلاق الأرضي بشكل صحيح وتقليل الأخطاء في التصنيف.

تسلط الدراسة الضوء أيضًا على أهمية العوامل المختلفة التي تؤثر على حدوث الانزلاقات الأرضية، حيث أظهرت العوامل الطبوغرافية، وخاصة الانحدار، التأثير الأكبر. كانت الأنشطة البشرية والعوامل الهيدرولوجية أيضًا ذات أهمية، بينما كانت العوامل المحفزة مثل الأمطار والزلازل لها تأثير أقل. تؤكد النتائج على ضرورة استخدام نهج شامل لتقييم النماذج يتجاوز مقاييس ROC-AUC، داعيةً إلى نماذج لا تظهر فقط دقة تنبؤية عالية ولكن أيضًا موثوقية عملية لإدارة مخاطر الانزلاق الأرضي بشكل فعال. إن الأداء المتفوق لنموذج RF يجعله الأداة الأكثر موثوقية لتخفيف المخاطر في المجتمعات الضعيفة.

مقدمة

تسلط مقدمة هذه الورقة البحثية الضوء على الزيادة المتزايدة في حدوث الانزلاقات الأرضية في المناطق الجبلية، والتي تفاقمت بسبب تغير المناخ العالمي والتحضر، مما أدى إلى فقدان كبير في الأرواح والبنية التحتية. يتم التأكيد على ضرورة وجود خرائط عرضة للانزلاق الأرضي موثوقة (LSMs)، حيث تعتبر هذه الأدوات ضرورية لتوقع حدوث الانزلاقات الأرضية وإدارة مخاطر الكوارث. يتم مناقشة منهجيات مختلفة لإنشاء LSMs، بما في ذلك الأساليب النوعية، والفيزيائية، والكمية. بينما تعتمد الأساليب النوعية على حكم الخبراء، تعتمد الأساليب الفيزيائية على القوانين الأساسية للفيزياء والبيانات التفصيلية، مما يجعلها أقل عملية للمناطق الكبيرة. على النقيض من ذلك، اكتسبت الأساليب الكمية، وخاصة تلك التي تستخدم التعلم الآلي (ML)، زخمًا لقدرتها على تحليل الأنماط المعقدة وإنتاج نماذج تنبؤية.

على الرغم من التقدم في تطبيقات ML لـ LSMs، تكشف الأدبيات عن نقص في النماذج الموثوقة عالميًا القابلة للتطبيق عبر مناطق متنوعة، حيث تختلف الأداء بشكل كبير بسبب عوامل مثل جودة البيانات والخصائص الجغرافية. تحدد الورقة فجوة حاسمة في الاعتماد على منطقة تحت منحنى خاصية التشغيل المستقبلية (ROC-AUC) كمقياس التحقق الأساسي، مما قد يؤدي إلى تفسيرات مضللة لفعالية النموذج. لمعالجة هذه القضايا، تقترح الدراسة إطارًا شاملاً يدمج تقنيات ML مع مقاييس تقييم قوية، بما في ذلك ROC-AUC، والدقة، والاسترجاع، ودرجة F1، جنبًا إلى جنب مع تقنيات التحقق المكاني. بالإضافة إلى ذلك، يهدف استخدام قيم SHapley Additive exPlanations (SHAP) إلى تعزيز قابلية تفسير النموذج من خلال تحديد العوامل الرئيسية التي تؤثر على عرضة الانزلاق الأرضي. تسعى هذه الدراسة إلى تحسين موثوقية LSM في المناطق الجافة، مما يساهم في تقديم رؤى قيمة لتخطيط استخدام الأراضي وتقليل مخاطر الكوارث، وتأسيس معيار للبحوث المستقبلية في هذا المجال.

الطرق

في هذه الدراسة، استخدم المؤلفون طرق التعلم الآلي (ML) لتقييم عرضة الانزلاق الأرضي، معترفين بأهمية هذه التقنيات المتزايدة في تحليل المخاطر الجيوتقنية بسبب قدرتها على إدارة التعقيدات المتأصلة في الأنظمة الجيوتقنية (Zhang et al. 2020; Phoon and Zhang 2023). تم استخدام ثلاثة نهج ML على وجه التحديد: الانحدار اللوجستي (LR)، الغابة العشوائية (RF)، وآلة الدعم المتجه (SVM). تم تقييم فعالية هذه الطرق من حيث قدرتها على رسم عرضة الانزلاق الأرضي بدقة، مما يساهم في تحسين النمذجة التنبؤية في التطبيقات الجيوتقنية.

النتائج

تشير نتائج التحقق من النموذج إلى أن جميع النماذج التي تم فحصها حققت قيم ROC-AUC تتجاوز 85%، مع قيم محددة تبلغ 0.954 للغابة العشوائية (RF)، 0.900 لآلة الدعم المتجه (SVM)، و0.882 للانحدار اللوجستي (LR). تشير هذه النتائج إلى أن النماذج توفر نتائج تنبؤية مقبولة ضمن منطقة الدراسة، كما تدعمه Zhang et al. (2023).

تم حساب مقاييس تقييم إضافية لضمان تقييم شامل لأداء النموذج. تكشف النتائج، المفصلة في الجدول 3، أن RF تفوقت باستمرار على كل من SVM وLR عبر مقاييس متنوعة. على وجه التحديد، أظهر RF دقة (ACC) أكبر من SVM وLR بمقدار 0.07 و0.13، على التوالي. بالإضافة إلى ذلك، تفوق RF على SVM وLR في الدقة بمقدار 0.16 و0.29، وفي درجة F1 بمقدار 0.09 و0.17، على التوالي. على الرغم من هذه الاختلافات، حققت جميع النماذج دقة جديرة بالثناء تبلغ 0.89.

المناقشة

تركز البحث على عرضة الانزلاق الأرضي في منطقة حضرية تتطور بسرعة جنوب شرق القاهرة، مصر، حيث تشكل الكثافة السكانية العالية تحديات اجتماعية واقتصادية كبيرة. تم إنشاء جرد شامل للانزلاقات الأرضية، حيث تم تحديد 183 موقعًا من خلال العمل الميداني وتحليل الصور الفضائية، مما يشكل أساسًا لتطوير نماذج عرضة الانزلاق الأرضي. تم اختيار أربعة عشر عاملاً شرطياً عبر فئات طبوغرافية، وجيولوجية، وهيدرولوجية، وإنسانية، ومحفزة، وتمت معالجتها مسبقًا لضمان الدقة وتقليل التعدد الخطي. استخدمت الدراسة تقنيات تعلم آلي (ML) متنوعة، بما في ذلك الانحدار اللوجستي (LR)، والغابة العشوائية (RF)، وآلة الدعم المتجه (SVM)، لإنشاء خرائط عرضة مصنفة إلى خمسة مستويات من المخاطر.

أشارت النتائج إلى وجود اختلافات كبيرة في توزيع مناطق المخاطر بين النماذج، حيث حدد LR نسبة أعلى بشكل ملحوظ من المناطق عالية المخاطر مقارنةً بـ RF وSVM. كشفت التحليلات أن الانحدار كان العامل الأكثر تأثيرًا عبر جميع النماذج، يليه مؤشر الرطوبة الطبوغرافية (TWI) وانحناء الملف. تم استخدام قيم SHAP لتفسير النماذج، مما أكد أهمية العوامل الطبوغرافية مع تسليط الضوء على التأثير المحدود للعوامل الجيولوجية والهيدرولوجية. تؤكد النتائج على ضرورة إجراء تقييمات مخصصة لعرضة الانزلاق الأرضي في المناطق الحضرية، مما يبرز دور طرق ML في تعزيز الدقة التنبؤية والمساعدة في استراتيجيات التخفيف من الكوارث.

Journal: Natural Hazards, Volume: 121, Issue: 9
DOI: https://doi.org/10.1007/s11069-025-07197-0
Publication Date: 2025-03-13
Author(s): Mohamed M. Abdelkader et al.
Primary Topic: Landslides and related hazards

Overview

This study investigates the effectiveness of three machine learning models—logistic regression (LR), random forest (RF), and support vector machine (SVM)—in generating accurate landslide susceptibility maps (LSM) for a hilly region southeast of Cairo, Egypt. Utilizing a dataset comprising 183 landslide and 183 non-landslide locations, the research employed fourteen conditioning factors across various categories, including topographical and hydrological variables. While all models achieved high receiver operating characteristic-area under the curve (ROC-AUC) scores (RF: 0.95, SVM: 0.90, LR: 0.88), further evaluation using additional performance metrics revealed that the RF model outperformed the others in terms of accuracy, recall, precision, and F1 score, demonstrating its reliability in correctly classifying landslide-prone areas and minimizing misclassifications.

The study also highlights the significance of various factors influencing landslide occurrences, with topographic factors, particularly slope, showing the most substantial impact. Human activities and hydrological factors were also significant, while trigger factors like rainfall and earthquakes had a lesser effect. The findings emphasize the necessity of using a comprehensive approach to model evaluation that goes beyond ROC-AUC metrics, advocating for models that not only exhibit high predictive accuracy but also practical reliability for effective landslide risk management. The RF model’s superior performance positions it as the most dependable tool for mitigating risks in vulnerable communities.

Introduction

The introduction of this research paper highlights the increasing prevalence of landslides in hilly regions, exacerbated by global climate change and urbanization, leading to significant loss of life and infrastructure. The necessity for reliable landslide susceptibility maps (LSMs) is emphasized, as these tools are crucial for predicting landslide occurrences and managing disaster risks. Various methodologies for generating LSMs are discussed, including qualitative, physical, and quantitative approaches. While qualitative methods rely on expert judgment, physical methods depend on fundamental laws of physics and detailed data, making them less practical for large areas. In contrast, quantitative methods, particularly those utilizing machine learning (ML), have gained traction for their ability to analyze complex patterns and produce predictive models.

Despite the advancements in ML applications for LSMs, the literature reveals a lack of universally reliable models applicable across diverse regions, with performance varying significantly due to factors such as data quality and geographic characteristics. The paper identifies a critical gap in the reliance on the receiver operating characteristic-area under the curve (ROC-AUC) as the primary validation metric, which may lead to misleading interpretations of model efficacy. To address these issues, the research proposes a comprehensive framework that integrates ML techniques with robust evaluation metrics, including ROC-AUC, accuracy, recall, and F1-score, alongside spatial validation techniques. Additionally, the use of SHapley Additive exPlanations (SHAP) values aims to enhance model interpretability by identifying key factors influencing landslide susceptibility. This study seeks to improve LSM reliability in arid regions, contributing valuable insights for land-use planning and disaster risk reduction, and establishing a benchmark for future research in this domain.

Methods

In this study, the authors employed machine learning (ML) methods to evaluate landslide susceptibility, recognizing the increasing importance of these techniques in geotechnical hazard analysis due to their capacity to manage the complexities inherent in geotechnical systems (Zhang et al. 2020; Phoon and Zhang 2023). Specifically, three ML approaches were utilized: logistic regression (LR), random forest (RF), and support vector machine (SVM). The effectiveness of these methods was assessed in terms of their ability to accurately map landslide susceptibility, contributing to improved predictive modeling in geotechnical applications.

Results

The results of the model validation indicate that all examined models achieved ROC-AUC values exceeding 85%, with specific values of 0.954 for Random Forest (RF), 0.900 for Support Vector Machine (SVM), and 0.882 for Logistic Regression (LR). These findings suggest that the models provide acceptable predictive outcomes within the study area, as supported by Zhang et al. (2023).

Further evaluation metrics were calculated to ensure a comprehensive assessment of model performance. The results, detailed in Table 3, reveal that RF consistently outperformed both SVM and LR across various metrics. Specifically, RF exhibited an accuracy (ACC) greater than SVM and LR by 0.07 and 0.13, respectively. Additionally, RF surpassed SVM and LR in precision by 0.16 and 0.29, and in F1 score by 0.09 and 0.17, respectively. Despite these differences, all models achieved a commendable precision of 0.89.

Discussion

The research focuses on landslide susceptibility in a rapidly urbanizing area southeast of Cairo, Egypt, where high population density poses significant social and economic challenges. A comprehensive landslide inventory was created, identifying 183 locations through fieldwork and satellite imagery analysis, which serves as a foundation for developing landslide susceptibility models. Fourteen conditioning factors were selected across topographic, geological, hydrological, anthropogenic, and triggering categories, and were preprocessed to ensure accuracy and reduce multicollinearity. The study employed various machine learning (ML) techniques, including Logistic Regression (LR), Random Forest (RF), and Support Vector Machine (SVM), to generate susceptibility maps categorized into five risk levels.

The results indicated significant variations in risk zone distribution among the models, with LR identifying a notably higher proportion of high-risk areas compared to RF and SVM. The analysis revealed that slope was the most influential factor across all models, followed by topographic wetness index (TWI) and profile curvature. SHAP values were utilized to interpret the models, confirming the importance of topographic factors while highlighting the limited influence of geological and hydrological factors. The findings underscore the necessity for tailored landslide susceptibility assessments in urbanizing regions, emphasizing the role of ML methods in enhancing predictive accuracy and aiding disaster mitigation strategies.