التنبؤ بالزلازل باستخدام تقنيات التعلم الآلي: الأضواء والظلال Forecasting earthquakes by Machine Learning techniques: lights and shadows

المجلة: Earth Science Informatics، المجلد: 19، العدد: 2
DOI: https://doi.org/10.1007/s12145-026-02078-x
تاريخ النشر: 2026-02-01
المؤلف: Jesús Jover-Alfaro وآخرون
الموضوع الرئيسي: كشف الزلازل وتحليلها

نظرة عامة

تناقش هذه الفقرة التحول في منهجيات توقع الزلازل من الأساليب الحتمية إلى التنبؤ الاحتمالي، الذي يهدف إلى تقدير احتمال حدوث الأحداث الزلزالية ضمن معلمات مكانية وزمنية محددة. يدعم هذا الانتقال التطبيق المتزايد لتقنيات التعلم الآلي (ML) في علم الزلازل، والتي يمكن أن تعزز النمذجة الإحصائية لتوقعات الزلازل.

يميز النص بين تطبيقين رئيسيين للتعلم الآلي في هذا المجال: (1) مهام معالجة البيانات، مثل اختيار المراحل وتوليد الفهارس، حيث أثبت التعلم الآلي فعاليته وأصبح مقبولاً على نطاق واسع، و(2) الاستخدام المباشر للتعلم الآلي في توقع الزلازل، والذي لا يزال مثيرًا للجدل. على الرغم من الجدل المحيط بالأخير، أظهرت العديد من الدراسات نجاحًا كبيرًا في توقع النشاط الزلزالي باستخدام خوارزميات متقدمة، حيث حقق البعض معدلات دقة تتجاوز 90% في التحليلات الاستعادية.

مقدمة

تسلط مقدمة ورقة البحث الضوء على التهديد المستمر الذي تشكله الأحداث الزلزالية على السلامة العالمية والبنية التحتية. تنتقد نماذج التنبؤ الحتمية للزلازل، مشددة على طبيعتها “الصندوق الأسود” ونقص بروتوكولات التحقق الحالية. يشير المؤلفون إلى المخاوف التي أثارها مينيان وبروكاردو (2019) بشأن الإفراط في ملاءمة نماذج التعلم العميق المعقدة للقوانين الفيزيائية والتحديات التي تواجهها هذه النماذج عند مواجهة تحولات المجال، كما هو موضح من قبل تشاترجي وآخرون (2025). تجادل الورقة بأن طرق التحقق القياسية، مثل التحقق العشوائي باستخدام k-fold، تفشل في معالجة الخصائص غير الثابتة والسببية للعمليات الزلزالية، مما يؤدي إلى تسرب البيانات ورفع مقاييس الأداء.

تهدف الدراسة إلى إعادة تقييم سير العمل في توقع الزلازل المعتمد على التعلم الآلي، من خلال تقييم قابلية تعميم النموذج في بيئة تكتونية مختلفة (طوكيو، اليابان) مقارنة بالدراسة الأصلية (لوس أنجلوس، الولايات المتحدة الأمريكية). كما تسعى إلى تحليل كيفية تشويه انتهاكات السببية الزمنية للأداء المبلغ عنه. تم هيكلة المخطوطة لتوفير خلفية عن الحالة الحالية للتعلم الآلي في علم الزلازل، وتفصيل النسخ التجريبية باستخدام مجموعة بيانات طوكيو، ومناقشة العيوب المنهجية المحددة. تشير النتائج إلى أن فرض التحقق الزمني الصارم يقلل بشكل كبير من أداء النموذج، مما يبرز الحاجة إلى تحسين طرق التنبؤ التي تأخذ في الاعتبار الطبيعة الفوضوية وغير الخطية للعمليات الزلزالية.

الطرق

في هذا القسم، يصف المؤلفون نهجهم المنهجي لتكرار الدراسة التي أجراها يافاس وآخرون (2024)، والتي ركزت على توقع الأحداث الزلزالية الكبيرة في منطقة لوس أنجلوس كمشكلة تصنيف متعددة الفئات. استخدمت الدراسة الأصلية مجموعة بيانات من مركز بيانات الزلازل في جنوب كاليفورنيا (SCEDC)، والتي تشمل أكثر من 23,000 حدث زلزالي من يناير 2012 إلى سبتمبر 2024، مع درجات تتراوح من 0.65 إلى 5.5. يبرز المؤلفون المخاطر الزلزالية الحرجة التي تشكلها منطقة لوس أنجلوس ذات الكثافة السكانية العالية، والتي تتميز بأنظمة صدع نشطة، ويشيرون إلى الأداء التنبؤي المثير للإعجاب الذي أبلغ عنه يافاس وآخرون.

لتقييم قوة نتائج الدراسة الأصلية، يمدد المؤلفون المنهجية إلى طوكيو، اليابان، وهي منطقة ذات إعداد تكتوني مختلف يهيمن عليه عمليات الغمر. من خلال تطبيق نفس المعايير المكانية لنطاق 100 كم حول مركز مدينة طوكيو، يهدفون إلى تحديد ما إذا كانت القدرات التنبؤية التي لوحظت في لوس أنجلوس قابلة للتطبيق عبر بيئات زلزالية تكتونية متنوعة. يتبع المؤلفون بدقة إجراءات الدراسة الأصلية، بدءًا من تحليل البيانات الاستكشافية وتوليد الميزات، باستخدام مجموعة فرعية من ميزات الإدخال كما هو موضح في دراستهم المرجعية. للحصول على حسابات مفصلة وتبريرات لهذه المتغيرات، يوجهون القراء إلى المقالة الأصلية.

المناقشة

تقيّم قسم المناقشة في ورقة البحث بشكل نقدي فعالية منهجيات التعلم الآلي في توقع الزلازل، مع تسليط الضوء على نتائج مهمة من دراسات مختلفة. تصنف الأساليب الحالية إلى نوعين رئيسيين: تلك التي تستخدم التعلم الآلي على بيانات الفهارس وتلك التي تستخدم تحليل السلاسل الزمنية للإشارات السابقة. من الجدير بالذكر أن عمل وانغ وآخرون (2022) أظهر أن نموذج الذاكرة الطويلة والقصيرة (LSTM) تفوق على النماذج التنبؤية التقليدية في مراقبة الإشارات السابقة الزلزالية الشاذة. وبالمثل، حقق سعد وآخرون (2023) دقة بنسبة 70% في التنبؤ في الوقت الحقيقي باستخدام بيانات كهرومغناطيسية وجيو-صوتية، مما يبرز دمج الإشارات الفيزيائية مع التعلم الآلي.

تشدد الورقة على أهمية التحقق من هذه المنهجيات ضد السببية الزمنية، كاشفة أن العديد من مقاييس الأداء العالية المبلغ عنها في الأدبيات قد تنبع من تسرب البيانات بدلاً من القدرة التنبؤية الحقيقية. على سبيل المثال، عندما قام المؤلفون بتكرار منهجية يافاس وآخرون (2024) تحت قيود زمنية صارمة، انخفضت دقة نموذج الغابة العشوائية من أكثر من 99% إلى حوالي 20-25%. تشير هذه الانخفاض الحاد إلى أن النتائج الأصلية كانت تعتمد بشكل كبير على العوامل المنهجية بدلاً من قدرة النموذج على تعلم الإشارات السابقة الزلزالية. في النهاية، ترفض الدراسة الفرضيات المتعلقة بقابلية تعميم وصلاحية الزمن للمنهجيات، مما يشير إلى أن النجاحات المبلغ عنها في توقع الزلازل قد لا تترجم بشكل فعال عبر إعدادات تكتونية مختلفة أو تحت ظروف تشغيل واقعية.

Journal: Earth Science Informatics, Volume: 19, Issue: 2
DOI: https://doi.org/10.1007/s12145-026-02078-x
Publication Date: 2026-02-01
Author(s): Jesús Jover-Alfaro et al.
Primary Topic: Earthquake Detection and Analysis

Overview

The section discusses the shift in earthquake prediction methodologies from deterministic approaches to probabilistic forecasting, which aims to estimate the likelihood of seismic events within defined spatio-temporal parameters. This transition is supported by the increasing application of Machine Learning (ML) techniques in seismology, which can enhance statistical modeling for earthquake predictions.

The text distinguishes between two primary applications of ML in this field: (1) data processing tasks, such as phase picking and catalog generation, where ML has proven effective and is now widely accepted, and (2) the direct use of ML for earthquake forecasting, which remains contentious. Despite the controversies surrounding the latter, several studies have demonstrated significant success in predicting seismic activity using advanced algorithms, with some achieving accuracy rates exceeding 90% in retrospective analyses.

Introduction

The introduction of the research paper highlights the persistent threat posed by seismic events to global safety and infrastructure. It critiques the deterministic prediction models for earthquakes, emphasizing their “black box” nature and the inadequacies of current validation protocols. The authors reference concerns raised by Mignan and Broccardo (2019) regarding the overfitting of complex Deep Learning models to physical laws and the challenges these models face when encountering domain shifts, as illustrated by Chatterjee et al. (2025). The paper argues that standard validation methods, such as random k-fold cross-validation, fail to adequately address the nonstationary and causal characteristics of seismogenic processes, leading to data leakage and inflated performance metrics.

The study aims to reassess Machine Learning-based earthquake forecasting workflows, specifically by evaluating model generalizability in a different tectonic setting (Tokyo, Japan) compared to the original study (Los Angeles, USA). It also seeks to analyze how violations of temporal causality can distort reported performance. The manuscript is structured to provide a background on the current state of Machine Learning in seismology, detail the experimental replication using the Tokyo dataset, and discuss identified methodological flaws. The findings suggest that enforcing strict temporal validation significantly reduces model performance, underscoring the need for improved forecasting methods that account for the chaotic and nonlinear nature of seismic processes.

Methods

In this section, the authors describe their methodological approach to replicating the study by Yavas et al. (2024), which focused on forecasting significant seismic events in the Los Angeles area as a multi-class classification problem. The original study utilized a dataset from the Southern California Earthquake Data Center (SCEDC), encompassing over 23,000 earthquake events from January 2012 to September 2024, with magnitudes ranging from 0.65 to 5.5. The authors highlight the critical seismic risk posed by the densely populated Los Angeles region, characterized by active fault systems, and note the impressive predictive performance reported by Yavas et al.

To evaluate the robustness of the original study’s findings, the authors extend the methodology to Tokyo, Japan, a region with a different tectonic setting dominated by subduction processes. By applying the same spatial criterion of a 100 km radius around Tokyo’s city center, they aim to determine whether the predictive capabilities observed in Los Angeles are applicable across varying seismotectonic environments. The authors meticulously follow the original study’s procedures, beginning with exploratory data analysis and feature generation, utilizing a subset of input features as outlined in their reference study. For detailed calculations and justifications of these variables, they direct readers to the original article.

Discussion

The discussion section of the research paper critically evaluates the efficacy of machine learning methodologies in earthquake forecasting, highlighting significant findings from various studies. It categorizes existing approaches into two main types: those utilizing machine learning on catalog data and those employing time-series analysis of precursor signals. Notably, the work of Wang et al. (2022) demonstrated that the Long-Short Term Memory (LSTM) model outperformed traditional predictive models in monitoring anomalous earthquake precursors. Similarly, Saad et al. (2023) achieved a 70% accuracy in real-time forecasting using electromagnetic and geo-acoustic data, underscoring the integration of physical signals with machine learning.

The paper emphasizes the importance of validating these methodologies against temporal causality, revealing that many high-performance metrics reported in the literature may stem from data leakage rather than genuine predictive capability. For instance, when the authors replicated the methodology of Yavas et al. (2024) under strict temporal constraints, the accuracy of the Random Forest model plummeted from over 99% to approximately 20-25%. This stark decline indicates that the original results were heavily reliant on methodological artifacts rather than the model’s ability to learn seismic precursors. The study ultimately refutes the hypotheses regarding the generalizability and temporal validity of the methodologies, suggesting that the reported successes in earthquake forecasting may not translate effectively across different tectonic settings or under realistic operational conditions.