التقدم وآفاق التعلم العميق في التنبؤ بالطقس القاسي على المدى المتوسط Advances and prospects of deep learning for medium-range extreme weather forecasting

المجلة: Geoscientific model development، المجلد: 17، العدد: 6
DOI: https://doi.org/10.5194/gmd-17-2347-2024
تاريخ النشر: 2024-03-21
المؤلف: Leonardo Olivetti وآخرون
الموضوع الرئيسي: الظواهر الجوية والمحاكاة

نظرة عامة

في السنوات الأخيرة، ظهرت نماذج التعلم العميق كبدائل قابلة للتطبيق لنماذج الأرصاد الجوية التقليدية المعتمدة على الفيزياء للتنبؤ بالطقس على المدى المتوسط، مع وجود عدة دراسات تشير إلى أن هذه النماذج يمكن أن تتفوق على الأساليب التقليدية المعتمدة على الفيزياء. ومع ذلك، لا تزال هناك تحديات كبيرة، خاصة في التنبؤ بدقة بالأحداث الجوية المتطرفة. تستعرض هذه الورقة التقدمات الأخيرة في التنبؤ بالطقس باستخدام التعلم العميق وتسلط الضوء على قيود النماذج الحالية في التعامل مع الطقس المتطرف، بما في ذلك عدم كفاية التحسين لعينات التدريب المحدودة والافتراضات البسيطة بشأن توزيعات أخطاء التنبؤ.

يؤكد المؤلفون أنه لكي تكون نماذج التنبؤ بالطقس باستخدام التعلم العميق (DLWP) قابلة للتطبيق عمليًا، يجب أن تتجاوز ليس فقط المهارة المتوسطة للنماذج المعتمدة على الفيزياء ولكن أيضًا أن تظهر كفاءة في التنبؤ بالأحداث الجوية المتطرفة. يحددون الحاجة الملحة إلى سير عمل مخصص لـ DLWP يركز على توقعات الطقس المتطرف، داعين إلى تعديل الهياكل الحالية بدلاً من تطوير نماذج جديدة تمامًا. يهدف سير العمل الأساسي المقترح إلى تعزيز قوة التنبؤات باستخدام التعلم العميق للطقس المتطرف من خلال مراعاة الأسئلة المناخية المحددة وفترات العودة ذات الصلة، وبالتالي الاستفادة من التقدمات الأخيرة في تقنيات التعلم العميق.

مقدمة

تستعرض مقدمة ورقة البحث تطور الحالة الحالية لنماذج التعلم العميق في التنبؤ بالطقس، متتبعة أصولها إلى التسعينيات. كانت التطبيقات الأولية تركز على التنبؤات المحلية وقصيرة المدى، لكن التقدمات الأخيرة وضعت التعلم العميق كأداة قابلة للتطبيق للتنبؤات على المدى المتوسط وما دون الموسمي. ومن الجدير بالذكر أنه منذ أوائل عام 2022، أبلغت مجموعات بحثية متعددة عن نماذج تعلم عميق تتفوق على التنبؤات الحتمية التقليدية من المركز الأوروبي للتنبؤات الجوية المتوسطة المدى (ECMWF). وقد تم تعزيز هذا التقدم بشكل كبير من خلال مساهمات من شركات التكنولوجيا الكبرى، التي عززت الموارد الحاسوبية وجذبت خبرات التعلم الآلي إلى علم الأرصاد الجوية.

على الرغم من هذه التقدمات، تسلط الورقة الضوء على التحديات المستمرة التي تواجه نماذج التعلم العميق، خاصة نقص القيود الفيزيائية، مما يمكن أن يؤدي إلى توقعات غير واقعية، وطبيعتها الحتمية، مما يعقد تقدير عدم اليقين. بالإضافة إلى ذلك، تكافح النماذج المعتمدة على البيانات مع الاستقراء خارج بيانات التدريب الخاصة بها، خاصة في التنبؤ بالأحداث الجوية المتطرفة. يؤكد المؤلفون على أهمية التنبؤات الدقيقة لمنع الكوارث والأنشطة الاقتصادية، مع الإشارة أيضًا إلى أن التطورات السريعة جارية لمعالجة هذه التحديات. تهدف المقالة إلى استعراض التقدمات الأخيرة في التنبؤ بالطقس باستخدام التعلم العميق على نطاق واسع، وفحص القيود التقنية المتعلقة بالتنبؤ بالطقس المتطرف، واقتراح طرق بديلة لتحسين أداء النموذج في هذا المجال.

نقاش

تستعرض قسم النقاش في ورقة البحث تطور الحالة الحالية لنماذج التنبؤ بالطقس باستخدام التعلم العميق (DLWP)، مسلطة الضوء على الانتقال من الشبكات العصبية الأمامية المبكرة (FNNs) إلى هياكل أكثر تقدمًا مثل الشبكات العصبية التلافيفية (CNNs) والشبكات العصبية المتكررة (RNNs) وشبكات الذاكرة طويلة الأمد (LSTM). بينما كانت FNNs تكافح مع البيانات المكانية بسبب طبيعتها أحادية الاتجاه، حسنت CNNs من قابلية التوسع والتعرف على الأنماط المكانية. سمح إدخال RNNs وLSTMs بالتعامل بشكل أفضل مع بيانات السلاسل الزمنية، لكن معالجتها المتسلسلة حدت من إمكانية المعالجة المتوازية، وهو أمر حاسم للاستفادة من مجموعات بيانات المناخ عالية الدقة. شهدت التقدمات الأخيرة ظهور هياكل التشفير-فك التشفير والمحولات، التي تعزز كفاءة معالجة البيانات المتسلسلة وقد تم تكييفها لكل من المهام المناخية ورؤية الكمبيوتر.

على الرغم من هذه التقدمات، تواجه نماذج DLWP الحالية تحديات، خاصة في التنبؤ بدقة بالأحداث الجوية المتطرفة. تعطي العديد من النماذج الأولوية لمهارة التنبؤ المتوسطة على حساب التنبؤ بالقيم المتطرفة، مما يؤدي إلى قيود في أدائها خلال السيناريوهات ذات التأثير العالي. تؤكد الورقة على الحاجة إلى نماذج مصممة خصيصًا للتنبؤ بالمتطرفات، داعية إلى استخدام دوال خسارة مستهدفة تركز على توزيعات الذيل بدلاً من الأخطاء المتوسطة العامة. تقترح استخدام تقنيات مثل خسارة الكرة الدوارة ونماذج التصنيف الثنائي لتحسين دقة التنبؤات الجوية المتطرفة. بالإضافة إلى ذلك، يقترح المؤلفون نهجًا من خطوتين يتجاوز العتبة يجمع بين الانحدار الكمي ونظرية القيم المتطرفة لتقدير خصائص الأحداث المتطرفة بشكل أفضل، على الرغم من أنهم يعترفون بالصعوبات الكامنة في تحقيق توقعات حتمية للأحداث النادرة. بشكل عام، يدعو القسم إلى تغيير التركيز نحو تعزيز قوة نماذج DLWP في التنبؤ بالأحداث الجوية المتطرفة، وهو أمر حاسم لمنع الكوارث وتخفيف المخاطر.

Journal: Geoscientific model development, Volume: 17, Issue: 6
DOI: https://doi.org/10.5194/gmd-17-2347-2024
Publication Date: 2024-03-21
Author(s): Leonardo Olivetti et al.
Primary Topic: Meteorological Phenomena and Simulations

Overview

In recent years, deep learning models have emerged as viable alternatives to traditional physics-based numerical models for medium-range weather forecasting, with several studies indicating that these models can outperform state-of-the-art physics-based approaches. However, significant challenges remain, particularly in accurately forecasting extreme weather events. This paper reviews recent advancements in deep learning weather forecasting and highlights the limitations of current models in handling extreme weather, including inadequate optimization for limited training samples and simplistic assumptions regarding forecasting error distributions.

The authors emphasize that for deep learning weather prediction (DLWP) models to be operationally viable, they must not only surpass the average skill of physics-based models but also demonstrate proficiency in predicting extreme weather events. They identify the urgent need for a tailored DLWP workflow focused on extreme weather forecasts, advocating for the adaptation of existing architectures rather than the development of entirely new models. The proposed foundational workflow aims to enhance the robustness of deep learning forecasts for extreme weather by considering the specific meteorological questions and return periods of interest, thereby leveraging recent advancements in deep learning technologies.

Introduction

The introduction of the research paper outlines the evolution and current state of deep learning models in weather forecasting, tracing their origins back to the 1990s. Initial applications focused on local and short-term predictions, but recent advancements have positioned deep learning as a viable tool for medium-range and subseasonal forecasting. Notably, since early 2022, multiple research groups have reported deep learning models that outperform traditional deterministic forecasts from the European Centre for Medium-Range Weather Forecasts (ECMWF). This progress has been significantly bolstered by contributions from major technology companies, which have enhanced computational resources and attracted machine learning expertise to meteorology.

Despite these advancements, the paper highlights ongoing challenges faced by deep learning models, particularly their lack of physical constraints, which can lead to unrealistic forecasts, and their deterministic nature, which complicates uncertainty estimation. Additionally, data-driven models struggle with extrapolation beyond their training data, particularly in forecasting extreme weather events. The authors emphasize the importance of accurate predictions for disaster prevention and economic activities, while also noting that rapid developments are underway to address these challenges. The article aims to survey recent advancements in large-scale deep learning weather prediction, examine the technical limitations regarding extreme weather forecasting, and propose alternative approaches for improving model performance in this area.

Discussion

The discussion section of the research paper outlines the evolution and current state of deep learning weather prediction (DLWP) models, highlighting the transition from early feedforward neural networks (FNNs) to more advanced architectures like convolutional neural networks (CNNs), recurrent neural networks (RNNs), and long short-term memory (LSTM) networks. While FNNs struggled with spatial data due to their unidirectional nature, CNNs improved scalability and spatial pattern recognition. The introduction of RNNs and LSTMs allowed for better handling of time-series data, but their sequential processing limited parallelization, which is crucial for utilizing high-resolution climate datasets. Recent advancements have seen the emergence of encoder-decoder architectures and transformers, which enhance the efficiency of processing sequential data and have been adapted for both meteorological and computer vision tasks.

Despite these advancements, current DLWP models face challenges, particularly in accurately forecasting extreme weather events. Many models prioritize average forecast skill over extreme value prediction, leading to limitations in their performance during high-impact scenarios. The paper emphasizes the need for models specifically designed to forecast extremes, advocating for the use of targeted loss functions that focus on tail distributions rather than overall mean errors. It suggests employing techniques such as pinball loss and binary classification models to improve the accuracy of extreme weather predictions. Additionally, the authors propose a two-step peak-over-threshold approach that combines quantile regression with extreme value theory to better estimate the properties of extreme events, although they acknowledge the inherent difficulties in achieving deterministic forecasts for rare occurrences. Overall, the section calls for a shift in focus towards enhancing the robustness of DLWP models in predicting extreme weather events, which is critical for disaster prevention and risk mitigation.