تحليل مقارن لهياكل التعلم العميق في توقع الطاقة الشمسية Comparative analysis of deep learning architectures in solar power prediction

المجلة: Scientific Reports، المجلد: 15، العدد: 1
DOI: https://doi.org/10.1038/s41598-025-14908-x
PMID: https://pubmed.ncbi.nlm.nih.gov/40877313
تاريخ النشر: 2025-08-28
المؤلف: Montaser Abdelsattar وآخرون
الموضوع الرئيسي: الإشعاع الشمسي والطاقة الشمسية

نظرة عامة

تدرس هذه الدراسة فعالية هياكل التعلم العميق (DL) المختلفة في توقع الطاقة الشمسية، باستخدام مجموعة بيانات تتكون من 4200 سجل تاريخي و20 ميزة من الميزات الجوية والفلكية. تم مقارنة ثمانية نماذج: Autoencoder، وLong Short-Term Memory (LSTM)، وGated Recurrent Unit (GRU)، وSimple Recurrent Neural Network (SimpleRNN)، وConvolutional Neural Network (CNN)، وTemporal Convolutional Network (TCN)، وTransformer، وLightweight Informer for Long Sequence Time-Series Forecasting (InformerLite). تم تقييم مقاييس الأداء، بما في ذلك Root Mean Squared Error (RMSE)، وMean Absolute Error (MAE)، وMean Absolute Percentage Error (MAPE)، وCoefficient of Determination (R²) عبر مجموعات بيانات التدريب والتحقق والاختبار.

أشارت النتائج إلى أن نموذج TCN تفوق على جميع النماذج الأخرى، محققًا R² اختبار قدره 0.7786 وأظهر قدرة قوية على التقاط الأنماط الزمنية. كما أن Autoencoder قدم أداءً جيدًا، مع R² كامل قدره 0.8437، بينما أظهر نموذج Transformer أداءً أقل بكثير (R² اختبار = 0.0714)، مما يشير إلى عدم ملاءمته لهذه المهمة التنبؤية دون تعديلات. على الرغم من أن نماذج GRU وLSTM أظهرت نجاحًا معتدلاً، إلا أنها واجهت صعوبة مع الاعتماديات الأطول. تسلط الدراسة الضوء على إمكانيات هذه النماذج DL في تعزيز دقة توقع الطاقة الشمسية وتوفر إطارًا للبحوث المستقبلية في النمذجة الهجينة والتنبؤات متعددة الآفاق، مع الاعتراف بالقيود مثل القيود الجغرافية وغياب مدخلات البيانات في الوقت الحقيقي.

طرق

تم هيكلة منهجية هذه البحث بعناية لضمان الاتساق وإمكانية التكرار والتقييم الموثوق للنماذج. خضعت مجموعة بيانات استعادة توليد الطاقة الشمسية للتطبيع القياسي وتم تقسيمها إلى مجموعات تدريب والتحقق والاختبار. تم استخدام هياكل متعددة من التعلم العميق (DL)، مع استخدام طريقة تحسين آدم بمعدل تعلم ثابت وإيقاف مبكر للتخفيف من الإفراط في التكيف. شملت مقاييس تقييم الأداء Root Mean Square Error (RMSE)، وMean Absolute Error (MAE)، وMean Absolute Percentage Error (MAPE)، ومعامل التحديد (R²) عبر تقسيمات البيانات المختلفة.

بالإضافة إلى ذلك، دمجت الدراسة أسلوب Monte Carlo dropout لتقييم عدم اليقين في التنبؤ. تم تقديم نظرة عامة مفصلة عن المعلمات الرئيسية المستخدمة في الدراسة، بما في ذلك تكوينات معالجة البيانات، واختيار النموذج، وخيارات التدريب، والاختبارات التشخيصية، في الجدول 1. يبرز هذا النهج الشامل قوة التصميم التجريبي وموثوقية النتائج.

نتائج

قيمت الدراسة أداء نماذج الشبكات العصبية العميقة المتقدمة (DNN) في توقع توليد الطاقة الشمسية من خلال تجارب واسعة عبر هياكل مختلفة. تم تقييم النماذج باستخدام ثلاث مجموعات بيانات متميزة: التدريب، والتحقق، والاختبار. تم استخدام مقاييس الانحدار القياسية، بما في ذلك Root Mean Square Error (RMSE)، وMean Absolute Error (MAE)، وMean Absolute Percentage Error (MAPE)، ومعامل التحديد (R²) لقياس أداء النموذج.

بالإضافة إلى ذلك، دمجت البحث تقدير عدم اليقين وتحليل المتبقيات لتعزيز التقييم التشخيصي. قدمت هذه المنهجيات رؤى حول موثوقية ودقة التنبؤات التي قدمتها نماذج DNN، مما ساهم في فهم شامل لفعاليتها في توقع توليد الطاقة الشمسية.

مناقشة

في هذه الدراسة، تم تحليل مجموعة بيانات تتكون من 4200 سجل تاريخي لتوليد الطاقة الشمسية، مع دمج 20 ميزة من الميزات الجوية والفلكية لتوقع إنتاج الطاقة بالكيلووات. كشفت مصفوفة الارتباط أن الإشعاع القصير الموجي وزاوية الزينيث كان لهما أقوى الارتباطات مع إنتاج الطاقة، بينما حدد الانحدار Lasso الإشعاع القصير الموجي كأهم متنبئ، يليه ضغط مستوى سطح البحر المتوسط وسرعة الرياح عند 80 م. تم اعتبار الميزات ذات القيمة التنبؤية المنخفضة، مثل زاوية السقوط والزاوية الأفقية، أقل تأثيرًا على تدريب النموذج. استخدمت الدراسة نماذج متعددة من التعلم العميق (DL)، بما في ذلك Autoencoder، وRNN المتغيرة (SimpleRNN، GRU، LSTM)، وCNN، وTransformer، لتقييم قدراتها التنبؤية لتوليد الطاقة الشمسية.

تم تدريب النماذج باستخدام مجموعة بيانات معالجة مسبقًا، مع حساب مقاييس الأداء مثل RMSE، وMAE، وMAPE، وR² لمجموعات التدريب والتحقق والاختبار. أظهر Autoencoder وTemporal Convolutional Network (TCN) أداءً تنبؤيًا متفوقًا، محققين قيم R² قدرها 0.8677 و0.8374 على التوالي، بينما كان أداء نموذج Transformer أقل بكثير مع R² قدره 0.1551. سلطت الدراسة الضوء على أهمية قابلية تفسير النموذج وتقدير عدم اليقين، مستخدمةً أسلوب Monte Carlo dropout لتقييم ثقة التنبؤ. يُقترح أن تستكشف الأعمال المستقبلية الهياكل الهجينة، والتحقق الجغرافي، ودمج المتغيرات المستمدة من الأقمار الصناعية لتعزيز قوة النموذج وقابليته للتطبيق في السيناريوهات الواقعية. بشكل عام، بينما أظهرت النماذج أداءً قويًا، هناك حاجة إلى تحسينات إضافية واختبارات أوسع للنشر العملي في توقع الطاقة الشمسية.

Journal: Scientific Reports, Volume: 15, Issue: 1
DOI: https://doi.org/10.1038/s41598-025-14908-x
PMID: https://pubmed.ncbi.nlm.nih.gov/40877313
Publication Date: 2025-08-28
Author(s): Montaser Abdelsattar et al.
Primary Topic: Solar Radiation and Photovoltaics

Overview

This study investigates the effectiveness of various deep learning (DL) architectures for solar power forecasting, utilizing a dataset comprising 4,200 historical records and 20 meteorological and astronomical features. Eight models were compared: Autoencoder, Long Short-Term Memory (LSTM), Gated Recurrent Unit (GRU), Simple Recurrent Neural Network (SimpleRNN), Convolutional Neural Network (CNN), Temporal Convolutional Network (TCN), Transformer, and Lightweight Informer for Long Sequence Time-Series Forecasting (InformerLite). Performance metrics, including Root Mean Squared Error (RMSE), Mean Absolute Error (MAE), Mean Absolute Percentage Error (MAPE), and Coefficient of Determination (R²), were assessed across training, validation, and test datasets.

The results indicated that the TCN model outperformed all others, achieving a test R² of 0.7786 and demonstrating a strong ability to capture temporal patterns. The Autoencoder also performed well, with a Whole R² of 0.8437, while the Transformer model exhibited significantly lower performance (test R² = 0.0714), suggesting its unsuitability for this forecasting task without modifications. Although GRU and LSTM models showed moderate success, they struggled with longer dependencies. The study highlights the potential of these DL models in enhancing solar power forecasting accuracy and provides a framework for future research in hybrid modeling and multi-horizon predictions, while acknowledging limitations such as geographical constraints and the absence of real-time data inputs.

Methods

The methodology of this research was carefully structured to ensure consistency, reproducibility, and reliable evaluation of the models. The solar power generation recovery dataset underwent standard normalization and was stratified into training, validation, and test sets. Multiple deep learning (DL) architectures were employed, utilizing the Adam optimization method with a fixed learning rate and early stopping to mitigate overfitting. Performance evaluation metrics included Root Mean Square Error (RMSE), Mean Absolute Error (MAE), Mean Absolute Percentage Error (MAPE), and the coefficient of determination (R²) across different data partitions.

Additionally, the study incorporated Monte Carlo dropout to assess prediction uncertainty. A detailed overview of the key parameters used in the study, including data processing configurations, model selection, training options, and diagnostic tests, is provided in Table 1. This comprehensive approach underlines the robustness of the experimental design and the reliability of the findings.

Results

The study evaluated the performance of various advanced deep neural network (DNN) models in predicting solar power generation through extensive experimentation across different architectures. The models were assessed using three distinct datasets: training, validation, and testing. Standard regression metrics, including Root Mean Square Error (RMSE), Mean Absolute Error (MAE), Mean Absolute Percentage Error (MAPE), and the coefficient of determination (R²), were employed to quantify model performance.

Additionally, the research incorporated uncertainty quantification and residual analysis to enhance diagnostic evaluation. These methodologies provided insights into the reliability and accuracy of the predictions made by the DNN models, thereby contributing to a comprehensive understanding of their effectiveness in solar power generation forecasting.

Discussion

In this study, a dataset of 4200 historical solar power generation records was analyzed, incorporating 20 meteorological and astronomical features to predict power output in kilowatts. A correlation matrix revealed that shortwave radiation and zenith angle had the strongest correlations with output power, while Lasso regression identified shortwave radiation as the most significant predictor, followed by mean sea level pressure and wind speed at 80 m. Features with low predictive value, such as angle of incidence and azimuth, were deemed less impactful for model training. The study employed various deep learning (DL) models, including Autoencoder, RNN variants (SimpleRNN, GRU, LSTM), CNN, and Transformer, to evaluate their predictive capabilities for solar power generation.

The models were trained using a pre-processed dataset, with performance metrics such as RMSE, MAE, MAPE, and R² calculated for training, validation, and test sets. The Autoencoder and Temporal Convolutional Network (TCN) demonstrated superior predictive performance, achieving R² values of 0.8677 and 0.8374, respectively, while the Transformer model underperformed with an R² of 0.1551. The study highlighted the importance of model interpretability and uncertainty estimation, employing Monte Carlo dropout to assess prediction confidence. Future work is suggested to explore hybrid architectures, geographic validation, and the inclusion of satellite-derived variables to enhance model robustness and applicability in real-world scenarios. Overall, while the models showed strong performance, further improvements and broader testing are necessary for practical deployment in solar power forecasting.