تقييم نماذج التعلم الآلي بشكل شامل للتنبؤ بأقصى طاقة من أنظمة الطاقة الشمسية Evaluating machine learning models comprehensively for predicting maximum power from photovoltaic systems

المجلة: Scientific Reports، المجلد: 15، العدد: 1
DOI: https://doi.org/10.1038/s41598-025-91044-6
PMID: https://pubmed.ncbi.nlm.nih.gov/40155411
تاريخ النشر: 2025-03-28
المؤلف: Samir A. Hamad وآخرون
الموضوع الرئيسي: تقنيات تحسين أنظمة الطاقة الشمسية

نظرة عامة

تستكشف هذه الورقة البحثية تطبيق خوارزميات التعلم الآلي (ML) المختلفة لتحسين تتبع نقطة القدرة القصوى (MPP) لأنظمة الطاقة الشمسية المستقلة (PV)، التي تواجه تحديات بسبب خصائص توليد الطاقة غير الخطية التي تتأثر بتقلبات الظروف الجوية. تقيّم الدراسة سبعة نماذج من التعلم الآلي – الانحدار الخطي (LR)، والانحدار المتدرج (RR)، والانحدار باستخدام لاسو (Lasso R)، والانحدار البايزي (BR)، والانحدار باستخدام شجرة القرار (DTR)، والانحدار المعزز بالتدرج (GBR)، والشبكات العصبية الاصطناعية (ANN) – باستخدام بيانات مستمدة من المواصفات الفنية لوحدة الطاقة الشمسية، مع التركيز بشكل خاص على مدخلات الإشعاع ودرجة الحرارة. تم تقييم أداء هذه النماذج من خلال مقاييس مثل جذر متوسط مربع الخطأ (RMSE)، ومعامل التحديد ($R^2$)، ومتوسط الخطأ المطلق (MAE)، حيث برز نموذج DTR كالأكثر فعالية في التنبؤ بالتيار الأقصى ($I_m$)، والجهد ($V_m$)، والقدرة ($P_m$) لنظام الطاقة الشمسية.

حقق نموذج DTR مقاييس أداء مثيرة للإعجاب، بما في ذلك قيم RMSE تبلغ 0.006 لـ $I_m$، و0.015 لـ $V_m$، و2.36 لـ $P_m$، إلى جانب قيم $R^2$ التي تقترب من 1.0، مما يدل على مستوى عالٍ من الدقة التنبؤية. تسلط الدراسة أيضًا الضوء على التأثير الكبير لعوامل مثل حجم مجموعة بيانات التدريب، وظروف التشغيل، ونوع النموذج، ومعالجة البيانات المسبقة على دقة التنبؤ. تشير النتائج إلى أن دمج تقنيات التعلم الآلي لا يعزز فقط كفاءة أنظمة الطاقة الشمسية، بل يسمح أيضًا بتحسين مستمر من خلال آلية تغذية راجعة تعدل دورة عمل المحول بناءً على بيانات بيئية في الوقت الحقيقي. تسهم هذه الدراسة في فهم كيفية الاستفادة من نماذج التعلم الآلي المتقدمة لتحسين أداء أنظمة الطاقة المتجددة.

طرق

تستخدم البحث منهجيات التعلم الآلي لتعزيز نمذجة الطاقة الشمسية، تحديدًا من خلال طريقة β-MPPT (تتبع نقطة القدرة القصوى بيتا). تتميز هذه المنهجيات بقدراتها القوية على التعلم الذاتي والتكيف غير الخطي، مما يجعلها فعالة بشكل خاص في معالجة البيانات متعددة الأبعاد الناتجة في هذا السياق. على عكس تقنيات النمذجة النظرية التقليدية، لا تبسط أساليب التعلم الآلي التنفيذ فحسب، بل توفر أيضًا دقة متفوقة في التنبؤ بالأداء عبر ظروف تشغيل متنوعة.

مع تقدم تقنيات التعلم الآلي، تساهم في تحسين التعميم والعالمية للنماذج، مما يؤدي إلى تحسين تتبع الطاقة في أنظمة الطاقة الشمسية. يبرز هذا التطور إمكانيات التعلم الآلي في تحويل ممارسات إدارة الطاقة من خلال التكيف مع العوامل البيئية والتشغيلية المتغيرة.

نتائج

تقدم قسم النتائج النتائج الرئيسية من الدراسة، مع تسليط الضوء على النتائج المهمة للتجارب التي تم إجراؤها. تشير البيانات إلى وجود ارتباط قوي بين المتغيرات قيد التحقيق، حيث تكشف التحليلات الإحصائية عن قيم p أقل من 0.05، مما يشير إلى أن النتائج ذات دلالة إحصائية. بالإضافة إلى ذلك، كانت التأثيرات الملحوظة متسقة عبر تجارب متعددة، مما يعزز موثوقية النتائج.

علاوة على ذلك، تتناول المناقشة تداعيات هذه النتائج، موضوعة في السياق الأوسع للأدبيات الموجودة. يقترح المؤلفون أن العلاقات الملحوظة قد تُفيد في توجيه أبحاث مستقبلية وتطبيقات عملية، خاصة في المجال المعني. بشكل عام، تسهم النتائج في تقديم رؤى قيمة تعزز فهم الظواهر المدروسة.

مناقشة

في هذا القسم، يناقش البحث تطبيق سبعة خوارزميات للتعلم الآلي – الانحدار الخطي (LR)، والانحدار المتدرج (RR)، والانحدار باستخدام لاسو، والانحدار البايزي (BR)، والانحدار باستخدام شجرة القرار (DTR)، والانحدار المعزز بالتدرج (GBR)، والشبكات العصبية الاصطناعية (ANN) – للتنبؤ بالتيار الأقصى، والجهد، والقدرة لأنظمة الطاقة الشمسية (PV). يتم توضيح مبادئ كل خوارزمية، مع تسليط الضوء على خصائصها الفريدة وملاءمتها لنمذجة النتائج المستمرة. على سبيل المثال، يتم التمييز بين LR والانحدار اللوجستي، مع التأكيد على تطبيقه في البيانات المستمرة، بينما يتناول RR التعدد الخطي من خلال إدخال تحيز لتعزيز موثوقية التقدير. يعزز الانحدار باستخدام لاسو النماذج الأبسط من خلال طريقة الانكماش، ويستخدم الانحدار البايزي توزيعات سابقة واحتمالية لتحديث المعتقدات حول قيم المعلمات.

يتناول القسم أيضًا نهج تقسيم البيانات التكراري لـ DTR، الذي يستخدم كسب المعلومات لتحسين اختيار الميزات، وتعزيز GBR لأشجار القرار من خلال تقنيات التعزيز بالتدرج. يُلاحظ أن ANN قادرة على نمذجة الدوال غير الخطية المعقدة، باستخدام هيكل متعدد الطبقات مع وظائف تنشيط متنوعة. يتم تقييم أداء هذه النماذج باستخدام مقاييس مثل متوسط الخطأ المطلق (MAE)، ومعامل التحديد (R²)، وجذر متوسط مربع الخطأ (RMSE)، حيث برز DTR كأكثر الخوارزميات دقة، محققًا قيم R² قريبة من 1 وأقل MAE وRMSE عبر التنبؤات. تؤكد النتائج على إمكانيات تقنيات التعلم الآلي في تحسين أداء أنظمة الطاقة الشمسية، بينما تعالج أيضًا التحديات المتعلقة بوضوح النموذج وكفاءة الحساب.

Journal: Scientific Reports, Volume: 15, Issue: 1
DOI: https://doi.org/10.1038/s41598-025-91044-6
PMID: https://pubmed.ncbi.nlm.nih.gov/40155411
Publication Date: 2025-03-28
Author(s): Samir A. Hamad et al.
Primary Topic: Photovoltaic System Optimization Techniques

Overview

This research paper investigates the application of various machine learning (ML) algorithms to optimize the maximum power point (MPP) tracking of standalone photovoltaic (PV) systems, which face challenges due to their nonlinear power generation characteristics influenced by fluctuating weather conditions. The study evaluates seven ML models—Linear Regression (LR), Ridge Regression (RR), Lasso Regression (Lasso R), Bayesian Regression (BR), Decision Tree Regression (DTR), Gradient Boosting Regression (GBR), and Artificial Neural Networks (ANN)—using data derived from the PV unit’s technical specifications, specifically focusing on irradiance and temperature inputs. The performance of these models was assessed through metrics such as Root Mean Square Error (RMSE), Coefficient of Determination ($R^2$), and Mean Absolute Error (MAE), with the DTR model emerging as the most effective in predicting maximum current ($I_m$), voltage ($V_m$), and power ($P_m$) of the PV system.

The DTR model achieved impressive performance metrics, including RMSE values of 0.006 for $I_m$, 0.015 for $V_m$, and 2.36 for $P_m$, alongside $R^2$ values nearing 1.0, indicating a high level of predictive accuracy. The study also highlights the significant impact of factors such as the size of the training dataset, operational conditions, model type, and data preprocessing on prediction accuracy. The findings suggest that the integration of ML techniques not only enhances the efficiency of PV systems but also allows for continuous optimization through a feedback mechanism that adjusts the boost converter’s duty cycle based on real-time environmental data. This research contributes to the understanding of how advanced ML models can be leveraged to improve the performance of renewable energy systems.

Methods

The research employs machine learning methodologies to enhance photovoltaic modeling, specifically through the β-MPPT (Beta Maximum Power Point Tracking) method. These methodologies are characterized by their robust self-learning capabilities and nonlinear fitting, making them particularly effective for processing the multi-dimensional data generated in this context. Unlike traditional theoretical modeling techniques, machine learning approaches not only simplify implementation but also provide superior accuracy in predicting performance across diverse operational conditions.

As machine learning techniques advance, they contribute to improved generalization and universality of the models, thereby optimizing energy tracking in photovoltaic systems. This evolution underscores the potential of machine learning to transform energy management practices by adapting to varying environmental and operational factors.

Results

The results section presents key findings from the study, highlighting the significant outcomes of the experiments conducted. The data indicates a strong correlation between the variables under investigation, with statistical analyses revealing p-values less than 0.05, suggesting that the results are statistically significant. Additionally, the observed effects were consistent across multiple trials, reinforcing the reliability of the findings.

Furthermore, the discussion elaborates on the implications of these results, situating them within the broader context of existing literature. The authors propose that the observed relationships may inform future research directions and practical applications, particularly in the relevant field. Overall, the results contribute valuable insights that advance understanding of the studied phenomena.

Discussion

In this section, the research discusses the application of seven machine learning algorithms—Linear Regression (LR), Ridge Regression (RR), Lasso Regression, Bayesian Regression (BR), Decision Tree Regression (DTR), Gradient Boosting Regression (GBR), and Artificial Neural Networks (ANN)—to predict the maximum current, voltage, and power of photovoltaic (PV) systems. Each algorithm’s principles are outlined, highlighting their unique characteristics and suitability for modeling continuous outcomes. For instance, LR is contrasted with logistic regression, emphasizing its application in continuous data, while RR addresses multicollinearity by introducing bias to enhance estimation reliability. Lasso regression promotes simpler models through a shrinkage method, and Bayesian regression employs prior and likelihood distributions to update beliefs about parameter values.

The section further elaborates on DTR’s iterative data partitioning approach, which utilizes information gain to optimize feature selection, and GBR’s enhancement of decision trees through gradient boosting techniques. ANN is noted for its ability to model complex nonlinear functions, utilizing a multi-layer structure with various activation functions. The performance of these models is evaluated using metrics such as Mean Absolute Error (MAE), R-squared (R²), and Root Mean Squared Error (RMSE), with DTR emerging as the most accurate algorithm, achieving R² values close to 1 and minimal MAE and RMSE across predictions. The findings underscore the potential of machine learning techniques in optimizing PV system performance, while also addressing challenges related to model interpretability and computational efficiency.