توقع الأداء الأكاديمي القائم على التعلم الآلي مع إمكانية الشرح لتحسين اتخاذ القرار في المؤسسات التعليمية Machine learning-based academic performance prediction with explainability for enhanced decision-making in educational institutions

المجلة: Scientific Reports، المجلد: 15، العدد: 1
DOI: https://doi.org/10.1038/s41598-025-12353-4
PMID: https://pubmed.ncbi.nlm.nih.gov/40707614
تاريخ النشر: 2025-07-24
المؤلف: Wesam Ahmed وآخرون
الموضوع الرئيسي: التعلم عبر الإنترنت والتحليلات

نظرة عامة

تؤكد ورقة البحث على أهمية التعليم في تطوير المهارات الحياتية الأساسية وتخصيص الموارد، مع تسليط الضوء على دمج التقنيات المتقدمة مثل الذكاء الاصطناعي (AI) في التعليم العالي. تركز على التنبؤ بالأداء الأكاديمي لتعزيز تصنيفات الجامعات وفرص الطلاب، مع معالجة التحديات في تحليل الأداء والتقييم من خلال نماذج التعلم الآلي (ML). تستخدم الدراسة عشرة نماذج انحدار، بما في ذلك الجيران الأقرب، والانحدار الخطي، والانحدار بالتصويت الجماعي (VR)، للتنبؤ بالنتائج الأكاديمية باستخدام مجموعتين بيانات متميزتين. تكشف المجموعة الأولى، التي تحتوي على 10,000 عينة وست ميزات، والثانية، التي تحتوي على 6,607 سجلات و20 ميزة، أن نموذج VR الجماعي يتفوق على الآخرين، محققًا R² قدره 0.9890 للمجموعة الأولى و0.7716 للثانية، مما يدل على قوته عبر سياقات أكاديمية متنوعة.

في الختام، يقترح المؤلفون منهجية تجمع بين تقنيات المعالجة المسبقة ونماذج الانحدار المتعددة للتنبؤ بأداء الطلاب بشكل فعال. يظهر نموذج VR الجماعي مقاييس أداء متفوقة، بما في ذلك MAE قدره 0.0837 وRMSE قدره 0.1050. تحدد الدراسة المتنبئين الرئيسيين لنجاح الطلاب، مثل المشاركة في الأنشطة اللامنهجية ومستويات التحضير، باستخدام نماذج قابلة للتفسير مثل LIME وSHAP. تشمل اتجاهات البحث المستقبلية استكشاف تقنيات التعلم العميق، ودمج العوامل الاجتماعية والاقتصادية، وتطوير أنظمة التعلم التكيفية، ومعالجة التحيزات في النماذج التنبؤية. تؤكد الدراسة على أهمية التعامل الأخلاقي مع البيانات وحماية الخصوصية في تحليلات التعليم، داعيةً إلى الشفافية والعدالة في عمليات اتخاذ القرار.

الطرق

تتضمن المنهجية المقترحة للتنبؤ بأداء الطلاب نهجًا منهجيًا يبدأ بمعالجة البيانات لضمان أن مجموعة البيانات مناسبة للتحليل. بعد ذلك، تم اختيار تسعة نماذج انحدار مختلفة للتعلم الآلي (ML) للتقييم. تم تقييم أداء هذه النماذج، بما في ذلك نموذج الانحدار بالتصويت، باستخدام ثلاثة مقاييس مختلفة – خطأ الجذر التربيعي المتوسط (RMSE)، وخطأ القيمة المطلقة المتوسطة (MAE)، وR² – على مجموعة بيانات التحقق. تختتم القسم بتفسير تنبؤات النموذج، مدعومًا بمخطط معماري توضيحي لنظام التنبؤ.

تم تصميم المنهجية لمعالجة القيود التي تم تحديدها في الدراسات السابقة، مثل الاعتماد على مجموعات بيانات ضيقة وقياسات تقييم غير كافية. من خلال استخدام مجموعة بيانات شاملة وتنفيذ نموذج تصويت يدمج ميزات متنوعة، تعزز النهج المقترح من التعرف المبكر على المخاطر في أداء الطلاب. علاوة على ذلك، فإنه يقدم تحليلًا مقارنًا قويًا لنماذج الانحدار، مما يوفر فهمًا أكثر شمولية لفعاليتها في التنبؤ بالنتائج.

النتائج

تقيم قسم النتائج أداء نماذج الانحدار المطورة لتوقع أداء الطلاب، باستخدام كل من الميزات المختارة ومجموعة الميزات الكاملة. تم تقسيم مجموعة البيانات إلى 80% للتدريب و20% للاختبار، مع تقييم دقة النموذج من خلال مقاييس مثل خطأ القيمة المطلقة المتوسطة (MAE)، وخطأ الجذر التربيعي المتوسط (RMSE)، ودرجة R². تؤكد النتائج على فعالية المنهجية المقترحة وقابليتها للتطبيق في السياقات الواقعية. من الجدير بالذكر أن التحليل يكشف أن مشاركة الطلاب في الأنشطة اللامنهجية منخفضة، مما قد يعيق الأداء العام، بينما توجد علاقة إيجابية بين ساعات النوم ومؤشر الأداء، مما يبرز ضرورة الراحة الكافية للنجاح الأكاديمي.

يوضح التحليل الإضافي من خلال SHAP (SHapley Additive exPlanations) تأثير الميزات المختلفة على تنبؤات النموذج. ظهرت الدرجات السابقة كأهم متنبئ، حيث ارتبطت الدرجات الأعلى بتحسين النتائج. كما أثرت ساعات الدراسة بشكل إيجابي على التنبؤات، بينما كانت العلاقة مع ساعات النوم أكثر تعقيدًا، مما يشير إلى أن النوم المفرط أو غير الكافي يمكن أن يؤثر سلبًا على الأداء. ساهمت أوراق الأسئلة النموذجية التي تم ممارستها بشكل إيجابي ولكن كانت أقل تأثيرًا من الدرجات السابقة وساعات الدراسة. كانت الأنشطة اللامنهجية لها تأثير ضئيل على التنبؤات. تختتم القسم بتحليل LIME (تفسيرات محلية قابلة للتفسير وغير مرتبطة بالنموذج)، مما يوفر رؤى حول تنبؤات النموذج بناءً على الميزات المختارة.

المناقشة

تسلط قسم المناقشة في ورقة البحث الضوء على التقدم في خوارزميات التعلم الآلي (ML) لتوقع الأداء الأكاديمي للطلاب، مع التأكيد على أهمية هذه التنبؤات للمؤسسات التعليمية. استكشفت دراسات مختلفة نهج ML المختلفة، كاشفةً عن قيود مثل الاعتماد على مجموعات بيانات صغيرة، ونقص في تحليل الميزات الشامل، ومقاييس تقييم غير متسقة. من الجدير بالذكر أن نموذج الانحدار بالتصويت المقترح في هذه الدراسة يعالج هذه النواقص من خلال دمج نماذج متعددة، مما يعزز دقة التنبؤ وقوتها مع تقليل مخاطر الإفراط في التكيف. يساعد استخدام LIME وSHAP في تفسير النموذج أيضًا المعلمين في فهم العوامل التي تؤثر على أداء الطلاب.

تظهر تقييم أداء نموذج الانحدار بالتصويت الجماعي المقترح (VR) تفوقه على النماذج الفردية، محققًا أقل خطأ قيمة مطلقة (MAE) قدره 0.0837، وخطأ الجذر التربيعي المتوسط (RMSE) قدره 0.1050، وأعلى درجة R² قدرها 0.9890. بالمقابل، أظهرت نماذج مثل AdaBoost وK-Neighbors مقاييس خطأ أعلى، مما يدل على عدم فعاليتها في التقاط أنماط البيانات الأساسية. يقلل النهج الجماعي بشكل فعال من التحيز والتباين، مما يؤدي إلى نموذج تنبؤ قوي يتعمم بشكل جيد على البيانات غير المرئية، كما يتضح من الأداء المتسق عبر اختبارات التحقق المتقاطع. بشكل عام، تؤكد النتائج على إمكانيات الأساليب الجماعية في تعزيز دقة التنبؤ في البيئات التعليمية، مما يوفر رؤى قابلة للتنفيذ لتحديد الطلاب المعرضين للخطر.

Journal: Scientific Reports, Volume: 15, Issue: 1
DOI: https://doi.org/10.1038/s41598-025-12353-4
PMID: https://pubmed.ncbi.nlm.nih.gov/40707614
Publication Date: 2025-07-24
Author(s): Wesam Ahmed et al.
Primary Topic: Online Learning and Analytics

Overview

The research paper emphasizes the significance of education in developing essential life skills and resource allocation, highlighting the integration of advanced technologies like artificial intelligence (AI) in higher education. It focuses on predicting academic performance to enhance university rankings and student opportunities, addressing challenges in performance analysis and evaluation through machine learning (ML) models. The study employs ten regression models, including K-Nearest Neighbors, Linear Regression, and ensemble voting regression (VR), to predict academic outcomes using two distinct datasets. The first dataset, with 10,000 samples and six features, and the second, with 6,607 records and 20 features, reveal that the ensemble VR model outperforms others, achieving an R² of 0.9890 for the first dataset and 0.7716 for the second, indicating its robustness across various academic contexts.

In the conclusion, the authors propose a methodology that combines preprocessing techniques and multiple regression models to predict student performance effectively. The ensemble VR model demonstrates superior performance metrics, including an MAE of 0.0837 and RMSE of 0.1050. The study identifies key predictors of student success, such as extracurricular participation and preparation levels, using interpretable models like LIME and SHAP. Future research directions include exploring deep learning techniques, incorporating socioeconomic factors, developing adaptive learning systems, and addressing biases in predictive models. The study underscores the importance of ethical data handling and privacy protection in educational analytics, advocating for transparency and fairness in decision-making processes.

Methods

The proposed methodology for predicting student performance involves a systematic approach that begins with data preprocessing to ensure the dataset is suitable for analysis. Following this, nine distinct machine learning (ML) regression models were selected for evaluation. The performance of these models, including a voting regressor, was assessed using three different metrics—Root Mean Square Error (RMSE), Mean Absolute Error (MAE), and R²—on a validation dataset. The section concludes with an interpretation of the model’s predictions, supported by an illustrative architecture diagram of the prediction system.

The methodology is designed to address limitations identified in previous studies, such as reliance on narrow datasets and inadequate evaluation metrics. By utilizing a comprehensive dataset and implementing a voting model that incorporates diverse features, the proposed approach enhances early risk identification in student performance. Furthermore, it offers a robust comparative analysis of the regression models, thereby providing a more holistic understanding of their effectiveness in predicting outcomes.

Results

The results section evaluates the performance of regression models developed to forecast student performance, utilizing both selected features and the complete feature set. The dataset was split into 80% for training and 20% for testing, with model accuracy assessed through metrics such as Mean Absolute Error (MAE), Root Mean Square Error (RMSE), and R² Score. The findings underscore the proposed methodology’s effectiveness and its applicability in real-world contexts. Notably, the analysis reveals that student engagement in extracurricular activities is low, potentially hindering overall performance, while a positive correlation exists between sleep hours and performance index, emphasizing the necessity of adequate rest for academic success.

Further analysis through SHAP (SHapley Additive exPlanations) illustrates the influence of various features on model predictions. Previous scores emerged as the most significant predictor, with higher scores correlating with improved outcomes. Hours studied also positively impacted predictions, whereas the relationship with sleep hours was more nuanced, indicating that both excessive and insufficient sleep could adversely affect performance. Sample question papers practiced contributed positively but were less influential than previous scores and study hours. Extracurricular activities had minimal impact on predictions. The section concludes with a LIME (Local Interpretable Model-agnostic Explanations) analysis, providing insights into the model’s predictions based on selected features.

Discussion

The discussion section of the research paper highlights the advancements in machine learning (ML) algorithms for predicting student academic performance, emphasizing the importance of such predictions for educational institutions. Various studies have explored different ML approaches, revealing limitations such as reliance on small datasets, lack of comprehensive feature analysis, and inconsistent evaluation metrics. Notably, the proposed voting regression model in this study addresses these shortcomings by integrating multiple models, enhancing prediction accuracy and robustness while mitigating overfitting risks. The use of LIME and SHAP for model interpretability further aids educators in understanding the factors influencing student performance.

The performance evaluation of the proposed Ensemble Voting Regressor (VR) demonstrates its superiority over individual models, achieving the lowest Mean Absolute Error (MAE) of 0.0837, Root Mean Square Error (RMSE) of 0.1050, and the highest R² score of 0.9890. In contrast, models like AdaBoost and K-Neighbors exhibited higher error metrics, indicating their inefficacy in capturing the underlying data patterns. The ensemble approach effectively minimizes bias and variance, leading to a robust predictive model that generalizes well to unseen data, as evidenced by consistent performance across cross-validation tests. Overall, the findings underscore the potential of ensemble methods in enhancing predictive accuracy in educational settings, providing actionable insights for identifying at-risk students.