تعزيز التنبؤ بعبء السكري المدفوع بالذكاء الاصطناعي: تحليل مقارن لنماذج التعلم العميق والنماذج الإحصائية Enhancing AI-driven forecasting of diabetes burden: a comparative analysis of deep learning and statistical models

المجلة: Scientific Reports، المجلد: 15، العدد: 1
DOI: https://doi.org/10.1038/s41598-025-14599-4
PMID: https://pubmed.ncbi.nlm.nih.gov/40783432
تاريخ النشر: 2025-08-09
المؤلف: Rasool Esmaeilyfard وآخرون
الموضوع الرئيسي: الذكاء الاصطناعي في الرعاية الصحية

نظرة عامة

تدرس هذه الدراسة فعالية أربعة نماذج للتنبؤ – المحول مع مشفر تلقائي متغير (VAE)، الذاكرة طويلة وقصيرة الأجل (LSTM)، وحدة التكرار المغلقة (GRU)، ومتوسط الحركة التكاملي الذاتي (ARIMA) – في التنبؤ بالعبء العالمي للسكري. باستخدام بيانات سنوية عن سنوات الحياة المعدلة حسب الإعاقة (DALYs)، والوفيات، وانتشار المرض من 1990 إلى 2021، تم تدريب النماذج على بيانات من 1990 إلى 2014 وتقييمها من 2015 إلى 2021. شملت مقاييس الأداء متوسط الخطأ المطلق (MAE) وجذر متوسط مربع الخطأ (RMSE)، مع تقييم المتانة من خلال محاكاة الضوضاء والبيانات المفقودة. أظهر نموذج المحول-VAE أعلى دقة تنبؤية (MAE: 0.425، RMSE: 0.501) ومرونة تجاه عيوب البيانات (p < 0.01)، بينما أظهرت LSTM وGRU قيودًا في نمذجة الاتجاهات طويلة الأجل. على الرغم من كفاءته، واجه ARIMA صعوبة في التنبؤ الديناميكي. تؤكد النتائج على إمكانيات نماذج التعلم العميق المتقدمة في التنبؤ بالرعاية الصحية، خاصة قدرة المحول-VAE على إدارة سيناريوهات البيانات المعقدة. ومع ذلك، قد تحد متطلباته الحسابية العالية والتحديات في قابلية التفسير من تطبيقه في البيئات ذات الموارد المحدودة. تدعو الدراسة إلى مزيد من البحث لتعزيز كفاءة النموذج من خلال تقنيات مثل SHAP للشفافية، وأساليب النمذجة الهجينة، ودمج العوامل الاجتماعية والبيئية لتحسين دقة التنبؤ وإبلاغ التدخلات الصحية العامة المستهدفة.

الطرق

في هذا القسم، يحدد المؤلفون إطارًا منهجيًا للتنبؤ بدقة بعبء السكري المستقبلي، مع التأكيد على أهمية معالجة الاعتماد الزمني المعقد المتأصل في بيانات السلاسل الزمنية. تنشأ هذه الاعتمادات من عوامل متفاعلة متنوعة، بما في ذلك الانتشار، ومعدلات الوفيات، وسنوات الحياة المعدلة حسب الإعاقة (DALYs). يتكون النهج المقترح من عدة خطوات رئيسية: جمع البيانات، المعالجة المسبقة (التي تشمل تعويض القيم المفقودة، والتطبيع، والتنسيق)، تهيئة معلمات النموذج، وتطبيق نموذج تعلم عميق للتنبؤ.

يبرز المؤلفون استخدام ثلاثة نماذج متقدمة من التعلم الآلي – المحول مع مشفر تلقائي متغير (VAE)، الذاكرة طويلة وقصيرة الأجل (LSTM)، ووحدة التكرار المغلقة (GRU) – لإدارة التحديات التي تطرحها الاعتمادات الزمنية والبيانات غير المكتملة بشكل فعال. تم اختيار هذه النماذج لقدرتها على التقاط الاعتمادات طويلة الأجل وتعزيز المتانة التنبؤية. سيتم تقييم أداء النماذج باستخدام مقاييس مناسبة، مما يضمن تقييمًا شاملاً لدقة التنبؤ الخاصة بها.

النتائج

يقيم قسم النتائج أداء أربعة نماذج: المشفر التلقائي المتغير (VAE)، الذاكرة طويلة وقصيرة الأجل (LSTM)، وحدة التكرار المغلقة (GRU)، ومتوسط الحركة التكاملي الذاتي (ARIMA). يتم تقييم فعالية كل نموذج بناءً على مقاييس محددة، والتي لم يتم تفصيلها في النص المقدم. تشير النتائج إلى نقاط القوة والضعف المقارنة بين النماذج، مما يبرز ملاءمتها لأنواع مختلفة من البيانات أو مهام التنبؤ. قد تكشف التحليلات الإضافية عن رؤى حول قدرات النماذج التنبؤية وأدائها تحت ظروف متغيرة.

المناقشة

في قسم المناقشة من ورقة البحث، يبرز المؤلفون التقدم في التنبؤ بعبء السكري من خلال الذكاء الاصطناعي (AI)، مع التركيز بشكل خاص على تفوق تقنيات التعلم العميق (DL) على النماذج الإحصائية التقليدية. يشيرون إلى أن أساليب DL قد أظهرت دقة ملحوظة في تقدير الانتشار وعوامل الخطر المرتبطة بمضاعفات السكري، مثل اعتلال الشبكية السكري، وغالبًا ما تتطابق مع أداء المقيمين البشريين الخبراء ولكن مع تقليل كبير في وقت المعالجة. تقارن الدراسة منهجيتها بالأعمال السابقة، مثل تلك التي قام بها المطيري وآخرون وخان وآخرون، والتي استخدمت بشكل أساسي طرقًا إحصائية تقليدية وافتقرت إلى عمق التحليل الذي توفره هياكل DL الحديثة. يؤكد المؤلفون أن دراستهم رائدة في دمج التعلم العميق مع التصنيف القائم على الدخل والتنبؤ متعدد الأبعاد بعبء السكري، باستخدام مجموعة بيانات شاملة تمتد حتى عام 2021.

كما يناقش المؤلفون التحديات التي تواجه هذا المجال، بما في ذلك توفر البيانات وقابلية تفسير النموذج. يؤكدون على أهمية تعزيز شفافية النموذج وملاءمته السريرية، والتي تعالجها منهجيتهم من خلال دمج التعلم العميق مع التقييم المنظم والمعايير المقارنة. تستخدم الدراسة نماذج متقدمة، بما في ذلك مشفر تلقائي متغير قائم على المحول (VAE)، والذاكرة طويلة وقصيرة الأجل (LSTM)، ووحدة التكرار المغلقة (GRU)، للتنبؤ بالمؤشرات الرئيسية للسكري عبر مجموعات الدخل المختلفة للبنك الدولي. تشير النتائج إلى أن نهجهم لا يحسن فقط الأداء التنبؤي ولكن أيضًا يقدم إطارًا أكثر عدلاً لفهم العبء العالمي للسكري، مما يساهم في تقديم رؤى قيمة للتدخل المبكر واستراتيجيات الصحة العامة.

Journal: Scientific Reports, Volume: 15, Issue: 1
DOI: https://doi.org/10.1038/s41598-025-14599-4
PMID: https://pubmed.ncbi.nlm.nih.gov/40783432
Publication Date: 2025-08-09
Author(s): Rasool Esmaeilyfard et al.
Primary Topic: Artificial Intelligence in Healthcare

Overview

This study investigates the effectiveness of four forecasting models—Transformer with Variational Autoencoder (VAE), Long Short-Term Memory (LSTM), Gated Recurrent Unit (GRU), and AutoRegressive Integrated Moving Average (ARIMA)—in predicting the global burden of diabetes. Utilizing annual data on Disability-Adjusted Life Years (DALYs), Deaths, and Prevalence from 1990 to 2021, the models were trained on data from 1990 to 2014 and evaluated from 2015 to 2021. Performance metrics included Mean Absolute Error (MAE) and Root Mean Squared Error (RMSE), with robustness assessed through noise and missing data simulations. The Transformer-VAE model demonstrated the highest predictive accuracy (MAE: 0.425, RMSE: 0.501) and resilience to data imperfections (p < 0.01), while LSTM and GRU showed limitations in long-term trend modeling. ARIMA, although efficient, struggled with dynamic forecasting. The findings underscore the potential of advanced deep learning models for healthcare forecasting, particularly the Transformer-VAE's ability to manage complex data scenarios. However, its high computational demands and challenges in interpretability may restrict its application in resource-limited settings. The study advocates for future research to enhance model efficiency through techniques like SHAP for transparency, hybrid modeling approaches, and the incorporation of socioeconomic and environmental factors to improve forecasting accuracy and inform targeted public health interventions.

Methods

In this section, the authors outline a methodological framework for accurately forecasting the future burden of diabetes, emphasizing the importance of addressing complex temporal dependencies inherent in time-series data. These dependencies arise from various interacting factors, including prevalence, mortality rates, and Disability-Adjusted Life Years (DALYs). The proposed approach consists of several key steps: data collection, preprocessing (which includes imputation of missing values, normalization, and formatting), initialization of model parameters, and the application of a deep learning model for forecasting.

The authors highlight the use of three advanced machine learning models—Transformer with Variational Autoencoder (VAE), Long Short-Term Memory (LSTM), and Gated Recurrent Unit (GRU)—to effectively manage the challenges posed by temporal dependencies and incomplete data. These models were chosen for their capabilities in capturing long-term dependencies and enhancing predictive robustness. The performance of the models will be evaluated using appropriate metrics, ensuring a comprehensive assessment of their forecasting accuracy.

Results

The results section evaluates the performance of four models: Variational Autoencoder (VAE), Long Short-Term Memory (LSTM), Gated Recurrent Unit (GRU), and Autoregressive Integrated Moving Average (ARIMA). Each model’s effectiveness is assessed based on specific metrics, which are not detailed in the provided text. The findings indicate comparative strengths and weaknesses among the models, highlighting their suitability for different types of data or forecasting tasks. Further analysis may reveal insights into the models’ predictive capabilities and their performance under varying conditions.

Discussion

In the discussion section of the research paper, the authors highlight the advancements in forecasting diabetes burden through Artificial Intelligence (AI), particularly emphasizing the superiority of deep learning (DL) techniques over traditional statistical models. They note that DL approaches have demonstrated remarkable accuracy in estimating the prevalence and risk factors associated with diabetic complications, such as diabetic retinopathy, often matching the performance of expert human graders but with significantly reduced processing time. The study contrasts its methodology with previous works, such as those by Almutairi et al. and Khan et al., which primarily utilized traditional statistical methods and lacked the depth of analysis provided by modern DL architectures. The authors assert that their study is pioneering in combining deep learning with income-based stratification and multidimensional forecasting of diabetes burden, utilizing a comprehensive dataset that spans up to 2021.

The authors also discuss the challenges faced in the field, including data availability and model interpretability. They emphasize the importance of enhancing model transparency and clinical relevance, which their methodology addresses by integrating deep learning with structured evaluation and comparative baselines. The study employs advanced models, including a Transformer-based Variational Autoencoder (VAE), Long Short-Term Memory (LSTM), and Gated Recurrent Unit (GRU), to forecast key diabetes indicators across different World Bank income groups. The results indicate that their approach not only improves predictive performance but also offers a more equitable framework for understanding the global diabetes burden, thus contributing valuable insights for early intervention and public health strategies.