إطار عمل قوي وعام في تصنيف السكري عبر بيئات غير متجانسة A robust and generalized framework in diabetes classification across heterogeneous environments

المجلة: Computers in Biology and Medicine، المجلد: 186
DOI: https://doi.org/10.1016/j.compbiomed.2025.109720
PMID: https://pubmed.ncbi.nlm.nih.gov/39864329
تاريخ النشر: 2025-01-25
المؤلف: Zhenyun Du وآخرون
الموضوع الرئيسي: الذكاء الاصطناعي في الرعاية الصحية

نظرة عامة

تتناول ورقة البحث القضية الصحية العالمية الهامة لمرض السكري (DM)، وخاصة تأثيره على النساء أثناء الحمل وبعد الولادة. تسلط الضوء على الزيادة في انتشار مرض السكري بسبب أنماط الحياة المستقرة والعادات الغذائية السيئة، مما يبرز الحاجة إلى التعرف المبكر لتجنب المضاعفات وتحسين نتائج العلاج. غالبًا ما تفتقر النماذج التنبؤية التقليدية إلى الصلاحية الخارجية عبر مجموعات بيانات متنوعة، مما دفع المؤلفين إلى اقتراح إطار عمل لتعلم الآلة (ML) لتنبؤ مرض السكري باستخدام مجموعتي بيانات PIMA و BD. تستخدم الدراسة تقنيات تحقق متنوعة، كاشفة أن نموذج تعزيز التدرج المتطرف (XGBoost) حقق دقة 79% على مجموعة بيانات PIMA، بينما وصلت نماذج الغابة العشوائية (RF) وتعزيز التدرج (GB) إلى ما يقرب من 99% على مجموعة بيانات BD. ومن الجدير بالذكر أن النموذج التجميعي حقق دقة 88% في التحقق بين المجموعات عندما تم تدريبه على PIMA واختباره على BD، على الرغم من أن الأداء انخفض إلى 74% عندما تم عكس مجموعات بيانات التدريب والاختبار.

تؤكد الاستنتاجات على إمكانية الإطار لتحسين تنبؤ مرض السكري من خلال دمج مجموعات بيانات متنوعة، مما يوضح أن دمج مجموعات البيانات الجزئية يمكن أن يحسن بشكل كبير من قوة النموذج ودقته. كما تبرز الدراسة أهمية تقنيات معالجة البيانات المسبقة لموثوقية النموذج. تهدف الأبحاث المستقبلية إلى دمج مجموعات بيانات إضافية تشمل عوامل ديموغرافية ونمط حياة، والتي تفتقر حاليًا، لتعزيز أداء النموذج وقابليته للتعميم. بالإضافة إلى ذلك، يخطط المؤلفون لتحسين تنبؤ أحداث نقص السكر في الدم وارتفاع السكر في الدم من خلال دمج بيانات السجلات الصحية الإلكترونية (EHR)، مما يوسع من تطبيق نتائجهم عبر مجموعات سكانية متنوعة.

مقدمة

تسلط مقدمة ورقة البحث الضوء على الأزمة الصحية العالمية المتزايدة التي يسببها مرض السكري، حيث يؤثر حاليًا على حوالي 463 مليون بالغ، مع توقعات تشير إلى أن هذا العدد قد يتجاوز 700 مليون بحلول عام 2045. يتم تصنيف مرض السكري إلى ثلاثة أنواع رئيسية: السكري من النوع 1 (T1D)، السكري من النوع 2 (T2D)، وسكري الحمل (GDM)، والذي يمثل مخاطر لكل من الأمهات وأطفالهن. يتم التأكيد على أن الإدارة الفعالة لمرض السكري، وخاصة من خلال الكشف المبكر وتعديلات نمط الحياة، هي أمر حاسم للتخفيف من المضاعفات الصحية على المدى الطويل.

تناقش الورقة إمكانيات تقنيات تعلم الآلة (ML) والتعلم العميق (DL) في تحسين تنبؤ مرض السكري، والتشخيص، والإدارة، متجاوزة الأساليب السريرية التقليدية. على الرغم من التقدم، لا تزال هناك تحديات في تطوير نماذج تنبؤية قوية وقابلة للتعميم قابلة للتطبيق عبر مجموعات بيانات متنوعة. تهدف الدراسة إلى سد هذه الفجوة من خلال اقتراح إطار عمل شامل لتعلم الآلة يقيم الأداء التنبؤي لمختلف النماذج عبر مجموعات بيانات متعددة. تشمل المساهمات الرئيسية تطوير نموذج تنبؤي عام بقيم ميزات موحدة، وتحليل مقارن للبيانات الأساسية لصلابة النموذج، والتحقق مقابل الأساليب الحديثة. يتم توضيح هيكل الورقة، مع تفاصيل الأقسام التالية التي ستتناول الأعمال ذات الصلة، ووصف مجموعات البيانات، وتقييم النماذج، ومناقشات حول النتائج والاتجاهات المستقبلية.

طرق البحث

في هذه الدراسة، نستخدم مجموعتين مختلفتين من البيانات لتدريب واختبار نموذج تنبؤ مرض السكري، كل منهما تشمل ثمانية ميزات مشابهة ولكنها مأخوذة من مناطق مختلفة. الهدف هو إجراء تحليل شامل ومقارنة لهذه المجموعات من البيانات لتعزيز فهمنا لعوامل خطر مرض السكري وتطوير نموذج تنبؤي يكون قابلاً للتعميم وقويًا. يتكون الإطار المقترح، الموضح في الشكل 1، من أربعة أقسام رئيسية: إنشاء ثلاث استراتيجيات مختلفة لتقسيم مجموعة بيانات التدريب والاختبار، وتطبيق طريقتين لمعالجة البيانات المسبقة، وبناء النماذج التنبؤية، وتقييم أداء النموذج باستخدام مقاييس متنوعة.

بالإضافة إلى ذلك، نحقق في تأثيرات التحقق من دمج مجموعات البيانات الجزئية من خلال تدريب النماذج على مجموعة بيانات مشتركة مستمدة من بيانات PIMA و BD. يهدف هذا النهج إلى تقييم التحسينات المحتملة في دقة التنبؤ التي قد تنشأ من دمج مصادر بيانات متنوعة، مما يساهم في الفعالية العامة لنموذج تنبؤ مرض السكري.

النتائج

يقدم قسم النتائج تقييمًا شاملاً لمختلف نماذج تعلم الآلة، بما في ذلك الغابة العشوائية (RF)، وتعزيز التدرج (GB)، والبيرسيبترون متعدد الطبقات (MLP)، وXGBoost (XGB)، ونموذج التجميع، ونماذج التعلم العميق (DL)، عبر سيناريوهات تحقق مختلفة. تشمل هذه السيناريوهات التحقق داخل مجموعة البيانات على مجموعتي بيانات PIMA و BD، والتحقق بين مجموعات البيانات، والتحقق من دمج مجموعات البيانات الجزئية. يتم تفصيل مقاييس الأداء في الجداول 4 إلى 9، مما يبرز فعالية النماذج في التنبؤ بالنتائج المتعلقة بالسكري بناءً على ميزات رئيسية مثل “الحمل”، “الجلوكوز”، “ضغط الدم”، “مؤشر كتلة الجسم”، “العمر”، و”وظيفة شجرة عائلة السكري”.

تشير التحليلات إلى أن النماذج تظهر مستويات أداء متفاوتة اعتمادًا على مجموعة البيانات وطريقة التحقق المستخدمة. ومن الجدير بالذكر أن النتائج تؤكد على أهمية اختيار الميزات في تعزيز دقة النموذج، حيث تساهم ميزات معينة باستمرار في تحسين القدرات التنبؤية عبر مجموعات البيانات التي تم تقييمها.

مناقشة

في قسم المناقشة هذا، يقدم المؤلفون نظرة شاملة على تقنيات تعلم الآلة (ML) والتعلم العميق (DL) الكلاسيكية الموجودة المطبقة على تنبؤ مرض السكري، مع التأكيد على قيود الدراسات السابقة. يشيرون إلى أن العديد من نماذج تعلم الآلة الكلاسيكية، مثل آلات الدعم المتجهة (SVM) وأشجار القرار (DT)، تم تقييمها بشكل أساسي على مجموعة بيانات PIMA، وغالبًا ما تفتقر إلى التعميم عبر مجموعات بيانات متنوعة. على سبيل المثال، بينما استكشفت دراسات مثل تلك التي أجراها تشانغ وآخرون وأشور وآخرون نماذج تعلم الآلة المختلفة، لم تعالج بشكل كافٍ دمج التعلم التجميعي أو استخدام مجموعات بيانات متعددة، وهو أمر حاسم لتعزيز قوة النموذج ودقته. بالإضافة إلى ذلك، يبرز المؤلفون أن العديد من أساليب التعلم العميق تعاني أيضًا من تحقق محدود، وغالبًا ما تعتمد فقط على مجموعة بيانات PIMA، مما يقيد قابليتها للتطبيق على مجموعات سكانية أوسع.

يقترح المؤلفون نهجًا جديدًا يدمج استراتيجية تعلم تجميعي شاملة ويجمع بين مجموعات بيانات متعددة، بما في ذلك مجموعتي بيانات PIMA و BD، لتحسين القابلية للتعميم ودقة التنبؤ. يؤكدون على أهمية معالجة البيانات المسبقة، بما في ذلك التعامل مع القيم المفقودة وتوحيد الميزات، لضمان موثوقية تدريب النموذج. تستخدم الدراسة نماذج تعلم آلة متنوعة (مثل الغابة العشوائية، وتعزيز التدرج، والبيرسيبترون متعدد الطبقات) ونموذج تعلم عميق سطحي، مع استخدام تقنيات مثل SMOTE لتحقيق توازن في بيانات التدريب مع الحفاظ على سلامة بيانات الاختبار. تشير النتائج إلى أن دمج مجموعات البيانات يعزز أداء النموذج، حيث تظهر النماذج التجميعية دقة وقوة أفضل مقارنة بنماذج مجموعة البيانات الواحدة. بشكل عام، تسهم هذه الأبحاث في المجال من خلال معالجة قيود الدراسات السابقة واقتراح إطار عمل أكثر فعالية لتنبؤ مرض السكري يستفيد من مصادر بيانات متنوعة وتقنيات نمذجة متقدمة.

Journal: Computers in Biology and Medicine, Volume: 186
DOI: https://doi.org/10.1016/j.compbiomed.2025.109720
PMID: https://pubmed.ncbi.nlm.nih.gov/39864329
Publication Date: 2025-01-25
Author(s): Zhenyun Du et al.
Primary Topic: Artificial Intelligence in Healthcare

Overview

The research paper addresses the significant global health issue of diabetes mellitus (DM), particularly its impact on women during pregnancy and postpartum. It highlights the rising prevalence of DM due to sedentary lifestyles and poor dietary habits, emphasizing the need for timely identification to prevent complications and improve treatment outcomes. Traditional predictive models often lack external validity across diverse datasets, prompting the authors to propose a machine learning (ML) framework for diabetes prediction utilizing the PIMA and BD datasets. The study employs various validation techniques, revealing that the Extreme Gradient Boosting (XGBoost) model achieved 79% accuracy on the PIMA dataset, while Random Forest (RF) and Gradient Boosting (GB) models reached nearly 99% on the BD dataset. Notably, the ensemble model attained 88% accuracy in inter-dataset validation when trained on PIMA and tested on BD, although performance declined to 74% when the training and testing datasets were reversed.

The conclusions emphasize the framework’s potential to enhance diabetes prediction through the integration of diverse datasets, demonstrating that partial dataset fusion can significantly improve model robustness and accuracy. The study also underscores the importance of data preprocessing techniques for model reliability. Future research aims to incorporate additional datasets that include demographic and lifestyle factors, which are currently lacking, to further enhance model performance and generalizability. Additionally, the authors plan to improve the prediction of hypoglycemic and hyperglycemic events by integrating electronic health records (EHR) data, thereby broadening the applicability of their findings across various populations.

Introduction

The introduction of the research paper highlights the escalating global health crisis posed by diabetes, with approximately 463 million adults currently affected and projections indicating this number could surpass 700 million by 2045. Diabetes is categorized into three main types: type 1 diabetes (T1D), type 2 diabetes (T2D), and gestational diabetes mellitus (GDM), the latter of which presents risks for both mothers and their offspring. Effective management of diabetes, particularly through early detection and lifestyle modifications, is emphasized as critical to mitigating long-term health complications.

The paper discusses the potential of machine learning (ML) and deep learning (DL) methods in enhancing diabetes prediction, diagnosis, and management, surpassing traditional clinical approaches. Despite advancements, challenges remain in developing robust and generalizable prediction models applicable across diverse datasets. The study aims to fill this gap by proposing a comprehensive ML framework that evaluates the predictive performance of various models across multiple datasets. Key contributions include the development of a generalized predictive model with standardized feature values, a comparative analysis of essential data for model robustness, and validation against state-of-the-art methods. The structure of the paper is outlined, detailing subsequent sections that will cover related work, dataset descriptions, model evaluations, and discussions on findings and future directions.

Methods

In this study, we utilize two distinct datasets for training and testing a diabetes prediction model, each encompassing eight similar features but sourced from different regions. The objective is to conduct a comprehensive analysis and comparison of these datasets to enhance our understanding of diabetes risk factors and to develop a prediction model that is both generalizable and robust. The proposed framework, illustrated in Fig. 1, consists of four key sections: establishing three different train-test dataset split strategies, applying two data preprocessing methods, constructing the predictive models, and evaluating model performance using various metrics.

Additionally, we investigate the effects of partial fusion dataset validation by training the models on a combined dataset derived from the PIMA and BD data. This approach aims to assess the potential improvements in predictive accuracy that may arise from integrating diverse data sources, thereby contributing to the overall effectiveness of the diabetes prediction model.

Results

The results section presents a comprehensive evaluation of various machine learning models, including Random Forest (RF), Gradient Boosting (GB), Multi-Layer Perceptron (MLP), XGBoost (XGB), an Ensemble model, and Deep Learning (DL) models, across different validation scenarios. These scenarios encompass intra-dataset validation on the PIMA and BD datasets, inter-dataset validation, and partial fusion dataset validation. The performance metrics are detailed in Tables 4 through 9, highlighting the models’ effectiveness in predicting diabetes-related outcomes based on key features such as ‘Pregnancies’, ‘Glucose’, ‘Blood Pressure’, ‘BMI’, ‘Age’, and ‘Diabetes Pedigree Function’.

The analysis indicates that the models exhibit varying performance levels depending on the dataset and validation approach employed. Notably, the results underscore the importance of feature selection in enhancing model accuracy, with specific features consistently contributing to improved predictive capabilities across the evaluated datasets.

Discussion

In this discussion section, the authors provide a comprehensive overview of existing classical machine learning (ML) and deep learning (DL) techniques applied to diabetes prediction, emphasizing the limitations of prior studies. They note that many classical ML models, such as Support Vector Machines (SVM) and Decision Trees (DT), have been evaluated primarily on the PIMA dataset, often lacking generalization across diverse datasets. For instance, while studies like those by Chang et al. and Ashour et al. explored various ML models, they did not adequately address the integration of ensemble learning or the use of multiple datasets, which are crucial for enhancing model robustness and accuracy. Additionally, the authors highlight that many DL approaches also suffer from limited validation, often relying solely on the PIMA dataset, which restricts their applicability to broader populations.

The authors propose a novel approach that incorporates a comprehensive ensemble learning strategy and integrates multiple datasets, including the PIMA and BD datasets, to improve generalizability and prediction accuracy. They emphasize the importance of data preprocessing, including handling missing values and standardizing features, to ensure the reliability of model training. The study employs various ML models (e.g., Random Forest, Gradient Boosting, and Multi-Layer Perceptron) and a shallow DL model, utilizing techniques like SMOTE to balance training data while preserving the integrity of test data. The results indicate that combining datasets enhances model performance, with ensemble models demonstrating superior accuracy and robustness compared to single-dataset models. Overall, this research contributes to the field by addressing the limitations of previous studies and proposing a more effective framework for diabetes prediction that leverages diverse data sources and advanced modeling techniques.