تحسين توقع مرض السكري باستخدام الشبكات العصبية التلافيفية المدربة مسبقاً، LSTM، وGAN الشرطي على البيانات الرقمية المحولة Enhanced diabetes prediction using pre-trained CNNs, LSTM, and conditional GAN on transformed numerical data

المجلة: Scientific Reports، المجلد: 16، العدد: 1
DOI: https://doi.org/10.1038/s41598-026-38942-5
PMID: https://pubmed.ncbi.nlm.nih.gov/41667617
تاريخ النشر: 2026-02-10
المؤلف: K. Singh وآخرون
الموضوع الرئيسي: الذكاء الاصطناعي في الرعاية الصحية

نظرة عامة

تتناول الأبحاث التحدي الكبير الذي تمثله السكري للصحة العامة، والذي يمكن أن يؤدي إلى مضاعفات خطيرة إذا لم يتم اكتشافه مبكرًا. تقترح الدراسة إطار عمل جديد للتعلم العميق يدمج تحويل البيانات الجدولية إلى صور، والشبكات العصبية التلافيفية المدربة مسبقًا (CNNs)، وشبكات الذاكرة طويلة وقصيرة المدى (LSTM) لتعزيز توقع السكري. باستخدام مجموعة بيانات السكري لسكان البيما، قام المؤلفون بتحويل الميزات العددية إلى تمثيلات صور ثنائية الأبعاد بناءً على أنماط الارتباط وأهمية الميزات. استخدموا الشبكات التنافسية التوليدية الشرطية (cGANs) لتوليد عينات تدريب اصطناعية، محققين دقة تصنيف تبلغ 94% وAUC تبلغ 98% من خلال نهج التحقق المتقاطع بخمسة أضعاف. تم التحقق من أداء النموذج على مجموعة بيانات السكري في فرانكفورت، مما يشير إلى نتائج قابلة للمقارنة، على الرغم من أن حجم العينة المحدود يتطلب مزيدًا من الدراسات لتأسيس القابلية للتعميم.

تشير النتائج إلى أن الإطار المقترح يحسن بشكل فعال توقع السكري من خلال الاستفادة من تقنيات تمثيل البيانات المتقدمة وتعلم الميزات. ومن الجدير بالذكر أن دمج ResNet152 مع LSTM ودمج الميزات أسفر عن أعلى دقة، مما يبرز فوائد الجمع بين أساليب التعلم العميق المكاني والتسلسلي. لقد تفوقت البيانات المعززة بواسطة cGAN على طرق التعزيز التقليدية، مما يبرز مزايا توليد البيانات الاصطناعية الواقعية. بينما تشير النتائج إلى إمكانات واعدة للتطبيق السريري، يؤكد المؤلفون على الحاجة إلى مزيد من التحقق على مجموعات بيانات أكبر وأكثر تنوعًا لضمان نشر موثوق في بيئات الرعاية الصحية الواقعية.

طرق

في هذه الدراسة، تم تطوير منهجية جديدة لتحويل مجموعة بيانات عددية إلى تمثيلات شبيهة بالصور لاستخدامها مع الشبكات العصبية التلافيفية (CNNs). تضمنت العملية تجميع الميزات بناءً على الارتباط، وتنقيح ترتيبها باستخدام t-SNE للحفاظ على علاقات التشابه، ورسم قيم الميزات القياسية إلى كثافات البكسل. يعزز هذا النهج، الذي أظهر وعدًا في الأبحاث الطبية الحيوية، قدرة CNN على اكتشاف الأنماط المحلية والعلاقات الأوسع ضمن البيانات. شمل سير العمل معالجة البيانات، وتحويل البيانات الجدولية إلى صور، وزيادة البيانات باستخدام الشبكات التنافسية التوليدية الشرطية (cGANs). تم استخدام بنية CNN-LSTM للاستفادة من نقاط القوة في كلا النموذجين، حيث تقوم CNNs باستخراج الميزات المكانية وLSTMs بالتقاط الاعتماديات التسلسلية.

أظهرت النتائج التجريبية أن نموذج ResNet152-LSTM الهجين المقترح حقق دقة تبلغ 94% (AUC = 97.9%) على مجموعة بيانات PIMA المعززة، مما يشير إلى أداء قوي مقارنة بأساليب التعلم الآلي التقليدية. تم تقييم قابلية تعميم النموذج على مجموعة بيانات خارجية، حيث حافظ على أداء مستقر. زادت تقنيات زيادة البيانات من حجم مجموعة البيانات من 1,000 إلى 9,000 صورة، مما يسهل التدريب والتحقق الفعال. استخدمت الدراسة نماذج CNN المدربة مسبقًا مختلفة لاستخراج الميزات، وتم تطبيق تحسين بايزي لضبط معلمات LSTM. بشكل عام، تشير النتائج إلى أن الطريقة المقترحة تعزز بشكل فعال نمذجة التفاعلات بين الميزات المحلية والطويلة المدى مع تقليل الإفراط في التخصيص، على الرغم من أنه يجب تفسير النتائج على أنها أولية بسبب الطبيعة الاصطناعية لبيانات التدريب.

مناقشة

في قسم المناقشة من ورقة البحث، يتم مراجعة استراتيجيات التعلم الآلي المختلفة لتشخيص السكري وتصنيفه، مع تسليط الضوء على فعالية الخوارزميات المختلفة. ومن الجدير بالذكر أن مصنف الأشجار الإضافية أظهر معدلات دقة عالية تبلغ 89% و96% على مجموعتي بيانات PIMA وBRFSS، على التوالي. تؤكد الورقة على استخدام طرق التجميع، مثل دمج XGBoost وAdaBoost، التي حققت دقة تبلغ 90%. بالإضافة إلى ذلك، أظهرت تقنيات التعلم العميق، ولا سيما تلك التي تستخدم الشبكات العصبية التلافيفية (CNNs)، نتائج واعدة، حيث حقق نموذج التعلم العميق دقة مثيرة للإعجاب تبلغ 99.5% على مجموعة بيانات تمت معالجتها مسبقًا. تناقش الدراسة أيضًا تحويل البيانات الجدولية إلى تنسيقات صور للاستفادة من قدرات CNN، مع معالجة التحديات مثل عدم توازن الفئات من خلال تقنيات مثل SMOTE والشبكات التنافسية التوليدية الشرطية (cGANs) لزيادة البيانات.

يقترح المؤلفون منهجية جديدة تدمج تحويل البيانات الجدولية إلى صور مع تقنيات موازنة الفئات، مما يعزز قدرة النموذج على التقاط العلاقات المعقدة بين الميزات التي تعتبر حاسمة لتوقع السكري. من خلال استخدام بنية CNN-LSTM الهجينة، تهدف الدراسة إلى نمذجة الاعتماديات المكانية والتسلسلية في الميزات السريرية، مما يحسن الأداء التنبؤي. تؤكد الأبحاث أيضًا على أهمية التحقق الخارجي باستخدام مجموعات بيانات مستقلة، مما يعزز قابلية تعميم النتائج. بشكل عام، تسهم الدراسة في زيادة الأدبيات المتزايدة حول تطبيقات التعلم الآلي في توقع السكري، مع عرض أساليب مبتكرة لتحسين دقة النموذج وقابليته للتفسير في البيئات السريرية.

القيود

يظهر الإطار الهجين المقترح CNN-LSTM، المعزز بواسطة الشبكات التنافسية التوليدية الشرطية (cGAN)، قدرات تنبؤية قوية؛ ومع ذلك، فإنه يواجه عدة قيود. تثير الاعتماد على مجموعة بيانات السكري لسكان البيما، التي تعتبر صغيرة نسبيًا، مخاوف بشأن تمثيلها لمجموعات المرضى العالمية المتنوعة. على الرغم من معالجة عدم توازن الفئات من خلال تعزيز cGAN وتقنية زيادة العينات للأقليات الاصطناعية (SMOTE)، قد لا تعكس البيانات الاصطناعية التعقيدات التي تواجهها في السيناريوهات السريرية الواقعية. بالإضافة إلى ذلك، قد تتجاهل عملية التحويل من البيانات الجدولية إلى بيانات الصور العلاقات السريرية الهامة بين الميزات. قد لا يكون التحقق الخارجي الذي تم إجراؤه على مجموعة بيانات السكري في فرانكفورت، على الرغم من كونه مستقلًا، كافيًا لتقييم قابلية تعميم النموذج عبر بيئات الرعاية الصحية المختلفة.

سيركز العمل المستقبلي على عدة مجالات رئيسية لتعزيز قوة الإطار وملاءمته. تشمل هذه المجالات التحقق من النموذج على مجموعات بيانات أكبر ومتعددة الأنماط وأكثر تنوعًا، واستكشاف تقنيات تحويل البيانات البديلة، ودمج الذكاء الاصطناعي القابل للتفسير لتحسين قابلية التفسير للأطباء، والتحقيق في استراتيجيات نشر خفيفة مناسبة للبيئات ذات الموارد المحدودة. تهدف هذه المبادرات إلى تعزيز الفائدة الواقعية لأنظمة توقع السكري المدفوعة بالذكاء الاصطناعي.

Journal: Scientific Reports, Volume: 16, Issue: 1
DOI: https://doi.org/10.1038/s41598-026-38942-5
PMID: https://pubmed.ncbi.nlm.nih.gov/41667617
Publication Date: 2026-02-10
Author(s): K. Singh et al.
Primary Topic: Artificial Intelligence in Healthcare

Overview

The research addresses the significant public health challenge posed by diabetes, which can lead to severe complications if not detected early. The study proposes a novel deep learning framework that integrates tabular-to-image transformation, pre-trained Convolutional Neural Networks (CNNs), and Long Short-Term Memory (LSTM) networks to enhance diabetes prediction. Utilizing the Pima Indians Diabetes Dataset, the authors transformed numerical features into 2D image representations based on correlation patterns and feature importance. They employed Conditional Generative Adversarial Networks (cGANs) to generate synthetic training samples, achieving a classification accuracy of 94% and an AUC of 98% through a five-fold cross-validation approach. The model’s performance was validated on the Frankfurt Diabetes Dataset, indicating comparable results, although the limited sample size necessitates further studies to establish generalizability.

The findings suggest that the proposed framework effectively improves diabetes prediction by leveraging advanced data representation and feature learning techniques. Notably, the integration of ResNet152 with LSTM and feature fusion yielded the highest accuracy, underscoring the benefits of combining deep spatial and sequential learning approaches. The use of cGAN-augmented data outperformed traditional augmentation methods, highlighting the advantages of realistic synthetic data generation. While the results indicate promising potential for clinical application, the authors emphasize the need for further validation on larger and more diverse datasets to ensure reliable deployment in real-world healthcare settings.

Methods

In this study, a novel methodology was developed to transform a numerical dataset into image-like representations for use with convolutional neural networks (CNNs). The process involved grouping features based on correlation, refining their arrangement using t-SNE to maintain similarity relationships, and mapping standardized feature values to pixel intensities. This approach, which has shown promise in biomedical research, enhances the CNN’s ability to detect both local patterns and broader relationships within the data. The workflow included data preprocessing, tabular-to-image transformation, and data augmentation using conditional Generative Adversarial Networks (cGANs). The CNN-LSTM architecture was employed to leverage the strengths of both models, with CNNs extracting spatial features and LSTMs capturing sequential dependencies.

The experimental results demonstrated that the proposed hybrid ResNet152-LSTM model achieved an accuracy of 94% (AUC = 97.9%) on the augmented PIMA dataset, indicating strong performance compared to traditional machine learning methods. The model’s generalizability was assessed on an external dataset, where it maintained stable performance. Data augmentation techniques increased the dataset from 1,000 to 9,000 images, facilitating effective training and validation. The study utilized various pre-trained CNN models for feature extraction, and Bayesian optimization was applied to fine-tune the LSTM hyperparameters. Overall, the results suggest that the proposed method effectively models both local and long-range feature interactions while mitigating overfitting, although the findings should be interpreted as preliminary due to the synthetic nature of the training data.

Discussion

In the discussion section of the research paper, various machine learning strategies for diabetes diagnosis and classification are reviewed, highlighting the effectiveness of different algorithms. Notably, the Extra Trees classifier demonstrated high accuracy rates of 89% and 96% on the PIMA and BRFSS datasets, respectively. The paper emphasizes the use of ensemble methods, such as combining XGBoost and AdaBoost, which achieved an accuracy of 90%. Additionally, deep learning techniques, particularly those utilizing convolutional neural networks (CNNs), have shown promising results, with a deep learning model achieving an impressive 99.5% accuracy on a pre-processed dataset. The study also discusses the transformation of tabular data into image formats to leverage CNN capabilities, addressing challenges like class imbalance through techniques such as SMOTE and conditional Generative Adversarial Networks (cGANs) for data augmentation.

The authors propose a novel methodology that integrates tabular-to-image transformation with class-balancing techniques, enhancing the model’s ability to capture complex inter-feature relationships critical for diabetes prediction. By employing a hybrid CNN-LSTM architecture, the study aims to model both spatial and sequential dependencies in clinical features, improving predictive performance. The research further underscores the importance of external validation using independent datasets, thereby enhancing the generalizability of the findings. Overall, the study contributes to the growing body of literature on machine learning applications in diabetes prediction, showcasing innovative approaches to improve model accuracy and interpretability in clinical settings.

Limitations

The proposed hybrid CNN-LSTM framework, augmented by conditional Generative Adversarial Networks (cGAN), demonstrates strong predictive capabilities; however, it faces several limitations. The reliance on the Pima Indians Diabetes Dataset, which is relatively small, raises concerns about its representativeness of diverse global patient populations. Although class imbalance was addressed through cGAN augmentation and Synthetic Minority Over-sampling Technique (SMOTE), synthetic data may not fully encapsulate the complexities encountered in real-world clinical scenarios. Additionally, the transformation from tabular to image data may overlook significant clinical relationships among features. The external validation conducted on the Frankfurt Diabetes Dataset, while independent, may not adequately evaluate the model’s generalizability across different healthcare settings.

Future work will focus on several key areas to enhance the framework’s robustness and applicability. These include validating the model on larger, multi-modal, and more diverse datasets, exploring alternative data transformation techniques, incorporating explainable AI to improve interpretability for clinicians, and investigating lightweight deployment strategies suitable for resource-limited environments. These initiatives aim to strengthen the real-world utility of AI-driven diabetes prediction systems.