التعلم العميق للنقل وإطار الذكاء الاصطناعي القابل للتفسير لاكتشاف اضطراب طيف التوحد عبر مجموعات بيانات متعددة Deep transfer learning and explainable AI framework for autism spectrum disorder detection across multiple datasets

المجلة: Frontiers in Neurology، المجلد: 16
DOI: https://doi.org/10.3389/fneur.2025.1617446
PMID: https://pubmed.ncbi.nlm.nih.gov/41607838
تاريخ النشر: 2026-01-13
المؤلف: Shtwai Alsubai
الموضوع الرئيسي: البحث في اضطراب طيف التوحد

نظرة عامة

تقدم هذه الورقة البحثية نهج التعلم المنقول للكشف عن اضطراب طيف التوحد (ASD) باستخدام الشبكات العصبية العميقة (DNN) عبر ثلاثة مجموعات بيانات متميزة. أنشأت الدراسة خط أساس من خلال تدريب نماذج التعلم الآلي والتعلم العميق المختلفة على مجموعة بيانات فحص التوحد للأطفال الصغار من المملكة العربية السعودية، باستخدام تقنية زيادة العينة للأقليات الاصطناعية (SMOTE) للتخفيف من عدم توازن الفئات. تضمنت بنية DNN طبقات تنظيم وطبقات إسقاط، وتم تقييم أدائها مقارنةً بنماذج أخرى، مثل LSTM وAttention LSTM. أظهرت النتائج أن DNN تفوقت على هذه النماذج، مع تعزيز التعلم المنقول للأداء حتى مع بيانات تدريب محدودة. قدمت تقنيات الذكاء الاصطناعي القابلة للتفسير رؤى قيمة حول الميزات الرئيسية ذات الصلة بتصنيف ASD عبر مجموعات سكانية متنوعة.

تشير النتائج إلى أن التعلم المنقول فعال لتصنيف ASD عبر مجموعات البيانات، كاشفة عن مؤشرات سلوكية شائعة على الرغم من الاختلافات الديموغرافية. ومع ذلك، كان أداء النموذج متغيرًا عبر مجموعات البيانات، حيث انخفض بشكل خاص في مجموعة البيانات 3 بسبب عدم توازن الفئات. بينما يظهر النهج وعدًا للكشف المبكر، فإنه يعتمد على البيانات المبلغ عنها ذاتيًا من الأوصياء، مما قد يقدم تحيزًا ويفتقر إلى التحقق السريري. يهدف العمل المستقبلي إلى معالجة هذه القيود من خلال التعاون مع الأطباء، واختبارها على مجموعات بيانات متعددة المراكز أكبر، ودمج التصوير العصبي المتعدد الوسائط والأجهزة القابلة للارتداء. بالإضافة إلى ذلك، تقترح الدراسة تعزيز قابلية تفسير النموذج من خلال واجهات الذكاء الاصطناعي القابلة للتفسير وتقنيات التعلم التكيفية لتحسين دقة التشخيص وتسهيل التدخل المبكر.

مقدمة

تناقش مقدمة هذه الورقة البحثية اضطراب طيف التوحد (ASD) كحالة عصبية تنموية متعددة الأوجه تتميز بتحديات في التواصل الاجتماعي وسلوكيات متكررة. تتطلب تعقيدات أعراض ASD إطارًا تشخيصيًا دقيقًا، يعتمد حاليًا على التقييمات الذاتية من قبل الأطباء المدربين، مما يؤدي إلى تباين في نتائج التشخيص. تبرز الورقة إمكانيات تقنيات التعلم الآلي (ML) والتعلم العميق (DL) لتعزيز تشخيص ASD من خلال تحديد أنماط دقيقة في أنواع بيانات متنوعة، بما في ذلك المعلومات السلوكية، والتصوير العصبي، والمعلومات الجينية.

يبني المؤلفون على الدراسات السابقة التي استخدمت بنجاح منهجيات ML وDL المختلفة، مثل الشبكات العصبية الالتفافية (CNNs) والتعلم المنقول، لتحسين دقة التصنيف في الكشف عن ASD. ومن الجدير بالذكر أنهم يشيرون إلى نماذج تحقق معدلات كشف عالية، مثل نموذج ASDDTLA بمعدل نجاح 87.7% ونهج قائم على CNN حقق دقة 81.56% باستخدام بيانات تصوير الدماغ. تؤكد الورقة على أهمية دمج مصادر البيانات المتعددة الوسائط، بما في ذلك تتبع العين والتعبيرات الوجهية، لتعزيز دقة التشخيص وقابلية التفسير. يهدف البحث المقترح إلى تطوير نموذج تعلم منقول لا يحسن فقط التعميم عبر مجموعات ASD المختلفة، بل يدمج أيضًا تقنيات الذكاء الاصطناعي القابلة للتفسير لتوفير رؤى قابلة للتفسير حول عملية اتخاذ القرار للنموذج، مما يعالج فجوة كبيرة في الدراسات الحالية لتصنيف ASD.

الطرق

في قسم الطرق، يوضح المؤلفون الإطار التجريبي المستخدم لتقييم أداء مختلف مصنفات التعلم العميق (DL) عبر مجموعات بيانات متعددة. تم تقييم فعالية كل نموذج باستخدام مقاييس الأداء الرئيسية، بما في ذلك الدقة، والدقة، والاسترجاع، ودرجة F1. لتعزيز تعميم النموذج، دمجت الدراسة تقنيات التعلم المنقول. بالإضافة إلى ذلك، تم استخدام طرق الذكاء الاصطناعي القابلة للتفسير (XAI) لتوفير رؤى حول عمليات اتخاذ القرار للنماذج، مما يسهل فهم أفضل لمخرجاتها.

المناقشة

في هذه الدراسة، تم استخدام ثلاث مجموعات بيانات متميزة تتعلق باضطراب طيف التوحد (ASD) لتطوير إطار عمل للتعلم المنقول لتصنيف ASD. تضمنت مجموعة البيانات الأولى معلومات فحص من أطفال سعوديين تتراوح أعمارهم بين 12-36 شهرًا، تم جمعها عبر استبيان Q-CHAT-10 المترجم إلى العربية. تضمنت هذه المجموعة أسئلة فحص سلوكية وبيانات ديموغرافية، مما سمح بتحديد سمات ASD المحتملة بناءً على نظام تسجيل. تضمنت مجموعة البيانات الثانية، مجموعة بيانات SciDB، بيانات أعراض من أطفال تتراوح أعمارهم بين 3-15 عامًا باستخدام قائمة التحقق المعدلة للتوحد في الأطفال الصغار، المعدلة (M-CHAT-R)، التي تم جمعها من خلال منظمات مختلفة في مصر. كانت هذه المجموعة تهدف إلى دعم تطبيقات التعلم الآلي للسكان الناطقين بالعربية وواجهت تحديات بسبب عدم اتساق التسميات والميزات المتغيرة. تضمنت مجموعة البيانات الثالثة، المستمدة من مسابقة Kaggle، بيانات سريرية وديموغرافية من البالغين، مما زاد من تعقيد التحليل مع خلفيات المرضى المتنوعة.

تضمنت المنهجية المقترحة معالجة مجموعات البيانات لمعالجة القضايا الشائعة في بيانات الرعاية الصحية، مثل القيم المفقودة وعدم توازن الفئات، باستخدام تقنيات مثل الاستيفاء القائم على الوسيط، والترميز الأحادي، وتقنية زيادة العينة للأقليات الاصطناعية (SMOTE). تم إجراء محاذاة الميزات لضمان التوافق عبر مجموعات البيانات غير المتجانسة، مع التركيز على القابلية الإحصائية للنقل بدلاً من المعادلة السريرية المباشرة. تم استخدام بنية شبكة عصبية عميقة (DNN)، مما أظهر أداءً متفوقًا في تصنيف سمات ASD مقارنةً بنماذج LSTM وAttention-LSTM. حققت DNN دقة تحقق قصوى تبلغ 0.9908، مما يدل على فعاليتها في المهمة. عزز دمج تقنيات الذكاء الاصطناعي القابلة للتفسير، مثل LIME، من قابلية تفسير النموذج، مما يسمح للأطباء بفهم تأثير الميزات المختلفة على التنبؤات، مما يساهم في سد الفجوة بين الرؤى المستندة إلى البيانات والخبرة السريرية.

القيود

الإطار المقترح للكشف عن اضطراب طيف التوحد (ASD) من خلال التعلم العميق المنقول والذكاء الاصطناعي القابل للتفسير مبني على عدة افتراضات رئيسية تهدف إلى ضمان الاتساق المنهجي وقابلية النقل عبر مجموعات بيانات مختلفة، بما في ذلك تلك الخاصة بأطفال سعوديين، وSCDB، وذوي التوحد من البالغين. يفترض أن هذه المجموعات تظهر ميزات مشابهة بعد الخضوع لمعالجة قياسية، وأن خصائص EEG والسلوك تعمل كعلامات موثوقة للاختلافات العصبية والمعرفية المتعلقة بـ ASD. يُفترض أن تنفيذ SMOTE لتوازن البيانات الاصطناعية يقدم تحيزًا ضئيلًا، وقد تم تصميم الإطار للاستفادة من أنماط الميزات الكامنة المشتركة لتحقيق تعلم منقول فعال عبر مجموعات البيانات.

على الرغم من قدراته الواعدة، فإن الإطار له قيود ملحوظة. قد يتراجع أداؤه عند تطبيقه على مجموعات بيانات غير مرئية تختلف في ظروف الاكتساب، مما قد يتطلب إعادة تدريب أو ضبط. بالإضافة إلى ذلك، قد لا يعكس الاعتماد على التوازن الاصطناعي من خلال SMOTE التوزيع الطبيعي للبيانات بدقة. يركز النموذج الحالي فقط على ميزات EEG والسلوك، مما يشير إلى الحاجة لاستكشاف دمج البيانات المتعددة الوسائط في المستقبل، مثل بيانات التصوير بالرنين المغناطيسي، أو البيانات الجينية، أو بيانات الكلام. هذه القيود، بينما تقدم تحديات، تبرز أيضًا الفرص لمزيد من التطوير وتعزيز قابلية تطبيق الإطار في تحليل ASD.

Journal: Frontiers in Neurology, Volume: 16
DOI: https://doi.org/10.3389/fneur.2025.1617446
PMID: https://pubmed.ncbi.nlm.nih.gov/41607838
Publication Date: 2026-01-13
Author(s): Shtwai Alsubai
Primary Topic: Autism Spectrum Disorder Research

Overview

This research paper introduces a transfer learning approach for detecting Autism Spectrum Disorder (ASD) using Deep Neural Networks (DNN) across three distinct datasets. The study established a baseline by training various machine learning and deep learning models on a toddler ASD screening dataset from Saudi Arabia, employing the Synthetic Minority Oversampling Technique (SMOTE) to mitigate class imbalance. The DNN architecture incorporated regularization and dropout layers, and its performance was evaluated against other models, such as LSTM and Attention LSTM. Results indicated that the DNN outperformed these models, with transfer learning enhancing performance even with limited training data. Explainable AI techniques provided valuable insights into key features relevant for ASD classification across diverse populations.

The findings suggest that transfer learning is effective for cross-dataset ASD classification, revealing common behavioral indicators despite demographic variations. However, the model’s performance varied across datasets, particularly declining on Dataset 3 due to class imbalance. While the approach shows promise for early detection, it relies on self-reported data from guardians, which may introduce bias and lacks clinical validation. Future work aims to address these limitations by collaborating with clinicians, testing on larger multicenter datasets, and integrating multimodal neuroimaging and wearable devices. Additionally, the study proposes enhancing model interpretability through explainable AI interfaces and adaptive learning techniques to improve diagnostic accuracy and facilitate early intervention.

Introduction

The introduction of this research paper discusses Autism Spectrum Disorder (ASD) as a multifaceted neurodevelopmental condition characterized by social communication challenges and repetitive behaviors. The complexity of ASD symptoms necessitates a nuanced diagnostic framework, which is currently reliant on subjective assessments by trained clinicians, leading to variability in diagnostic outcomes. The paper highlights the potential of machine learning (ML) and deep learning (DL) techniques to enhance ASD diagnosis by identifying subtle patterns in diverse data types, including behavioral, neuroimaging, and genetic information.

The authors build on previous studies that have successfully employed various ML and DL methodologies, such as convolutional neural networks (CNNs) and transfer learning, to improve classification accuracy in ASD detection. Notably, they reference models achieving high detection rates, such as the ASDDTLA model with an 87.7% success rate and a CNN-based approach that attained 81.56% accuracy using brain imaging data. The paper emphasizes the importance of integrating multimodal data sources, including eye-tracking and facial expressions, to enhance diagnostic precision and interpretability. The proposed research aims to advance the field by developing a transfer learning model that not only improves generalization across different ASD populations but also incorporates explainable AI techniques to provide interpretable insights into the model’s decision-making process, addressing a significant gap in existing ASD classification studies.

Methods

In the Methods section, the authors detail the experimental framework used to evaluate the performance of various deep learning (DL) classifiers across multiple datasets. The effectiveness of each model was assessed using key performance metrics, including accuracy, precision, recall, and F1-score. To enhance model generalization, the study incorporated transfer learning techniques. Additionally, Explainable AI (XAI) methods were utilized to provide insights into the decision-making processes of the models, thereby facilitating a better understanding of their outputs.

Discussion

In this study, three distinct datasets related to Autism Spectrum Disorder (ASD) were utilized to develop a transfer learning framework for ASD classification. The first dataset comprised screening information from Saudi Arabian toddlers aged 12-36 months, collected via an Arabic-translated Q-CHAT-10 questionnaire. This dataset included behavioral screening questions and demographic data, allowing for the identification of potential ASD traits based on a scoring system. The second dataset, the SciDB dataset, included symptom data from children aged 3-15 years using the Modified Checklist for Autism in Toddlers, Revised (M-CHAT-R), collected through various organizations in Egypt. This dataset aimed to support machine learning applications for Arabic-speaking populations and presented challenges due to label inconsistencies and variable features. The third dataset, sourced from a Kaggle competition, contained clinical and demographic data from adults, enhancing the complexity of the analysis with diverse patient backgrounds.

The proposed methodology involved preprocessing the datasets to address common healthcare data issues, such as missing values and class imbalance, using techniques like median-based imputation, one-hot encoding, and the Synthetic Minority Over-sampling Technique (SMOTE). Feature alignment was performed to ensure compatibility across the heterogeneous datasets, focusing on statistical transferability rather than direct clinical equivalence. A Deep Neural Network (DNN) architecture was employed, demonstrating superior performance in classifying ASD traits compared to LSTM and Attention-LSTM models. The DNN achieved a maximum validation accuracy of 0.9908, indicating its effectiveness for the task. The integration of explainable AI techniques, such as LIME, further enhanced the interpretability of the model, allowing clinicians to understand the influence of various features on predictions, thereby bridging the gap between data-driven insights and clinical expertise.

Limitations

The proposed framework for detecting Autism Spectrum Disorder (ASD) through Deep Transfer Learning and Explainable AI is built on several key assumptions aimed at ensuring methodological consistency and transferability across various datasets, including those from Saudi Arabia Toddlers, SCDB, and Autism Adults. It assumes that these datasets exhibit similar features after undergoing standardized preprocessing, and that EEG and behavioral characteristics serve as reliable markers for ASD-related neural and cognitive differences. The implementation of SMOTE for synthetic data balancing is presumed to introduce minimal bias, and the framework is designed to leverage common latent feature patterns for effective transfer learning across datasets.

Despite its promising capabilities, the framework has notable limitations. Its performance may decline when applied to unseen datasets that differ in acquisition conditions, potentially requiring retraining or fine-tuning. Additionally, the reliance on synthetic balancing through SMOTE may not accurately capture the natural distribution of data. The current model focuses solely on EEG and behavioral features, suggesting a need for future exploration of multimodal data integration, such as MRI, genetic, or speech data. These limitations, while presenting challenges, also highlight opportunities for further development and enhancement of the framework’s applicability in ASD analysis.