إطار عمل قابل للتفسير وآمن لتوقع التوحد باستخدام تتبع العين متعدد الأنماط وبيانات الحركة Explainable and secure framework for autism prediction using multimodal eye tracking and kinematic data

المجلة: Complex & Intelligent Systems، المجلد: 11، العدد: 3
DOI: https://doi.org/10.1007/s40747-025-01790-3
تاريخ النشر: 2025-02-19
المؤلف: Ahmad Almadhor وآخرون
الموضوع الرئيسي: البحث في اضطراب طيف التوحد

نظرة عامة

تقدم ورقة البحث إطارًا لتشخيص اضطراب طيف التوحد (ASD) ي prioritizes خصوصية البيانات وأمانها من خلال استخدام التعلم الفيدرالي (FL). يدمج هذا النهج بيانات حركية متعددة الأبعاد وبيانات حركة العين مع الشبكات العصبية العميقة (DNN) والذكاء الاصطناعي القابل للتفسير (XAI). من خلال استخدام FL، يسمح الإطار بتدريب النماذج عبر أجهزة متعددة دون مركزية البيانات الحساسة، مما يضمن الخصوصية. شملت التقييمات تجارب باستخدام DNN ضحلة ومتوسط الفيدرالي (FedAvg) عبر سيناريوهين: أحدهما يستخدم جميع الميزات والآخر يستخدم الميزات المختارة بواسطة XAI. أظهرت النتائج أن مجموعة بيانات L_General حققت أعلى دقة، حيث حقق العميل 2 دقة 99.99%.

في الخاتمة، تؤكد الدراسة على فعالية FL في تشخيص ASD مع الحفاظ على دقة تشخيص عالية وحماية البيانات الصحية الحساسة. تسلط الضوء على اختلافات الأداء بين استخدام جميع الميزات مقابل الميزات المختارة، مشيرة إلى أنه بينما يمكن أن تعزز اختيار الميزات أداء النموذج في بعض الحالات، فإن نهج جميع الميزات أثبت تفوقه لمجموعات بيانات معينة. تشير النتائج إلى أن FL لا يحافظ فقط على الخصوصية ولكن أيضًا يعزز أمان تطبيقات الرعاية الصحية، مما يجعله حلاً واعدًا لإدارة البيانات الطبية الحساسة وسط عدد كبير محتمل من العملاء.

مقدمة

تناقش مقدمة ورقة البحث اضطراب طيف التوحد (ASD)، وهو حالة عصبية معقدة تتميز بصعوبات في التفاعل الاجتماعي، والتواصل، ووجود سلوكيات متكررة. يؤثر على حوالي 1 من كل 54 طفلًا في الولايات المتحدة، مع انتشار أعلى بين الأولاد، تظهر أعراض ASD عادة في مرحلة الطفولة المبكرة وتشمل تحديات في تطوير العلاقات، والتواصل غير اللفظي، والحساسيات الحسية. يعتمد التشخيص على التقييمات السلوكية بدلاً من الاختبارات الطبية المحددة، وعلى الرغم من أن الأسباب الدقيقة لـ ASD ليست مفهومة تمامًا، يُعتقد أنها تشمل مزيجًا من العوامل الوراثية والبيئية.

تسلط الورقة الضوء على التطبيق المتزايد لتقنيات التعلم الآلي (ML) في أبحاث ASD، مع التركيز على كيفية تحليل الخوارزميات مثل الشبكات العصبية، وآلات الدعم الناقل (SVMs)، وأشجار القرار لمختلف أنواع البيانات لتعزيز دقة التشخيص وتخصيص العلاج. الابتكار الرئيسي الذي تم مناقشته هو التعلم الفيدرالي (FL)، وهو نهج تعلم آلي لامركزي يسمح بتدريب النماذج عبر أجهزة متعددة مع الحفاظ على البيانات الحساسة محلية. هذه الطريقة ذات صلة خاصة في الرعاية الصحية، حيث تحافظ على سرية المرضى بينما تمكن من تطوير أدوات دقيقة لاكتشاف ASD. تؤكد الدراسة على أن FL يسهل تدريب نماذج التشخيص باستخدام مجموعات بيانات موزعة، مما يضمن الخصوصية مع الاستفادة من بيانات متنوعة لتحسين النتائج في تشخيص ASD.

الطرق

في هذا القسم، يصف المؤلفون المنهجية التجريبية المستخدمة لتحليل البيانات الحركية وبيانات حركة العين. شملت الدراسة حالتين تجريبيتين لكل مجموعة بيانات: الأولى استخدمت قائمة ميزات (FL) مع جميع الميزات المتاحة، بينما استخدمت الثانية FL مع ميزات الذكاء الاصطناعي القابل للتفسير (XAI)، باستخدام نموذج XGBoost بشكل محدد. بالنسبة للحالتين، تم تنفيذ نموذجين محليين من الشبكات العصبية العميقة (DNN)، مع إجراء التجارب عبر ثلاثة عملاء وثلاث جولات تدريب.

لتقييم فعالية الإطار المقترح، ركز المؤلفون على أداء النماذج في تصنيف البيانات غير المرئية. استخدموا عدة مقاييس رئيسية، بما في ذلك الدقة، والدقة، والاسترجاع، ودرجة F1، لتقييم قدرات تعميم النماذج. توفر كل مقياس رؤى فريدة حول أداء النماذج، وهو أمر ضروري لعلماء البيانات والمحللين لاتخاذ قرارات مستنيرة بشأن قابلية تطبيق النماذج في السيناريوهات العملية.

النتائج

تشير نتائج تحليل الأداء لنموذج محلي 1 إلى أن استخدام جميع الميزات بشكل عام يؤدي إلى نتائج أفضل من الميزات المختارة المستمدة من الذكاء الاصطناعي القابل للتفسير (XAI). بالنسبة للعميل 1، تراوحت الدقة من 0.22 إلى 0.33، مع درجة F1 تبلغ 0.25 في كل من K_Attention و L_Attention عند استخدام جميع الميزات. من الجدير بالذكر أن تطبيق الميزات المختارة حسّن الأداء في K_Attention، محققًا دقة تبلغ 0.55 ودرجة F1 تبلغ 0.55. أظهر العميل 2 أعلى أداء بدقة 0.88 ودرجة F1 تبلغ 0.86 باستخدام جميع الميزات في مجموعة بيانات L_General. في المقابل، أظهر العميل 3 تباينًا كبيرًا، حيث حقق K_Attention دقة تبلغ 0.67 باستخدام جميع الميزات، بينما أدت الميزات المختارة إلى درجات F1 أقل.

بالنسبة لنموذج محلي 2، كشفت التحليلات أن العميل 2 حقق أداء تصنيف مثالي بدقة ودرجة F1 تبلغ 1.00 في L_General عند استخدام جميع الميزات. ومع ذلك، كان الأداء مع الميزات المختارة أقل قليلاً، مع دقة قصوى تبلغ 0.88 في K_Attention. حافظ العميل 3 على أداء قوي عبر مجموعتي الميزات، محققًا دقة تبلغ 0.88 ودرجة F1 تبلغ 0.88 في L_General مع جميع الميزات. توضح مصفوفات الارتباك لكلا النموذجين قدرات التصنيف، حيث أظهر النموذج 2 نتائج مثالية. بشكل عام، تشير النتائج إلى أنه بينما يمكن أن تعزز الميزات المختارة الأداء في سياقات معينة، فإن الاستخدام الشامل لجميع الميزات غالبًا ما يؤدي إلى نتائج متفوقة، خاصة لمجموعات البيانات ذات تنوع الميزات الأكبر.

المناقشة

تؤكد قسم المناقشة في ورقة البحث هذه على استكشاف التعلم الفيدرالي (FL) كطريقة لتشخيص اضطراب طيف التوحد (ASD) باستخدام الشبكات العصبية العميقة (DNNs) مع إعطاء الأولوية للأمان والخصوصية. تجري الدراسة تجارب عبر سيناريوهين لكل مجموعة بيانات: أحدهما يستخدم FL مع جميع الميزات والآخر يستخدم FL مع الميزات المختارة من خلال تقنيات الذكاء الاصطناعي القابل للتفسير (XAI). تشير النتائج إلى أن نماذج DNN حققت دقة ملحوظة، حيث حقق العميل 2 دقة مثالية تبلغ 99.99% ودرجة F1 تبلغ 99.99% على مجموعة بيانات L_General، بينما حقق العميل 3 دقة تبلغ 88% ودرجة F1 تبلغ 88%. تسلط هذه النتائج الضوء على إمكانية FL لتعزيز الأداء التشخيصي مع حماية بيانات المرضى الحساسة.

تستعرض الورقة أيضًا الأعمال ذات الصلة، مشيرة إلى منهجيات مختلفة تم استخدامها في مجال تشخيص ASD، بما في ذلك استخدام مقاييس الشبكات المعقدة وتقنيات التعلم الآلي. تؤكد على أهمية دمج XAI لتحسين قابلية تفسير النموذج، وهو أمر حاسم لاتخاذ القرارات السريرية. يضمن الإطار المقترح ليس فقط خصوصية البيانات من خلال الحفاظ على المعلومات الحساسة محلية، ولكنه أيضًا يستخدم تقنيات الخصوصية التفاضلية لحماية النقاط الفردية من البيانات بشكل أكبر. تختتم الدراسة بأن FL يمثل نهجًا واعدًا لتشخيص ASD بشكل آمن وفعال، مما يمهد الطريق لأبحاث مستقبلية يمكن أن تتضمن تقنيات متقدمة، مثل الشبكات العصبية الفيزيائية، لتعزيز كفاءة وأمان تطبيقات الرعاية الصحية.

Journal: Complex & Intelligent Systems, Volume: 11, Issue: 3
DOI: https://doi.org/10.1007/s40747-025-01790-3
Publication Date: 2025-02-19
Author(s): Ahmad Almadhor et al.
Primary Topic: Autism Spectrum Disorder Research

Overview

The research paper presents a framework for diagnosing Autism Spectrum Disorder (ASD) that prioritizes data privacy and security through the use of Federated Learning (FL). This approach integrates multimodal kinematic and eye movement data with Deep Neural Networks (DNN) and Explainable Artificial Intelligence (XAI). By employing FL, the framework allows for model training across multiple devices without centralizing sensitive data, thus ensuring privacy. The evaluation involved experiments using a shallow DNN and Federated Averaging (FedAvg) across two scenarios: one utilizing all features and the other employing features selected by XAI. Results indicated that the L_General dataset yielded the highest accuracy, with Client 2 achieving 99.99% accuracy.

In the conclusion, the study emphasizes the effectiveness of FL in diagnosing ASD while maintaining high diagnostic accuracy and safeguarding sensitive health data. It highlights the performance variations between using all features versus selected features, noting that while feature selection can enhance model performance in certain cases, the all-features approach proved superior for specific datasets. The findings suggest that FL not only preserves privacy but also enhances the security of healthcare applications, making it a promising solution for managing sensitive medical data amidst a potentially large number of clients.

Introduction

The introduction of the research paper discusses Autism Spectrum Disorder (ASD), a complex neurodevelopmental condition characterized by difficulties in social interaction, communication, and the presence of repetitive behaviors. Affecting approximately 1 in 54 children in the U.S., with a higher prevalence in boys, ASD symptoms typically emerge in early childhood and include challenges in relationship development, nonverbal communication, and sensory sensitivities. Diagnosis relies on behavioral assessments rather than specific medical tests, and while the exact causes of ASD are not fully understood, they are thought to involve a combination of genetic and environmental factors.

The paper highlights the growing application of machine learning (ML) techniques in ASD research, focusing on how algorithms like neural networks, support vector machines (SVMs), and decision trees can analyze various data types to enhance diagnostic accuracy and treatment personalization. A key innovation discussed is Federated Learning (FL), a decentralized ML approach that allows model training across multiple devices while keeping sensitive data local. This method is particularly relevant in healthcare, as it preserves patient confidentiality while enabling the development of accurate ASD detection tools. The study emphasizes that FL facilitates the training of diagnostic models using distributed datasets, ensuring privacy while leveraging diverse data for improved outcomes in ASD diagnosis.

Methods

In this section, the authors describe the experimental methodology employed to analyze kinematic and eye movement data. The study involved two experimental cases for each dataset: the first utilized a feature list (FL) with all available features, while the second employed FL with explainable artificial intelligence (XAI) features, specifically using an XGBoost model. For both cases, two deep neural network (DNN) local models were implemented, with experiments conducted across three clients and three training rounds.

To evaluate the effectiveness of the proposed framework, the authors focused on the models’ performance in classifying unseen data. They employed several key metrics, including accuracy, precision, recall, and F1 score, to assess the models’ generalization capabilities. Each metric provides unique insights into the models’ performance, which is essential for data scientists and analysts to make informed decisions regarding the applicability of the models in practical scenarios.

Results

The results of the performance analysis for Local Model 1 indicate that using all features generally yields better outcomes than selected features derived from Explainable Artificial Intelligence (XAI). For Client 1, accuracy ranged from 0.22 to 0.33, with an F1-score of 0.25 in both K_Attention and L_Attention when all features were used. Notably, applying selected features improved performance in K_Attention, achieving an accuracy of 0.55 and an F1-score of 0.55. Client 2 exhibited the highest performance with an accuracy of 0.88 and an F1-score of 0.86 using all features in the L_General dataset. In contrast, Client 3 showed significant variability, with K_Attention achieving an accuracy of 0.67 using all features, while selected features resulted in lower F1-scores.

For Local Model 2, the analysis revealed that Client 2 achieved perfect classification performance with an accuracy and F1-score of 1.00 in L_General when using all features. However, the performance with selected features was slightly lower, with a maximum accuracy of 0.88 in K_Attention. Client 3 maintained robust performance across both feature sets, achieving an accuracy of 0.88 and an F1-score of 0.88 in L_General with all features. The confusion matrices for both models illustrated the classification capabilities, with Model 2 demonstrating flawless results. Overall, the findings suggest that while selected features can enhance performance in certain contexts, the comprehensive use of all features often leads to superior results, particularly for datasets with greater feature diversity.

Discussion

The discussion section of this research paper emphasizes the exploration of federated learning (FL) as a method for diagnosing Autism Spectrum Disorder (ASD) using deep neural networks (DNNs) while prioritizing security and privacy. The study conducts experiments across two scenarios for each dataset: one employing FL with all features and the other utilizing FL with features selected through explainable artificial intelligence (XAI) techniques. The results indicate that the DNN models achieved remarkable accuracies, with Client 2 reaching a perfect accuracy of 99.99% and an F1-score of 99.99% on the L_General dataset, while Client 3 achieved an accuracy of 88% and an F1-score of 88%. These findings highlight the potential of FL to enhance diagnostic performance while safeguarding sensitive patient data.

The paper also reviews related work, noting various methodologies that have been employed in the field of ASD diagnosis, including the use of complex network measures and machine learning techniques. It underscores the importance of integrating XAI to improve model interpretability, which is crucial for clinical decision-making. The proposed framework not only ensures data privacy by keeping sensitive information localized but also employs differential privacy techniques to further protect individual data points. The study concludes that FL represents a promising approach for secure and effective ASD diagnosis, paving the way for future research that could incorporate advanced technologies, such as Physical Neural Networks, to enhance the efficiency and security of healthcare applications.