استكشاف تطبيق الذكاء الاصطناعي في تعليم الأطفال المصابين بالتوحد: منظور الصحة العامة Exploring the application of AI in the education of children with autism: a public health perspective

المجلة: Frontiers in Psychiatry، المجلد: 15
DOI: https://doi.org/10.3389/fpsyt.2024.1521926
PMID: https://pubmed.ncbi.nlm.nih.gov/39935626
تاريخ النشر: 2025-01-28
المؤلف: Lan Liu وآخرون
الموضوع الرئيسي: البحث في اضطراب طيف التوحد

نظرة عامة

تقدم البحث نموذج المحول المدفوع بالصحة العامة (PHDT)، وهو إطار مبتكر يهدف إلى تعزيز المهارات الاجتماعية لدى الأطفال المصابين باضطراب طيف التوحد (ASD). تواجه التدخلات التقليدية، مثل تحليل السلوك التطبيقي (ABA) وتدريب المهارات الاجتماعية (SST)، على الرغم من فعاليتها، قيودًا في الوصول والتكيف. يستفيد نموذج PHDT من تقنيات الذكاء الاصطناعي المتقدمة، وخاصة المنهجيات المعتمدة على المحولات، لإنشاء تدخلات مخصصة وحساسة للسياق. من خلال استخدام مدخلات بيانات متعددة الأنماط – بما في ذلك النصوص والصوت والإشارات الوجهية – يوفر النموذج تعليقات فورية وتفسيرًا للسياقات الاجتماعية، مما يحسن بشكل كبير من المشاركة واكتساب المهارات.

تشير النتائج التجريبية إلى أن نموذج PHDT يتفوق على الطرق التقليدية، محققًا نتائج متقدمة على مجموعات بيانات SST-5 وReDial، متجاوزًا النماذج الرائدة مثل RoBERTa وBERT بفارق 2.15% و3.79% في الدقة، على التوالي. أظهرت التقييمات العملية مع مجموعة من 30 طفلًا زيادة بنسبة 23.4% في التعرف على الإشارات الاجتماعية وتقليل زمن الاستجابة بنسبة 15.7% على مدار ثمانية أسابيع. تشير النتائج إلى أن PHDT لا يكمل فقط التدخلات الحالية، بل يقدم أيضًا حلاً قابلاً للتوسع لتعزيز التنمية الاجتماعية في بيئات متنوعة. ستركز الأعمال المستقبلية على النشر الفوري في البيئات السريرية ودمج مدخلات حسية إضافية لتعزيز قابلية تكيف النموذج وفعاليته، مما يعزز دوره في المبادرات الصحية العامة التي تهدف إلى تحسين الوصول والفعالية في تدخلات ASD.

مقدمة

تناقش مقدمة ورقة البحث الأهمية المتزايدة لتعزيز المهارات الاجتماعية لدى الأطفال المصابين باضطراب طيف التوحد (ASD)، مع تسليط الضوء على التأثير الكبير الذي تحدثه هذه المهارات على تطورهم المعرفي والعاطفي والسلوكي. تواجه التدخلات التقليدية، مثل تحليل السلوك التطبيقي (ABA) وتدريب المهارات الاجتماعية (SST)، على الرغم من فعاليتها، تحديات في قابلية التوسع والوصول بسبب طبيعتها المكثفة. تفترض الورقة أن التقدم في الذكاء الاصطناعي (AI)، وخاصة من خلال نماذج التعلم العميق المعتمدة على المحولات، يقدم فرصًا جديدة لتدريب المهارات الاجتماعية القابلة للتوسع والتكيف، مما قد يصل إلى شريحة سكانية أوسع من خلال المنصات الرقمية.

ينتقد المؤلفون التطبيقات السابقة للذكاء الاصطناعي في تعزيز المهارات الاجتماعية، والتي اعتمدت على الذكاء الاصطناعي الرمزي والأنظمة القائمة على القواعد التي تفتقر إلى المرونة للتكيف مع تعقيدات التفاعلات الاجتماعية في الحياة الواقعية. يشيرون إلى أنه بينما حسنت أساليب التعلم الآلي من قابلية التكيف من خلال التعرف على الأنماط في السلوكيات الاجتماعية، كانت محدودة بسبب الحاجة إلى مجموعات بيانات موسومة بشكل موسع وموارد حسابية. يمثل ظهور التعلم العميق، وخاصة هياكل المحولات، تقدمًا كبيرًا، حيث يمكن لهذه النماذج معالجة الإشارات الاجتماعية المعقدة والمعلومات السياقية بشكل فعال. يهدف نموذج المحول المدفوع بالصحة العامة (PHDT) المقترح إلى معالجة قيود الطرق السابقة من خلال تقديم حل قابل للتوسع ومخصص لتعزيز المهارات الاجتماعية لدى الأطفال المصابين باضطراب طيف التوحد (ASD)، مع دمج وحدة جديدة تعتمد على الانتباه وآلية تعديل حجم الدفعة الديناميكية لتحسين الأداء والوصول في سياقات الصحة العامة.

طرق

استخدم الإعداد التجريبي إطارًا حسابيًا عالي الأداء مع وحدات معالجة الرسوميات NVIDIA A100، حيث تم تنفيذ النماذج في PyTorch وتحسينها باستخدام مُحسِّن آدم. تضمنت عملية التدريب معدل تعلم أولي قدره $1 \times 10^{-4}$، يتناقص بعامل 0.5 كل 10 دورات، مع حجم دفعة قدره 64. تم تدريب كل نموذج لمدة 50 دورة مع إيقاف مبكر بناءً على خسارة التحقق لتعزيز القابلية للتعميم. شمل معالجة البيانات تقسيم النصوص باستخدام مُقسّم BERT المدرب مسبقًا لمجموعات بيانات النصوص (SST-5 وYelp) والحوار المنظم لمجموعة بيانات ReDial. بالنسبة لمجموعة بيانات DAiSEE، تم استخراج وإعادة تحجيم إطارات الفيديو لاستخراج الميزات باستخدام هيكل ResNet المدرب مسبقًا. تم استخدام هياكل نماذج متنوعة، بما في ذلك نموذج قائم على BERT لتصنيف المشاعر، وهيكل قائم على المحولات لـ ReDial، ونموذج من مرحلتين يجمع بين CNN وLSTM لـ DAiSEE.

تجاوزت الطريقة المقترحة النماذج المتقدمة (SOTA) عبر أربع مجموعات بيانات – SST-5 وReDial وYelp وDAiSEE – محققة درجات متفوقة في الدقة والاسترجاع ودرجة F1 وAUC. على سبيل المثال، حقق النموذج دقة 92.45% واسترجاع 90.62% على SST-5، متجاوزًا RoBERTa. في ReDial، أدت آلية الانتباه السياقي للنموذج إلى دقة 87.94%، بينما حقق على مجموعة بيانات Yelp دقة 94.56% ودرجة F1 قدرها 90.24%. أظهرت نتائج مجموعة بيانات DAiSEE دقة 87.98% وAUC قدره 88.34%، مما كان أفضل بكثير من BERT. تم عزو تحسينات الأداء إلى الهيكل المتقدم للنموذج وتقنيات معالجة البيانات الخاصة بالمجموعات، التي التقطت بفعالية المشاعر الدقيقة وحالات المشاركة، مما يؤكد مرونته في التعامل مع تحديات معالجة اللغة الطبيعية ورؤية الكمبيوتر المتنوعة.

مناقشة

تؤكد قسم المناقشة في ورقة البحث على أهمية الأساليب الصحية العامة في تدخلات التوحد، التي تهدف إلى توفير حلول قابلة للتوسع على مستوى المجتمع للتشخيص المبكر والتدخل. تعترف هذه الأساليب بالتوحد كأحد التحديات المجتمعية، داعيةً إلى استراتيجيات قائمة على السكان تعزز الوصول إلى أدوات الكشف المبكر، خاصةً للمجتمعات المحرومة. يتم تسليط الضوء على دمج التكنولوجيا، بما في ذلك الأدوات الرقمية والمدفوعة بالذكاء الاصطناعي، كوسيلة لتحسين الوصول إلى التدخلات، مع إشراك الأطر المجتمعية الأسر والمحترفين في معالجة الاحتياجات المتعلقة بالتوحد. يتم التأكيد على إمكانيات الذكاء الاصطناعي، وخاصة نماذج المحولات، لقدرتها على تحليل البيانات متعددة الأنماط وتقديم تدريب مخصص على المهارات الاجتماعية، متكيفًا مع تقدم الأطفال الفرديين ومعالجة الفجوات في الوصول إلى الرعاية.

تتناول القسم بمزيد من التفصيل تطبيق نماذج المحولات في تدريب المهارات الاجتماعية للأطفال المصابين باضطراب طيف التوحد (ASD). تتفوق هذه النماذج في معالجة بيانات التفاعل الاجتماعي المعقدة، مما يمكّن من تقديم تعليقات فورية وتدخلات مخصصة يمكن أن تتكيف ديناميكيًا بناءً على مشاركة الطفل وأدائه. يُلاحظ أن قابلية تكيف التدخلات المعززة بالذكاء الاصطناعي أمر حاسم لتعزيز المهارات الاجتماعية، حيث توفر الوكلاء الافتراضيون والروبوتات الاجتماعية بيئات آمنة للممارسة والتفاعل. تختتم الورقة بأن دمج الذكاء الاصطناعي في تدخلات التوحد لا يعزز فقط تجارب التعلم الفردية، بل يقدم أيضًا رؤى قيمة للمعالجين والمعلمين، مما يدعم في النهاية نهجًا أكثر فعالية وتخصيصًا لتطوير المهارات الاجتماعية لدى الأطفال المصابين باضطراب طيف التوحد.

Journal: Frontiers in Psychiatry, Volume: 15
DOI: https://doi.org/10.3389/fpsyt.2024.1521926
PMID: https://pubmed.ncbi.nlm.nih.gov/39935626
Publication Date: 2025-01-28
Author(s): Lan Liu et al.
Primary Topic: Autism Spectrum Disorder Research

Overview

The research introduces the Public Health-Driven Transformer (PHDT) model, an innovative framework aimed at enhancing social skills in children with Autism Spectrum Disorder (ASD). Traditional interventions, such as Applied Behavior Analysis (ABA) and Social Skills Training (SST), while effective, often face limitations in accessibility and adaptability. The PHDT model leverages advanced artificial intelligence techniques, particularly Transformer-based methodologies, to create personalized and context-sensitive interventions. By utilizing multi-modal data inputs—including text, audio, and facial cues—the model provides real-time feedback and interpretation of social contexts, significantly improving engagement and skill acquisition.

Experimental results indicate that the PHDT model outperforms traditional methods, achieving state-of-the-art results on the SST-5 and ReDial datasets, surpassing leading models like RoBERTa and BERT by margins of 2.15% and 3.79% in accuracy, respectively. Practical evaluations with a cohort of 30 children demonstrated a 23.4% increase in social cue recognition and a 15.7% reduction in response latency over eight weeks. The findings suggest that PHDT not only complements existing interventions but also offers a scalable solution for fostering social development in diverse environments. Future work will focus on real-time deployment in clinical settings and the integration of additional sensory inputs to enhance the model’s adaptability and effectiveness, thereby solidifying its role in public health initiatives aimed at improving accessibility and efficacy in ASD interventions.

Introduction

The introduction of the research paper discusses the growing importance of enhancing social skills in children with Autism Spectrum Disorder (ASD), highlighting the significant impact these skills have on their cognitive, emotional, and behavioral development. Traditional interventions, such as Applied Behavior Analysis (ABA) and Social Skills Training (SST), while effective, face challenges in scalability and accessibility due to their intensive nature. The paper posits that advancements in artificial intelligence (AI), particularly through Transformer-based deep learning models, present new opportunities for scalable and adaptive social skill training, potentially reaching a wider demographic through digital platforms.

The authors critique earlier AI applications in social skill enhancement, which relied on symbolic AI and rule-based systems that lacked the flexibility to adapt to the complexities of real-life social interactions. They note that while machine learning approaches improved adaptability by recognizing patterns in social behaviors, they were limited by the need for extensive labeled datasets and computational resources. The emergence of deep learning, especially Transformer architectures, marks a significant advancement, as these models can effectively process complex social cues and contextual information. The proposed Public Health-Driven Transformer (PHDT) aims to address the limitations of previous methods by offering a scalable, personalized solution for social skill enhancement in children with ASD, incorporating a novel attention-based module and a dynamic batch size adjustment mechanism to optimize performance and accessibility in public health contexts.

Methods

The experimental setup utilized a high-performance computational framework with NVIDIA A100 GPUs, implementing models in PyTorch and optimizing them with the Adam optimizer. The training process involved an initial learning rate of $1 \times 10^{-4}$, decayed by a factor of 0.5 every 10 epochs, with a batch size of 64. Each model was trained for 50 epochs with early stopping based on validation loss to enhance generalizability. Data preprocessing included tokenization using a pre-trained BERT tokenizer for text datasets (SST-5 and Yelp) and structured dialogues for the ReDial dataset. For the DAiSEE dataset, video frames were extracted and resized for feature extraction using a pre-trained ResNet backbone. Various model architectures were employed, including a BERT-based model for sentiment classification, a Transformer-based architecture for ReDial, and a two-stage model combining CNN and LSTM for DAiSEE.

The proposed method outperformed state-of-the-art (SOTA) models across four datasets—SST-5, ReDial, Yelp, and DAiSEE—achieving superior scores in accuracy, recall, F1 score, and AUC. For instance, the model attained 92.45% accuracy and 90.62% recall on SST-5, surpassing RoBERTa. In ReDial, the model’s contextual attention mechanism led to an accuracy of 87.94%, while on the Yelp dataset, it achieved 94.56% accuracy and 90.24% F1 score. The DAiSEE dataset results demonstrated an accuracy of 87.98% and an AUC of 88.34%, significantly better than BERT. The performance improvements were attributed to the model’s advanced architecture and dataset-specific preprocessing techniques, which effectively captured nuanced sentiment and engagement states, confirming its versatility in handling diverse NLP and computer vision challenges.

Discussion

The discussion section of the research paper emphasizes the significance of public health approaches in autism interventions, which aim to provide scalable, community-wide solutions for early diagnosis and intervention. These approaches recognize autism as a societal challenge, advocating for population-based strategies that enhance access to early detection tools, particularly for underserved communities. The integration of technology, including digital and AI-driven tools, is highlighted as a means to improve access to interventions, with community frameworks engaging families and professionals in addressing autism-related needs. The potential of AI, particularly transformer models, is underscored for their ability to analyze multimodal data and deliver personalized social skill training, adapting to individual children’s progress and addressing disparities in access to care.

The section further elaborates on the application of transformer models in social skill training for children with Autism Spectrum Disorder (ASD). These models excel in processing complex social interaction data, enabling real-time feedback and personalized interventions that can dynamically adjust based on a child’s engagement and performance. The adaptability of AI-enhanced interventions is noted as crucial for fostering social skills, with virtual agents and social robots providing safe environments for practice and interaction. The paper concludes that the integration of AI into autism interventions not only enhances individual learning experiences but also offers valuable insights for therapists and educators, ultimately supporting a more effective and personalized approach to social skill development in children with ASD.