الكشف الآلي عن النوبات في الصرع باستخدام شبكة انتباه ديناميكية زمانية-مكانية جديدة Automated seizure detection in epilepsy using a novel dynamic temporal-spatial graph attention network

المجلة: Scientific Reports، المجلد: 15، العدد: 1
DOI: https://doi.org/10.1038/s41598-025-01015-0
PMID: https://pubmed.ncbi.nlm.nih.gov/40355495
تاريخ النشر: 2025-05-12
المؤلف: Kunxian Yan وآخرون
الموضوع الرئيسي: تخطيط الدماغ وواجهات الدماغ-الكمبيوتر

نظرة عامة

تقدم ورقة البحث نهجًا جديدًا لتحليل الشبكات الدماغية الديناميكية في الصرع باستخدام شبكة انتباه رسومية زمنية-مكانية ديناميكية (DTS-GAN). يتناول هذا النموذج قيود نماذج الرسوم البيانية ذات الطوبولوجيا الثابتة من خلال دمج معالجة إشارات الرسوم البيانية مع إطار عمل تعلم عميق هجين. تستخدم DTS-GAN مشفرًا زمنيًا قائمًا على LSTM لالتقاط الاعتمادات طويلة الأمد في تسلسلات EEG وشبكة انتباه رسومية ديناميكية مع اتصال غاوسي احتمالي لتعلم التفاعلات الوظيفية العابرة بين عقد الأقطاب بشكل تكيفي. تشير النتائج التجريبية على مجموعة بيانات TUSZ إلى أن DTS-GAN تحقق دقة تتراوح بين 89-91% ودرجة F1 مرجحة تتراوح بين 87-91% في تصنيف سبعة أنواع من النوبات، متجاوزة بشكل كبير النماذج الأساسية.

تخلص الدراسة إلى أن DTS-GAN تستخرج بفعالية الميزات الزمانية-المكانية من إشارات EEG، مما يقلل من الحاجة إلى مراجعة EEG اليدوية ويظهر إمكانية تطبيقها على اضطرابات عصبية أخرى ذات اتصال ديناميكي غير طبيعي، مثل الفصام ومرض الزهايمر. تشمل اتجاهات البحث المستقبلية تحسين النموذج للنشر في الوقت الحقيقي على الأجهزة السريرية، ودمج البيانات متعددة الوسائط لتحسين تحديد موقع النوبات والتفريق بين الأنواع الفرعية، وتطوير أدوات التصور لتعزيز ثقة الأطباء في اتخاذ القرارات المدفوعة بالذكاء الاصطناعي. تهدف هذه الجهود إلى ربط التقدم النظري بالتطبيقات الطبية العملية، مما يسهم في مجال علم الأعصاب الدقيق.

الطرق

في هذا القسم، يوضح المؤلفون الطرق المستخدمة في دراستهم، مع التركيز على تدريب نموذج يستخدم الحد الأدنى من الأدلة (ELBO) في سياق الاستدلال التبايني والمشفّرات التلقائية التباينية (VAEs). يتم التعبير عن ELBO كالتالي:

\[
\text{ELBO} = E_{q(z|x)}[\log p(x|z)] – KL[q(z|x) || p(z)]
\]

حيث يمثل \(q(z|x)\) نموذج التعرف، و\(p(x|z)\) النموذج التوليدي، و\(x\) البيانات الملاحظة، و\(z\) المتغير الكامن، و\(p(z)\) التوزيع السابق. الهدف هو تعظيم ELBO لتحسين أداء النموذج مع تقليل تباين كولباك-ليبلر لمواءمة توزيع النموذج مع التوزيع الخلفي الحقيقي. يتم استخدام خوارزمية تحسين آدم لهذا الغرض. تتضمن الدراسة أيضًا وحدات شبكة عصبية رسومية لمقارنة آليات الانتباه الثابتة والديناميكية، مما يكشف أن نموذج الانتباه الديناميكي (DTS-GAN) يتفوق على الآخرين، بما في ذلك نماذج CNN وTransformer، خاصة في تصنيف نوع النوبات.

تشمل مقاييس التقييم درجة F1، التي يتم حسابها باستخدام الإيجابيات الحقيقية (TP) والإيجابيات الكاذبة (FP) والسلبيات الكاذبة (FN)، مع اعتبار كل من درجات Macro F1 وWeighted F1 لتقييم أداء النموذج عبر أنواع النوبات المختلفة. تشير النتائج إلى أن نموذج DTS-GAN يحقق معدل اكتشاف مرجح يتراوح بين 87% و91%، مما يظهر دقة وقوة فائقة، خاصة في التعامل مع اختلالات الفئات. من الجدير بالذكر أن آلية الانتباه الديناميكية تعزز الأداء بشكل كبير، حيث تحسن الدقة بنسبة 4.7% وWeighted F1 بنسبة 6.7% مقارنة بـ GAT الثابت، بينما تقلل أيضًا من وقت التدريب بنسبة 18%. بشكل عام، تؤكد النتائج فعالية آلية الانتباه الديناميكية في تحسين أداء النموذج في مهام التصنيف متعددة الفئات.

المناقشة

يقدم قسم المناقشة في ورقة البحث إطار عمل DTS-GAN، الذي يدمج الشبكات العصبية طويلة وقصيرة المدى (LSTM) للتشفير الزمني مع آليات الانتباه الرسومية الديناميكية لتحليل بيانات EEG المتعلقة بالصرع. يسمح هذا النهج المبتكر بالاستخراج المشترك للتمثيلات الزمانية-المكانية، مما يعكس الاتصال الديناميكي الكامن في نشاط الدماغ. تستخدم استراتيجية توليد الرسوم البيانية الفريدة للنموذج احتمالات غاوسية متغيرة زمنياً لأوزان الحواف، مما يعالج قيود نماذج الرسوم البيانية الثابتة ويمكّن من تحديثات الطوبولوجيا التكيفية التي تتماشى مع أنماط انتشار النوبات. تم اختبار DTS-GAN بدقة على مجموعة بيانات TUSZ، متفوقة على النماذج الأساسية مثل CNN وTransformer وشبكات الانتباه الرسومية الثابتة (GAT) في تصنيف سبعة أنواع من النوبات، مما يبرز أهمية الاتصال الوظيفي الديناميكي في التقاط التوقيعات الكهربية الخاصة بالنوبات.

على الرغم من تقدمها، تواجه النموذج قيودًا تتعلق بمعالجة بيانات EEG، والتي قد تؤثر على قابليتها للتعميم عبر إعدادات سريرية مختلفة بسبب اختلافات في بروتوكولات اكتساب الإشارة. يُقترح أن تتضمن الأبحاث المستقبلية تقنيات تكيف المجال لتوحيد توزيعات الميزات من مصادر EEG متنوعة. بالإضافة إلى ذلك، بينما يقلل متغير GATv2 من وقت التدريب، فإن زيادة عدد المعلمات تتطلب تحسينًا للتطبيقات السريرية في الوقت الحقيقي. يقترح المؤلفون أن إطار عمل DTS-GAN يمكن أن يوسع فائدته إلى ما هو أبعد من الصرع إلى اضطرابات عصبية أخرى تتميز بشذوذات في الاتصال الزماني-المكاني، مثل الفصام. لتعزيز قوة النموذج وقابليته للتطبيق، يجب أن تركز الجهود المستقبلية على إنشاء مجموعات بيانات متوازنة من خلال تعاون متعدد المراكز وتطوير نماذج خفيفة لنشر فعال في البيئات السريرية.

Journal: Scientific Reports, Volume: 15, Issue: 1
DOI: https://doi.org/10.1038/s41598-025-01015-0
PMID: https://pubmed.ncbi.nlm.nih.gov/40355495
Publication Date: 2025-05-12
Author(s): Kunxian Yan et al.
Primary Topic: EEG and Brain-Computer Interfaces

Overview

The research paper presents a novel approach to analyzing dynamic brain networks in epilepsy using a Dynamic Temporal-Spatial Graph Attention Network (DTS-GAN). This model addresses the limitations of fixed-topology graph models by integrating graph signal processing with a hybrid deep learning framework. DTS-GAN employs an LSTM-based temporal encoder to capture long-term dependencies in EEG sequences and a dynamic graph attention network with probabilistic Gaussian connectivity to adaptively learn transient functional interactions among electrode nodes. Experimental results on the TUSZ dataset indicate that DTS-GAN achieves an accuracy of 89-91% and a weighted F1-score of 87-91% in classifying seven seizure types, significantly surpassing baseline models.

The study concludes that DTS-GAN effectively extracts spatiotemporal features from EEG signals, reducing the need for manual EEG review and showing potential applicability to other neurological disorders with abnormal dynamic connectivity, such as schizophrenia and Alzheimer’s disease. Future research directions include optimizing the model for real-time deployment on clinical devices, integrating multimodal data for improved seizure localization and subtype differentiation, and developing visualization tools to enhance clinician trust in AI-driven decision-making. These efforts aim to bridge theoretical advancements with practical medical applications, ultimately contributing to the field of precision neurology.

Methods

In this section, the authors detail the methods employed in their study, focusing on the training of a model utilizing the Evidence Lower Bound (ELBO) in the context of variational inference and variational autoencoders (VAEs). The ELBO is expressed as:

\[
\text{ELBO} = E_{q(z|x)}[\log p(x|z)] – KL[q(z|x) || p(z)]
\]

where \(q(z|x)\) represents the recognition model, \(p(x|z)\) the generative model, \(x\) the observed data, \(z\) the latent variable, and \(p(z)\) the prior distribution. The objective is to maximize the ELBO to enhance model performance while minimizing the Kullback-Leibler divergence to align the model’s distribution with the true posterior. The Adam optimization algorithm is employed for this purpose. The study also incorporates graph neural network modules to compare static and dynamic attention mechanisms, revealing that the dynamic attention model (DTS-GAN) outperforms others, including CNN and Transformer models, particularly in seizure type classification.

The evaluation metrics include the F1 score, which is calculated using true positives (TP), false positives (FP), and false negatives (FN), with both Macro F1 and Weighted F1 scores considered to assess model performance across different seizure types. The results indicate that the DTS-GAN model achieves a weighted detection rate between 87% and 91%, demonstrating superior accuracy and robustness, especially in handling category imbalances. Notably, the dynamic attention mechanism significantly enhances performance, improving accuracy by 4.7% and Weighted F1 by 6.7% compared to static GAT, while also reducing training time by 18%. Overall, the findings underscore the effectiveness of the dynamic attention mechanism in improving model performance in multi-class classification tasks.

Discussion

The discussion section of the research paper presents the DTS-GAN framework, which integrates Long Short-Term Memory (LSTM) networks for temporal encoding with dynamic graph attention mechanisms for analyzing EEG data related to epilepsy. This innovative approach allows for the joint extraction of spatiotemporal representations, reflecting the dynamic connectivity inherent in brain activity. The model’s unique graph generation strategy employs time-varying Gaussian probabilities for edge weights, addressing the limitations of static graph models and enabling adaptive topology updates that align with the propagation patterns of seizures. The DTS-GAN was rigorously tested on the TUSZ dataset, outperforming baseline models such as CNN, Transformer, and static Graph Attention Networks (GAT) in classifying seven seizure types, highlighting the importance of dynamic functional connectivity in capturing seizure-specific electrophysiological signatures.

Despite its advancements, the model faces limitations related to the preprocessing of EEG data, which may affect its generalizability across different clinical settings due to variations in signal acquisition protocols. Future research is suggested to incorporate domain adaptation techniques to harmonize feature distributions from diverse EEG sources. Additionally, while the GATv2 variant of the model reduces training time, its increased parameter count necessitates optimization for real-time clinical applications. The authors propose that the DTS-GAN framework could extend its utility beyond epilepsy to other neurological disorders characterized by spatiotemporal connectivity anomalies, such as schizophrenia. To enhance the model’s robustness and applicability, future efforts should focus on creating balanced datasets through multi-center collaborations and developing lightweight models for efficient deployment in clinical environments.