AMEEGNet: شبكة EEGNet متعددة المقاييس المعتمدة على الانتباه لفك تشفير صور الحركة الفعالة AMEEGNet: attention-based multiscale EEGNet for effective motor imagery EEG decoding

المجلة: Frontiers in Neurorobotics، المجلد: 19
DOI: https://doi.org/10.3389/fnbot.2025.1540033
PMID: https://pubmed.ncbi.nlm.nih.gov/39911854
تاريخ النشر: 2025-01-22
المؤلف: Xuejian Wu وآخرون
الموضوع الرئيسي: تخطيط الدماغ وواجهات الدماغ-الكمبيوتر

نظرة عامة

في التطورات الأخيرة في تكنولوجيا واجهة الدماغ-الكمبيوتر (BCI)، أظهرت إشارات تخطيط الدماغ الكهربائي (EEG) المستمدة من تخيل الحركة (MI) وعدًا، خاصة في إعادة تأهيل المرضى المشلولين. ومع ذلك، فإن نسبة الإشارة إلى الضوضاء المنخفضة المرتبطة بـ MI EEG تطرح تحديات كبيرة للتشفير الفعال، مما يحد من تقدم BCI. يقدم هذا البحث شبكة EEGNet متعددة المقاييس المعتمدة على الانتباه (AMEEGNet) المصممة لتعزيز أداء التشفير لـ MI-EEG.

تستخدم الطريقة المقترحة ثلاث شبكات EEGNets متوازية تعتمد على نهج نقل دمج لاستخراج ميزات زمنية-مكانية عالية الجودة من بيانات EEG عبر مقاييس متعددة. بالإضافة إلى ذلك، تم دمج وحدة انتباه القناة الفعالة (ECA) لتحسين تحديد الميزات المكانية التمييزية من خلال تطبيق آلية وزن خفيفة على القنوات الحرجة. تشير النتائج التجريبية إلى أن AMEEGNet تحقق دقة تشفير تبلغ 81.17%، 89.83%، و95.49% على مجموعات بيانات BCI-2a، BCI-2b، وHGD، على التوالي. تشير هذه النتائج إلى أن AMEEGNet تقوم بتشفير الميزات الزمنية-المكانية بفعالية، مما يقدم نهجًا جديدًا لتشفير MI-EEG ويساهم في تقدم تطبيقات BCI المستقبلية.

مقدمة

تناقش مقدمة هذه الورقة البحثية التقدم في تكنولوجيا واجهة الدماغ-الكمبيوتر (BCI)، التي تدمج العلوم الطبية الحيوية، وعلم الأعصاب، وتفاعل الإنسان مع الكمبيوتر لإنشاء مسار اتصال مباشر بين الدماغ والأجهزة الخارجية، مما يحسن نوعية الحياة للأفراد ذوي الإعاقة. تؤكد الورقة على مزايا تخيل الحركة (MI) EEG مقارنة بتخطيط الدماغ الكهربائي المستحث (EEG)، مشددة على طبيعته العفوية التي تسمح بالتحكم دون تدخل خارجي. تلاحظ أن مهام تخيل الحركة المختلفة تنشط مناطق دماغية متميزة، مع استجابات سعة متغيرة في قشرة الحركية الحسية خلال الحركات الجانبية والجانبية المعاكسة، خاصة في نطاقات التردد α (8-12 هرتز) وβ (13-30 هرتز).

يقدم المؤلفون أربع مساهمات رئيسية لتعزيز أداء تشفير EEG. أولاً، يقدمون شبكة EEGNet متعددة المقاييس التي تستخرج الميزات عبر مقاييس مختلفة، مما يحسن قدرة النموذج على التقاط إشارات EEG الزمنية والمكانية المعقدة. ثانيًا، يقترحون طريقة نقل دمج تحلل ميزات EEG عبر مقاييس مختلفة من خلال شبكات متوازية، مما يعزز التفاعل متعدد المقاييس ودقة التشفير. ثالثًا، يسمح تنفيذ آلية انتباه القناة الفعالة (ECA) للنموذج بالتركيز على القنوات الهامة مع تقليل المتطلبات الحسابية. أخيرًا، يظهر النموذج أداءً قويًا على مجموعات بيانات مسابقة BCI IV، مدعومًا بتقنيات التصوير التي تعزز القابلية للتفسير وتؤكد فعالية الطريقة المقترحة.

طرق

في قسم الطرق، يوضح المؤلفون إعداد التجربة ومقاييس الأداء المستخدمة لتقييم نموذج AMEEGNet لتشفير تخطيط الدماغ الكهربائي لتخيل الحركة (MI-EEG). تم إجراء جميع التجارب تحت ظروف متسقة باستخدام إطار عمل Pytorch 1.12 على وحدة معالجة الرسوميات Nvidia GTX 3060. تم تحسين النموذج باستخدام مُحسّن آدم بمعدل تعلم قدره 0.001، وحجم دفعة قدره 64، وتدريب لمدة 1,000 دورة مع خسارة الانتروبيا المتقاطعة. تم تقييم الأداء بشكل أساسي من خلال مقاييس الدقة، مما يكشف أن AMEEGNet تفوقت على نماذج أخرى في مهام التشفير ذات الفئتين والأربع فئات، كما يتضح من مصفوفات الالتباس التي أبرزت التمييز الفعال بين معظم الفئات، على الرغم من أن فئة اللسان أظهرت دقة أقل.

بالإضافة إلى ذلك، استخدم المؤلفون طريقة تقييم ترك موضوع واحد خارج (LOSO) لتقييم أداء التشفير عبر الموضوعات. أظهرت النتائج من مجموعة بيانات BCI 2a أن AMEEGNet حققت دقة قدرها 66.68% ودرجة كابا قدرها 0.5558، مما يدل على أداء قوي على الموضوعات غير المرئية. أظهرت المقارنات مع الطرق التقليدية، مثل CSP + SVM، ونماذج التعلم العميق الأخرى (EEGNet، EEGNeX، TCNet) تحسينات كبيرة، خاصة تحسين بنسبة 15.22% مقارنة بـ CSP + SVM (p < 0.02). ساهم دمج نقل الدمج وكتلة ECA في هذه التقدمات، مع تأكيد التحليلات الإحصائية على تفوق AMEEGNet في كل من المهام المعتمدة على الموضوعات وغير المعتمدة على الموضوعات.

نقاش

يسلط قسم النقاش في الورقة البحثية الضوء على تطوير وتقييم شبكة EEGNet متعددة المقاييس المعتمدة على الانتباه (AMEEGNet) لتشفير تخيل الحركة (MI) EEG. تستخدم الدراسة عدة مجموعات بيانات، بما في ذلك مجموعات بيانات مسابقة BCI IV 2a و2b، ومجموعة بيانات غاما العالية (HGD)، لتقييم أداء النموذج عبر مهام تصنيف MI المختلفة. تستخدم AMEEGNet بنية فريدة تتكون من ثلاث كتل EEGNet متوازية لاستخراج الميزات متعددة المقاييس، وكتلة انتباه القناة الفعالة (ECA) لتعزيز الاعتماد على القنوات، وكتلة تصنيف تدمج الميزات لتحسين الدقة. أظهر النموذج تحسينات كبيرة في دقة التصنيف ودرجات كابا مقارنة بأساليب التعلم الآلي التقليدية وغيرها من أساليب التعلم العميق الحديثة، محققًا دقة متوسطة قدرها 81.17% و89.83% على مجموعات بيانات BCI 2a و2b، على التوالي.

تؤكد النتائج على فعالية تصميم AMEEGNet، خاصة قدرته على استخراج ميزات زمنية-مكانية معقدة دون معالجة مسبقة واسعة، مما يقلل من فقدان البيانات. كما تؤكد الدراسة على أهمية كتلة ECA في تحسين اختيار الميزات، مما يساهم في قوة النموذج واستقراره في تشفير MI-EEG. تشمل اتجاهات البحث المستقبلية تعزيز أداء النموذج في البيئات الديناميكية من خلال دمج الشبكات العصبية المتكررة (RNNs) ومعالجة التحديات في التشفير عبر الإنترنت لتحسين قدرات المعالجة في الوقت الحقيقي. بشكل عام، تمثل AMEEGNet المقترحة تقدمًا كبيرًا في مجال واجهات الدماغ-الكمبيوتر (BCIs)، مع آثار واعدة للتطبيقات العملية في تشفير MI-EEG.

Journal: Frontiers in Neurorobotics, Volume: 19
DOI: https://doi.org/10.3389/fnbot.2025.1540033
PMID: https://pubmed.ncbi.nlm.nih.gov/39911854
Publication Date: 2025-01-22
Author(s): Xuejian Wu et al.
Primary Topic: EEG and Brain-Computer Interfaces

Overview

In recent developments within brain-computer interface (BCI) technology, electroencephalogram (EEG) signals derived from motor imagery (MI) have shown promise, particularly for rehabilitating paralyzed patients. However, the low signal-to-noise ratio associated with MI EEG poses significant challenges for effective decoding, thereby limiting BCI advancements. This paper introduces an attention-based multiscale EEGNet (AMEEGNet) designed to enhance the decoding performance of MI-EEG.

The proposed method employs three parallel EEGNets utilizing a fusion transmission approach to extract high-quality temporal-spatial features from EEG data across multiple scales. Additionally, an efficient channel attention (ECA) module is integrated to improve the identification of discriminative spatial features by applying a lightweight weighting mechanism to critical channels. Experimental results indicate that AMEEGNet achieves decoding accuracies of 81.17%, 89.83%, and 95.49% on the BCI-2a, BCI-2b, and HGD datasets, respectively. These findings suggest that AMEEGNet effectively decodes temporal-spatial features, offering a novel approach to MI-EEG decoding and contributing to the advancement of future BCI applications.

Introduction

The introduction of this research paper discusses the advancements in brain-computer interface (BCI) technology, which integrates biomedical science, neuroscience, and human-computer interaction to create a direct communication pathway between the brain and external devices, thereby improving the quality of life for individuals with disabilities. The paper emphasizes the advantages of motor imagery (MI) EEG over evoked electroencephalography (EEG), highlighting its spontaneous nature that allows for control without external interference. It notes that different motor imagery tasks activate distinct brain regions, with varying amplitude responses in the sensorimotor cortex during ipsilateral and contralateral movements, particularly in the α (8-12 Hz) and β (13-30 Hz) frequency bands.

The authors present four key contributions to enhance EEG decoding performance. First, they introduce a multi-scale EEGNet that extracts features across various scales, improving the model’s ability to capture complex temporal and spatial EEG signals. Second, they propose a fusion transmission method that analyzes EEG features across different scales through parallel networks, enhancing multiscale interaction and decoding accuracy. Third, the implementation of an efficient channel attention (ECA) mechanism allows the model to focus on significant channels while minimizing computational demands. Finally, the model demonstrates strong performance on BCI Competition IV datasets, supported by visualization techniques that enhance interpretability and validate the proposed method’s efficacy.

Methods

In the Methods section, the authors detail the experimental setup and performance metrics used to evaluate the AMEEGNet model for motor imagery electroencephalography (MI-EEG) decoding. All experiments were conducted under consistent conditions utilizing the Pytorch 1.12 framework on an Nvidia GTX 3060 GPU. The model was optimized using the Adam optimizer with a learning rate of 0.001, a batch size of 64, and trained for 1,000 epochs with cross-entropy loss. Performance was primarily assessed through accuracy metrics, revealing that AMEEGNet outperformed other models in both two-class and four-class decoding tasks, as evidenced by confusion matrices that highlighted effective discrimination among most classes, although the tongue class showed lower accuracy.

Additionally, the authors employed a Leave-One-Subject-Out (LOSO) evaluation method to assess cross-subject decoding performance. Results from the BCI 2a dataset indicated that AMEEGNet achieved an accuracy of 66.68% and a Kappa score of 0.5558, demonstrating robust performance on unseen subjects. Comparisons with traditional methods, such as CSP + SVM, and other deep learning models (EEGNet, EEGNeX, TCNet) showed significant improvements, particularly a 15.22% enhancement over CSP + SVM (p < 0.02). The integration of fusion transmission and the ECA block contributed to these advancements, with statistical analyses confirming the superiority of AMEEGNet in both subject-dependent and subject-independent tasks.

Discussion

The discussion section of the research paper highlights the development and evaluation of the attention-based multiscale EEGNet (AMEEGNet) for motor imagery (MI) EEG decoding. The study utilizes several datasets, including the BCI Competition IV 2a and 2b datasets, and the High Gamma Dataset (HGD), to assess the model’s performance across various MI classification tasks. AMEEGNet employs a unique architecture comprising three parallel EEGNet blocks for multiscale feature extraction, an efficient channel attention (ECA) block to enhance channel dependencies, and a classification block that integrates features for improved accuracy. The model demonstrated significant improvements in classification accuracy and kappa scores compared to traditional machine learning methods and other state-of-the-art deep learning approaches, achieving average accuracies of 81.17% and 89.83% on the BCI 2a and 2b datasets, respectively.

The findings underscore the effectiveness of AMEEGNet’s design, particularly its ability to extract complex temporal-spatial features without extensive preprocessing, thus minimizing data loss. The study also emphasizes the importance of the ECA block in refining feature selection, which contributes to the model’s robustness and stability in MI-EEG decoding. Future research directions include enhancing the model’s performance in dynamic environments by integrating recurrent neural networks (RNNs) and addressing challenges in online decoding to improve real-time processing capabilities. Overall, the proposed AMEEGNet represents a significant advancement in the field of brain-computer interfaces (BCIs), with promising implications for practical applications in MI-EEG decoding.