شبكة تحسين مركبة محسنة للانتباه لتصنيف تخطيط الدماغ الكهربائي لتخيل الحركة A composite improved attention convolutional network for motor imagery EEG classification

المجلة: Frontiers in Neuroscience، المجلد: 19
DOI: https://doi.org/10.3389/fnins.2025.1543508
PMID: https://pubmed.ncbi.nlm.nih.gov/39981403
تاريخ النشر: 2025-02-05
المؤلف: Wenzhe Liao وآخرون
الموضوع الرئيسي: تخطيط الدماغ وواجهات الدماغ-الكمبيوتر

نظرة عامة

تقدم ورقة البحث شبكة تحسين الانتباه المركبة (CIACNet) المصممة لتصنيف إشارات تخطيط الدماغ الكهربائية (MI-EEG) المتعلقة بالتصور الحركي، والتي تعتبر حاسمة لتطبيقات واجهة الدماغ-الكمبيوتر (BCI). تتضمن بنية CIACNet شبكة عصبية تلافيفية ذات فرعين لاستخراج الميزات الزمنية، ووحدة تحسين الانتباه (CBAM) لتحسين استخراج الميزات، وشبكة تلافيفية زمنية (TCN) لالتقاط الديناميات الزمنية المتقدمة. بالإضافة إلى ذلك، يتم استخدام دمج الميزات متعددة المستويات لتوفير تمثيل أكثر شمولاً للبيانات.

أظهرت التقييمات التجريبية على مجموعات بيانات BCI IV-2a وBCI IV-2b دقة تصنيف بلغت 85.15% و90.05%، على التوالي، مع درجة كابا تبلغ 0.80 لكلا المجموعتين. تشير هذه النتائج إلى أن CIACNet يتفوق بشكل كبير على أربعة نماذج مقارنة، مما يدل على قدراته القوية في التصنيف وكفاءته من حيث وقت التدريب. تختتم الدراسة بأن كل مكون من مكونات CIACNet يساهم بشكل ملموس في أدائه، مما يبرز إمكانياته للتطبيقات العملية في أنظمة MI-BCI دون الحاجة إلى إزالة الشوائب، مما يقلل من التكاليف الزمنية الإجمالية.

مقدمة

تناقش مقدمة ورقة البحث هذه التقدمات في تكنولوجيا واجهة الدماغ-الكمبيوتر (BCI)، التي تسهل التواصل المباشر بين إشارات الدماغ العصبية والأجهزة الخارجية. تمكّن أنظمة BCI، وخاصة تلك التي تستخدم تخطيط الدماغ الكهربائي (EEG)، من تفسير نشاط الدماغ للتحكم في الأجهزة مثل الكراسي المتحركة والأطراف الصناعية. يتم تسليط الضوء على التصور الحركي (MI)، وهو إعادة تصور ذهني للحركات، كمنطقة تطبيق مهمة لإشارات MI-EEG، وخاصة في الطب التأهيلي. ومع ذلك، فإن التحديات مثل التباين بين الأفراد وعدم استقرار إشارات EEG تعقد التفسير الدقيق لنوايا المستخدمين، مما يستلزم طرق تصنيف فعالة.

تستعرض الورقة تقنيات استخراج الميزات التقليدية مثل الأنماط المكانية الشائعة (CSP) وآلات الدعم الناقل (SVM)، مشيرة إلى قيودها في التعامل مع بيانات MI-EEG المعقدة وغير الخطية. بالمقابل، ظهرت طرق التعلم العميق (DL) كحلول واعدة، حيث أظهرت نماذج مثل الشبكات العصبية التلافيفية (CNN) والشبكات العصبية المتكررة (RNN) أداءً محسنًا في التصنيف. تهدف إدخال آليات الانتباه والهياكل الجديدة، بما في ذلك شبكة تحسين الانتباه المركبة المقترحة (CIACNet)، إلى تعزيز استخراج الميزات ودقة التصنيف في معالجة إشارات MI-EEG. يظهر نموذج CIACNet، الذي يتميز بهيكله ثنائي الفرع وآليات الانتباه المحسنة، إمكانيات عالية الأداء في تطبيقات MI-BCI، مع معالجة التحديات التي تطرحها الطرق التقليدية.

طرق

يتم هيكلة نموذج CIACNet حول ثلاثة مكونات رئيسية: كتلة التلافيف (CV)، وكتلة تحسين الانتباه CBAM (IAT)، وكتلة التلافيف الزمنية (TC). تنقسم كتلة CV إلى فرعين فرعيين، CV1 وCV2، يحتوي كل منهما على ثلاثة أنواع من الطبقات التلافيفية—التلافيف الزمنية، والتلافيف العمق القنوي، والتلافيف المكانية—المصممة لترميز الميزات الزمكانية لإشارات MI-EEG. يسمح هذا التكوين لكتلة CV بالتقاط الميزات الزمنية بفعالية عبر خطوات زمنية مختلفة، مما ينتج تمثيلًا شاملاً للإشارات المدخلة.

تعزز كتلة IAT استخراج الميزات من خلال استخدام وحدة تحسين الانتباه التلافيفية (CBAM) المحسنة، التي تركز على كل من الأبعاد القنوية والمكانية. في الوقت نفسه، تستفيد كتلة TC من الشبكات التلافيفية الزمنية (TCN) لاشتقاق ميزات زمنية عالية المستوى من البيانات المتسلسلة. في مرحلة الدمج النهائية، يتم دمج المخرجات من كتل IAT وTC وCV2 ومعالجتها بعد ذلك من خلال طبقة متصلة بالكامل (FC) مع مصنف سوفت ماكس، مما يؤدي إلى قدرات النموذج التنبؤية. سيتم مناقشة مزيد من التفاصيل حول بنية وأداء نموذج CIACNet في الأقسام التالية.

مناقشة

في قسم المناقشة من ورقة البحث، يوضح المؤلفون المعالجة المسبقة وتمثيل إشارات MI-EEG ضمن نموذج CIACNet المقترح. يحتفظ النموذج بالطيف الكامل لترددات EEG وجميع القنوات أثناء المعالجة المسبقة، ويطبق تطبيع Z-score لمعالجة عدم استقرار الإشارات. يضمن هذا التطبيع أن يكون لكل قناة متوسط صفر وانحراف معياري واحد، مما يسهل فك تشفير بيانات MI-EEG بشكل فعال. تتضمن بنية النموذج كتل تلافيفية (CV) مستوحاة من EEGNet، معززة باستراتيجيات تلافيفية متعددة الفروع ومتعددة المقاييس لتحسين استخراج الميزات من البيانات المزعجة. تستخدم كتل CV التلافيف ثنائية الأبعاد وتدمج تطبيع الدفعة، وتنظيم الريج، وطبقات الإسقاط لتحسين الأداء وتقليل الإفراط في التخصيص.

بالإضافة إلى ذلك، يقدم المؤلفون كتلة تحسين الانتباه CBAM (IAT) المحسنة، التي تدمج آليات الانتباه القنوي والمكاني لتعزيز تركيز النموذج على الميزات الهامة ضمن البيانات المدخلة. تستخدم كتلة IAT طريقة تجميع عشوائية جديدة، مما يسمح باختيار أكثر دقة للميزات بناءً على أحجامها، وبالتالي تحسين قدرة النموذج على استخراج المعلومات ذات الصلة. تم تصميم كتلة التلافيف الزمنية (TC) لمعالجة البيانات المتسلسلة بفعالية، باستخدام تلافيف سببية موسعة لالتقاط سياقات زمنية أوسع مع الحفاظ على سلامة المدخلات السابقة. يتم تقييم نموذج CIACNet باستخدام مجموعات بيانات BCI IV-2a وIV-2b، مما يظهر دقة تصنيف واستقرار متفوقين مقارنة بالنماذج الأخرى المعتمدة على CNN، مما يبرز إمكانياته للتطبيقات الواقعية في فك تشفير إشارات MI-EEG.

Journal: Frontiers in Neuroscience, Volume: 19
DOI: https://doi.org/10.3389/fnins.2025.1543508
PMID: https://pubmed.ncbi.nlm.nih.gov/39981403
Publication Date: 2025-02-05
Author(s): Wenzhe Liao et al.
Primary Topic: EEG and Brain-Computer Interfaces

Overview

The research paper introduces a composite improved attention convolutional network (CIACNet) designed for the classification of motor imagery electroencephalography (MI-EEG) signals, which are crucial for brain-computer interface (BCI) applications. The CIACNet architecture incorporates a dual-branch convolutional neural network (CNN) for extracting temporal features, an enhanced convolutional block attention module (CBAM) for improved feature extraction, and a temporal convolutional network (TCN) to capture advanced temporal dynamics. Additionally, multi-level feature concatenation is employed to provide a more comprehensive representation of the data.

Experimental evaluations on the BCI IV-2a and BCI IV-2b datasets yielded classification accuracies of 85.15% and 90.05%, respectively, with a kappa score of 0.80 for both datasets. These results indicate that CIACNet significantly outperforms four comparative models, demonstrating its robust classification capabilities and efficiency in terms of training time. The study concludes that each component of the CIACNet contributes meaningfully to its performance, underscoring its potential for practical applications in MI-BCI systems without the need for artifact removal, thus reducing overall time costs.

Introduction

The introduction of this research paper discusses the advancements in brain-computer interface (BCI) technology, which facilitates direct communication between brain neural signals and external devices. BCI systems, particularly those utilizing electroencephalography (EEG), enable the interpretation of brain activity to control devices such as wheelchairs and prosthetics. Motor imagery (MI), the mental rehearsal of movements, is highlighted as a significant area of application for MI-EEG signals, particularly in rehabilitation medicine. However, challenges such as inter-individual variability and non-stationarity of EEG signals complicate the accurate interpretation of users’ intentions, necessitating effective classification methods.

The paper reviews traditional feature extraction techniques like common spatial patterns (CSP) and support vector machines (SVM), noting their limitations in handling complex, non-linear MI-EEG data. In contrast, deep learning (DL) approaches have emerged as promising solutions, with models such as convolutional neural networks (CNN) and recurrent neural networks (RNN) demonstrating improved classification performance. The introduction of attention mechanisms and novel architectures, including the proposed composite improved attention convolutional network (CIACNet), aims to enhance feature extraction and classification accuracy in MI-EEG signal processing. The CIACNet model, characterized by its dual-branch architecture and improved attention mechanisms, shows potential for high performance in MI-BCI applications, addressing the challenges posed by traditional methods.

Methods

The CIACNet model is structured around three primary components: the convolutional (CV) block, the improved CBAM attention (IAT) block, and the temporal convolution (TC) block. The CV block is subdivided into two subblocks, CV1 and CV2, each containing three types of convolutional layers—temporal, channel depth-wise, and spatial convolutions—designed to encode the spatiotemporal features of motor imagery electroencephalography (MI-EEG) signals. This configuration allows the CV block to effectively capture temporal features across various time steps, producing a comprehensive representation of the input signals.

The IAT block enhances feature extraction by employing an improved Convolutional Block Attention Module (CBAM), which focuses on both channel and spatial dimensions. Meanwhile, the TC block leverages Temporal Convolutional Networks (TCN) to derive high-level temporal features from sequential data. In the final integration stage, the outputs from the IAT, TC, and CV2 blocks are concatenated and subsequently processed through a fully connected (FC) layer with a softmax classifier, culminating in the model’s predictive capabilities. Further elaboration on the CIACNet model’s architecture and performance will be discussed in the following sections.

Discussion

In the discussion section of the research paper, the authors detail the preprocessing and representation of motor imagery electroencephalography (MI-EEG) signals within their proposed CIACNet model. The model retains the full spectrum of EEG frequencies and all channels during preprocessing, applying Z-score normalization to address signal non-stationarity. This normalization ensures that each channel has a mean of zero and a standard deviation of one, facilitating effective decoding of MI-EEG data. The model’s architecture includes convolutional (CV) blocks inspired by EEGNet, enhanced with multi-branch, multi-scale convolutional strategies to improve feature extraction from noisy data. The CV blocks utilize 2D convolutions and incorporate batch normalization, ridge regularization, and dropout layers to optimize performance and mitigate overfitting.

Additionally, the authors introduce an improved CBAM attention (IAT) block, which integrates channel and spatial attention mechanisms to enhance the model’s focus on significant features within the input data. The IAT block employs a novel stochastic pooling method, allowing for a more nuanced selection of features based on their magnitudes, thereby improving the model’s ability to extract relevant information. The temporal convolutional (TC) block is designed to process sequential data effectively, utilizing dilated causal convolutions to capture broader temporal contexts while maintaining the integrity of prior inputs. The CIACNet model is evaluated using the BCI IV-2a and IV-2b datasets, demonstrating superior classification accuracy and stability compared to other CNN-based models, underscoring its potential for real-world applications in MI-EEG signal decoding.