تصنيف إشارة تخطيط الدماغ لتخيل الحركة باستخدام خوارزمية تعلم عميقة جديدة Motor imagery EEG signal classification using novel deep learning algorithm

المجلة: Scientific Reports، المجلد: 15، العدد: 1
DOI: https://doi.org/10.1038/s41598-025-00824-7
PMID: https://pubmed.ncbi.nlm.nih.gov/40628758
تاريخ النشر: 2025-07-08
المؤلف: Sathish Mathiyazhagan وآخرون
الموضوع الرئيسي: تخطيط الدماغ وواجهات الدماغ-الكمبيوتر

نظرة عامة

تقدم هذه الدراسة نموذجًا جديدًا لتصنيف إشارات تخيل الحركة (MI) EEG، حيث تتناول التحديات مثل ضوضاء الإشارة، والتباين بين الأفراد، ومتطلبات المعالجة في الوقت الحقيقي التي تعيق التقنيات الحالية. يتضمن النهج المقترح مرحلة معالجة مسبقة هجينة تستخدم تحليل الوضع التجريبي (EMD) لاستخراج وضع الإشارة الجوهري وتحويل الموجات المستمرة (CWT) للتحليل متعدد الدقة. من أجل تحسين استخراج الميزات المكانية، يستخدم النموذج تماسك قوة المصدر (SPoC) بالتزامن مع الأنماط المكانية الشائعة (CSP). يتم إجراء التصنيف باستخدام شبكة اعتقاد عميقة قابلة للتكيف (ADBN)، مع تحسين المعلمات من خلال خوارزمية تحسين البعيد والقريب (FNO).

تُثبت فعالية النموذج المقترح على مجموعات بيانات مرجعية، محققة نتائج مثيرة للإعجاب: دقة 95.7%، استرجاع 96.2%، دقة 95.9%، وخصوصية 97.5% على مجموعة بيانات مسابقة BCI IV Dataset 2a، ودقة 94.1%، استرجاع 94.0%، دقة 93.6%، وخصوصية 95.0% على مجموعة بيانات PhysioNet. تُظهر هذه النتائج أداءً متفوقًا مقارنة بالطرق الحالية، بما في ذلك الشبكات العصبية التلافيفية (CNN)، والذاكرة طويلة وقصيرة المدى (LSTM)، وخوارزميات LSTM ثنائية الاتجاه (BiLSTM)، مما يبرز قوة النموذج وقدرته على التكيف في تحليل إشارات EEG.

الطرق

في هذا القسم، يصف المؤلفون المنهجيات المستخدمة في بحثهم، مع تسليط الضوء على استخدام MSCARNet مع الالتفاف متعدد المقاييس وتضمينات ريمان، بالإضافة إلى المحولات الجماعية لتصنيف EEG متعدد الفئات. يُلاحظ أن نهج MSCARNet يتميز بكثافة حسابية عالية، خاصة في المهام المستقلة عن الأفراد، واعتماده على ضبط المعلمات. في الوقت نفسه، تواجه المحولات الجماعية متطلبات موارد عالية ولديها استكشاف محدود للتعديلات الخفيفة المناسبة للتطبيقات في الوقت الحقيقي.

يقدم المؤلفون شبكة مُحسّنة باستخدام خوارزمية مشغل فورييه العصبي (FNO)، والتي تستخدم استراتيجية مزدوجة من الاستكشاف العالمي والتنقيح المحلي. يسهل هذا النهج التقارب الفعال ويساعد على تجنب الحد الأدنى المحلي، متفوقًا على طرق التحسين التقليدية مثل الانحدار العشوائي (SGD) وآدم. يُظهر النموذج المقترح دقة تصنيف محسّنة ومرونة ضد التباين والضوضاء، كما هو موضح في مخطط تدفق العملية المرافق (الشكل 1).

النتائج

تُثبت نتائج الدراسة أداء النموذج المقترح باستخدام مجموعة بيانات مسابقة BCI IV Dataset 2a، والتي تتكون من مهام تخيل الحركة (MI) متعددة الفئات مع إشارات EEG مقسمة مسبقًا. تم إجراء التجارب على محطة عمل عالية الأداء تحتوي على معالج Intel Core i7، وذاكرة وصول عشوائي سعتها 32 جيجابايت، و بطاقة رسومات NVIDIA GeForce RTX 3080، مما يسهل حسابات التعلم العميق بكفاءة. تم تطوير النموذج بلغة بايثون 3.8، باستخدام مكتبات رئيسية مثل TensorFlow 2.9 للتعلم العميق، وScikit-learn لبرامج التعلم الآلي، وMatplotlib وSeaborn لتصور البيانات.

يستخدم النموذج المقترح بنية متعددة الطبقات مصممة لدمج الميزات المكانية والزمانية وتردد المجال من بيانات EEG بشكل فعال. تم حساب مقاييس الأداء، بما في ذلك الدقة، والدقة، والاسترجاع، ودرجة F1، وكابا، لإجراء تحليل مقارن شامل ضد نماذج التعلم الآلي (ML) والتعلم العميق (DL) التقليدية، مثل CNN، وLSTM، وBiLSTM، وغابة عشوائية (RF). بالإضافة إلى ذلك، تم تقييم نماذج هجينة تجمع بين CNN مع SVM، وCNN مع RF، وDNN مع SVM. تم تفصيل المعلمات الفائقة المستخدمة في المحاكاة في الجدول 2، مما يوفر أساسًا للمقارنة مع النماذج التقليدية.

المناقشة

يوفر قسم المناقشة في الورقة نظرة شاملة على الأبحاث الحالية في تصنيف إشارات تخيل الحركة (MI) EEG، مع تسليط الضوء على منهجيات مختلفة ونقاط قوتها وقيودها. من الجدير بالذكر أن العديد من النماذج تستخدم تقنيات متقدمة مثل الشبكات العصبية، وتحويلات الموجات، والهندسة الريمانية لتعزيز دقة التصنيف. على سبيل المثال، يستخدم نموذج MSCARNet انتباه الالتفاف متعدد المقاييس وتضمين الهندسة الريمانية، محققًا أداءً محسّنًا في التصنيفات المعتمدة على الأفراد، على الرغم من أنه يواجه صعوبة مع التباين الفردي. تُظهر نماذج أخرى، مثل تلك التي تستخدم تحليل حزمة الموجات وتفكيك الوضع التجريبي الجماعي، أداءً متفوقًا على الطرق التقليدية ولكن تواجه تحديات تتعلق بالتعقيد الحسابي وزيادة الميزات.

تحدد مراجعة الأدبيات الفجوات البحثية الحرجة، خاصة فيما يتعلق بمرونة طرق استخراج الميزات تجاه التباين بين الأفراد وعدم كفاءة الخوارزميات الحالية. تركز العديد من الدراسات على التصنيف الثنائي، متجاهلة تعقيدات مهام BCI في العالم الحقيقي. تهدف الأعمال المقترحة إلى معالجة هذه الفجوات من خلال دمج مرحلة معالجة مسبقة هجينة، وتقنيات استخراج ميزات متقدمة، ونموذج تصنيف مُحسّن باستخدام خوارزمية تحسين البعيد والقريب (FNO). يسعى هذا النهج إلى تعزيز أداء التصنيف مع الحفاظ على عبء حسابي منخفض، مما يحسن من قدرة التكيف وقابلية التوسع لأنظمة تصنيف EEG المعتمدة على MI.

Journal: Scientific Reports, Volume: 15, Issue: 1
DOI: https://doi.org/10.1038/s41598-025-00824-7
PMID: https://pubmed.ncbi.nlm.nih.gov/40628758
Publication Date: 2025-07-08
Author(s): Sathish Mathiyazhagan et al.
Primary Topic: EEG and Brain-Computer Interfaces

Overview

This research presents a novel model for classifying motor imagery (MI) EEG signals, addressing challenges such as signal noise, inter-subject variability, and real-time processing demands that hinder existing technologies. The proposed approach incorporates a hybrid preprocessing stage that utilizes empirical mode decomposition (EMD) for intrinsic signal mode extraction and continuous wavelet transform (CWT) for multi-resolution analysis. For enhanced spatial feature extraction, the model employs source power coherence (SPoC) in conjunction with common spatial patterns (CSP). The classification is performed using an adaptive deep belief network (ADBN), with parameters optimized through the Far and near optimization (FNO) algorithm.

The effectiveness of the proposed model is validated on benchmark datasets, achieving impressive results: 95.7% accuracy, 96.2% recall, 95.9% precision, and 97.5% specificity on the BCI competition IV Dataset 2a, and 94.1% accuracy, 94.0% recall, 93.6% precision, and 95.0% specificity on the PhysioNet dataset. These outcomes demonstrate superior performance compared to existing methods, including convolutional neural networks (CNN), long short-term memory (LSTM), and bidirectional LSTM (BiLSTM) algorithms, highlighting the model’s robustness and adaptability in EEG signal analysis.

Methods

In this section, the authors describe the methodologies employed in their research, highlighting the use of MSCARNet with multi-scale convolution and Riemannian embeddings, as well as ensemble transformers for multi-class EEG classification. The MSCARNet approach is noted for its computational intensity, particularly in subject-independent tasks, and its dependence on parameter tuning. Meanwhile, the ensemble transformers face high resource demands and have limited exploration of lightweight adaptations suitable for real-time applications.

The authors introduce a network optimized with the Fourier Neural Operator (FNO) algorithm, which employs a dual strategy of global exploration and local refinement. This approach facilitates efficient convergence and helps avoid local minima, outperforming traditional optimization methods such as Stochastic Gradient Descent (SGD) and Adam. The proposed model demonstrates enhanced classification accuracy and robustness against variability and noise, as illustrated in the accompanying process flow diagram (Figure 1).

Results

The results of the study validate the proposed model’s performance using the BCI Competition IV Dataset 2a, which consists of multi-class motor imagery (MI) tasks with pre-segmented EEG signals. The experiments were conducted on a high-performance workstation featuring an Intel Core i7 processor, 32 GB of RAM, and an NVIDIA GeForce RTX 3080 GPU, facilitating efficient deep learning computations. The model was developed in Python 3.8, utilizing key libraries such as TensorFlow 2.9 for deep learning, Scikit-learn for machine learning utilities, and Matplotlib and Seaborn for data visualization.

The proposed model employs a multi-layer architecture designed to effectively integrate spatial, temporal, and frequency-domain features from the EEG data. Performance metrics, including accuracy, precision, recall, F1-score, and kappa, were computed for a comprehensive comparative analysis against conventional machine learning (ML) and deep learning (DL) models, such as CNN, LSTM, BiLSTM, and Random Forest (RF). Additionally, hybrid models combining CNN with SVM, CNN with RF, and DNN with SVM were also evaluated. The hyperparameters used in the simulations are detailed in Table 2, providing a basis for comparison with the conventional models.

Discussion

The discussion section of the paper provides a comprehensive overview of existing research in motor imagery (MI) EEG signal classification, highlighting various methodologies and their respective strengths and limitations. Notably, several models utilize advanced techniques such as neural networks, wavelet transforms, and Riemannian geometry to enhance classification accuracy. For instance, the MSCARNet model employs multi-scale convolutional attention and Riemannian geometry embedding, achieving improved performance in subject-dependent classifications, although it struggles with individual variability. Other models, such as those using wavelet packet decomposition and ensemble empirical mode decomposition, demonstrate superior performance over traditional methods but face challenges related to computational complexity and feature redundancy.

The literature review identifies critical research gaps, particularly regarding the robustness of feature extraction methods to inter-subject variability and the computational inefficiency of existing algorithms. Many studies focus on binary classification, neglecting the complexities of real-world BCI tasks. The proposed work aims to address these gaps by integrating a hybrid preprocessing stage, advanced feature extraction techniques, and an optimized classification model utilizing the Far and Near Optimization (FNO) algorithm. This approach seeks to enhance classification performance while maintaining low computational overhead, thereby improving the adaptability and scalability of MI-based EEG classification systems.