تعزيز تصنيف إشارات EEG في واجهات الدماغ والحاسوب باستخدام نماذج التعلم العميق الهجينة Enhanced EEG signal classification in brain computer interfaces using hybrid deep learning models

المجلة: Scientific Reports، المجلد: 15، العدد: 1
DOI: https://doi.org/10.1038/s41598-025-07427-2
PMID: https://pubmed.ncbi.nlm.nih.gov/40715225
تاريخ النشر: 2025-07-25
المؤلف: Abir Das وآخرون
الموضوع الرئيسي: تخطيط الدماغ وواجهات الدماغ-الكمبيوتر

نظرة عامة

تبحث هذه الدراسة في تحسين تصنيف تخيل الحركة (MI) في واجهات الدماغ-الكمبيوتر (BCIs) من خلال تقنيات التعلم الآلي المتقدمة والتعلم العميق. تؤكد الدراسة على أهمية تصنيف بيانات تخطيط الدماغ الكهربائي (EEG) بدقة، والتي تتم معالجتها من خلال ثلاث مراحل: معالجة البيانات المسبقة، استخراج الميزات، والتصنيف. من خلال تقييم خمسة مصنفات تقليدية—أقرب الجيران (KNN)، مصنف الدعم المتجه (SVC)، الانحدار اللوجستي (LR)، الغابة العشوائية (RF)، ونايف بايز (NB)—باستخدام مجموعة بيانات “PhysioNet EEG Motor Movement/Imagery Dataset”، حقق مصنف الغابة العشوائية أعلى دقة بنسبة 91%. بالمقابل، أسفرت أساليب التعلم العميق، وبشكل خاص الشبكات العصبية التلافيفية (CNN) وشبكات الذاكرة قصيرة وطويلة الأمد (LSTM)، عن دقة أقل بلغت 88.18% و16.13% على التوالي. ومن الجدير بالذكر أن نموذج هجين يجمع بين CNN وLSTM تفوق بشكل كبير على جميع الطرق الأخرى، محققًا دقة بلغت 96.06%.

تؤكد النتائج على مزايا الهياكل الهجينة في أنظمة BCI المعتمدة على EEG، وخاصة نموذج Hybrid CNN + LSTM، الذي لم يحقق فقط دقة متفوقة ولكن أيضًا أظهر تدريبًا فعالًا، حيث تطلب فقط 30-50 دورة تدريبية من 5 ثوانٍ لكل منها. هذه الكفاءة تجعل النموذج مناسبًا جدًا للتطبيقات في الوقت الحقيقي. تقترح الدراسة عدة مجالات للبحث المستقبلي، بما في ذلك استكشاف هياكل الشبكات البديلة مثل النماذج المعتمدة على المحولات، وتحسين خط أنابيب المعالجة المسبقة باستخدام طرق مثل تحليل المكونات المستقلة (ICA)، وتطوير تطبيقات عملية لأجهزة BCI المحمولة. تهدف هذه الاتجاهات إلى تعزيز دقة التصنيف وكفاءة الحوسبة، مما يسهل في النهاية اعتماد تقنيات BCI بشكل أوسع.

مقدمة

تستعرض مقدمة ورقة البحث أهمية تحليل كل من ميزات المجال الزمني وميزات المجال الترددي في سياق الدراسة. تشير ميزات المجال الزمني إلى خصائص الإشارات كما تتغير مع مرور الوقت، مما يمكن أن يوفر رؤى حول السلوك الزمني للبيانات. بالمقابل، تتضمن ميزات المجال الترددي تحليل الإشارات من حيث مكوناتها الترددية، مما يسمح بتحديد الدورات والخصائص الطيفية التي قد لا تكون واضحة في المجال الزمني.

تؤكد الورقة على أن الفهم الشامل للظواهر قيد التحقيق يتطلب دمج كلا النوعين من الميزات. من خلال الاستفادة من تحليل المجال الزمني جنبًا إلى جنب مع تقنيات المجال الترددي، تهدف الدراسة إلى تعزيز دقة وموثوقية النتائج، مما يسهم في النهاية في تفسير أكثر دقة للعمليات الأساسية. يُعتبر هذا النهج المزدوج عاملاً حاسمًا في تقدم المجال ومعالجة الفجوات الموجودة في الأدبيات.

الطرق

تستعرض قسم المنهجية في الدراسة النهج الشامل المتبع لتصنيف الحركات التخيلية، مع التركيز بشكل خاص على التمييز بين حركات اليد اليمنى واليسرى باستخدام إشارات EEG. شملت الدراسة عملية منهجية تضمنت إعداد مجموعة البيانات، استخراج الميزات، وتنفيذ خوارزميات تصنيف متنوعة. تم إجراء التجارب على جهاز كمبيوتر شخصي مزود بشريحة M1، باستخدام Python 3.7 على Google Colab، وركزت على تقييم أداء كل من مصنفات التعلم الآلي التقليدية (ML) والتعلم العميق (DL).

كشفت النتائج أنه من بين خوارزميات ML التقليدية، حققت الغابة العشوائية (RF) أعلى دقة تصنيف بلغت 91.04%، متجاوزة بشكل كبير طرقًا أخرى مثل نايف بايز (NB) بنسبة 84.23% والانحدار اللوجستي (LR) بنسبة 72.54%. بالمقابل، أدت خوارزميات أقرب الجيران (KNN) وآلة الدعم المتجه (SVM) أداءً ضعيفًا، بدقة بلغت 46.03% و54.93% على التوالي. كما استكشفت الدراسة خوارزميات DL، حيث حققت شبكة عصبية تلافيفية (CNN) دقة بلغت 88.18%، بينما كان أداء نموذج الذاكرة قصيرة وطويلة الأمد (LSTM) ضعيفًا عند 16.13%. لتعزيز أداء التصنيف، تم اقتراح نموذج هجين يجمع بين CNN وLSTM، مما أسفر عن دقة مثيرة للإعجاب بلغت 96.06%، متفوقًا على جميع النماذج الأخرى المختبرة. استغل هذا النهج الهجين بشكل فعال نقاط القوة في كلا الهيكلين، مما أظهر أداءً متفوقًا عبر مناطق الاهتمام المختلفة (ROIs) في بيانات EEG.

النتائج

في تقييم نماذج التصنيف التقليدية، حقق نموذج الانحدار اللوجستي دقة بلغت 0.860، لكنه واجه تحديات مع 18 إيجابية خاطئة و27 سلبية خاطئة، مما يشير إلى الحاجة إلى مزيد من الضبط لتحسين الدقة والاسترجاع. أظهر نموذج آلة الدعم المتجه (SVM) دقة معتدلة بلغت 0.842، مع معدلات تصنيف خاطئة أعلى (21 إيجابية خاطئة و29 سلبية خاطئة)، مما يشير إلى مشاكل محتملة في الإفراط في التكيف أو نقص التكيف. كان لدى نموذج أقرب الجيران (KNN) دقة أقل بلغت 0.818، مع 21 إيجابية خاطئة و38 سلبية خاطئة، مما يعكس صعوبات في تمييز الفئات. سجل نموذج الشبكة العصبية التلافيفية (CNN) دقة بلغت 0.810، مع مواجهة تصنيفات خاطئة كبيرة (28 إيجابية خاطئة و32 سلبية خاطئة). كان أداء نموذج الذاكرة قصيرة وطويلة الأمد (LSTM) هو الأسوأ، بدقة بلغت 0.706 ومعدلات تصنيف خاطئة عالية (39 إيجابية خاطئة و52 سلبية خاطئة).

بالمقابل، أظهر النموذج الهجين المقترح CNN-LSTM أداءً متفوقًا، حيث انخفضت خسائر التدريب والتحقق بشكل كبير على مدى 50 دورة تدريبية، مما يدل على تقليل فعال للأخطاء. انخفضت خسارة التدريب من حوالي 0.04 إلى أقل من 0.01، بينما انخفضت خسارة التحقق من حوالي 0.05 إلى حوالي 0.015، مما يظهر قدرة النموذج على التعميم بشكل جيد دون الإفراط في التكيف. أظهرت مقاييس الدقة تحسنًا قويًا، حيث ارتفعت دقة التدريب من حوالي 85% إلى أكثر من 96%، وزادت دقة التحقق من 84% إلى حوالي 96%. تسلط هذه النتائج الضوء على قوة النموذج الهجين ودقته في مهام تصنيف تخيل الحركة، متفوقًا بشكل كبير على المصنفات التقليدية وغيرها من أساليب التعلم العميق التي تم تقييمها في هذه الدراسة.

المناقشة

تؤكد قسم المناقشة في هذه الورقة البحثية على الحاجة الملحة لتحسين أنظمة واجهة الدماغ-الكمبيوتر (BCI) لتعزيز جودة الحياة للأفراد ذوي الإعاقات الحركية. يحدد المؤلفون فجوات كبيرة في الأساليب الحالية المعتمدة على EEG، لا سيما فيما يتعلق بحدود البيانات وعدم كفاية تقنيات التعلم الآلي الحالية في تصنيف إشارات EEG المعقدة بشكل فعال. يبرزون إمكانية استخدام الشبكات التنافسية التوليدية (GANs) لزيادة البيانات والحاجة إلى نماذج التعلم العميق الأكثر قابلية للتفسير لتعزيز الثقة بين الممارسين الصحيين.

تقدم الدراسة نموذجًا هجينًا جديدًا للتعلم العميق يجمع بين الشبكات العصبية التلافيفية (CNNs) وشبكات الذاكرة قصيرة وطويلة الأمد (LSTM)، محققًا دقة تصنيف بلغت 96.06%، والتي تتجاوز نماذج التعلم الآلي التقليدية التي حققت 91%. يلتقط هذا النهج الهجين بشكل فعال كل من الميزات المكانية والزمنية لبيانات EEG، بينما تعزز تقنيات المعالجة المسبقة واستخراج الميزات المتقدمة، بما في ذلك تحويل الموجات والهندسة ريمان، أداء التصنيف. تشير النتائج إلى أن النموذج المقترح يمكن أن يحسن بشكل كبير نتائج المرضى في إعادة التأهيل العصبي والتقنيات المساعدة، مما يمهد الطريق لتطبيقات BCI أكثر فعالية في البيئات السريرية.

Journal: Scientific Reports, Volume: 15, Issue: 1
DOI: https://doi.org/10.1038/s41598-025-07427-2
PMID: https://pubmed.ncbi.nlm.nih.gov/40715225
Publication Date: 2025-07-25
Author(s): Abir Das et al.
Primary Topic: EEG and Brain-Computer Interfaces

Overview

This research investigates the enhancement of Motor Imagery (MI) classification in brain-computer interfaces (BCIs) through advanced machine learning and deep learning techniques. The study emphasizes the importance of accurately classifying electroencephalogram (EEG) data, which is processed through three stages: data pre-processing, feature extraction, and classification. Evaluating five traditional classifiers—K-Nearest Neighbors (KNN), Support Vector Classifier (SVC), Logistic Regression (LR), Random Forest (RF), and Naive Bayes (NB)—using the “PhysioNet EEG Motor Movement/Imagery Dataset,” the Random Forest classifier achieved the highest accuracy of 91%. In contrast, deep learning approaches, specifically Convolutional Neural Networks (CNN) and Long Short-Term Memory (LSTM) networks, yielded lower accuracies of 88.18% and 16.13%, respectively. Notably, a hybrid model combining CNN and LSTM significantly outperformed all other methods, achieving an accuracy of 96.06%.

The findings underscore the advantages of hybrid architectures in EEG-based BCI systems, particularly the Hybrid CNN + LSTM model, which not only achieved superior accuracy but also demonstrated efficient training, requiring only 30-50 epochs of 5 seconds each. This efficiency positions the model as highly suitable for real-time applications. The study suggests several avenues for future research, including exploring alternative network architectures like Transformer-based models, optimizing the preprocessing pipeline with methods such as Independent Component Analysis (ICA), and developing practical implementations for portable BCI devices. These directions aim to enhance classification accuracy and computational efficiency, ultimately facilitating broader adoption of BCI technologies.

Introduction

The introduction of the research paper outlines the significance of analyzing both time domain and frequency domain features in the context of the study. Time domain features refer to characteristics of signals as they vary over time, which can provide insights into the temporal behavior of the data. Conversely, frequency domain features involve the analysis of signals in terms of their frequency components, allowing for the identification of periodicities and spectral characteristics that may not be evident in the time domain.

The paper emphasizes that a comprehensive understanding of the phenomena under investigation requires the integration of both types of features. By leveraging time domain analysis alongside frequency domain techniques, the research aims to enhance the accuracy and robustness of the findings, ultimately contributing to a more nuanced interpretation of the underlying processes. This dual approach is posited as a critical factor in advancing the field and addressing existing gaps in the literature.

Methods

The methodology section of the study outlines the comprehensive approach taken to classify imaginary movements, specifically distinguishing between right and left hand movements using EEG signals. The research involved a systematic process that included dataset preparation, feature extraction, and the implementation of various classification algorithms. The experiments were conducted on a personal computer with an M1 chip, utilizing Python 3.7 on Google Colab, and focused on evaluating the performance of both traditional machine learning (ML) and deep learning (DL) classifiers.

The findings revealed that among the traditional ML algorithms, Random Forest (RF) achieved the highest classification accuracy of 91.04%, significantly surpassing other methods such as Naive Bayes (NB) at 84.23% and Logistic Regression (LR) at 72.54%. In contrast, K-Nearest Neighbors (KNN) and Support Vector Machine (SVM) performed poorly, with accuracies of 46.03% and 54.93%, respectively. The study also explored DL algorithms, where a Convolutional Neural Network (CNN) achieved an accuracy of 88.18%, while a Long Short-Term Memory (LSTM) model underperformed at 16.13%. To enhance classification performance, a hybrid model combining CNN and LSTM was proposed, resulting in an impressive accuracy of 96.06%, outperforming all other tested models. This hybrid approach effectively leveraged the strengths of both architectures, demonstrating superior performance across various regions of interest (ROIs) in the EEG data.

Results

In the evaluation of traditional classification models, the Logistic Regression model achieved an accuracy of 0.860, but it faced challenges with 18 false positives and 27 false negatives, indicating a need for further tuning to improve precision and recall. The Support Vector Machine (SVM) model demonstrated a moderate accuracy of 0.842, with higher misclassification rates (21 false positives and 29 false negatives), suggesting potential overfitting or underfitting issues. The K-Nearest Neighbors (KNN) model had a lower accuracy of 0.818, coupled with 21 false positives and 38 false negatives, reflecting difficulties in class distinction. The Convolutional Neural Network (CNN) model recorded an accuracy of 0.810, facing significant misclassifications (28 false positives and 32 false negatives). The Long Short-Term Memory (LSTM) model performed the worst, with an accuracy of 0.706 and high misclassification rates (39 false positives and 52 false negatives).

In contrast, the proposed hybrid CNN-LSTM model exhibited superior performance, with training and validation losses decreasing significantly over 50 epochs, indicating effective error minimization. The training loss dropped from approximately 0.04 to under 0.01, while validation loss decreased from around 0.05 to about 0.015, demonstrating the model’s ability to generalize well without overfitting. The accuracy metrics showed a strong improvement, with training accuracy rising from about 85% to over 96%, and validation accuracy increasing from 84% to approximately 96%. These findings highlight the hybrid model’s robustness and precision in motor imagery classification tasks, significantly outperforming traditional classifiers and other deep learning approaches assessed in this study.

Discussion

The discussion section of this research paper emphasizes the urgent need for improved brain-computer interface (BCI) systems to enhance the quality of life for individuals with motor disabilities. The authors identify significant gaps in current EEG-based BCI approaches, particularly regarding data limitations and the inadequacy of existing machine learning techniques in effectively classifying complex EEG signals. They highlight the potential of using Generative Adversarial Networks (GANs) for data augmentation and the necessity for more interpretable deep learning models to foster trust among healthcare practitioners.

The research introduces a novel hybrid deep learning model that combines Convolutional Neural Networks (CNNs) and Long Short-Term Memory (LSTM) networks, achieving a classification accuracy of 96.06%, which surpasses traditional machine learning models that achieved 91%. This hybrid approach effectively captures both spatial and temporal features of EEG data, while advanced preprocessing and feature extraction techniques, including Wavelet Transform and Riemannian Geometry, enhance classification performance. The findings suggest that the proposed model can significantly improve patient outcomes in neurorehabilitation and assistive technologies, paving the way for more effective BCI applications in clinical settings.