تصنيف إشارة EEG باستخدام هياكل التعلم العميق بناءً على تخيل الحركة لإعادة تأهيل الأطراف العلوية Classification of EEG Signal Using Deep Learning Architectures Based Motor-Imagery for an Upper-Limb Rehabilitation Exoskeleton

المجلة: SN Computer Science، المجلد: 6، العدد: 3
DOI: https://doi.org/10.1007/s42979-025-03743-6
تاريخ النشر: 2025-02-18
المؤلف: Maryam Khoshkhooy Titkanlou وآخرون
الموضوع الرئيسي: تخطيط الدماغ وواجهات الدماغ-الكمبيوتر

نظرة عامة

تتناول ورقة البحث تطوير واجهة دماغ-كمبيوتر (BCI) تتيح للأفراد ذوي الإعاقات الجسدية التحكم في الأجهزة باستخدام أفكارهم فقط، وبشكل خاص من خلال تخيل الحركة (MI) استنادًا إلى تخطيط الدماغ الكهربائي (EEGs). تركز الدراسة على تحسين دقة تصنيف إشارات MI باستخدام خوارزميتين للتعلم العميق: CNN-LSTM و CNN-Transformer. لمعالجة التحديات في تصنيف إشارات EEG، نفذ المؤلفون ثلاث تقنيات لتكبير البيانات – حقن الضوضاء (NI)، ومشفّر تلقائي متغير شرطي (cVAE)، وGAN شرطي مع دالة سعر فاسرشتاين وعقوبة التدرج (cWGAN-GP). حقق نموذج CNN-LSTM أعلى دقة بلغت 79.06% على مجموعة بيانات تضم 29 مشاركًا صحيًا يقومون بحركات اليد.

في الختام، وُجد أن الطرق المقترحة، بما في ذلك CNN-LSTM و CNN-Transformer و EEG-ITNet، مناسبة لتصنيف حركات اليد من إشارات EEG. على الرغم من أن مجموعة البيانات تم تكبيرها لتعزيز حجمها وتحسين دقة التصنيف، إلا أن النتائج أشارت إلى أن تكبير البيانات لم يؤثر بشكل كبير على الأداء. يقترح المؤلفون أن البحث المستقبلي يمكن أن يستكشف نماذج تعلم عميق إضافية أو نهج هجينة لتعزيز نتائج التصنيف. مجموعة البيانات المستخدمة في هذه الدراسة متاحة للجهود البحثية المستقبلية.

مقدمة

تناقش مقدمة ورقة البحث تطوير وتطبيق واجهات الدماغ-الكمبيوتر (BCIs)، التي تمكن التواصل بين الدماغ البشري والأجهزة الخارجية من خلال تسجيل وفك تشفير إشارات الدماغ. تُستخدم BCIs بشكل أساسي في سياقين: أولاً، للتحقيق في نشاط الدماغ للتحكم في الأجهزة دون التركيز على إعادة التأهيل، وثانيًا، في أنظمة الحلقة المغلقة الموجهة نحو إعادة التأهيل العصبي، حيث يكون التغذية الراجعة أمرًا حاسمًا لتعزيز المرونة العصبية وتنظيم وظائف الدماغ.

تنقسم طرق تسجيل نشاط الدماغ إلى تقنيات جراحية وغير جراحية. تُفضل الطرق غير الجراحية، وخاصة تخطيط الدماغ الكهربائي (EEG)، نظرًا لفعاليتها من حيث التكلفة، وقابليتها للنقل، وقياسها المباشر للنشاط العصبي، على الرغم من أن تقنيات أخرى مثل التصوير بالرنين المغناطيسي الوظيفي (fMRI) تقدم دقة مكانية أفضل. تم استخدام EEG بفعالية في التحكم في الأجهزة المساعدة وأجهزة إعادة التأهيل. في المقابل، تتضمن واجهات الدماغ-الكمبيوتر الجراحية أقطابًا كهربائية مزروعة جراحيًا تلتقط النشاط من الخلايا العصبية في القشرة أو الهياكل العميقة في الدماغ. تسلط الورقة الضوء على ظاهرة التزامن المرتبط بالحدث (ERD) التي لوحظت أثناء تخيل الحركة (MI)، حيث يشير انخفاض سعة إشارة EEG في نطاقات ألفا (8-12 هرتز) وبيتا (13-30 هرتز) إلى تنشيط الدماغ أثناء المحاكاة الذهنية للحركة، حتى في غياب حركة العضلات الفعلية.

طرق

في قسم “المواد والطرق”، يحدد المؤلفون منهجيات التعلم العميق المستخدمة في بحثهم. يوضحون بنية النماذج المقترحة، بما في ذلك الطبقات المحددة ووظائف التنشيط المستخدمة، بالإضافة إلى تقنيات التحسين المطبقة أثناء التدريب. تم تصميم النماذج لمعالجة مشكلة البحث بفعالية من خلال الاستفادة من هياكل الشبكات العصبية المتقدمة.

بالإضافة إلى ذلك، يصف المؤلفون مجموعة البيانات المستخدمة للتدريب والتحقق، بما في ذلك حجمها ومصدرها وأي خطوات معالجة مسبقة تم اتخاذها لتعزيز جودة البيانات. كما يحددون مقاييس التقييم المستخدمة لتقييم أداء النموذج، مما يضمن فهمًا شاملاً لفعالية الطرق في تحقيق أهداف الدراسة.

نتائج

تسلط النتائج المقدمة في هذا القسم الضوء على مقاييس الأداء لمختلف النماذج، بما في ذلك CNN-LSTM و CNN-Transformer و EEG-ITNet و CNN و CNN-Transformer-LSTM، جنبًا إلى جنب مع تركيباتها مع طرق التكبير. حقق نموذج CNN-LSTM أعلى دقة تصنيف بلغت 79.06%، تلاه عن كثب نموذج NI CNN-Transformer بنسبة 78.64%، وكلاهما تنافسي مع نموذج EEG-ITNet. يتم توضيح اتجاهات الدقة والخسارة لنماذج CNN-LSTM و NI CNN-Transformer و NI EEG-ITNet في الأشكال 4 و 5 و 6، بينما توفر الشكل 7 مصفوفات الارتباك لنماذج CNN-LSTM و NI CNN-Transformer، مما يعمل كأداة لتقييم الأداء.

تشير النتائج إلى أن طريقة تكبير حقن الضوضاء حسنت بشكل طفيف دقة نموذج CNN-Transformer من 77.93% إلى 78.64% ونموذج EEG-ITNet من 75.45% إلى 75.86%. ومع ذلك، يبدو أن التأثير العام لتكبير البيانات على دقة التصنيف لهذه المجموعة من البيانات ضئيل. وفقًا للجدول 2، تفوق نموذج CNN-LSTM على النماذج المتطورة من حيث الدقة والدقة والاسترجاع ودرجة F1، مما يؤكد فعاليته في سياق تصنيف تخيل الحركة.

مناقشة

في قسم المناقشة من الورقة، يقدم دوك ثين فام ورومان موشيك نتائجهم حول فعالية هياكل التعلم العميق المختلفة في تصنيف حركات اليد استنادًا إلى إشارات EEG. تسلط الدراسة الضوء على أن نموذج CNN-LSTM حقق أعلى دقة ودقة، بقيم بلغت 79.06% و 79.13%، على التوالي. يدمج هذا النموذج بشكل فعال استخراج الميزات المكانية من خلال طبقات CNN مع التقاط الاعتماد الزمني عبر طبقات LSTM. بالمقارنة، حققت نماذج CNN-Transformer و EEG-ITNet أداءً أقل، بدقة بلغت 77.93% و 75.45%، على التوالي. ومن الجدير بالذكر أن تقنيات تكبير البيانات، وخاصة حقن الضوضاء، قدمت تحسينات طفيفة في دقة التصنيف ولكن لم تعزز النتائج بشكل كبير مقارنة بالنماذج المدربة بدون تكبير.

يؤكد المؤلفون على أهمية تعقيد النموذج وإمكانية الأداء العالي دون تكبير بيانات موسع، مشيرين إلى أن هياكل التعلم الآلي والتعلم العميق الأكثر تعقيدًا يمكن أن تلتقط خصائص مجموعة البيانات بشكل فعال. كما يلاحظون قيد حجم مجموعة بياناتهم، التي تتكون من 29 مشاركًا صحيًا، ويقترحون أبحاثًا مستقبلية لاستكشاف نماذج داخل الموضوع، حيث استخدمت مقاربتهم الحالية بشكل أساسي نماذج بين الموضوعات. بشكل عام، تشير النتائج إلى أنه بينما تعتبر الهياكل المقترحة مناسبة لتصنيف حركات اليد المعتمدة على EEG، فإن المزيد من الاستكشاف مع مجموعات بيانات أكبر ونماذج متنوعة مطلوب لتعزيز دقة التصنيف.

Journal: SN Computer Science, Volume: 6, Issue: 3
DOI: https://doi.org/10.1007/s42979-025-03743-6
Publication Date: 2025-02-18
Author(s): Maryam Khoshkhooy Titkanlou et al.
Primary Topic: EEG and Brain-Computer Interfaces

Overview

The research paper discusses the development of a brain-computer interface (BCI) that allows individuals with physical disabilities to control devices using only their thoughts, specifically through motor imagery (MI) based on electroencephalograms (EEGs). The study focuses on enhancing the classification accuracy of MI signals using two deep learning algorithms: CNN-LSTM and CNN-Transformer. To address challenges in classifying EEG signals, the authors implemented three data augmentation techniques—noise injection (NI), conditional variational autoencoder (cVAE), and conditional GAN with Wasserstein price function and gradient penalty (cWGAN-GP). The CNN-LSTM model achieved the highest accuracy of 79.06% on a dataset comprising 29 healthy subjects performing hand movements.

In conclusion, the proposed methods, including CNN-LSTM, CNN-Transformer, and EEG-ITNet, were found suitable for classifying hand movements from EEG signals. Although the dataset was augmented to enhance its size and improve classification accuracy, the results indicated that data augmentation did not significantly impact performance. The authors suggest that further research could explore additional deep learning models or hybrid approaches to enhance classification outcomes. The dataset used in this study is made available for future research endeavors.

Introduction

The introduction of the research paper discusses the development and application of brain-computer interfaces (BCIs), which enable communication between the human brain and external devices through the recording and decoding of brain signals. BCIs are primarily utilized in two contexts: first, to investigate brain activity for controlling devices without a focus on rehabilitation, and second, in closed-loop systems aimed at neurorehabilitation, where feedback is crucial for enhancing neural plasticity and regulating brain functions.

The methods for recording brain activity are divided into invasive and non-invasive techniques. Non-invasive methods, particularly electroencephalography (EEG), are favored for their cost-effectiveness, portability, and direct measurement of neural activity, despite other technologies like fMRI offering better spatial resolution. EEG has been effectively employed in controlling assistive and rehabilitation devices. In contrast, invasive BCIs involve surgically implanted electrodes that capture activity from neurons in the cortex or deep brain structures. The paper highlights the phenomenon of event-related desynchronization (ERD) observed during motor imagery (MI), where a decrease in EEG signal amplitude in the alpha (8-12 Hz) and beta (13-30 Hz) bands indicates brain activation during the mental simulation of movement, even in the absence of actual muscle movement.

Methods

In the “Materials and Methods” section, the authors outline the deep learning methodologies employed in their research. They detail the architecture of the proposed models, including the specific layers and activation functions utilized, as well as the optimization techniques applied during training. The models are designed to address the research problem effectively by leveraging advanced neural network structures.

Additionally, the authors describe the dataset used for training and validation, including its size, source, and any preprocessing steps undertaken to enhance data quality. They also specify the evaluation metrics employed to assess model performance, ensuring a comprehensive understanding of the methods’ effectiveness in achieving the study’s objectives.

Results

The results presented in this section highlight the performance metrics of various models, including CNN-LSTM, CNN-Transformer, EEG-ITNet, CNN, and CNN-Transformer-LSTM, along with their combinations with augmentation methods. The CNN-LSTM model achieved the highest classification accuracy at 79.06%, closely followed by the NI CNN-Transformer at 78.64%, both of which are competitive with the EEG-ITNet model. The accuracy and loss trends for the CNN-LSTM, NI CNN-Transformer, and NI EEG-ITNet models are illustrated in Figures 4, 5, and 6, while Figure 7 provides confusion matrices for the CNN-LSTM and NI CNN-Transformer models, serving as a performance evaluation tool.

The findings indicate that the noise injection augmentation method marginally improved the accuracy of the CNN-Transformer model from 77.93% to 78.64% and the EEG-ITNet model from 75.45% to 75.86%. However, the overall impact of data augmentation on classification accuracy for this dataset appears to be minimal. According to Table 2, the CNN-LSTM model outperformed the state-of-the-art models in terms of accuracy, precision, recall, and F1 score, confirming its effectiveness in the context of motor imagery classification.

Discussion

In the discussion section of the paper, Duc Thien Pham and Roman Mouček present their findings on the effectiveness of various deep learning architectures for classifying hand movements based on EEG signals. The study highlights that the CNN-LSTM model achieved the highest accuracy and precision, with values of 79.06% and 79.13%, respectively. This model effectively integrates spatial feature extraction through CNN layers with temporal dependency capture via LSTM layers. Comparatively, the CNN-Transformer and EEG-ITNet models yielded lower performance, with accuracies of 77.93% and 75.45%, respectively. Notably, data augmentation techniques, particularly noise injection, provided slight improvements in classification accuracy but did not significantly enhance results compared to models trained without augmentation.

The authors emphasize the importance of model complexity and the potential for high performance without extensive data augmentation, suggesting that more intricate machine learning and deep learning architectures can effectively capture the dataset’s characteristics. They also note the limitation of their dataset size, which consisted of 29 healthy participants, and propose future research to explore intra-subject models, as their current approach primarily utilized inter-subject models. Overall, the findings indicate that while the proposed architectures are suitable for EEG-based hand movement classification, further exploration with larger datasets and diverse models is warranted to enhance classification accuracy.