نموذج الكشف والتعرف عن EEG للصرع يعتمد على آلية الانتباه المزدوج EEG detection and recognition model for epilepsy based on dual attention mechanism

المجلة: Scientific Reports، المجلد: 15، العدد: 1
DOI: https://doi.org/10.1038/s41598-025-90315-6
PMID: https://pubmed.ncbi.nlm.nih.gov/40108237
تاريخ النشر: 2025-03-19
المؤلف: Zhentao Huang وآخرون
الموضوع الرئيسي: تخطيط الدماغ وواجهات الدماغ-الكمبيوتر

نظرة عامة

في مجال الأعصاب السريرية، يعد الكشف التلقائي عن نوبات الصرع من إشارات تخطيط الدماغ الكهربائي (EEG) أمرًا حيويًا لتسريع تشخيص الصرع وتعزيز رعاية المرضى. تقدم هذه الورقة نموذج دمج الميزات الزمانية والمكانية للكشف عن نوبات الصرع من EEG مع آلية انتباه مزدوج (STFFDA)، الذي يستفيد من تقنيات التعلم العميق لتفسير حالات الصرع مباشرة من بيانات EEG الخام. من خلال تجاوز معالجة البيانات الشاملة واستخراج الميزات، يحقق STFFDA معدلات دقة ملحوظة تبلغ 95.18% و77.65% في اختبارات التحقق الفردية لمجموعات بيانات CHB-MIT وجامعة بون، على التوالي، و92.42% و67.24% في اختبارات التحقق المتقاطع 10-fold.

يظهر نموذج STFFDA أداءً متفوقًا مقارنةً بالطرق الحالية، خاصةً في قدرته على دمج الميزات الزمانية والمكانية، مما يعزز دقة وموثوقية الكشف. بينما تركز الدراسة الحالية على نوع واحد من النوبات، فإنها تعترف بتعقيد تصنيفات النوبات في العالم الحقيقي، مما يشير إلى أن الأبحاث المستقبلية يجب أن تستكشف الكشف عن أنواع متعددة من النوبات وتحسين عملية استخراج الميزات أو المصنفات لتحسين التمييز بينها. بشكل عام، تشير النتائج إلى أن STFFDA يحمل وعدًا كبيرًا في تعزيز الكشف عن النوبات وتحسين نتائج المرضى في إدارة الصرع.

طرق

يدمج نموذج STFFDA للكشف عن EEG للصرع عدة مكونات متقدمة، بما في ذلك وحدة الشبكة العصبية التلافيفية (CNN)، وحدة الضغط والتحفيز أحادية البعد (1D SE)، وحدة الذاكرة الطويلة والقصيرة المزدوجة الاتجاه (Bi-LSTM)، وآلية الانتباه. تتحمل وحدة CNN مسؤولية استخراج الميزات المكانية من إشارات EEG، باستخدام طبقات تلافيفية مع 32 نواة وفلتر 1 × 3 لالتقاط الميزات المتسلسلة عبر قنوات الأقطاب الكهربائية المتعددة. تعزز وحدة 1D SE ميزات القناة مع الحفاظ على حجم المخرجات، مما يسهل التكامل مع النموذج. تلتقط وحدة Bi-LSTM الاعتماديات الزمنية، مما يسمح للنموذج بتعلم الارتباطات المعقدة المتعلقة بالوقت في نشاط الدماغ. يتم دمج المخرجات من هذه الوحدات ومعالجتها من خلال آلية الانتباه، التي تعين أوزانًا تفاضلية للميزات قبل تمريرها إلى طبقة متصلة بالكامل للإخراج النهائي.

تشمل مجموعات البيانات المستخدمة في هذه الدراسة تسجيلات من قاعدة بيانات CHB-MIT، التي تشمل بيانات EEG من 22 مريضًا أطفال مصابين بالصرع، وقاعدة بيانات EEG لجامعة بون، التي تتكون من خمسة مجموعات بيانات مصنفة (A إلى E) مع تسجيلات من فروة الرأس والداخل الجمجمي. تحتوي مجموعة بيانات CHB-MIT على 23 قناة مسجلة بمعدل أخذ عينات يبلغ 256 هرتز، بينما تتكون مجموعة بيانات بون من 11,500 عينة عبر 178 قناة تم أخذ عينات منها بمعدل 173.61 هرتز. للتحقق من أداء النموذج، تم إجراء اختبار تحقق فردي واختبار تحقق متقاطع 10-fold. تضمنت الأخيرة تقسيم مجموعة البيانات إلى 10 مجموعات فرعية، مما يسمح بتقييم شامل لفعالية النموذج. لمعالجة عدم التوازن المحتمل في الفئات بسبب المدة القصيرة للنوبات، تم استخدام خسارة الانتروبيا المتقاطعة الموزونة، مما يعين أهمية أكبر للفئة الأقل (نوبات الصرع) لتعزيز أداء الكشف.

مناقشة

في السنوات الأخيرة، ظهرت طرق متنوعة للتنبؤ التلقائي بنوبات الصرع، تعتمد بشكل أساسي على بيانات تخطيط الدماغ الكهربائي (EEG) نظرًا لدقتها الزمنية العالية. تم استخدام خوارزميات التعلم الآلي التقليدية، مثل نماذج المزيج الغاوسي (GMM)، وآلات الدعم الناقل (SVM)، والغابات العشوائية (RF)، لتصنيف النوبات. ومع ذلك، غالبًا ما تتطلب هذه الطرق اختيار الميزات يدويًا وقد تكون مستهلكة للوقت، مما يحد من تطبيقها العملي. لقد حسنت تقنيات التعلم العميق بشكل كبير من قدرات الكشف عن النوبات من خلال أتمتة عمليات استخراج الميزات والتصنيف. تشمل التقدمات الملحوظة استخدام الشبكات العصبية التلافيفية (CNNs) وشبكات الذاكرة الطويلة والقصيرة (LSTM)، التي أظهرت أداءً متفوقًا في التقاط الاعتماديات المكانية والزمنية في إشارات EEG.

تقدم هذه الورقة نموذج تعلم عميق جديد، يسمى دمج الميزات الزمانية والمكانية مع انتباه مزدوج (STFFDA)، يهدف إلى تعزيز دقة الكشف عن النوبات. يعالج نموذج STFFDA بيانات سلسلة زمنية EEG الخام بشكل فريد دون الحاجة إلى تحويلها مسبقًا إلى مجال التردد الزمني، مما يبسط عملية التنبؤ. يستخدم مزيجًا من هياكل Bi-LSTM وCNN لاستخراج الميزات المكانية والزمنية بفعالية، بينما يدمج آليات الضغط والتحفيز (SE) وآليات الانتباه ذات المنتج النقطي لإعطاء الأولوية للمعلومات ذات الصلة. يتم التحقق من فعالية النموذج عبر مجموعتين من البيانات، CHB-MIT وجامعة بون، مما يظهر أداءً قويًا في التعميم ودقة محسنة في تحديد أنماط النوبات مقارنةً بالطرق الحالية.

Journal: Scientific Reports, Volume: 15, Issue: 1
DOI: https://doi.org/10.1038/s41598-025-90315-6
PMID: https://pubmed.ncbi.nlm.nih.gov/40108237
Publication Date: 2025-03-19
Author(s): Zhentao Huang et al.
Primary Topic: EEG and Brain-Computer Interfaces

Overview

In the realm of clinical neurology, the automated detection of epileptic seizures from electroencephalogram (EEG) signals is crucial for expediting epilepsy diagnosis and enhancing patient care. This paper introduces the Spatio-temporal Feature Fusion Epilepsy EEG Recognition model with Dual Attention Mechanism (STFFDA), which leverages deep learning techniques to interpret epileptic states directly from raw EEG data. By circumventing extensive data preprocessing and feature extraction, STFFDA achieves notable accuracy rates of 95.18% and 77.65% on single-validation tests for the CHB-MIT and Bonn University datasets, respectively, and 92.42% and 67.24% in 10-fold cross-validation tests.

The STFFDA model demonstrates superior performance compared to existing methods, particularly in its ability to fuse spatio-temporal features, thereby enhancing detection accuracy and reliability. While the current study focuses on a single type of seizure, it acknowledges the complexity of real-world seizure classifications, suggesting that future research should explore the detection of multiple seizure types and refine the feature extraction process or classifiers to improve differentiation among them. Overall, the findings indicate that STFFDA holds significant promise for advancing seizure detection and improving patient outcomes in epilepsy management.

Methods

The STFFDA model for epilepsy EEG recognition integrates several advanced components, including a Convolutional Neural Network (CNN) module, a one-dimensional Squeeze-and-Excitation (1D SE) module, a Bidirectional Long Short-Term Memory (Bi-LSTM) module, and an Attention mechanism. The CNN module is responsible for spatial feature extraction from EEG signals, utilizing convolutional layers with 32 kernels and 1 × 3 filters to capture sequential features across multiple electrode channels. The 1D SE module enhances channel features while maintaining the output size, facilitating integration with the model. The Bi-LSTM module captures temporal dependencies, allowing the model to learn complex time-related correlations in brain activity. The outputs from these modules are concatenated and processed through an attention mechanism, which assigns differential weights to the features before being passed to a fully connected layer for final output.

The datasets utilized in this study include recordings from the CHB-MIT database, which encompasses EEG data from 22 pediatric patients with epilepsy, and the University of Bonn EEG database, comprising five labeled datasets (A to E) with both scalp and intracranial recordings. The CHB-MIT dataset features 23 channels recorded at a sampling rate of 256 Hz, while the Bonn dataset consists of 11,500 samples across 178 channels sampled at 173.61 Hz. To validate the model’s performance, both a single validation test and a 10-fold cross-validation were conducted. The latter involved partitioning the dataset into 10 subsets, allowing for a comprehensive evaluation of the model’s efficacy. To address potential class imbalance due to the short duration of seizures, a Weighted Cross-Entropy Loss was employed, assigning greater importance to the minority class (epileptic seizures) to enhance detection performance.

Discussion

In recent years, various methods have emerged for the automatic prediction of epileptic seizures, predominantly utilizing electroencephalogram (EEG) data due to its high temporal resolution. Traditional machine learning algorithms, such as Gaussian Mixture Models (GMM), Support Vector Machines (SVM), and Random Forests (RF), have been employed for seizure classification. However, these methods often require manual feature selection and can be time-consuming, limiting their practical application. The introduction of deep learning techniques has significantly improved seizure detection capabilities by automating feature extraction and classification processes. Notable advancements include the use of Convolutional Neural Networks (CNNs) and Long Short-Term Memory (LSTM) networks, which have demonstrated superior performance in capturing both spatial and temporal dependencies in EEG signals.

This paper presents a novel deep learning model, termed Space-time Feature Fusion with Dual Attention (STFFDA), aimed at enhancing seizure detection accuracy. The STFFDA model uniquely processes raw EEG time series data without requiring prior transformation into the time-frequency domain, thereby streamlining the prediction process. It employs a combination of Bidirectional LSTM and CNN architectures to effectively extract spatial and temporal features, while integrating Squeeze-and-Excitation (SE) and dot-product attention mechanisms to prioritize relevant information. The model’s efficacy is validated across two datasets, the CHB-MIT and the University of Bonn, showcasing robust generalization performance and improved accuracy in identifying seizure patterns compared to existing methods.