توقع نوبات الصرع عبر دمج الشبكات العصبية المتكررة والمحولات متعددة الأبعاد Epileptic seizure prediction via multidimensional transformer and recurrent neural network fusion

المجلة: Journal of Translational Medicine، المجلد: 22، العدد: 1
DOI: https://doi.org/10.1186/s12967-024-05678-7
PMID: https://pubmed.ncbi.nlm.nih.gov/39367475
تاريخ النشر: 2024-10-04
المؤلف: Rong Zhu وآخرون
الموضوع الرئيسي: تخطيط الدماغ وواجهات الدماغ-الكمبيوتر

نظرة عامة

تتناول ورقة البحث التحدي الحاسم في توقع نوبات الصرع، وهي حالة تؤثر بشكل كبير على جودة حياة المرضى. يقترح المؤلفون نموذجًا هجينًا جديدًا للتعلم العميق يدمج بين المحولات متعددة الأبعاد والشبكات العصبية التكرارية (LSTM و GRU) لتصنيف إشارات تخطيط الدماغ الكهربائي (EEG). تتضمن المنهجية استخراج ميزات الوقت والتردد من إشارات EEG باستخدام تحويل فورييه قصير المدى، تليها تعلم هذه الميزات من خلال نموذج المحول. تقوم شبكات LSTM و GRU بمزيد من تحسين الخصائص الزمنية والترددية، مع استخدام آلية بوابة لدمج الميزات وتصنيفها.

تم تقييم النموذج على مجموعتين من البيانات: مجموعة بيانات EEG بون ومجموعة بيانات CHB-MIT، محققًا نتائج مثيرة للإعجاب بمتوسط حساسية 98.24% وخصوصية 97.27% على مجموعة بيانات CHB-MIT، ودقة تقارب 99% في مهام التصنيف الثنائي و98% في مهام التصنيف الثلاثي لمجموعة بيانات بون. تشير النتائج إلى أن النموذج المقترح يلتقط بشكل فعال الديناميات الزمنية والترددية المعقدة لإشارات EEG، متفوقًا على نماذج الشبكات العصبية التكرارية التقليدية أو نماذج الانتباه. يعترف المؤلفون بالحاجة إلى مزيد من البحث لتعزيز دقة وموثوقية التوقع، مقترحين أن تركز الأعمال المستقبلية على تحسين هيكل النموذج واستكشاف تقنيات استخراج الميزات المتقدمة لتحسين جودة حياة مرضى الصرع.

مقدمة

تسلط مقدمة ورقة البحث هذه الضوء على انتشار الصرع عالميًا، حيث يؤثر على حوالي 50 مليون فرد، وتبرز التحديات التي تطرحها النوبات غير المتوقعة، والتي يمكن أن تعطل بشكل كبير حياة المرضى اليومية. تم تطوير تقنيات مختلفة لاكتشاف النوبات وتوقعها، حيث تعتبر تخطيط الدماغ الكهربائي (EEG) هي الطريقة الأكثر دراسة. تم استخدام أساليب معالجة الإشارات التقليدية، مثل تحويل فورييه قصير المدى (STFT) وتحويل الموجات (WT)، لتحليل إشارات EEG، مما أدى إلى تطبيق تقنيات التعلم الآلي للتصنيف. ومع ذلك، تواجه طرق التعلم الآلي التقليدية قيودًا في تعلم التمثيل وكفاءة الحوسبة، خاصة عند التعامل مع بيانات السلاسل الزمنية.

لمعالجة هذه التحديات، تقترح الورقة نموذجًا جديدًا يعتمد على المحولات للشبكات العصبية التكرارية (RNN) يدمج بين المحول وذاكرة طويلة وقصيرة المدى (LSTM) ووحدات تكرارية بوابة (GRU) لتحسين توقع النوبات من إشارات EEG. تهدف هذه المقاربة الهجينة إلى الاستفادة من نقاط القوة في المحولات في التقاط الاعتماديات العالمية وقدرات استخراج الميزات المحلية لـ LSTM و GRU. يظهر النموذج المقترح أداءً متفوقًا في المقاييس الرئيسية مثل الدقة والحساسية والخصوصية عند تقييمه على مجموعات بيانات متعددة، بما في ذلك مجموعة بيانات EEG بون ومجموعة بيانات CHB-MIT. توضح الأقسام اللاحقة من الورقة إعداد التجارب والنتائج والمقارنات مع النماذج الحالية الرائدة، مما يؤكد في النهاية فعالية وموثوقية الطريقة المقترحة في سياق توقع الصرع.

الطرق

في هذه الدراسة، يتم توضيح النموذج المقترح لتوقع نوبات الصرع من خلال منهجية منهجية. في البداية، تم استخراج إشارات EEG من مراحل ما قبل النوبة وما بين النوبات وتم تصنيفها. تم تحليل الخصائص الزمنية والترددية لهذه الإشارات باستخدام تحويل فورييه قصير المدى (STFT)، مما أدى إلى توليد طيف الترددات. ثم تم إدخال هذه الميزات الطيفية إلى مشفر لتسهيل تعلم الخصائص الزمنية والترددية.

بعد ذلك، تم إدخال الميزات الزمنية والترددية المعالجة إلى شبكات ذاكرة طويلة وقصيرة المدى (LSTM) ووحدات تكرارية بوابة (GRU) لتوضيح الديناميات الزمنية لإشارات EEG بشكل أكبر. تم استخدام آلية بوابة لدمج الميزات ببساطة وتصنيف المخرجات من كل من LSTM و GRU. تم تقييم أداء النموذج باستخدام مجموعة بيانات بون، بينما كانت مجموعة بيانات CHB-MIT مثالًا تمثيليًا لتوضيح أكثر شمولاً للمنهجية. تهدف هذه المقاربة إلى تعزيز توقع نوبات الصرع من خلال تقنيات معالجة الإشارات المتقدمة وتقنيات التعلم الآلي.

المناقشة

في هذا القسم، يناقش المؤلفون مجموعات البيانات المستخدمة في بحثهم حول توقع الصرع، وبشكل خاص مجموعتي بيانات CHB-MIT و EEG بون، بالإضافة إلى تقنيات المعالجة والنمذجة المستخدمة. تتكون مجموعة بيانات CHB-MIT من 23 حالة من مرضى الصرع الأطفال، بإجمالي حوالي 844 ساعة من تسجيلات EEG مع 163 نوبة. قام المؤلفون بتوحيد بيانات EEG باستخدام 18 قطبًا وعالجوا قضايا عدم التوازن في البيانات بين مراحل ما بين النوبات وما قبل النوبة من خلال نوافذ انزلاقية متداخلة. من ناحية أخرى، تتكون مجموعة بيانات بون من مقاطع EEG أحادية القناة من كل من الأفراد الأصحاء ومرضى الصرع، مما يسمح بتقييم شامل لأداء النموذج عبر ظروف مختلفة.

استخدم المؤلفون هيكل نموذج جديد يدمج بين مشفرات المحولات مع شبكات LSTM و GRU لتعزيز تحليل إشارات EEG. يلتقط هذا الهيكل الميزات الزمنية والترددية بشكل فعال، مما يحسن من حساسية النموذج وخصوصيته في توقع النوبات. تشير النتائج إلى أن النموذج المقترح يتفوق على النماذج الحالية، محققًا متوسط حساسية 98.24% ومتوسط خصوصية 97.27% على مجموعة بيانات CHB-MIT، بينما يظهر أيضًا أداءً قويًا على مجموعة بيانات بون. يبرز المؤلفون أهمية تصميم نموذجهم في تحسين قدرات اكتشاف النوبات وتقليل معدلات الإنذار الكاذب، مما يسهم بشكل كبير في مجال أبحاث الصرع.

القيود

تسلط قسم القيود الضوء على عدة تحديات واجهت البحث الحالي حول اكتشاف الصرع باستخدام إشارات EEG. من الجدير بالذكر أن عدم القدرة على الحصول على إشارات EEG للنوبات في الوقت الحقيقي قيدت التحقق من أداء النموذج في التطبيقات العملية، خاصة في الأجهزة القابلة للارتداء. تثير هذه القيود مخاوف بشأن موثوقية النموذج في السيناريوهات الواقعية، على الرغم من نجاحه مع مجموعات البيانات غير المتصلة. بالإضافة إلى ذلك، فإن التعقيد الهيكلي للنموذج، الذي يتضمن مكونات مثل شبكات ذاكرة طويلة وقصيرة المدى (LSTM) ووحدات تكرارية بوابة (GRU)، يعقد عمليات تصحيح الأخطاء والتدريب. علاوة على ذلك، كانت تقنيات المعالجة المسبقة المستخدمة لإشارات EEG غير كافية لبعض الإشارات التي يصعب تحليلها.

لمعالجة هذه القيود، ستركز الأعمال المستقبلية على عدة مجالات رئيسية. يخطط الباحثون للتعاون مع المستشفيات والمؤسسات البحثية لجمع بيانات EEG حقيقية متنوعة، مما يعزز من قدرات تعميم النموذج. كما يهدفون إلى استكشاف تطبيقات معالجة الإشارات في الوقت الحقيقي والتحقق من النموذج على الأجهزة القابلة للارتداء. يعد تبسيط هيكل النموذج لتقليل الأعباء الحاسوبية وتحديات تصحيح الأخطاء أولوية أخرى، إلى جانب تحسين تقنيات المعالجة المسبقة لإشارات EEG من خلال طرق متقدمة لإزالة الضوضاء واستخراج الميزات. في النهاية، يعتزم الباحثون إجراء تجارب سريرية صغيرة النطاق لتقييم أداء النموذج في البيئات السريرية الحقيقية، مما يعزز من فائدته وموثوقيته للكشف المبكر عن الصرع وعلاجه.

Journal: Journal of Translational Medicine, Volume: 22, Issue: 1
DOI: https://doi.org/10.1186/s12967-024-05678-7
PMID: https://pubmed.ncbi.nlm.nih.gov/39367475
Publication Date: 2024-10-04
Author(s): Rong Zhu et al.
Primary Topic: EEG and Brain-Computer Interfaces

Overview

The research paper addresses the critical challenge of predicting epileptic seizures, a condition that significantly impacts patients’ quality of life. The authors propose a novel hybrid deep learning model that integrates a multidimensional Transformer with recurrent neural networks (LSTM and GRU) to classify electroencephalogram (EEG) signals. The methodology involves extracting time-frequency features from EEG signals using short-time Fourier transform, followed by learning these features through the Transformer model. The LSTM and GRU networks further refine the temporal and frequency characteristics, with a gating mechanism employed for feature splicing and classification.

The model was evaluated on two datasets: the Bonn EEG dataset and the CHB-MIT dataset, achieving impressive results with an average sensitivity of 98.24% and specificity of 97.27% on the CHB-MIT dataset, and approximately 99% accuracy on binary classification and 98% on tertiary classification tasks for the Bonn dataset. The findings indicate that the proposed model effectively captures the complex temporal and frequency dynamics of EEG signals, outperforming traditional single recurrent neural network or attention models. The authors acknowledge the need for further research to enhance prediction accuracy and reliability, suggesting future work will focus on refining model architecture and exploring advanced feature extraction techniques to improve the quality of life for epilepsy patients.

Introduction

The introduction of this research paper highlights the global prevalence of epilepsy, affecting approximately 50 million individuals, and underscores the challenges posed by unpredictable seizures, which can significantly disrupt patients’ daily lives. Various techniques for seizure detection and prediction have been developed, with electroencephalography (EEG) being the most extensively studied method. Traditional signal processing approaches, such as short-time Fourier transform (STFT) and wavelet transform (WT), have been employed to analyze EEG signals, leading to the application of machine learning techniques for classification. However, conventional machine learning methods face limitations in representation learning and computational efficiency, particularly when handling time-series data.

To address these challenges, the paper proposes a novel transformer-based recurrent neural network (RNN) model that integrates a transformer with long short-term memory (LSTM) and gated recurrent units (GRU) for improved seizure prediction from EEG signals. This hybrid approach aims to leverage the strengths of transformers in capturing global dependencies and the local feature extraction capabilities of LSTM and GRU. The proposed model demonstrates superior performance in key metrics such as accuracy, sensitivity, and specificity when evaluated on multiple datasets, including the Bonn EEG dataset and the CHB-MIT dataset. The subsequent sections of the paper detail the experimental setup, results, and comparisons with existing state-of-the-art models, ultimately affirming the effectiveness and reliability of the proposed method in the context of epilepsy prediction.

Methods

In this study, the proposed model for predicting epileptic seizures is outlined through a systematic methodology. Initially, EEG signals from the preictal and interictal stages were extracted and labeled. The temporal and frequency characteristics of these signals were analyzed using Short-Time Fourier Transform (STFT), leading to the generation of frequency spectra. These spectral features were then fed into an encoder to facilitate the learning of time-frequency characteristics.

Subsequently, the processed time and frequency features were input into Long Short-Term Memory (LSTM) and Gated Recurrent Unit (GRU) networks to further elucidate the temporal dynamics of the EEG signals. A gating mechanism was employed for simple feature fusion and classification of the outputs from both LSTM and GRU. The model’s performance was evaluated using the Bonn dataset, while the CHB-MIT dataset served as a representative example for a more comprehensive illustration of the methodology. This approach aims to enhance the prediction of epileptic seizures through advanced signal processing and machine learning techniques.

Discussion

In this section, the authors discuss the datasets utilized for their research on epilepsy prediction, specifically the CHB-MIT and Bonn EEG datasets, along with the preprocessing and modeling techniques employed. The CHB-MIT dataset comprises 23 cases from pediatric epilepsy patients, totaling approximately 844 hours of EEG recordings with 163 seizures. The authors standardized the EEG data by using 18 electrodes and addressed issues of data imbalance between interictal and preictal phases through overlapping sliding windows. The Bonn dataset, on the other hand, consists of single-channel EEG segments from both healthy individuals and epileptic patients, allowing for a comprehensive evaluation of the model’s performance across different conditions.

The authors employed a novel model architecture that integrates Transformer encoders with LSTM and GRU networks to enhance the analysis of EEG signals. This architecture captures both temporal and frequency features effectively, improving the model’s sensitivity and specificity in predicting seizures. The results indicate that the proposed model outperforms existing models, achieving a mean sensitivity of 98.24% and a mean specificity of 97.27% on the CHB-MIT dataset, while also demonstrating robust performance on the Bonn dataset. The authors highlight the importance of their model’s design in optimizing seizure detection capabilities and reducing false alarm rates, thereby contributing significantly to the field of epilepsy research.

Limitations

The section on limitations highlights several challenges faced in the current research on epilepsy detection using EEG signals. Notably, the inability to acquire real-time seizure EEG signals restricted the validation of the model’s performance in practical applications, particularly in wearable devices. This limitation raises concerns about the model’s reliability in real-world scenarios, despite its success with offline datasets. Additionally, the model’s structural complexity, which includes components such as Long Short-Term Memory (LSTM) networks and Gated Recurrent Units (GRU), complicates debugging and training processes. Furthermore, the preprocessing techniques employed for EEG signals were insufficient for certain difficult-to-analyze signals.

To address these limitations, future work will focus on several key areas. The researchers plan to collaborate with hospitals and research institutions to gather diverse real-time EEG data, thereby enhancing the model’s generalization capabilities. They also aim to explore real-time signal processing applications and validate the model on wearable devices. Simplifying the model structure to reduce computational overhead and debugging challenges is another priority, alongside improving preprocessing techniques for EEG signals through advanced denoising and feature extraction methods. Ultimately, the researchers intend to conduct small-scale clinical trials to assess the model’s performance in real clinical environments, thereby enhancing its utility and reliability for early epilepsy detection and treatment.