LSTM المتبقي والثنائي الاتجاه لاكتشاف نوبات الصرع Residual and bidirectional LSTM for epileptic seizure detection

المجلة: Frontiers in Computational Neuroscience، المجلد: 18
DOI: https://doi.org/10.3389/fncom.2024.1415967
PMID: https://pubmed.ncbi.nlm.nih.gov/38952709
تاريخ النشر: 2024-06-17
المؤلف: Wei Zhao وآخرون
الموضوع الرئيسي: تخطيط الدماغ وواجهات الدماغ-الكمبيوتر

نظرة عامة

تقدم هذه الدراسة نموذجًا هجينًا جديدًا للتعلم العميق، ResBiLSTM، مصممًا لاكتشاف نوبات الصرع من إشارات تخطيط الدماغ الكهربائي (EEG)، مع معالجة التحديات التي تطرحها التفسيرات اليدوية لهذه الإشارات. يدمج النموذج شبكة عصبية متبقية أحادية البعد (ResNet) لاستخراج الميزات المكانية المحلية من بيانات EEG، والتي تتم معالجتها بعد ذلك بواسطة طبقة الذاكرة طويلة وقصيرة المدى ثنائية الاتجاه (BiLSTM) لالتقاط الاعتماديات الزمنية. يتم تحسين الناتج النهائي من خلال طبقتين متصلتين بالكامل لاكتشاف النوبات بدقة. تظهر التقييمات على مجموعات البيانات من جامعة بون ومستشفى جامعة تمبل (TUH) أن ResBiLSTM يحقق معدلات دقة استثنائية تتراوح بين 98.88-100% في التصنيفات الثنائية والثلاثية على مجموعة بيانات بون، ودقة تصنيف تبلغ 95.03% مع درجة F1 مرجحة تبلغ 95.03% على مجموعة بيانات TUH.

تسلط النتائج الضوء على فعالية نموذج ResBiLSTM في إدارة تعقيدات بيانات EEG، مما يظهر تفوقه على العديد من طرق التعلم العميق المعاصرة. تسهم هذه الدراسة ليس فقط في مجال اكتشاف الصرع ولكن أيضًا تقترح تطبيقات محتملة للمنهجية المقترحة في تشخيص الاضطرابات العصبية ذات الصلة، مما يعزز من التحليل السريري وعمليات اتخاذ القرار.

مقدمة

تتناول مقدمة هذه الورقة البحثية القضية الملحة للصرع، وهو اضطراب عصبي يؤثر على أكثر من 70 مليون فرد حول العالم. يؤكد المؤلفون على الدور الحاسم لتخطيط الدماغ الكهربائي (EEG) في تشخيص الصرع، حيث يوفر رؤى أساسية حول الأنشطة الكهربية للدماغ. ومع ذلك، فإن الطرق التقليدية لتحليل بيانات EEG تتطلب جهدًا كبيرًا وتكون ذات طابع شخصي، مما يستلزم تطوير أنظمة كشف آلية لتعزيز كفاءة التشخيص ورعاية المرضى. تبرز الورقة تطور منهجيات كشف النوبات، من تقنيات معالجة الإشارات التقليدية وطرق التعلم الآلي إلى أساليب التعلم العميق المتقدمة.

يناقش المؤلفون مختلف هياكل التعلم العميق، بما في ذلك الشبكات العصبية التلافيفية (CNNs) والشبكات العصبية المتكررة (RNNs)، التي أظهرت وعدًا في استخراج الميزات تلقائيًا من إشارات EEG لاكتشاف النوبات. يتم الاستشهاد بدراسات بارزة، توضح فعالية هذه النماذج في تحقيق معدلات دقة عالية في تصنيف أنواع النوبات. علاوة على ذلك، تقدم الورقة نماذج هجينة تجمع بين CNNs وRNNs للاستفادة من نقاط قوتها في التقاط الديناميات المكانية والزمنية لبيانات EEG. يهدف نموذج الشبكة الهجينة الجديد المقترح، ResBiLSTM، إلى تحسين دقة التعرف عبر تحديات التصنيف المتعددة، متفوقًا على الطرق الحالية. ستفصل الأقسام التالية من الورقة مجموعات البيانات المستخدمة، والمنهجية المقترحة، والنتائج التجريبية، وتحليل مقارن للأداء مع خوارزميات التعلم العميق الأخرى.

طرق

تحدد قسم “المواد والطرق” تصميم التجربة والإجراءات المستخدمة في الدراسة. يوضح المواد المحددة المستخدمة، بما في ذلك أي مواد كيميائية، ومعدات، وعينات بيولوجية، مما يضمن إمكانية تكرار التجارب. تشمل المنهجية التقنيات المطبقة لجمع البيانات وتحليلها، مثل الطرق الإحصائية، والبروتوكولات التجريبية، وأي أدوات حسابية تم استخدامها.

بالإضافة إلى ذلك، قد يصف القسم حجم العينة ومعايير الاختيار، بالإضافة إلى أي ضوابط تم تنفيذها للتحقق من النتائج. إن الصرامة في النهج المنهجي أمر حاسم لتأسيس موثوقية النتائج ولتسهيل المقارنات مع الأبحاث المستقبلية في هذا المجال.

نتائج

في قسم النتائج، يظهر نموذج ResBiLSTM المقترح أداءً متفوقًا في مهام التصنيف المختلفة المتعلقة باكتشاف الصرع، حيث يتفوق بشكل خاص في التصنيفات ذات الفئتين والثلاث فئات (A-E، C-E، D-E، AB-E، CD-E، ABCD-E). بينما يحقق دقة عالية تبلغ 99.88% في مهمة تصنيف B-E، فإنه يتخلف بنسبة 10.58% مقارنة بالطريقة التي قدمها تونجر وبولات (2022b) في مهمة تصنيف C-DE، التي تركز على تحديد المنطقة المسببة للنوبات. ومع ذلك، يتفوق نموذج ResBiLSTM على تونجر وبولات بنسبة 11.23% في مهمة التصنيف الثلاثي AB-CD-E، مما يبرز قوته عبر مجموعة أوسع من تحديات التصنيف، بما في ذلك المهام المعقدة ذات الخمس فئات.

يمكن أن تُعزى الفجوات في الدقة بين الدراسات إلى عاملين رئيسيين: تقنيات استخراج الميزات المستخدمة من قبل تونجر وبولات، التي تم تصميمها للتصنيف الثنائي، والتعقيد الفطري لمهام التصنيف. بينما يتم تحسين نموذج تونجر وبولات للمهام الثنائية، تم تصميم نموذج ResBiLSTM للتعامل مع سيناريوهات متعددة الفئات الأكثر تعقيدًا. في المهام ذات الثلاث فئات، يحقق النموذج المقترح أفضل أداء، وفي التصنيفات ذات الخمس فئات، يتبع عن كثب أعلى دقة تم الإبلاغ عنها من قبل شانموغام ودارمار (2023). بشكل عام، يظهر نموذج ResBiLSTM فعالية أكبر وقدرة على التكيف في التعرف على الصرع وتصنيفه مقارنة بالطرق الحالية، كما هو مفصل في التحليل المقارن المقدم في الجدول 6.

مناقشة

في هذا القسم، يناقش المؤلفون مجموعات البيانات والمنهجيات المستخدمة لاكتشاف نوبات الصرع، مع التركيز على مجموعات بيانات بون وTUSZ. تتكون مجموعة بيانات بون من خمس مجموعات من مقاطع EEG، تحتوي كل منها على 100 تسجيل أحادي القناة من متطوعين أصحاء ومرضى، بينما تعتبر مجموعة بيانات TUSZ واحدة من أكبر المجموعات المفتوحة المصدر، حيث تحتوي على 3,050 حالة نوبة عبر ثمانية أنواع. نفذ المؤلفون نظامين لزيادة البيانات لتعزيز بيانات التدريب، مما أدى إلى إجمالي 2,240 حالة لكل فئة لمجموعة بيانات بون، مما حسن من فعالية تدريب النموذج.

يدمج النموذج المقترح شبكة عصبية متبقية (ResNet) مع شبكة ذاكرة طويلة وقصيرة المدى ثنائية الاتجاه (BiLSTM) لالتقاط كل من الاعتماديات المكانية والزمنية في بيانات EEG. تستخدم ResNet كتلًا متبقية للتخفيف من مشاكل التدرج، بينما تعزز BiLSTM قدرة النموذج على التعلم من البيانات المتسلسلة. أجرى المؤلفون تجارب واسعة لتحسين المعلمات الفائقة وتكوينات النموذج، وفي النهاية اختاروا نموذجًا يحتوي على ثلاث كتل متبقية وطبقة واحدة من BiLSTM كأفضل خيار، محققين مقاييس أداء عالية (دقة 96.75%، دقة 96.85%، استرجاع 96.73%، ودرجة F1 96.76%). أظهرت المقارنات مع النماذج الأساسية تفوق نموذج ResBiLSTM، خاصة في مهام التصنيف المعقدة، مما يبرز فعاليته في التمييز بين أنواع النوبات المختلفة والظروف.

القيود

تسلط القيود المفروضة على الدراسات السابقة حول اكتشاف نوبات الصرع الضوء على التحديات التي تواجهها الخوارزميات التقليدية، التي تعتمد على مستخلصات الميزات المصممة يدويًا وتفتقر إلى قدرات التعلم من البداية إلى النهاية. غالبًا ما تكافح هذه الطرق مع القابلية للتعميم عبر مجموعات بيانات مختلفة بسبب اعتمادها على خبرة المصمم. بينما حسنت الأساليب الحديثة في التعلم العميق (DL)، بما في ذلك الشبكات العصبية التلافيفية (CNNs) والشبكات العصبية المتكررة (RNNs)، الأداء، إلا أنها تقدم أيضًا قيودها الخاصة. تتفوق CNNs في استخراج الميزات المكانية لكنها أقل فعالية مع بيانات السلاسل الزمنية، بينما يمكن لـ RNNs، وخاصة الشبكات طويلة وقصيرة المدى (LSTM) وBiLSTM، التقاط الديناميات الزمنية ولكنها تعاني من تعقيد حسابي مرتفع ومشكلات مثل تلاشي التدرج.

يظهر نموذج ResBiLSTM المقدم في هذه الدراسة نتائج واعدة على مجموعات بيانات بون وTUSZ؛ ومع ذلك، فإنه يواجه تحديات تتعلق بالتغير الفطري وعدم الدقة لإشارات EEG في البيئات الواقعية. تشمل القيود الرئيسية معالجة النموذج الحتمية للبيانات المدخلة، مما قد يقلل من القوة في وجود الضوضاء والشواذ، والحاجة إلى مجموعة بيانات تدريب أكثر شمولاً لتعزيز القابلية للتعميم. ستركز الأبحاث المستقبلية على دمج المنطق الضبابي لإدارة عدم اليقين في البيانات بشكل أفضل وتوسيع مجموعة بيانات التدريب من خلال دمج مجموعات بيانات EEG العامة الإضافية وتقنيات توليد البيانات الاصطناعية، مثل الشبكات التنافسية التوليدية (GANs)، لتحسين أداء النموذج في سيناريوهات سريرية متنوعة.

Journal: Frontiers in Computational Neuroscience, Volume: 18
DOI: https://doi.org/10.3389/fncom.2024.1415967
PMID: https://pubmed.ncbi.nlm.nih.gov/38952709
Publication Date: 2024-06-17
Author(s): Wei Zhao et al.
Primary Topic: EEG and Brain-Computer Interfaces

Overview

The research presents a novel hybrid deep learning model, ResBiLSTM, designed for the detection of epileptic seizures from electroencephalogram (EEG) signals, addressing the challenges posed by the manual interpretation of these signals. The model integrates a one-dimensional residual neural network (ResNet) to extract local spatial features from the EEG data, which are then processed by a bidirectional long short-term memory (BiLSTM) layer to capture temporal dependencies. The final output is refined through two fully connected layers for accurate seizure detection. Evaluations on datasets from the University of Bonn and Temple University Hospital (TUH) demonstrate that ResBiLSTM achieves exceptional accuracy rates of 98.88-100% in binary and ternary classifications on the Bonn dataset, and a classification accuracy of 95.03% with a weighted F1 score of 95.03% on the TUH dataset.

The findings highlight the effectiveness of the ResBiLSTM model in managing the complexities of EEG data, showcasing its superiority over several contemporary deep learning methods. This research not only contributes to the field of epilepsy detection but also suggests potential applications of the proposed methodology in diagnosing related neurological disorders, thereby enhancing clinical analysis and decision-making processes.

Introduction

The introduction of this research paper addresses the pressing issue of epilepsy, a neurological disorder affecting over 70 million individuals worldwide. The authors emphasize the critical role of electroencephalogram (EEG) in diagnosing epilepsy, as it provides essential insights into the brain’s electrophysiological activities. However, traditional methods of analyzing EEG data are labor-intensive and subjective, necessitating the development of automated detection systems to enhance diagnostic efficiency and patient care. The paper highlights the evolution of seizure detection methodologies, transitioning from conventional signal processing and machine learning techniques to advanced deep learning (DL) approaches.

The authors discuss various DL architectures, including convolutional neural networks (CNNs) and recurrent neural networks (RNNs), which have shown promise in automatically extracting features from EEG signals for seizure detection. Notable studies are cited, demonstrating the effectiveness of these models in achieving high accuracy rates in classifying seizure types. Furthermore, the paper introduces hybrid models that combine CNNs and RNNs to leverage their strengths in capturing both spatial and temporal dynamics of EEG data. The proposed novel hybrid network model, ResBiLSTM, aims to improve recognition accuracy across multiple classification challenges, outperforming existing methods. The subsequent sections of the paper will detail the datasets used, the proposed methodology, experimental results, and a comparative analysis of performance with other DL algorithms.

Methods

The “Materials and Methods” section outlines the experimental design and procedures employed in the study. It details the specific materials used, including any reagents, equipment, and biological samples, ensuring reproducibility of the experiments. The methodology encompasses the techniques applied for data collection and analysis, such as statistical methods, experimental protocols, and any computational tools utilized.

Additionally, the section may describe the sample size and selection criteria, as well as any controls implemented to validate the results. The rigor in the methodological approach is crucial for establishing the reliability of the findings and for facilitating comparisons with future research in the field.

Results

In the results section, the proposed ResBiLSTM model demonstrates superior performance in various classification tasks related to epilepsy detection, particularly excelling in two-category and three-category classifications (A-E, C-E, D-E, AB-E, CD-E, ABCD-E). While it achieves a high accuracy of 99.88% in the B-E classification task, it falls short by 10.58% compared to the method by Tuncer and Bolat (2022b) in the C-DE classification task, which focuses on identifying the epileptogenic zone. However, the ResBiLSTM model outperforms Tuncer and Bolat by 11.23% in the AB-CD-E three-classification task, highlighting its robustness across a broader range of classification challenges, including complex five-category tasks.

The discrepancies in accuracy between the studies can be attributed to two main factors: the feature extraction techniques employed by Tuncer and Bolat, which are tailored for binary classification, and the inherent complexity of the classification tasks. While Tuncer and Bolat’s model is optimized for binary tasks, the ResBiLSTM model is designed to tackle more intricate multi-class scenarios. In three-category tasks, the proposed model achieves the best performance, and in five-category classifications, it closely follows the highest accuracy reported by Shanmugam and Dharmar (2023). Overall, the ResBiLSTM model exhibits greater efficacy and adaptability in epilepsy recognition and classification compared to existing methods, as detailed in the comparative analysis presented in Table 6.

Discussion

In this section, the authors discuss the datasets and methodologies used for epilepsy seizure detection, focusing on the Bonn and TUSZ datasets. The Bonn dataset comprises five sets of EEG segments, each containing 100 single-channel recordings from healthy volunteers and patients, while the TUSZ dataset is one of the largest open-source collections, featuring 3,050 seizure cases across eight types. The authors implemented two data augmentation schemes to enhance training data, resulting in a total of 2,240 instances per category for the Bonn dataset, which improved model training efficacy.

The proposed model integrates a residual neural network (ResNet) with a bidirectional long short-term memory (BiLSTM) network to capture both spatial and temporal dependencies in EEG data. The ResNet employs residual blocks to mitigate gradient issues, while the BiLSTM enhances the model’s ability to learn from sequential data. The authors conducted extensive experiments to optimize hyperparameters and model configurations, ultimately selecting a model with three residual blocks and one BiLSTM layer as optimal, achieving high performance metrics (accuracy of 96.75%, precision of 96.85%, recall of 96.73%, and F1 score of 96.76%). Comparisons with baseline models demonstrated the superiority of the ResBiLSTM model, particularly in complex classification tasks, highlighting its effectiveness in distinguishing between various seizure types and conditions.

Limitations

The limitations of previous studies on epileptic seizure detection highlight the challenges faced by traditional algorithms, which rely on hand-crafted feature extractors and lack end-to-end learning capabilities. These methods often struggle with generalizability across different datasets due to their dependence on the designer’s expertise. While recent deep learning (DL) approaches, including convolutional neural networks (CNNs) and recurrent neural networks (RNNs), have improved performance, they also present their own limitations. CNNs excel at spatial feature extraction but are less effective with time-series data, while RNNs, particularly long short-term memory (LSTM) and bidirectional LSTM (BiLSTM) networks, can capture temporal dynamics but suffer from high computational complexity and issues like gradient vanishing.

The ResBiLSTM model introduced in this study shows promising results on the Bonn and TUSZ datasets; however, it faces challenges related to the inherent variability and imprecision of EEG signals in real-world settings. Key limitations include the model’s deterministic handling of input data, which may reduce robustness in the presence of noise and outliers, and the need for a more extensive training dataset to enhance generalizability. Future research will focus on integrating fuzzy logic to better manage data uncertainty and expanding the training dataset by incorporating additional public EEG datasets and synthetic data generation techniques, such as generative adversarial networks (GANs), to improve the model’s performance in diverse clinical scenarios.