تحليل ERP القائم على قيمة SHAP (SHERPA): زيادة حساسية إشارات EEG باستخدام طرق الذكاء الاصطناعي القابلة للتفسير SHAP value-based ERP analysis (SHERPA): Increasing the sensitivity of EEG signals with explainable AI methods

المجلة: Behavior Research Methods، المجلد: 56، العدد: 6
DOI: https://doi.org/10.3758/s13428-023-02335-7
PMID: https://pubmed.ncbi.nlm.nih.gov/38453828
تاريخ النشر: 2024-03-07
المؤلف: Sophia Sylvester وآخرون
الموضوع الرئيسي: تخطيط الدماغ وواجهات الدماغ-الكمبيوتر

نظرة عامة

يقدم القسم SHERPA، وهو نهج جديد يعتمد على البيانات لتحليل إمكانيات الأحداث المرتبطة (ERPs) في بيانات تخطيط الدماغ الكهربائي (EEG)، والذي يستفيد من الذكاء الاصطناعي القابل للتفسير (XAI). غالبًا ما يعتمد تحليل ERP التقليدي على أجهزة استشعار ونقاط زمنية محددة مسبقًا، مما يؤدي إلى تحيزات محتملة واكتشاف محدود للتأثيرات غير المتوقعة. يتناول SHERPA هذه القيود من خلال استخدام شبكة عصبية تلافيفية (CNN) لتصنيف الظروف التجريبية واستخدام SHapley Additive exPlanations (SHAP) لتحديد الميزات الزمنية والمكانية الهامة. أثبتت التحقق من صحة من خلال تجربة تقليدية لتصور الوجه باستخدام EEG قدرة SHERPA على تحديد مجموعة قذالية مرتبطة بتأثير N170، بينما قامت أيضًا بتquantifying أهمية ERPs من خلال “درجة الأهمية”.

في الختام، يمثل SHERPA جهدًا رائدًا لاستخدام XAI لتحديد الأنماط الحرجة في بيانات EEG، مما يوفر بديلاً بسيطًا للطرق التقليدية. يعزز هذا النهج الحساسية والخصوصية في تحليل ERP، مما يسمح للباحثين باستكشاف بيانات EEG بما يتجاوز اختبار الفرضيات الصفرية. من خلال الاستفادة الكاملة من التوزيع المكاني لأجهزة استشعار EEG، يقلل SHERPA من خطر تقدير التأثيرات بشكل مفرط ويوفر فهمًا أكثر دقة للعمليات العصبية من خلال قيم SHAP. في النهاية، يحمل هذا الأسلوب وعدًا لتقدم فهم نشاط الدماغ والمعالجة العصبية.

مقدمة

تناقش مقدمة الورقة فائدة تخطيط الدماغ الكهربائي (EEG) في استكشاف الأسس العصبية للعمليات المعرفية والعاطفية من خلال قياس غير جراحي للنشاط الكهربائي في الدماغ. تسلط الضوء على مزايا النهج المدفوع من قبل الباحثين، لا سيما ثروة الأدبيات حول إمكانيات الأحداث المرتبطة (ERPs) وصلاتها بالظواهر النفسية، مثل مكون P1 المرتبط بتخصيص الموارد الانتباهية. ومع ذلك، تتناول أيضًا القيود الكبيرة، بما في ذلك التحيزات المحتملة من التركيز الانتقائي للباحثين والتحديات في فرض الفرضيات حول مفاهيم أو نماذج نفسية جديدة. للتخفيف من هذه القضايا، يقترح المؤلفون نهجًا يعتمد على البيانات باستخدام الشبكات العصبية التلافيفية (CNNs) وSHapley Additive exPlanations (SHAP) لتحليل أكثر موضوعية لبيانات EEG.

تركز الورقة بشكل خاص على مكون N170، وهو علامة معروفة لتصور الوجه، لإظهار فعالية تحليل ERP المدفوع بـ SHAP (SHERPA). يهدف هذا الأسلوب إلى تحديد النقاط الزمنية والأقطاب ذات الصلة في بيانات ERP، مما يعزز فهم الآليات المعرفية والعصبية. يحدد المؤلفون مساهماتهم، بما في ذلك تطوير مصنف تعلم عميق لتمييز بين ظروف تصور الوجه المختلفة وتطبيق SHAP لتوضيح الميزات المكانية والزمنية الحرجة للتصنيف. تم هيكلة بقية الورقة لتوفير خلفية حول مفاهيم التعلم الآلي، ومناقشة الأعمال ذات الصلة، وتقديم الأساليب والنتائج وتفسيرات نتائجهم.

الطرق

يحدد قسم “الطرق” تصميم التجربة والتقنيات التحليلية المستخدمة في الدراسة. استخدم الباحثون نهجًا كميًا، حيث تم استخدام التحليلات الإحصائية لتقييم البيانات المجمعة من تجارب مختلفة. شملت المنهجيات المحددة تجارب محكومة، حيث تم التلاعب بالمتغيرات بشكل منهجي لملاحظة تأثيراتها على النتائج ذات الاهتمام.

شملت جمع البيانات أدوات موحدة لضمان الموثوقية والصلاحية، تلتها اختبارات إحصائية صارمة لتقييم دلالة النتائج. تم تطبيق تقنيات مثل تحليل الانحدار وANOVA لتفسير النتائج، مما يسمح بفهم شامل للعلاقات بين المتغيرات المدروسة. بشكل عام، تم تصميم الطرق لتوفير أدلة قوية تدعم فرضيات البحث.

النتائج

في قسم النتائج، يستخدم إطار عمل SHERPA قيم SHAP لتفكيك مخرجات النموذج ونسبتها إلى ميزات فردية، وبشكل خاص مكونات ERP. يتيح هذا الأسلوب تحديد مكونات ERP الحرجة التي تؤثر بشكل كبير على توقع المتغيرات الناتجة، مثل الأداء السلوكي. تقوم “درجة الأهمية” لـ SHERPA بتquantifying مساهمة كل ميزة في الفرق بين الاحتمالات المتوقعة والمتوقعة، مع تفسير الدرجات على مقياس احتمالي يتراوح من 0 إلى 1. تشير درجة الصفر إلى عدم وجود تأثير، بينما تشير القيم الأعلى إلى مساهمات أقوى في النتائج التنبؤية.

تسمح الطبيعة الخطية لمخرجات SHERPA بإجراء مقارنات مباشرة بين مكونات ERP، حيث يُعتبر المكون الذي يحمل درجة SHERPA ضعف درجة مكون آخر أكثر أهمية مرتين. يسهل ذلك تقييم ما إذا كانت مكونات معينة تشير إلى آليات عصبية وإلى أي مدى. يوصي المؤلفون بتقديم نتائج SHERPA في شكلها الأصلي من أجل الوضوح، مع توفير درجات SHAP محددة لظروف مختلفة لتوضيح التأثيرات المتفاوتة لمكونات ERP على القياسات المعرفية أو السلوكية. يُشجع أيضًا تقديم معلومات سياقية إضافية حول مكونات ERP، مثل الكمون والتوبوغرافيا، لتعزيز فهم علاقاتها بالنتائج المقاسة.

المناقشة

في هذه المناقشة، يبرز المؤلفون التقدم في تطبيقات التعلم الآلي لتحليل EEG، لا سيما في سياق إمكانيات الأحداث المرتبطة (ERPs) مثل P300 وN170. يستعرضون دراسات مختلفة استخدمت تقنيات تعلم آلي مختلفة، مثل آلات بولتزمان المقيدة والشبكات العصبية التلافيفية، لتعزيز اكتشاف P300 في واجهات الدماغ-الكمبيوتر. على الرغم من النتائج الواعدة في اكتشاف P300، يشير المؤلفون إلى فجوة كبيرة في البحث بشأن مكونات ERP الأخرى، لا سيما N170، التي يُعتقد أنها تعكس عمليات معرفية متميزة تتعلق بتصور الوجه.

يقدم المؤلفون نهجهم الجديد باستخدام طرق الذكاء الاصطناعي القابل للتفسير (XAI) لاكتشاف مكون N170 دون الاعتماد على المعرفة السابقة بخصائصه. يتحققون من صحة طريقتهم مقابل نموذج تقليدي لتصور الوجه، مما يوضح أن خطتهم يمكن أن تحدد بدقة الإحداثيات الزمنية والمكانية لـ N170. تكشف النتائج أنه بينما يرتبط N170 بمعالجة الوجه، قد يشير أيضًا إلى عملية اختيار سلبية، مما يقترح أن دور المكون في التعرف على الوجه أكثر تعقيدًا مما كان يُفهم سابقًا. بشكل عام، تؤكد الدراسة على إمكانيات طريقتهم المعتمدة على XAI، SHERPA، لتوفير رؤى أعمق حول الآليات العصبية الكامنة وراء العمليات المعرفية، مما يعزز مجال تحليل EEG.

القيود

تتعدد قيود خط الأنابيب المقترح الذي يستخدم شبكة عصبية تلافيفية (CNN) والذكاء الاصطناعي القابل للتفسير (XAI) لتحليل EEG. إحدى القضايا الرئيسية هي أن مقاييس أهمية الميزات، مثل SHAP، ليست مصممة بطبيعتها لبيانات السلاسل الزمنية، مما يؤدي إلى مشكلتين كبيرتين: عدم القدرة على تفسير جميع النقاط الزمنية لميزة واحدة أو جميع الميزات لنقطة زمنية معينة، مما يعيق السياق العام؛ والتحدي الذي تطرحه الميزات المرتبطة، التي تكافح SHAP لمعالجتها نظريًا. على الرغم من هذه القيود، فقد حقق SHAP gradient explainer نتائج معقولة في هذه الدراسة، متماشية مع النتائج من أبحاث أخرى تشير إلى فعاليته في سياقات السلاسل الزمنية.

عيب آخر هو التكلفة الحاسوبية العالية المرتبطة باستخدام تقنيات التعلم العميق وSHAP، مما يتطلب الوصول إلى أجهزة قوية، بما في ذلك وحدات معالجة الرسوميات (GPUs) وموارد وحدة المعالجة المركزية الكبيرة. بينما يمكن أن يقلل تجميع الميزات من المتطلبات الحاسوبية، فإن هذا النهج يخاطر بالتضحية بالدقة الزمنية العالية التي تعد ميزة رئيسية لتحليل EEG مقارنة بطرق أخرى مثل fMRI. على الرغم من أن مجموعات بيانات EEG في علم النفس التجريبي تميل إلى أن تكون أصغر، مما يعقد تعميم نتائج التعلم الآلي، فإن العدد الكبير من التجارب لكل مشارك يمكن أن يعزز دقة التصنيف. ومع ذلك، بالنسبة للعمليات المعرفية الأساسية مثل تصور الوجه، قد تكون الفروق الفردية ضئيلة، مما قد يخفف من المخاوف المتعلقة بتنوع المشاركين.

Journal: Behavior Research Methods, Volume: 56, Issue: 6
DOI: https://doi.org/10.3758/s13428-023-02335-7
PMID: https://pubmed.ncbi.nlm.nih.gov/38453828
Publication Date: 2024-03-07
Author(s): Sophia Sylvester et al.
Primary Topic: EEG and Brain-Computer Interfaces

Overview

The section presents SHERPA, a novel data-driven approach for analyzing event-related potentials (ERPs) in electroencephalography (EEG) data, which leverages explainable artificial intelligence (XAI). Traditional ERP analysis often relies on predetermined sensors and time points, leading to potential biases and limited detection of unexpected effects. SHERPA addresses these limitations by employing a convolutional neural network (CNN) to classify experimental conditions and using SHapley Additive exPlanations (SHAP) to identify significant temporal and spatial features. Validation through a classical EEG face perception experiment demonstrated SHERPA’s capability to pinpoint an occipital cluster associated with the N170 effect, while also quantifying the relevance of ERPs through an “importance score.”

In conclusion, SHERPA represents a pioneering effort to utilize XAI for identifying critical patterns in EEG data, offering a straightforward alternative to traditional methods. This approach enhances sensitivity and specificity in ERP analysis, allowing researchers to explore EEG data beyond null-hypothesis testing. By fully utilizing the spatial distribution of EEG sensors, SHERPA minimizes the risk of overestimating effects and provides a more nuanced understanding of neural processes through SHAP values. Ultimately, this method holds promise for advancing the understanding of brain activity and neural processing.

Introduction

The introduction of the paper discusses the utility of Electroencephalography (EEG) in exploring the neural underpinnings of cognitive and emotional processes through non-invasive measurement of brain electrical activity. It highlights the advantages of a researcher-driven approach, particularly the wealth of literature on Event-Related Potentials (ERPs) and their connections to psychological phenomena, such as the P1 component associated with attentional resource allocation. However, it also addresses significant limitations, including potential biases from researchers’ selective focus and challenges in hypothesizing about new psychological concepts or paradigms. To mitigate these issues, the authors propose a data-driven approach utilizing convolutional neural networks (CNNs) and SHapley Additive exPlanations (SHAP) for a more objective analysis of EEG data.

The paper specifically focuses on the N170 component, a well-established marker of face perception, to demonstrate the effectiveness of their SHAP-driven ERP analysis (SHERPA). This method aims to identify relevant time points and electrodes in ERP data, enhancing the understanding of cognitive and neural mechanisms. The authors outline their contributions, including the development of a deep learning classifier to distinguish between different face perception conditions and the application of SHAP to elucidate the spatial and temporal features critical for classification. The remainder of the paper is structured to provide background on machine learning concepts, discuss related work, and present methods, results, and interpretations of their findings.

Methods

The “Methods” section outlines the experimental design and analytical techniques employed in the study. The researchers utilized a quantitative approach, employing statistical analyses to evaluate the data collected from various experiments. Specific methodologies included controlled trials, where variables were systematically manipulated to observe their effects on the outcomes of interest.

Data collection involved standardized instruments to ensure reliability and validity, followed by rigorous statistical testing to assess the significance of the findings. Techniques such as regression analysis and ANOVA were applied to interpret the results, allowing for a comprehensive understanding of the relationships between the variables studied. Overall, the methods were designed to provide robust evidence supporting the research hypotheses.

Results

In the results section, the SHERPA framework utilizes SHAP values to decompose model outputs and attribute them to individual features, specifically ERP components. This method enables the identification of critical ERP components that significantly influence the prediction of outcome variables, such as behavioral performance. The SHERPA “importance score” quantifies each feature’s contribution to the difference between predicted and expected probabilities, with scores interpreted on a probability scale ranging from 0 to 1. A score of zero indicates no effect, while higher values signify stronger contributions to the predictive outcomes.

The linear nature of the SHERPA output allows for direct comparisons between ERP components, where a component with a SHERPA score twice as high as another is deemed twice as important. This facilitates the assessment of whether specific components index neural mechanisms and to what extent. The authors recommend presenting SHERPA results in their original form for clarity, providing specific SHAP scores for different conditions to illustrate the varying influence of ERP components on cognitive or behavioral measures. Additional contextual information about the ERP components, such as latency and topography, is encouraged to enhance understanding of their relationships with the measured outcomes.

Discussion

In this discussion, the authors highlight the advancements in machine learning applications for EEG analysis, particularly in the context of event-related potentials (ERPs) like the P300 and N170. They review various studies that have employed different machine learning techniques, such as restricted Boltzmann machines and convolutional neural networks, to enhance P300 detection in brain-computer interfaces. Despite the promising results in P300 detection, the authors note a significant gap in research concerning other ERP components, particularly the N170, which is believed to reflect distinct cognitive processes related to face perception.

The authors present their novel approach using Explainable AI (XAI) methods to detect the N170 component without relying on prior knowledge of its characteristics. They validate their method against a conventional face perception paradigm, demonstrating that their pipeline can accurately identify the N170’s temporal and spatial coordinates. The findings reveal that while the N170 is associated with face processing, it may also indicate a negative selection process, suggesting that the component’s role in face recognition is more complex than previously understood. Overall, the study underscores the potential of their XAI-based method, SHERPA, to provide deeper insights into the neural mechanisms underlying cognitive processes, thereby advancing the field of EEG analysis.

Limitations

The limitations of the proposed pipeline utilizing a convolutional neural network (CNN) and explainable artificial intelligence (XAI) for EEG analysis are multifaceted. A primary concern is that feature importance measures, such as SHAP, are not inherently designed for time series data, leading to two significant issues: the inability to simultaneously explain all time points for a single feature or all features for a specific time point, which obscures the overall context; and the challenge posed by correlated features, which SHAP struggles to address theoretically. Despite these limitations, the SHAP gradient explainer has yielded plausible results in this study, aligning with findings from other research indicating its effectiveness in time series contexts.

Another drawback is the high computational cost associated with employing deep learning techniques and SHAP, necessitating access to powerful hardware, including GPUs and substantial CPU resources. While aggregating features could reduce computational demands, this approach risks compromising the high temporal resolution that is a key advantage of EEG analysis compared to other methods like fMRI. Although EEG datasets in experimental psychology tend to be smaller, which complicates the generalizability of machine learning results, the high number of trials per participant can enhance classification accuracy. Nonetheless, for fundamental cognitive processes such as face perception, individual differences may be minimal, potentially mitigating concerns regarding participant variability.