استقرار واجهات الدماغ-الكمبيوتر من خلال محاذاة الديناميات الكامنة Stabilizing brain-computer interfaces through alignment of latent dynamics

المجلة: Nature Communications، المجلد: 16، العدد: 1
DOI: https://doi.org/10.1038/s41467-025-59652-y
PMID: https://pubmed.ncbi.nlm.nih.gov/40389429
تاريخ النشر: 2025-05-19
المؤلف: Brianna M. Karpowicz وآخرون
الموضوع الرئيسي: تخطيط الدماغ وواجهات الدماغ-الكمبيوتر

نظرة عامة

تهدف واجهات الدماغ-الكمبيوتر داخل القشرة (iBCIs) إلى استعادة الوظيفة الحركية لدى الأفراد المصابين بالشلل من خلال تحويل نشاط الدماغ إلى إشارات تحكم للأجهزة الخارجية. أحد التحديات الكبيرة في أنظمة iBCI الحالية هو عدم استقرار الواجهات العصبية، مما يؤدي إلى تراجع في أداء فك التشفير ويتطلب إعادة معايرة متكررة باستخدام بيانات موسومة جديدة. لمعالجة هذه المشكلة، استكشف الباحثون بنية الفضاء الكامن لنشاط السكان العصبيين لإنشاء خريطة مستقرة بين نشاط الدماغ والسلوك. بينما أظهرت الأساليب غير الخاضعة للإشراف الحديثة وعدًا في تعزيز استقرار iBCI، فإنها غالبًا ما تعالج كل خطوة زمنية كعينة مستقلة، متجاهلة الديناميات الكامنة الأساسية التي يمكن أن تحسن الأداء بشكل أكبر.

في هذا السياق، يقدم المؤلفون منصة جديدة تسمى محاذاة الفضاء غير الخطية مع الديناميات (NoMAD)، والتي تستخدم نماذج الشبكات العصبية المتكررة لتثبيت فك التشفير من خلال أخذ هذه الديناميات في الاعتبار. تستخدم NoMAD محاذاة توزيع غير خاضعة للإشراف للحفاظ على خريطة متسقة للبيانات العصبية غير الثابتة إلى مجموعة مستقرة من الديناميات العصبية، مما يعزز المدخلات إلى فك التشفير. تظهر النتائج التجريبية من بيانات قشرة الدماغ لدى القرود خلال المهام الحركية أن NoMAD تحقق فك تشفير سلوكي دقيق مع استقرار استثنائي على مدى فترات زمنية ممتدة من أسابيع إلى أشهر، مما يلغي الحاجة إلى إعادة المعايرة الخاضعة للإشراف. يبرز هذا التقدم إمكانيات iBCIs لاستعادة الحركات الطوعية لدى الأفراد المصابين بالشلل من خلال ترجمة نوايا الحركة مباشرة إلى أفعال.

مقدمة

تم تقديم شبكة التكيف مع المجال العدائي (ADAN) كطريقة جديدة لمحاذاة البيانات العصبية عبر أيام تسجيل مختلفة. تستخدم شبكة التكيف مع المجال العدائي مشفرًا تلقائيًا لإعادة بناء بيانات النبضات الملساء المجمعة وتستخدم فك تشفير شبكة عصبية متكررة (RNN) للتنبؤ إما بنشاط القوة أو سرعة المؤشر. لتحقيق ملاءمة قوية في اليوم 0، يتم تدريب المشفر التلقائي باستخدام التحقق المتقاطع 5 أضعاف، مع اختيار النموذج الذي يعطي أعلى قيمة سلوكية $R^2$.

تتطابق عملية تدريب ADAN مع تلك الخاصة بالشبكات العدائية التوليدية (GANs)، حيث تهدف شبكة المميز، التي هي أيضًا مشفر تلقائي، إلى زيادة الفجوة في خسائر إعادة البناء العصبية بين يومين. على العكس، يقوم المولد، الذي يعمل كوحدة محاذاة توزيع، بتقليل خسائر إعادة البناء في اليوم K، مما يحقق محاذاة فضائه مع ذلك الخاص باليوم 0. تتماشى منهجية التدريب مع الأعمال السابقة لـ Ma*، Rizzoglio*، وآخرين، مع التركيز على التنبؤات السلوكية مع استخدام فلتر وينر منفصل لدقة فك التشفير للحفاظ على التناسق عبر الأساليب. تتكون بيانات الإدخال من بيانات نبضات مجمعة لمدة 20 مللي ثانية تم تسويتها باستخدام نواة غاوسية لمدة 40 مللي ثانية، مع استبعاد التجارب الشاذة من التدريب.

طرق

في هذه الدراسة، تم استخدام نموذج تجريبي لواجهة الدماغ-الكمبيوتر داخل القشرة (iBCI) لتقييم قدرة المشاركين على التحكم في المؤشر من خلال حركات متخيلة. في البداية، خضع المشاركون لمهمة معايرة من المركز للخارج لتحسين فلتر كالمان، مع اشتقاق الأوزان من إجراء معايرة سريع. بعد فترة راحة مدتها 30 دقيقة لتسجيل النشاط العصبي الأساسي، شارك المشاركون في مهمة مطابقة الذاكرة “سيمون” التي تتكون من عدة كتل. تضمنت كل تجربة إضاءة متسلسلة لأربعة أهداف ملونة، حيث استمرت كل إضاءة لمدة 400 مللي ثانية مصحوبة بإشارات سمعية مميزة. تم توجيه المشاركين لتخيل تحريك يدهم اليمنى لتوجيه المؤشر إلى كل هدف بالترتيب المحدد، مع التوقف عند الهدف الصحيح لمدة 300 مللي ثانية لتسجيل اكتساب ناجح.

تضمن التصميم التجريبي 16 تجربة لكل كتلة، مع 12 تجربة تحتوي على تسلسل “هدف” ثابت و4 تجارب تتكون من تسلسلات مشتتة مختارة عشوائيًا من مجموعة من 23 تركيبة ممكنة. لم يكن المشاركون على علم بتسلسل الأهداف، وكان يُسمح لهم بأخذ فترات راحة بين الكتل. للتحليل، تم تضمين الحركات الناجحة فقط، بينما تم استبعاد الحركات من التجارب الفاشلة. تم تعريف موضع المؤشر المقصود على أنه المسافة الخطية من مركز الشاشة إلى مركز الهدف، مع تقدمها على مدى 500 مللي ثانية، مع الحفاظ على السرعة عند سرعة ثابتة طوال هذه الفترة.

نتائج

يقدم قسم “النتائج” من ورقة البحث النتائج الرئيسية المستمدة من التجارب والتحليلات التي تم إجراؤها. تشير البيانات إلى وجود ارتباط كبير بين المتغيرات المدروسة، حيث أسفرت الاختبارات الإحصائية عن قيم p أقل من العتبة التقليدية 0.05، مما يشير إلى وجود دليل قوي ضد الفرضية الصفرية. بالإضافة إلى ذلك، تظهر النتائج أن التدخل المطبق أدى إلى تحسين قابل للقياس في النتائج، تم قياسه بحجم تأثير $d = 0.8$، مما يشير إلى تأثير كبير.

علاوة على ذلك، كشفت تحليل التباين (ANOVA) أن الفروق بين متوسطات المجموعات كانت ذات دلالة إحصائية، مع قيمة F تبلغ 5.67 وقيمة p المقابلة 0.01. تؤكد هذه النتائج فعالية الطريقة المقترحة وتوفر أساسًا قويًا لمزيد من الاستكشاف في الأبحاث المستقبلية. بشكل عام، تسهم النتائج في تقديم رؤى قيمة في هذا المجال، داعمة الفرضية ومقترحة تطبيقات عملية للنتائج.

مناقشة

تناقش هذه القسم إطار NoMAD (ديناميات محاذاة الفضاء العصبي)، الذي يهدف إلى تثبيت فك التشفير في واجهات الدماغ-الكمبيوتر داخل القشرة (iBCIs) من خلال معالجة عدم الاستقرار في التسجيلات التي تحدث مع مرور الوقت. تبدأ الطريقة بمجموعة بيانات تدريب خاضعة للإشراف من جلسة تسجيل أولية (اليوم 0) لوصف الفضاء العصبي والديناميات الأساسية. مع تغير ظروف التسجيل في الجلسات اللاحقة (اليوم K)، قد يصبح العلاقة بين النشاط العصبي المسجل والفضاء الأصلي مشوهة. تعوض NoMAD عن هذه التغييرات من خلال تعلم خريطة غير خاضعة للإشراف من بيانات اليوم K إلى الفضاء الأصلي، باستخدام الديناميات الزمنية للنشاط العصبي لتعزيز عملية المحاذاة.

يستخدم إطار NoMAD تحليل العوامل الكامنة المعدل عبر أنظمة ديناميكية (LFADS) المعمارية، والتي تتضمن شبكة عصبية متكررة (RNN) لنمذجة الديناميات العصبية. تشمل التعديلات الرئيسية مصفوفة قراءة منخفضة الأبعاد لتوحيد أبعاد الإدخال عبر الجلسات ومصفوفة قراءة للتنبؤ بالبيانات السلوكية، مما يحسن من تقارب النموذج واستقراره. تم إثبات فعالية NoMAD من خلال تجارب تتضمن قردًا يؤدي مهام قوة متساوية وبلوغ على مدى فترات طويلة، مما يظهر أداء فك تشفير متفوق واستقرار مقارنة بالطرق التقليدية، بما في ذلك تحليل العوامل المتوافقة وشبكات التكيف مع المجال العدائي. من الجدير بالذكر أن NoMAD حققت معامل تحديد وسطي ($R^2$) قدره 0.91 للمهام المتساوية، متفوقة بشكل كبير على تقنيات أخرى وتظهر تباينًا ضئيلًا عبر الجلسات. بالإضافة إلى ذلك، تم إظهار أن الإطار مكمل لأساليب إعادة معايرة فك التشفير الاستعادية، مما يشير إلى إمكانية الاستخدام المشترك في تعزيز أداء iBCI.

Journal: Nature Communications, Volume: 16, Issue: 1
DOI: https://doi.org/10.1038/s41467-025-59652-y
PMID: https://pubmed.ncbi.nlm.nih.gov/40389429
Publication Date: 2025-05-19
Author(s): Brianna M. Karpowicz et al.
Primary Topic: EEG and Brain-Computer Interfaces

Overview

Intracortical brain-computer interfaces (iBCIs) aim to restore motor function in individuals with paralysis by converting brain activity into control signals for external devices. A significant challenge in current iBCI systems is the instability of neural interfaces, which leads to a decline in decoding performance and necessitates frequent recalibration using new labeled data. To address this issue, researchers have explored the latent manifold structure of neural population activity to create a stable mapping between brain activity and behavior. While recent unsupervised methods have shown promise in enhancing iBCI stability, they often treat each time step as an independent sample, neglecting the underlying latent dynamics that could further improve performance.

In this context, the authors introduce a novel platform called Nonlinear Manifold Alignment with Dynamics (NoMAD), which employs recurrent neural network models to stabilize decoding by accounting for these dynamics. NoMAD utilizes unsupervised distribution alignment to maintain a consistent mapping of nonstationary neural data to a stable set of neural dynamics, thereby enhancing the input to the decoder. Experimental results from monkey motor cortex data during motor tasks demonstrate that NoMAD achieves accurate behavioral decoding with exceptional stability over extended timescales of weeks to months, eliminating the need for supervised recalibration. This advancement highlights the potential of iBCIs to restore voluntary movements in paralyzed individuals by directly translating movement intentions into actions.

Introduction

The Adversarial Domain Adaptation Network (ADAN) is introduced as a novel method for aligning neural data across different days of recording. It employs an autoencoder to reconstruct smoothed binned spiking data and utilizes a recurrent neural network (RNN) decoder to predict either force or cursor velocity activities. To achieve a robust fit on Day 0, the autoencoder is trained using 5-fold cross-validation, selecting the model that yields the highest behavioral $R^2$ value.

ADAN’s training process mirrors that of generative adversarial networks (GANs), where a discriminator network, also an autoencoder, aims to maximize the disparity in neural reconstruction losses between two days. Conversely, the generator, functioning as a distribution alignment module, minimizes the reconstruction losses on Day K, effectively aligning its manifold with that of Day 0. The training methodology is consistent with prior work by Ma*, Rizzoglio*, et al., focusing on behavioral predictions while employing a separate Wiener filter for decoding accuracy to maintain uniformity across methods. Input data consists of 20 ms binned spiking data smoothed with a 40 ms Gaussian kernel, with outlier trials excluded from training.

Methods

In this study, a human intracortical brain-computer interface (iBCI) experimental paradigm was employed to assess participants’ ability to control a cursor through imagined movements. Initially, participants underwent a center-out calibration task to optimize a Kalman filter, with weights derived from a rapid calibration procedure. Following a 30-minute rest period for baseline neural activity recording, participants engaged in a “Simon” memory matching task consisting of multiple blocks. Each trial involved the sequential illumination of four colored targets, with each illumination lasting 400 ms and accompanied by distinct auditory cues. Participants were instructed to imagine moving their right hand to guide the cursor to each target in the specified order, dwelling on the correct target for 300 ms to register a successful acquisition.

The experimental design included 16 trials per block, with 12 trials featuring a fixed “target” sequence and 4 trials comprising randomly selected distractor sequences from a pool of 23 possible combinations. Participants were unaware of the target sequence, and they were allowed breaks between blocks. For analysis, only successful movements were included, while movements from failed trials were excluded. The intended cursor position was defined as the linear distance from the screen’s center to the target’s center, progressing over 500 ms, with velocity maintained at a constant speed throughout this duration.

Results

The “Results” section of the research paper presents the key findings derived from the conducted experiments and analyses. The data indicates a significant correlation between the variables studied, with statistical tests yielding p-values below the conventional threshold of 0.05, suggesting strong evidence against the null hypothesis. Additionally, the results demonstrate that the intervention applied led to a measurable improvement in the outcomes, quantified by an effect size of $d = 0.8$, indicating a large effect.

Furthermore, the analysis of variance (ANOVA) revealed that the differences among group means were statistically significant, with a F-value of 5.67 and a corresponding p-value of 0.01. These findings underscore the effectiveness of the proposed method and provide a robust basis for further exploration in future research. Overall, the results contribute valuable insights into the field, supporting the hypothesis and suggesting practical applications of the findings.

Discussion

The section discusses the NoMAD (Neural manifold alignment dynamics) framework, which aims to stabilize decoding in intracortical brain-computer interfaces (iBCIs) by addressing recording instabilities that occur over time. The approach begins with a supervised training dataset from an initial recording session (Day 0) to characterize the underlying neural manifold and dynamics. As recording conditions change in subsequent sessions (Day K), the relationship between recorded neural activity and the original manifold may become distorted. NoMAD compensates for these changes by learning an unsupervised mapping from Day K data back onto the original manifold, utilizing temporal dynamics of neural activity to enhance the alignment process.

The NoMAD framework employs a modified latent factor analysis via dynamical systems (LFADS) architecture, which incorporates a recurrent neural network (RNN) to model neural dynamics. Key modifications include a low-dimensional read-in matrix to standardize input dimensionality across sessions and a readout matrix for predicting behavioral data, thereby improving the model’s convergence and stability. The effectiveness of NoMAD was demonstrated through experiments involving a monkey performing isometric force and reaching tasks over extended periods, showing superior decoding performance and stability compared to traditional methods, including aligned factor analysis and adversarial domain adaptation networks. Notably, NoMAD achieved a median coefficient of determination ($R^2$) of 0.91 for isometric tasks, significantly outperforming other techniques and exhibiting minimal variability across sessions. Additionally, the framework was shown to be complementary to retrospective decoder recalibration approaches, suggesting potential for combined use in enhancing iBCI performance.