اكتشاف استراتيجيات معرفية باستخدام شبكات عصبية متكررة صغيرة Discovering cognitive strategies with tiny recurrent neural networks

المجلة: Nature، المجلد: 644، العدد: 8078
DOI: https://doi.org/10.1038/s41586-025-09142-4
PMID: https://pubmed.ncbi.nlm.nih.gov/40604278
تاريخ النشر: 2025-07-02
المؤلف: Jian Li وآخرون
الموضوع الرئيسي: ديناميات الأعصاب ووظيفة الدماغ

نظرة عامة

تناقش هذه القسم نهجًا جديدًا لفهم اتخاذ القرار في الحيوانات والبشر من خلال استخدام الشبكات العصبية المتكررة (RNNs) لنمذجة الخوارزميات المعرفية. الأطر التقليدية المعيارية، مثل الاستدلال بايزي والتعلم المعزز، على الرغم من تأثيرها، غالبًا ما تبسط السلوك البيولوجي بشكل مفرط وتكون عرضة لتحيز الباحثين. بالمقابل، تستخدم الإطار المقترح شبكات RNN صغيرة، تتكون عادةً من وحدة واحدة إلى أربع وحدات، والتي لا تتفوق فقط على النماذج المعرفية الكلاسيكية في التنبؤ بالاختيارات عبر مهام التعلم المرتبط بالمكافآت المختلفة، ولكنها أيضًا تعزز القابلية للتفسير من خلال تصور ديناميات RNN كنظم ديناميكية منفصلة.

تظهر الدراسة أن هذه الشبكات العصبية الصغيرة يمكن أن تلتقط بفعالية أنماط سلوكية معقدة، بما في ذلك معدلات التعلم المتغيرة والاستمرارية المعتمدة على الحالة، والتي غالبًا ما يتم تجاهلها من قبل النماذج التقليدية. من خلال تحليل سلوك اتخاذ القرار في ست مهام معروفة للتعلم المرتبط بالمكافآت، يكشف المؤلفون أن نهجهم لا يحسن فقط دقة التنبؤ ولكن أيضًا يوفر رؤى أعمق حول الآليات المعرفية الأساسية. يعالج هذا الإطار قيود كل من الشبكات العصبية الأكبر والنماذج الكلاسيكية، مقدماً طريقة منهجية لكشف الاستراتيجيات المعرفية القابلة للتفسير في اتخاذ القرار ويساهم في فهم كل من الإدراك الصحي وغير السليم.

نقاش

في هذا القسم، يناقش المؤلفون فعالية الشبكات العصبية المتكررة الصغيرة (RNNs) في نمذجة عمليات اتخاذ القرار مقارنة بالنماذج المعرفية الكلاسيكية. يصنفون النماذج المعرفية إلى ثلاث عائلات: التعلم المعزز بدون نموذج (RL)، التعلم المعزز القائم على النموذج، والاستدلال بايزي، مع تسليط الضوء على الاختلافات في كيفية تحديث هذه النماذج لقيم الإجراءات وتمثيل حالات الاعتقاد. تظهر الدراسة أن الشبكات العصبية الصغيرة، التي تتكون غالبًا من 1 إلى 4 وحدات، تتفوق بشكل كبير على النماذج الكلاسيكية في التنبؤ باختيارات الحيوانات والبشر عبر مهام مختلفة، مما يظهر مرونتها وقدرتها على التقاط سلوكيات معقدة مع عدد أكبر من المعلمات الحرة.

كما يقدم المؤلفون إطارًا لتقطير المعرفة يعزز أداء الشبكات العصبية المتكررة في الدراسات البشرية، خاصة عندما تكون البيانات لكل مشارك محدودة. يسمح هذا النهج للشبكات العصبية الاستفادة من بيانات متعددة المشاركين، مما يحقق دقة تنبؤية قوية مع عدد أقل من التجارب. علاوة على ذلك، يقدم المؤلفون إطارًا تفسيريًا قائمًا على نظرية النظم الديناميكية، والذي يوضح تطور الحالة الداخلية للشبكات العصبية المتكررة استجابةً للمكافآت والإجراءات. يكشف هذا الإطار عن استراتيجيات معرفية جديدة، مثل معدلات التعلم المعتمدة على الحالة وتأثيرات اللامبالاة الناتجة عن المكافآت، والتي لم يتم التقاطها من قبل النماذج التقليدية. بشكل عام، تؤكد النتائج على إمكانيات الشبكات العصبية الصغيرة كنماذج متعددة الاستخدامات وقابلة للتفسير لفهم اتخاذ القرار عبر الأنواع والمهام.

Journal: Nature, Volume: 644, Issue: 8078
DOI: https://doi.org/10.1038/s41586-025-09142-4
PMID: https://pubmed.ncbi.nlm.nih.gov/40604278
Publication Date: 2025-07-02
Author(s): Jian Li et al.
Primary Topic: Neural dynamics and brain function

Overview

This section discusses a novel approach to understanding decision-making in animals and humans by utilizing recurrent neural networks (RNNs) to model cognitive algorithms. Traditional normative frameworks, such as Bayesian inference and reinforcement learning, while influential, often oversimplify biological behavior and are subject to researcher bias. In contrast, the proposed framework employs small RNNs, typically consisting of just one to four units, which not only outperform classical cognitive models in predicting choices across various reward-learning tasks but also enhance interpretability through the visualization of RNN dynamics as discrete dynamical systems.

The study demonstrates that these tiny RNNs can effectively capture complex behavioral patterns, including variable learning rates and state-dependent perseveration, which are often overlooked by traditional models. By analyzing decision-making behavior in six well-established reward-learning tasks, the authors reveal that their approach not only improves predictive accuracy but also provides deeper insights into the underlying cognitive mechanisms. This framework addresses the limitations of both larger neural networks and classical models, offering a systematic method for uncovering interpretable cognitive strategies in decision-making and contributing to the understanding of both healthy and dysfunctional cognition.

Discussion

In this section, the authors discuss the efficacy of tiny recurrent neural networks (RNNs) in modeling decision-making processes compared to classical cognitive models. They categorize cognitive models into three families: model-free reinforcement learning (RL), model-based RL, and Bayesian inference, highlighting the differences in how these models update action values and represent belief states. The study demonstrates that tiny RNNs, often comprising just 1 to 4 units, significantly outperform classical models in predicting animal and human choices across various tasks, showcasing their flexibility and ability to capture complex behaviors with a larger number of free parameters.

The authors also introduce a knowledge distillation framework that enhances the performance of RNNs in human studies, particularly when data per participant is limited. This approach allows RNNs to leverage multi-participant data, achieving robust predictive accuracy with fewer trials. Furthermore, the authors present an interpretative framework based on dynamical systems theory, which elucidates the internal state evolution of RNNs in response to rewards and actions. This framework reveals novel cognitive strategies, such as state-dependent learning rates and reward-induced indifference effects, which were not captured by traditional models. Overall, the findings underscore the potential of tiny RNNs as versatile and interpretable models for understanding decision-making across species and tasks.

كلمات مفتاحية: أنثى، إدراك، اتخاذ القرار، استنتاج، الإدراك، التعلم، الذكاء الاصطناعي، الشبكات العصبية المتكررة، بشر، تعزيز، علم النفس، تعلم الآلة، تعلم التعزيز، حيوانات، خوارزميات، ذكر، سلوك الاختيار، شبكات عصبية، حاسوبية، شبكة عصبية اصطناعية، علم المعرفة، علم النفس، علوم الحاسوب، لعنة الأبعاد، معياري، مكافأة، نظرية بايز، نماذج، عصبية