الشبكات الطبوغرافية من النهاية إلى النهاية كنماذج لتشكيل الخرائط القشرية والسلوك البصري البشري End-to-end topographic networks as models of cortical map formation and human visual behaviour

المجلة: Nature Human Behaviour، المجلد: 9، العدد: 9
DOI: https://doi.org/10.1038/s41562-025-02220-7
PMID: https://pubmed.ncbi.nlm.nih.gov/40481218
تاريخ النشر: 2025-06-06
المؤلف: Zejin Lu وآخرون
الموضوع الرئيسي: التعرف على الوجه والإدراك

نظرة عامة

يتناول هذا القسم أهمية التنظيم الطبوغرافي في نظام الرؤية لدى الرئيسيات ويقدم نموذجًا حسابيًا جديدًا يسمى الشبكات العصبية الطبوغرافية الكاملة (All-TNNs) لمعالجة قيود الشبكات العصبية التلافيفية التقليدية (CNNs). على عكس CNNs، التي تتعلم ميزات موحدة عبر الأبعاد المكانية، تم تصميم All-TNNs لتطوير ميزات تعكس الخرائط السلسة للتوجه وانتقائية الفئة المميزة لرؤية الرئيسيات. لا يعزز هذا النموذج كفاءة المعالجة للمعلومات ذات الصلة بالمهام فحسب، بل يعمل أيضًا على ميزانية طاقة منخفضة، مما يشير إلى مزايا أيض محتملة مرتبطة بالتنظيم الطبوغرافي.

للتحقق من فعالية All-TNNs، جمع الباحثون مجموعة بيانات حول التحيزات المكانية البشرية في التعرف على الأشياء، مما يظهر أن All-TNNs تتفوق بشكل كبير على النماذج التقليدية. وهذا يشير إلى أن All-TNNs هي إطار واعد لنمذجة الترتيب المكاني المنظم للخلايا العصبية في نظام الرؤية لدى الرئيسيات وتأثيراته على السلوك. تسلط النتائج الضوء على أهمية النماذج الحسابية في توضيح أصول وأدوار التنظيم الطبوغرافي في المعالجة البصرية والسلوك.

الطرق

يستعرض قسم “الطرق” في ورقة البحث التصميم التجريبي والتقنيات التحليلية المستخدمة للتحقيق في أسئلة البحث. استخدمت الدراسة نهجًا كميًا، حيث تم دمج التحليلات الإحصائية لتقييم البيانات التي تم جمعها من تجارب مختلفة. تضمنت المنهجيات المحددة تجارب مختبرية محكومة، حيث تم التلاعب بالمتغيرات بشكل منهجي لمراقبة تأثيراتها على النتائج ذات الصلة.

شملت جمع البيانات استخدام أدوات موحدة لضمان الموثوقية والصلاحية. تم إجراء التحليل باستخدام أدوات برمجية قادرة على إجراء اختبارات إحصائية معقدة، مثل تحليل الانحدار وANOVA، لتحديد أهمية النتائج. يبرز القسم أهمية القابلية للتكرار والشفافية في الطرق المستخدمة، حيث يقدم أوصافًا تفصيلية للإجراءات المتبعة لتسهيل البحث المستقبلي في هذا المجال.

النتائج

في هذه الدراسة، قدم المؤلفون الشبكات العصبية الطبوغرافية الكاملة (All-TNNs) للتحقيق في ظهور الطبوغرافيات المميزة وتأثيرها على السلوك. على عكس الشبكات العصبية التلافيفية التقليدية (CNNs)، لا تستخدم All-TNNs مشاركة الوزن، مما يسمح للوحدات في مواقع مكانية مختلفة بتعلم أوزان مميزة. يتم تنظيم وحدات كل طبقة على ورقة قشرية ثنائية الأبعاد، حيث تشترك الوحدات المجاورة في نفس المجال الاستقبالي، وتتلقى المدخلات من منطقة محلية في الطبقة السابقة.

لتحسين عملية التعلم، نفذ المؤلفون دالة خسارة السلاسة التي تعزز تشابه الوزن بين الوحدات المجاورة. من خلال ضبط قوة هذه الخسارة السلسة كمعامل هايبر، يُشار إليه بـ $\alpha$، يمكن للباحثين تعديل انتقائية الميزات لـ All-TNNs، بدءًا من التكوينات غير المتصلة مكانيًا إلى التكوينات الموحدة مكانيًا. هذه المرونة في توزيع الوزن وتمثيل الميزات ضرورية لفهم الآليات العصبية الكامنة وراء السلوك.

المناقشة

في هذا القسم، يناقش المؤلفون قدرات الشبكات العصبية الطبوغرافية الكاملة (All-TNNs) في تكرار الميزات الطبوغرافية لنظام الرؤية لدى الرئيسيات، مع التركيز بشكل خاص على انتقائية التوجه والتنظيم المكاني. أظهرت الطبقة الأولى من All-TNNs خرائط انتقائية سلسة للتوجه تشبه تلك الموجودة في V1 للرئيسيات، والتي تتميز بظهور انقطاعات دوارة وتنظيم مركزي-محيطي في انتقائية الميزات. يبرز المؤلفون أنه، على عكس الشبكات العصبية التلافيفية التقليدية (CNNs)، التي تعتمد على مشاركة الوزن وتنتج خرائط انتقائية منفصلة، تسمح All-TNNs بالتعلم الفردي لأوزان الميزات في كل موقع مكاني، مما يؤدي إلى تمثيل أكثر واقعية بيولوجيًا للطبوغرافيا العصبية.

تكشف الدراسة أيضًا أن All-TNNs توزع الطاقة بكفاءة على المناطق ذات الصلة بالمهام، مما يظهر انخفاضًا كبيرًا في استهلاك الطاقة مقارنة بـ CNNs، بينما تتماشى بشكل أفضل مع السلوك البصري المكاني البشري. أجرى المؤلفون تجارب تظهر أن All-TNNs تظهر تحيزات مكانية تعتمد على الفئة تعكس أداء البشر في مهام التعرف على الأشياء. بالإضافة إلى ذلك، ساهم دمج خسارة السلاسة أثناء التدريب في تعزيز توافق All-TNNs مع السلوك البشري، مما يشير إلى أن تنظيمها الطبوغرافي يساهم في تحسين الأداء في المهام البصرية. بشكل عام، تشير النتائج إلى أن All-TNNs لا تكرر فقط الخصائص الرئيسية للمعالجة البصرية لدى الرئيسيات، ولكنها أيضًا توفر إطارًا لفهم الفوائد الحسابية للتنظيم الطبوغرافي في الشبكات العصبية.

Journal: Nature Human Behaviour, Volume: 9, Issue: 9
DOI: https://doi.org/10.1038/s41562-025-02220-7
PMID: https://pubmed.ncbi.nlm.nih.gov/40481218
Publication Date: 2025-06-06
Author(s): Zejin Lu et al.
Primary Topic: Face Recognition and Perception

Overview

The section discusses the significance of topographic organization in the primate visual system and introduces a novel computational model called all-topographic neural networks (All-TNNs) to address the limitations of traditional convolutional neural networks (CNNs). Unlike CNNs, which learn uniform features across spatial dimensions, All-TNNs are designed to develop features that reflect the smooth orientation and category selectivity maps characteristic of primate vision. This model not only enhances processing efficiency for task-relevant information but also operates on a low energy budget, indicating potential metabolic advantages associated with topographic organization.

To validate the effectiveness of All-TNNs, the researchers collected a dataset on human spatial biases in object recognition, demonstrating that All-TNNs significantly outperform conventional models. This suggests that All-TNNs are a promising framework for modeling the structured spatial arrangement of neurons in the primate visual system and its implications for behavior. The findings highlight the importance of computational models in elucidating the origins and roles of topographic organization in visual processing and behavior.

Methods

The “Methods” section of the research paper outlines the experimental design and analytical techniques employed to investigate the research questions. The study utilized a quantitative approach, incorporating statistical analyses to evaluate the data collected from various experiments. Specific methodologies included controlled laboratory experiments, where variables were systematically manipulated to observe their effects on the outcomes of interest.

Data collection involved the use of standardized instruments to ensure reliability and validity. The analysis was conducted using software tools capable of performing complex statistical tests, such as regression analysis and ANOVA, to determine the significance of the findings. The section emphasizes the importance of replicability and transparency in the methods used, providing detailed descriptions of the procedures followed to facilitate future research in the field.

Results

In this study, the authors introduced fully topographic neural networks (All-TNNs) to investigate the emergence of feature topographies and their influence on behavior. Unlike conventional convolutional neural networks (CNNs), All-TNNs do not employ weight sharing, allowing units at different spatial locations to learn distinct weights. Each layer’s units are organized on a 2D cortical sheet, where neighboring units share the same receptive field, receiving inputs from a localized area in the preceding layer.

To enhance the learning process, the authors implemented a smoothness loss function that promotes weight similarity among neighboring units. By adjusting the strength of this smoothness loss as a hyperparameter, denoted as $\alpha$, the researchers can modulate the feature selectivity of All-TNNs, ranging from spatially discontinuous to spatially uniform configurations. This flexibility in weight distribution and feature representation is crucial for understanding the neural mechanisms underlying behavior.

Discussion

In this section, the authors discuss the capabilities of All-Topographic Neural Networks (All-TNNs) in replicating topographic features of the primate visual system, particularly focusing on orientation selectivity and spatial organization. The first layer of All-TNNs demonstrated smooth orientation selectivity maps akin to those found in primate V1, characterized by the emergence of pinwheel discontinuities and a center-periphery organization in feature selectivity. The authors highlight that, unlike traditional Convolutional Neural Networks (CNNs), which rely on weight sharing and produce discrete selectivity maps, All-TNNs allow for individual learning of feature weights at each spatial location, leading to a more biologically plausible representation of neural topography.

The study further reveals that All-TNNs allocate energy efficiently to task-relevant regions, showing a significant reduction in energy consumption compared to CNNs, while also aligning better with human spatial visual behavior. The authors conducted experiments demonstrating that All-TNNs exhibit category-dependent spatial biases that closely mirror human performance in object recognition tasks. Additionally, the incorporation of a smoothness loss during training enhanced the alignment of All-TNNs with human behavior, suggesting that their topographic organization contributes to improved performance in visual tasks. Overall, the findings indicate that All-TNNs not only replicate key characteristics of primate visual processing but also provide a framework for understanding the computational benefits of topographic organization in neural networks.