الترميز الدلالي أثناء فهم اللغة بدقة خلوية فردية Semantic encoding during language comprehension at single-cell resolution

المجلة: Nature، المجلد: 631، العدد: 8021
DOI: https://doi.org/10.1038/s41586-024-07643-2
PMID: https://pubmed.ncbi.nlm.nih.gov/38961302
تاريخ النشر: 2024-07-03
المؤلف: Mohsen Jamali وآخرون
الموضوع الرئيسي: علم الأعصاب للغة ثنائية اللغة

نظرة عامة

يتناول هذا القسم من ورقة البحث العلاقة المعقدة بين النشاط العصبي وفهم اللغة، مع التركيز بشكل خاص على كيفية استجابة الخلايا العصبية الفردية في قشرة الفص الجبهي اليسرى المهيمنة لغويًا للمعلومات الدلالية أثناء معالجة الكلام. استخدمت الدراسة تسجيلات للخلايا العصبية الفردية بينما استمع المشاركون إلى جمل ذات دلالات متنوعة وقصص طبيعية. تكشف النتائج أن خلايا عصبية معينة تظهر استجابات انتقائية لمعاني كلمات معينة، مميزة بين الكلمات وغير الكلمات. ومن الجدير بالذكر أن هذه الخلايا العصبية تظهر نشاطًا ديناميكيًا يعكس المعاني السياقية بدلاً من التمثيلات الثابتة، مما يشير إلى تنظيم قشري متطور للمعلومات الدلالية.

تسلط الأبحاث الضوء على أهمية فهم كيفية معالجة المعنى اللغوي على المستوى الخلوي، وهي منطقة لا تزال غير مستكشفة إلى حد كبير على الرغم من التقدم في دراسات التصوير. يؤكد المؤلفون أنه بينما يحدث المعالجة اللغوية الأولية في القشرة السمعية وتتدفق إلى شبكة مختارة لغويًا، فإن الآليات الخلوية التي تكمن وراء معالجة المعاني أثناء فهم اللغة الطبيعية لا تزال غير محددة. تهدف الدراسة إلى سد هذه الفجوة من خلال تقديم رؤى حول الديناميات الزمنية الفعلية لفهم الكلمات والجمل من خلال تسجيلات للخلايا العصبية الفردية، مما يعزز فهمنا للتنظيم الوظيفي للتمثيلات الدلالية في الدماغ البشري.

مقدمة

في مقدمة ورقة البحث، يوضح المؤلفون منهجيتهم لتأكيد جودة وقابلية فصل المجالات الدلالية من خلال مقاييس النقاء وتحليل d′. استخدموا إجراء تجميع كروي على عينة عشوائية من 60% من الجمل عبر 100 تكرار، حيث تم تجميع الكلمات في مجموعات ومطابقتها مع المجموعات الأصلية بناءً على أقصى تداخل للكلمات. تم تقييم جودة التجميع باستخدام مقياس النقاء، الذي يقيس نسبة الكلمات المصنفة بشكل صحيح، مما يعطي قيمًا تتراوح بين 0 (تجميع ضعيف) و1 (تجميع مثالي). بالإضافة إلى ذلك، تم تقييم قابلية فصل المجموعات باستخدام تحليل d′، الذي يقارن المسافات الكوسينية المتجهة للكلمات داخل مجموعة ما بتلك الموجودة في مجموعات أخرى، مما يؤدي إلى تحديد المجالات الدلالية بناءً على قرب الكلمات من مراكز المجموعات.

كما درس المؤلفون استجابات الخلايا العصبية لهذه المجالات الدلالية من خلال حساب معدلات إطلاق النار المتوافقة مع بدايات الكلمات. استخدموا اختبار مجموع الرتب ذو الجانبين لتقييم دلالة النشاط العصبي للكلمات ضمن مجالات معينة مقارنةً بأخرى، مع تعديل المقارنات المتعددة باستخدام طريقة بنجاميني-هوشبرغ. تم تعريف مؤشر الانتقائية (SI) لقياس درجة استجابة الخلايا العصبية للكلمات في مجال معين مقابل المجالات الأخرى، مع قيم تتراوح من 0 (عدم الانتقائية) إلى 1 (انتقائية حصرية). تم تأكيد موثوقية SI من خلال تحليل bootstrap، الذي تضمن تقسيم الكلمات عشوائيًا ومقارنة الانتقائية العصبية عبر المجموعات الفرعية. تم تحسين المجالات الدلالية بشكل أكبر بناءً على التقييمات الذاتية لعلاقات الكلمات، مما أسفر عن تسع مجالات متميزة مصنفة حسب تمثيلها في 300 بعد.

طرق

في هذه الدراسة، تم تقديم جمل مكونة من ثمانية كلمات للمشاركين مصممة لتشمل مجموعة واسعة من الكلمات المتنوعة دلاليًا عبر سياقات موضوعية مختلفة. لضمان تفاعل المشاركين، تم تقديم مطالبات قصيرة كل 10-15 جملة، تسأل عما إذا كانوا مستعدين للمضي قدمًا، مع حدوث الردود عادةً في غضون 1-2 ثانية.

بالإضافة إلى ذلك، تم استخدام أزواج من الكلمات المتجانسة لتقييم التغيرات المحددة في المعنى في النشاط العصبي مع التحكم في المحتوى الصوتي. تم اختيار الكلمات المتجانسة، مثل “شمس” و”ابن”، من تجارب جمل سابقة وتمثل مجالات دلالية متميزة. تم اتخاذ هذا الاختيار للاستفادة من التمثيلات المتجهة الفريدة التي تنتجها تمثيلات الكلمات لكل رمز، مما يسمح بتحليل أوضح لمعالجة المعاني بشكل مستقل عن التشابهات الصوتية.

مناقشة

تستكشف الأبحاث المقدمة في هذا القسم انتقائية الخلايا العصبية في القشرة الجبهية لمعاني كلمات معينة أثناء معالجة الكلام. باستخدام نهج قائم على البيانات لتمثيل الكلمات، تظهر الدراسة أن الخلايا العصبية تظهر أنماط إطلاق متميزة استجابةً لكلمات تنتمي إلى مجالات دلالية معينة، مثل الأفعال، والحالات، والأشياء. أظهرت حوالي 14% من الخلايا العصبية المسجلة انتقائية دلالية ملحوظة، مع تفضيل ملحوظ لمجال “الأفعال”. تكشف الدراسة أيضًا أن انتقائية استجابة الخلايا العصبية تعتمد على السياق، كما يتضح من الانخفاض الملحوظ في الانتقائية عندما تم تقديم الكلمات بترتيب عشوائي، خالية من الإشارات السياقية. وهذا يشير إلى أن الخلايا العصبية تمثل معاني الكلمات بشكل تكيفي بناءً على سياقات جملها، بدلاً من كونها كيانات ثابتة.

علاوة على ذلك، تشير النتائج إلى أن مجموعة الخلايا العصبية يمكن أن تعمم التمثيلات الدلالية عبر مواد لغوية مختلفة، كما يتضح من فك تشفير ناجح للمجالات الدلالية من النشاط العصبي المسجل أثناء تقديم الجمل والذي تم تطبيقه لاحقًا على سياقات سردية جديدة. تسلط الدراسة أيضًا الضوء على التنظيم الهرمي للعلاقات الدلالية بين الكلمات، حيث يرتبط النشاط العصبي بالمسافات الكوفينية في تمثيلات الكلمات. بشكل جماعي، تؤكد هذه النتائج قدرة خلايا القشرة الجبهية على ترميز معلومات دلالية معقدة بشكل ديناميكي وسياقي، مما يوفر رؤى حول الآليات العصبية الكامنة وراء فهم اللغة.

Journal: Nature, Volume: 631, Issue: 8021
DOI: https://doi.org/10.1038/s41586-024-07643-2
PMID: https://pubmed.ncbi.nlm.nih.gov/38961302
Publication Date: 2024-07-03
Author(s): Mohsen Jamali et al.
Primary Topic: Neurobiology of Language and Bilingualism

Overview

This section of the research paper discusses the intricate relationship between neural activity and language comprehension, particularly focusing on how individual neurons in the left language-dominant prefrontal cortex respond to semantic information during speech processing. The study utilized single-neuron recordings while participants listened to semantically diverse sentences and naturalistic stories. The findings reveal that specific neurons exhibit selective responses to particular word meanings, distinguishing between words and nonwords. Notably, these neurons demonstrate dynamic activity that reflects contextual meanings rather than fixed representations, indicating a sophisticated cortical organization of semantic information.

The research highlights the importance of understanding how linguistic meaning is processed at the cellular level, an area that remains largely unexplored despite advancements in imaging studies. The authors emphasize that while initial linguistic processing occurs in the auditory cortex and flows to a language-selective network, the cellular mechanisms underlying semantic processing during natural language comprehension are still undefined. The study aims to bridge this gap by providing insights into the real-time dynamics of word and sentence comprehension through single-neuron recordings, thereby enhancing our understanding of the functional organization of semantic representations in the human brain.

Introduction

In the introduction of the research paper, the authors detail their methodology for confirming the quality and separability of semantic domains through purity measures and d′ analysis. They employed a spherical clustering procedure on a random sample of 60% of sentences across 100 iterations, grouping words into clusters and matching these to original clusters based on maximum word overlap. The clustering quality was assessed using a purity metric, which quantifies the proportion of correctly classified words, yielding values between 0 (poor clustering) and 1 (perfect clustering). Additionally, the separability of clusters was evaluated using d′ analysis, which compares vectoral cosine distances of words within a cluster to those in other clusters, leading to the identification of semantic domains based on word proximity to cluster centroids.

The authors also examined neuronal responses to these semantic domains by calculating firing rates aligned with word onsets. They employed a two-sided rank-sum test to assess the significance of neuronal activity for words within specific domains compared to others, adjusting for multiple comparisons using the Benjamini-Hochberg method. The selectivity index (SI) was defined to quantify the degree of neuronal response to words in a particular domain versus others, with values ranging from 0 (no selectivity) to 1 (exclusive selectivity). Reliability of the SI was confirmed through bootstrap analysis, which involved random splitting of words and comparison of neuronal selectivity across subsets. The semantic domains were further refined based on subjective assessments of word relationships, resulting in nine distinct domains categorized by their 300-dimensional embeddings.

Methods

In this study, participants were presented with eight-word sentences designed to encompass a wide range of semantically diverse words across various thematic contexts. To ensure participant engagement, brief prompts were administered every 10-15 sentences, inquiring whether they were ready to proceed, with responses typically occurring within 1-2 seconds.

Additionally, homophone pairs were employed to assess meaning-specific changes in neural activity while controlling for phonetic content. The homophones, such as “sun” and “son,” were selected from prior sentence experiments and represented distinct semantic domains. This choice was made to leverage the unique vector representations produced by word embeddings for each token, thereby allowing for a clearer analysis of semantic processing independent of phonological similarities.

Discussion

The research presented in this section investigates the selectivity of neurons in the prefrontal cortex for specific word meanings during speech processing. Utilizing a data-driven word embedding approach, the study demonstrates that neurons exhibit distinct firing patterns in response to words belonging to particular semantic domains, such as actions, states, and objects. Approximately 14% of the recorded neurons showed significant semantic selectivity, with a notable preference for the ‘actions’ domain. The study further reveals that the neurons’ response selectivity is context-dependent, as evidenced by a marked decrease in selectivity when words were presented in random order, devoid of contextual cues. This indicates that the neurons adaptively represent word meanings based on their sentence contexts, rather than as fixed entities.

Moreover, the findings suggest that the neuronal population can generalize semantic representations across different linguistic materials, as demonstrated by successful decoding of semantic domains from neuronal activity recorded during sentence presentations and later applied to new narrative contexts. The study also highlights the hierarchical organization of semantic relationships among words, with neuronal activity correlating with the cophenetic distances in word embeddings. Collectively, these results underscore the capacity of prefrontal cortical neurons to encode complex semantic information dynamically and contextually, providing insights into the neural mechanisms underlying language comprehension.