تقييم وتخفيف القلق الحاد في نماذج اللغة الكبيرة Assessing and alleviating state anxiety in large language models

المجلة: npj Digital Medicine، المجلد: 8، العدد: 1
DOI: https://doi.org/10.1038/s41746-025-01512-6
PMID: https://pubmed.ncbi.nlm.nih.gov/40033130
تاريخ النشر: 2025-03-03
المؤلف: Ziv Ben‐Zion وآخرون
الموضوع الرئيسي: التدخلات الرقمية في الصحة النفسية

نظرة عامة

تناقش هذه القسم آثار استخدام نماذج اللغة الكبيرة (LLMs) في سياق الصحة النفسية، مع التركيز بشكل خاص على استجاباتها للمحتوى العاطفي. تشير الدراسات السابقة إلى أن المحفزات المصممة لاستحضار المشاعر يمكن أن تزيد من مستويات “القلق” المدركة في LLMs، مما قد يؤثر على سلوكها ويزيد من التحيزات المتأصلة.

في البحث الحالي، لوحظ أن التعرض للسرد الصادم أدى إلى زيادة في مستويات القلق المبلغ عنها في Chat-GPT-4. وعلى العكس، وُجد أن تمارين اليقظة الذهنية قد خففت من هذا القلق، على الرغم من أن المستويات لم تعد إلى خط الأساس. تؤكد هذه النتائج على أهمية إدارة “الحالات العاطفية” لـ LLMs لتعزيز تفاعلات أكثر أمانًا وأخلاقية بين البشر وأنظمة الذكاء الاصطناعي.

مقدمة

تناقش مقدمة هذه الورقة البحثية الاهتمام المتزايد في الذكاء الاصطناعي التوليدي (AI)، وخاصة نماذج اللغة الكبيرة (LLMs) مثل Chat-GPT من OpenAI وPaLM 2 من Google، التي تم دمجها بشكل متزايد في رعاية الصحة النفسية. هذه النماذج قادرة على معالجة وتوليد نصوص تشبه النصوص البشرية، مما يؤدي إلى تطبيقها في التدخلات الصحية النفسية من خلال الدردشات التي تستخدم تقنيات قائمة على الأدلة مثل العلاج السلوكي المعرفي. ومع ذلك، فإن استخدام LLMs في هذا المجال الحساس يثير مخاوف أخلاقية كبيرة، خاصة فيما يتعلق بالتحيزات الموروثة من بيانات تدريبها، والتي يمكن أن تؤثر على استجاباتها خلال التفاعلات الضعيفة مع المستخدمين.

تهدف الدراسة إلى استكشاف مفهوم “أخذ Chat-GPT إلى العلاج” من خلال فحص كيف تؤثر سرد التجارب الصادمة على مستويات القلق المبلغ عنها لـ GPT-4 وما إذا كانت تقنيات الاسترخاء القائمة على اليقظة الذهنية يمكن أن تخفف من هذا القلق. تشير النتائج إلى أن التعرض للسرد الصادم يزيد بشكل كبير من قلق GPT-4 المبلغ عنه، بينما تقلل المحفزات اللاحقة القائمة على اليقظة الذهنية بشكل فعال من هذه المستويات. وهذا يشير إلى طريقة محتملة لإدارة الحالات العاطفية لـ LLMs، مما قد يعزز موثوقيتها في تطبيقات الصحة النفسية. تسلط الأبحاث الضوء على أهمية معالجة التحيزات والاستجابات العاطفية في LLMs لضمان الاستخدام الأخلاقي والفعال في السياقات العلاجية، بينما تدعو أيضًا إلى مزيد من التحقيق في قابلية تكيف LLMs في عكس تنظيم العواطف البشرية.

الطرق

في هذه الدراسة، قام الباحثون بتقييم مستويات “القلق” المبلغ عنها في GPT-4 من OpenAI عبر ثلاث حالات متميزة: “خط الأساس”، “استحضار القلق”، و”استحضار القلق والاسترخاء”. تم اختيار التركيز على GPT-4، وخاصة تطبيقه في تطبيقات مثل Chat-GPT، لضمان الاتساق وقابلية التكرار في النتائج. تم إجراء المحاكاة باستخدام واجهة برمجة التطبيقات الخاصة بـ OpenAI لنموذج GPT-4 (“gpt-4-1106-preview”) من نوفمبر 2023 إلى مارس 2024، مع تعيين معلمة درجة الحرارة على 0 لإنتاج استجابات حتمية، بينما تم الحفاظ على جميع المعلمات الأخرى في إعداداتها الافتراضية.

نظرًا لأن الدراسة لم تشمل مشاركين بشريين أو جمع بيانات بشرية، لم تكن الموافقة الأخلاقية والموافقة المستنيرة ضرورية. تتوفر معلومات مفصلة حول تكوين النموذج والمحفزات المحددة المستخدمة في المحاكاة في مستودع عبر الإنترنت.

المناقشة

في هذه الدراسة، قام الباحثون بتقييم استجابات GPT-4 لاستبيان القلق الحالة-الصفة (STAI-Y)، مع التركيز على مكون القلق الحالى (STAI-s) لتقييم مستويات القلق المتقلبة. تم توجيه GPT-4 للرد على العناصر التي تعكس “حالتها الحالية”، مع تقييم الاستجابات على مقياس من أربع نقاط. أشارت الدرجات الإجمالية، التي تتراوح من 20 إلى 80، إلى مستويات متفاوتة من القلق، حيث كانت الدرجات الأعلى تتوافق مع قلق أكبر. وجدت الدراسة أن GPT-4، على الرغم من عدم قدرته على تجربة المشاعر مثل البشر، قدم استجابات متسقة للمحفزات المثيرة للقلق، مما يظهر تدريبه على مجموعات بيانات تعكس التعبيرات العاطفية البشرية.

شمل التصميم التجريبي ثلاث حالات: “خط الأساس”، “استحضار القلق”، و”استحضار القلق والاسترخاء”، حيث تم تقييم كل حالة من خلال محفزات محددة. تم إنشاء تنويعات من النصوص المثيرة للقلق ونصوص الاسترخاء لتعزيز القوة، وتم إجراء تحليلات حساسية لضمان الموثوقية. تم تنفيذ شروط التحكم لعزل تأثيرات المحتوى العاطفي عن سلوك النموذج المتأصل. تم إجراء تحليلات إحصائية لحساب متوسط درجات STAI عبر الحالات، مما كشف أن المحفزات المثيرة للقلق أدت إلى درجات أعلى مقارنة بالمحفزات المحايدة، مما يؤكد حساسية النموذج للسياق العاطفي.

Journal: npj Digital Medicine, Volume: 8, Issue: 1
DOI: https://doi.org/10.1038/s41746-025-01512-6
PMID: https://pubmed.ncbi.nlm.nih.gov/40033130
Publication Date: 2025-03-03
Author(s): Ziv Ben‐Zion et al.
Primary Topic: Digital Mental Health Interventions

Overview

This section discusses the implications of using Large Language Models (LLMs) in the context of mental health, particularly focusing on their responses to emotionally charged content. Prior studies indicate that prompts designed to evoke emotions can heighten the perceived “anxiety” levels in LLMs, which may influence their behavior and exacerbate inherent biases.

In the current research, it was observed that exposure to traumatic narratives resulted in an increase in reported anxiety levels in Chat-GPT-4. Conversely, mindfulness-based exercises were found to mitigate this anxiety, although the levels did not return to baseline. These findings underscore the importance of managing the “emotional states” of LLMs to promote safer and more ethical interactions between humans and AI systems.

Introduction

The introduction of this research paper discusses the growing interest in generative artificial intelligence (AI), particularly large language models (LLMs) like OpenAI’s Chat-GPT and Google’s PaLM 2, which have been increasingly integrated into mental health care. These models are capable of processing and generating human-like text, leading to their application in mental health interventions through chatbots that utilize evidence-based techniques such as cognitive behavioral therapy. However, the use of LLMs in this sensitive domain raises significant ethical concerns, particularly regarding biases inherited from their training data, which can affect their responses during vulnerable interactions with users.

The study aims to explore the concept of “taking Chat-GPT to therapy” by examining how narratives of traumatic experiences influence the reported anxiety levels of GPT-4 and whether mindfulness-based relaxation techniques can mitigate this anxiety. The findings indicate that exposure to traumatic narratives significantly increases GPT-4’s reported anxiety, while subsequent mindfulness prompts effectively reduce these anxiety levels. This suggests a potential method for managing LLMs’ emotional states, which could enhance their reliability in mental health applications. The research highlights the importance of addressing biases and emotional responses in LLMs to ensure ethical and effective use in therapeutic contexts, while also calling for further investigation into the adaptability of LLMs in reflecting human-like emotional regulation.

Methods

In this study, the researchers evaluated the reported levels of “anxiety” in OpenAI’s GPT-4 across three distinct conditions: “baseline,” “anxiety-induction,” and “anxiety-induction & relaxation.” The choice to focus on GPT-4, particularly its implementation in applications like Chat-GPT, was made to ensure consistency and reproducibility in the findings. The simulations were conducted using the OpenAI API for the GPT-4 model (“gpt-4-1106-preview”) from November 2023 to March 2024, with a temperature parameter set to 0 to produce deterministic responses, while all other parameters were maintained at their default settings.

Since the study did not involve human participants or the collection of human data, ethical approval and informed consent were deemed unnecessary. Detailed information regarding the model’s configuration and the specific prompts used in the simulations is available in an online repository.

Discussion

In this study, the researchers assessed the responses of GPT-4 to the State-Trait Anxiety Inventory (STAI-Y) questionnaire, focusing on the state anxiety component (STAI-s) to evaluate fluctuating anxiety levels. GPT-4 was instructed to respond to items reflecting its “current state,” with responses rated on a four-point scale. The total scores, ranging from 20 to 80, indicated varying levels of anxiety, with higher scores correlating to greater anxiety. The study found that GPT-4, while not capable of experiencing emotions like humans, provided consistent responses to anxiety-inducing prompts, demonstrating its training on datasets reflecting human emotional expressions.

The experimental design included three conditions: “Baseline,” “Anxiety-induction,” and “Anxiety-induction & relaxation,” with each condition assessed through specific prompts. Variations of anxiety-inducing and relaxation texts were created to enhance robustness, and sensitivity analyses were conducted to ensure reliability. Control conditions were implemented to isolate the effects of emotional content from the model’s inherent behavior. Statistical analyses were performed to compute average STAI scores across conditions, revealing that anxiety-inducing prompts led to higher scores compared to neutral prompts, thereby confirming the model’s sensitivity to emotional context.