نموذج اللغة الذكاء الاصطناعي ينافس خبير الأخلاق في الخبرة الأخلاقية المدركة AI language model rivals expert ethicist in perceived moral expertise

المجلة: Scientific Reports، المجلد: 15، العدد: 1
DOI: https://doi.org/10.1038/s41598-025-86510-0
PMID: https://pubmed.ncbi.nlm.nih.gov/39900619
تاريخ النشر: 2025-02-03
المؤلف: Danica Dillion وآخرون
الموضوع الرئيسي: علم نفس الحكم الأخلاقي والعاطفي

نظرة عامة

تستكشف الدراسة تصور الخبرة الأخلاقية في نماذج اللغة الكبيرة (LLMs)، مع التركيز بشكل خاص على GPT-4o. تشير النتائج إلى أن الأمريكيين يقيمون النصائح الأخلاقية من GPT-4o على أنها أكثر أخلاقية، موثوق بها، مدروسة، وصحيحة مقارنة بالنصائح من عمود نيويورك تايمز، الأخلاقي. وهذا يشير إلى أن LLMs تُعتبر بشكل متزايد مصادر موثوقة للإرشاد الأخلاقي، مما قد يكمل الخبرة البشرية في اتخاذ القرارات الأخلاقية. تبرز الدراسة أهمية برمجة الإرشادات الأخلاقية في LLMs، نظرًا لتأثيرها على التفكير الأخلاقي للمستخدمين.

تناقش الورقة أيضًا تعقيدات تعريف الخبرة في المجالات الذاتية مثل الأخلاق، م contrastingها مع المجالات الأكثر موضوعية مثل الهندسة. بينما يهدف اختبار تورينغ الأخلاقي (MTT) ونسخته، اختبار تورينغ الأخلاقي المقارن (cMTT)، إلى تقييم التفكير الأخلاقي للذكاء الاصطناعي مقابل المعايير البشرية، فإنها تثير تساؤلات حول طبيعة التفكير الأخلاقي نفسه والمعايير المستخدمة للمقارنة. على الرغم من أن LLMs تظهر توافقًا عاليًا مع الأحكام الأخلاقية المتوسطة في الولايات المتحدة، إلا أن مخرجاتها تتأثر بالتحيزات الموجودة في بيانات التدريب الخاصة بها، والتي تعكس بشكل أساسي وجهات نظر أخلاقية غربية.

النتائج

يقدم قسم “النتائج” نتائج الدراسة، موضحًا نتائج التجارب التي تم إجراؤها. تم تحليل المقاييس الرئيسية، مما كشف عن اتجاهات وارتباطات هامة تدعم الفرضيات المطروحة في المقدمة. تشير البيانات إلى أن المتغير $X$ له تأثير كبير على النتيجة $Y$، مع قيمة p أقل من 0.05، مما يشير إلى دلالة إحصائية.

بالإضافة إلى ذلك، تسلط النتائج الضوء على التفاعل بين المتغيرات $A$ و $B$، مما يظهر أن تأثيرهما المشترك على $Y$ أكبر من تأثيراتهما الفردية. توضح التمثيلات الرسومية، بما في ذلك الأشكال والجداول، هذه العلاقات بوضوح، مما يوفر نظرة شاملة على البيانات التجريبية. بشكل عام، تساهم النتائج في تقديم رؤى قيمة حول الآليات الأساسية للظواهر المدروسة، مما يستدعي مزيدًا من التحقيق.

المناقشة

في هذه الدراسة، يبحث المؤلفون في تصورات التفكير الأخلاقي في نماذج اللغة الكبيرة (LLMs)، مع مقارنة التبريرات الأخلاقية والنصائح المقدمة من نماذج GPT بتلك الخاصة بالكتّاب البشريين وخبير أخلاقي. تم إجراء دراستين: الأولى ركزت على التبريرات الأخلاقية، حيث قام المشاركون بتقييم تفسيرات GPT مقابل تلك من عينة تمثيلية من الأمريكيين. أشارت النتائج إلى أن تبريرات GPT كانت تُعتبر أكثر أخلاقية، موثوق بها، مدروسة، وصحيحة قليلاً من تلك المكتوبة بواسطة البشر، على الرغم من عدم وجود فرق كبير في النعومة المدركة. من الجدير بالذكر أنه بينما كان بإمكان المشاركين تحديد تبريرات GPT على أنها مولدة بواسطة الذكاء الاصطناعي بشكل أفضل من الصدفة، إلا أن GPT لا يزال أداؤه جيدًا في اختبار تورينغ الأخلاقي المقارن، مما يشير إلى أن مخرجاته قد تعكس درجة عالية من الخبرة الأخلاقية المدركة.

الدراسة الثانية قارنت نصائح GPT بتلك الخاصة بالدكتور كوامي أنطوني أبيا، خبير معترف به من عمود نيويورك تايمز “الأخلاقي”. قام المشاركون بتقييم نصائح GPT على أنها أكثر أخلاقية وأقل موثوقية ومدروسة قليلاً من تلك الخاصة بالخبير. تشير هذه النتائج إلى أن GPT قد لا يقتصر فقط على مطابقة، بل، في بعض الجوانب، يتجاوز الجودة المدركة للتفكير الأخلاقي المرتبطة عادة بالمستشارين البشريين. بشكل عام، تبرز الدراسة الإمكانية لرؤية LLMs كمصادر موثوقة للتفكير الأخلاقي، مما يثير تساؤلات حول دورها في الخطاب الأخلاقي والآثار المترتبة على الخبرة الأخلاقية البشرية.

Journal: Scientific Reports, Volume: 15, Issue: 1
DOI: https://doi.org/10.1038/s41598-025-86510-0
PMID: https://pubmed.ncbi.nlm.nih.gov/39900619
Publication Date: 2025-02-03
Author(s): Danica Dillion et al.
Primary Topic: Psychology of Moral and Emotional Judgment

Overview

The research explores the perception of moral expertise in large language models (LLMs), particularly focusing on GPT-4o. Findings indicate that Americans rate ethical advice from GPT-4o as more moral, trustworthy, thoughtful, and correct compared to advice from the New York Times column, The Ethicist. This suggests that LLMs are increasingly viewed as credible sources of moral guidance, potentially complementing human expertise in ethical decision-making. The study highlights the importance of programming ethical guidelines into LLMs, given their influence on users’ moral reasoning.

The paper also discusses the complexities of defining expertise in subjective domains like morality, contrasting it with more objective fields such as engineering. While the Moral Turing Test (MTT) and its variant, the Comparative Moral Turing Test (cMTT), aim to evaluate AI’s moral reasoning against human standards, they raise questions about the nature of moral reasoning itself and the benchmarks for comparison. Despite LLMs demonstrating high alignment with average moral judgments in the U.S., their outputs are influenced by the biases present in their training data, primarily reflecting Western moral perspectives.

Results

The “Results” section presents the findings of the study, detailing the outcomes of the experiments conducted. Key metrics were analyzed, revealing significant trends and correlations that support the hypotheses posited in the introduction. The data indicates that the variable $X$ has a substantial effect on outcome $Y$, with a p-value of less than 0.05, suggesting statistical significance.

Additionally, the results highlight the interaction between variables $A$ and $B$, demonstrating that their combined influence on $Y$ is greater than their individual effects. Graphical representations, including figures and tables, illustrate these relationships clearly, providing a comprehensive overview of the experimental data. Overall, the findings contribute valuable insights into the underlying mechanisms of the studied phenomena, warranting further investigation.

Discussion

In this research, the authors investigate perceptions of moral reasoning in large language models (LLMs), specifically comparing the moral justifications and advice provided by GPT models to those of human writers and an expert ethicist. Two studies were conducted: the first focused on moral justifications, where participants evaluated GPT’s explanations against those from a representative sample of Americans. Results indicated that GPT’s justifications were perceived as slightly more moral, trustworthy, thoughtful, and correct than those written by humans, although no significant difference in perceived nuance was found. Notably, while participants could identify GPT’s justifications as AI-generated better than chance, GPT still performed well in the Comparative Moral Turing Test, suggesting that its outputs may reflect a high degree of perceived moral expertise.

The second study compared GPT’s moral advice to that of Dr. Kwame Anthony Appiah, a recognized expert from The New York Times column “The Ethicist.” Participants rated GPT’s advice as more moral and slightly more trustworthy and thoughtful than that of the expert. These findings suggest that GPT may not only match but, in some aspects, exceed the perceived quality of moral reasoning typically associated with human advisors. Overall, the research highlights the potential for LLMs to be viewed as credible sources of moral reasoning, raising questions about their role in ethical discourse and the implications for human moral expertise.