فحص الأجواء: فهم تأثيرات شخصية ومواءمة وكلاء المحادثة المعتمدين على LLM على تصورات المستخدمين في المهام الموجهة نحو الهدف Vibe Check: Understanding the Effects of LLM-Based Conversational Agents’ Personality and Alignment on User Perceptions in Goal-Oriented Tasks

المجلة: Proceedings of the 2026 CHI Conference on Human Factors in Computing Systems
DOI: https://doi.org/10.1145/3772318.3790388
تاريخ النشر: 2026-04-13
المؤلف: Zhenyun Du وآخرون
الموضوع الرئيسي: الذكاء الاصطناعي في تفاعلات الخدمة

نظرة عامة

تقدم هذه القسم نظرة عامة على دراسة تبحث في تأثير التعبير عن الشخصية في وكلاء المحادثة المعتمدين على نماذج اللغة الكبيرة (LLM) على تصورات المستخدمين أثناء المهام الموجهة نحو الأهداف. تشير النتائج إلى وجود علاقة على شكل U مقلوب، حيث يكون مستوى متوسط من التعبير عن الشخصية هو الأكثر فعالية، متفوقًا على كل من المستويات المنخفضة والعالية من حيث الذكاء المدرك، والمتعة، والتجسيد، ونية التبني، والثقة، والإعجاب. بالإضافة إلى ذلك، يؤثر توافق سمات الشخصية بين المستخدمين والوكلاء بشكل كبير على تجربة المستخدم، حيث تم تحديد الانفتاح والاستقرار العاطفي كسمات ذات تأثير خاص.

تقدم الدراسة إطار عمل TMK، الذي يسمح بالتحكم الدقيق في التعبير عن الشخصية في نماذج اللغة الكبيرة، مما يسهل الانتقال من التلاعب الثنائي إلى استكشاف أكثر منهجية لتصميم الشخصية. تشير النتائج إلى أن التعبير المتوسط يجب أن يكون الافتراضي في تصميم وكلاء المحادثة، مع التركيز على توافق سمات الشخصية، وخاصة الانفتاح والاستقرار العاطفي، لتعزيز توافق المستخدم. يدعو المؤلفون إلى استخدام طرق المعايرة التكيفية لتلبية ملفات تعريف المستخدمين المتنوعة ويؤكدون على الحاجة إلى مزيد من البحث لاستكشاف هذه الديناميات عبر مجالات وثقافات مختلفة، مع ضمان الشفافية واستقلالية المستخدم في التعديلات في الوقت الحقيقي. بشكل عام، تؤكد الأبحاث على الدور الحاسم للشخصية في تشكيل تصورات المستخدمين وتبني أنظمة الذكاء الاصطناعي المحادثة.

مقدمة

في مقدمة هذه الورقة البحثية، يعكس المؤلفون تطور وكلاء المحادثة (CAs) من الأنظمة المبكرة مثل ELIZA، التي استخدمت تقنيات مطابقة الأنماط البسيطة، إلى الوكلاء المعاصرين المعتمدين على نماذج اللغة الكبيرة (LLM) مثل ChatGPT. يبرزون ظاهرة نسبة المستخدمين البشريين الحضور الاجتماعي والشخصية إلى هذه الأنظمة، وهو مفهوم تم صياغته من خلال نموذج “الحواسيب كفاعلين اجتماعيين” (CASA). على الرغم من التقدم في تكنولوجيا وكلاء المحادثة، لا يزال هناك نقص في الفهم المنهجي حول كيفية تأثير التعبير عن الشخصية في هذه الوكلاء على تصورات المستخدمين، خاصة مع تحول وكلاء المحادثة إلى جزء أساسي من قطاعات مختلفة مثل التعليم والرعاية الصحية.

يقترح المؤلفون إطار عمل جديد، مفاتيح تعديل السمات (TMK)، مصمم للتحكم في التعبير عن سمات الشخصية الخمس الكبرى (الانفتاح، الضمير، الانبساط، التوافق، العصابية) بمستويات مختلفة (منخفضة، متوسطة، عالية) دون الاعتماد على خلفيات سيرة ذاتية. يقومون بإجراء دراسة محكومة بتصميم بين الموضوعات 3×1 تشمل 150 مشاركًا يتفاعلون مع مساعدي تخطيط السفر الذين يظهرون ملفات شخصية مختلفة. تهدف الدراسة إلى معالجة أربعة أسئلة بحثية تتعلق بتأثيرات التعبير عن الشخصية والتوافق على تصورات المستخدمين. تشير النتائج الأولية إلى أن مستويات التعبير عن الشخصية المتوسطة تحقق أكثر التصورات إيجابية من قبل المستخدمين، مما يشير إلى وجود علاقة على شكل U مقلوب بين التعبير عن الشخصية وتجربة المستخدم، بينما تؤكد أيضًا على توافق الشخصية كاعتبار تصميمي مهم.

النتائج

في هذه الدراسة، تم توزيع 150 مشاركًا بالتساوي على ثلاث حالات تجريبية بناءً على سمات شخصيتهم المتعلقة بالضمير والتوافق (CA)، مع 50 مشاركًا في كل حالة. أكمل 130 مشاركًا (86.7%) جميع المهام الفرعية السبع ضمن حد زمني قدره 10 دقائق، بينما أكمل 20 مشاركًا معظم المهام الفرعية ولكن ليس جميعها، وذلك بشكل أساسي بسبب قيود الوقت. كان عدد دورات المحادثة متسقًا عبر الحالات، مما يشير إلى أن الاختلافات في تصورات المستخدمين كانت على الأرجح بسبب اختلافات الشخصية بدلاً من مستويات الانخراط.

كانت ملفات الشخصية متشابهة عمومًا عبر الحالات، كما أكدت اختبارات كروسكال-واليس، التي كشفت عن عدم وجود اختلافات كبيرة في الانفتاح، والانبساط، والتوافق، والاستقرار العاطفي. ومع ذلك، أظهر الضمير اختلافًا كبيرًا عبر الحالات (????(2) = 6.51، ???? = .039). أشارت الإحصائيات الوصفية إلى أن توزيع السمات الإجمالي ظل متوازنًا، مما يقلل من خطر التداخل المنهجي. كشفت تحليل المكونات الرئيسية لدرجات الشخصية الخمس الكبرى عن بعد كامن واحد يمثل 34.3% من التباين الكلي، مع تحميلات إيجابية قوية للانبساط، والتوافق، والانفتاح. أظهرت جميع القياسات اتساقًا داخليًا جيدًا إلى ممتاز (???? = .81-.95)، وتم استخدام تحليلات غير معلمية بسبب انتهاكات افتراضات الطبيعية. المواد التكميلية التي تحتوي على جميع البيانات متاحة.

المناقشة

ت outlines قسم المناقشة في الورقة البحثية الجوانب الأساسية لتصميم شخصية وكلاء المحادثة (CA) وتأثيراتها على تصورات المستخدمين، خاصة في المهام الموجهة نحو الأهداف. يبرز أربعة مجالات حاسمة: الميل البشري الفطري لنسب سمات الشخصية إلى وكلاء المحادثة، توافق شخصيات المستخدمين والوكلاء، التقدم في تحفيز الشخصية المعتمد على نماذج اللغة الكبيرة (LLM)، وسياق المحادثات الموجهة نحو الأهداف. يوضح نموذج CASA أن المستخدمين يطبقون المعايير الاجتماعية على وكلاء المحادثة بشكل مشابه للبشر، حيث تشير الدراسات المبكرة إلى أن سمات الشخصية، وخاصة من إطار العمل الخمس الكبرى، تؤثر بشكل كبير على تفاعلات المستخدمين. ومع ذلك، ركزت الأبحاث الحالية إلى حد كبير على التلاعبات الفردية أو المتطرفة في السمات، مما ترك فجوة في فهم التعبيرات الأكثر دقة للشخصية وتأثيراتها على تجربة المستخدم.

تستكشف الورقة أيضًا مفهوم توافق الشخصية، مشيرة إلى أن المستخدمين يفضلون عمومًا وكلاء المحادثة الذين تعكس شخصياتهم شخصياتهم الخاصة، على الرغم من أن بعض الدراسات تشير إلى أن الشخصيات التكميلية قد تكون مفضلة أيضًا. تشير هذه الأدلة المختلطة إلى الحاجة إلى مزيد من البحث الشامل حول توافق الشخصية، خاصة باستخدام مقاييس مستمرة بدلاً من النهج الفئوية. يمثل إدخال نماذج اللغة الكبيرة تحولًا كبيرًا في أبحاث شخصية وكلاء المحادثة، مما يسمح بتعبير أكثر مرونة ومنهجية عن الشخصية من خلال التحفيز باللغة الطبيعية. يقترح المؤلفون إطار عمل جديد لمفاتيح تعديل السمات (TMK) يمكّن من التحكم المتزامن في جميع السمات الخمس الكبرى بمستويات تعبير مختلفة، مما يعالج القيود السابقة في طرق تحفيز الشخصية. يهدف هذا الإطار إلى تسهيل استكشاف كيفية تأثير الشخصية على تصورات المستخدمين في السياقات التطبيقية، خاصة في التفاعلات المنظمة والموجهة نحو الأهداف، وبالتالي ربط الرؤى النظرية بالتطبيقات العملية في تصميم وكلاء المحادثة.

القيود

تقدم الدراسة رؤى مهمة حول تأثير الشخصية على وكلاء المحادثة الموجهة نحو الأهداف (CAs)، ومع ذلك تعترف بعدة قيود تستدعي مزيدًا من التحقيق. العينة، التي تتكون من 150 بالغًا ناطقًا باللغة الإنجليزية من الولايات المتحدة، تحد من إمكانية تعميم النتائج عبر سياقات ثقافية متنوعة حيث قد يختلف التعبير عن الشخصية. بالإضافة إلى ذلك، قد لا يلتقط التركيز على تخطيط السفر كمجال مهمة محدد التباين في توقعات الشخصية وتصورات المستخدمين التي تنشأ في سياقات أخرى، مثل التفاعلات الاجتماعية أو ذات المخاطر العالية. لا يأخذ نافذة التفاعل التي مدتها 10 دقائق، على الرغم من فعاليتها في اكتشاف تأثيرات الشخصية، في الاعتبار تطور تصورات المستخدمين على مدى تفاعلات أطول أو متكررة.

من الناحية المنهجية، قد لا يعكس التصميم التجريبي القائم على السيناريوهات، على الرغم من كونه صارمًا، أنماط الاستخدام في العالم الحقيقي حيث يكون لدى المستخدمين مصالح حقيقية في النتائج. يحد نهج الدراسة في التلاعب بجميع السمات الخمس الكبرى في وقت واحد من خلال ملفات تعريف موحدة، على الرغم من كونه صالحًا بيئيًا، من القدرة على عزل السمات الفردية أو استكشاف تفاعلات السمات. علاوة على ذلك، قد يت overlook التشغيلية لمستويات الشخصية كمنخفضة-متوسطة-عالية التأثيرات الدقيقة، وتشير النتائج بشكل أساسي إلى تفضيل التعبير عن الشخصية المتوسطة دون معالجة الاستجابات لملفات التعريف المتنوعة. كشفت عملية التحقق من إطار عمل TMK عن تحديات في التحكم في السمات المستهدفة المتوسطة، مع حدوث أخطاء ضمن نطاقات قريبة. أخيرًا، تثير اعتماد الدراسة على واجهة نصية تساؤلات حول إمكانية تعميم تأثيرات الشخصية على التفاعلات الصوتية، التي قد تؤدي إلى تصورات وتفضيلات مختلفة.

Journal: Proceedings of the 2026 CHI Conference on Human Factors in Computing Systems
DOI: https://doi.org/10.1145/3772318.3790388
Publication Date: 2026-04-13
Author(s): Zhenyun Du et al.
Primary Topic: AI in Service Interactions

Overview

This section presents an overview of a study investigating the impact of personality expression in large language model (LLM)-based conversational agents (CAs) on user perceptions during goal-oriented tasks. The findings indicate an inverted-U relationship, where a medium level of personality expression is most effective, outperforming both low and high extremes in terms of perceived intelligence, enjoyment, anthropomorphism, intention to adopt, trust, and likability. Additionally, the alignment of personality traits between users and agents significantly influences user experience, with Extraversion and Emotional Stability identified as particularly impactful traits.

The study introduces the TMK framework, which allows for nuanced control of personality expression in LLMs, facilitating a shift from binary manipulations to a more systematic exploration of personality design. The results suggest that medium expression should be the default in CA design, with a focus on aligning personality traits, especially Extraversion and Emotional Stability, to enhance user compatibility. The authors advocate for adaptive calibration methods to cater to diverse user profiles and emphasize the need for future research to explore these dynamics across various domains and cultures, while ensuring transparency and user autonomy in real-time adaptations. Overall, the research underscores the critical role of personality in shaping user perceptions and the adoption of conversational AI systems.

Introduction

In the introduction of this research paper, the authors reflect on the evolution of conversational agents (CAs) from early systems like ELIZA, which utilized simple pattern-matching techniques, to contemporary large language model (LLM)-based agents such as ChatGPT. They highlight the phenomenon of human users attributing social presence and personality to these systems, a concept formalized by the Computers Are Social Actors (CASA) paradigm. Despite advancements in CA technology, there remains a lack of systematic understanding regarding how personality expression in these agents influences user perceptions, particularly as CAs become integral to various sectors like education and healthcare.

The authors propose a new framework, Trait Modulation Keys (TMK), designed to control the expression of the Big Five personality traits (Openness, Conscientiousness, Extraversion, Agreeableness, Neuroticism) at varying levels (low, medium, high) without relying on biographical backstories. They conduct a controlled study with a 3×1 between-subjects design involving 150 participants interacting with travel planning assistants exhibiting different personality profiles. The study aims to address four research questions related to the effects of personality expression and alignment on user perceptions. Preliminary findings suggest that medium personality expression levels yield the most favorable user perceptions, indicating an inverted-U relationship between personality expression and user experience, while also establishing personality alignment as a significant design consideration.

Results

In this study, 150 participants were evenly assigned to three experimental conditions based on their Conscientiousness and Agreeableness (CA) personality traits, with 50 participants in each condition. A total of 130 participants (86.7%) completed all seven subtasks within a 10-minute time limit, while 20 participants completed most but not all subtasks, primarily due to time constraints. The number of conversational turns was consistent across conditions, indicating that differences in user perceptions were likely due to personality variations rather than engagement levels.

Personality profiles were generally comparable across conditions, as confirmed by Kruskal-Wallis tests, which revealed no significant differences in Openness, Extraversion, Agreeableness, and Emotional Stability. However, Conscientiousness showed a significant difference across conditions (????(2) = 6.51, ???? = .039). Descriptive statistics indicated that the overall trait distribution remained balanced, minimizing the risk of systematic confounding. A Principal Component Analysis of the Big Five personality scores revealed a single latent dimension accounting for 34.3% of the total variance, with strong positive loadings for Extraversion, Agreeableness, and Openness. All measures exhibited good to excellent internal consistency (???? = .81-.95), and nonparametric analyses were employed due to violations of normality assumptions. Supplementary material containing all data is available.

Discussion

The discussion section of the research paper outlines the foundational aspects of conversational agent (CA) personality design and its implications for user perceptions, particularly in goal-oriented tasks. It highlights four critical domains: the inherent human tendency to ascribe personality traits to CAs, the alignment of user and agent personalities, advancements in large language model (LLM)-based personality prompting, and the context of goal-oriented conversations. The CASA paradigm illustrates that users apply social norms to CAs similarly to humans, with early studies indicating that personality traits, particularly from the Big Five framework, significantly influence user interactions. However, existing research has largely focused on single or extreme trait manipulations, leaving a gap in understanding more nuanced personality expressions and their effects on user experience.

The paper further explores the concept of personality alignment, suggesting that users generally prefer CAs whose personalities mirror their own, although some studies indicate that complementary personalities may also be favored. This mixed evidence points to the need for more comprehensive research on personality alignment, particularly using continuous measures rather than categorical approaches. The introduction of LLMs marks a significant shift in CA personality research, allowing for more flexible and systematic personality expression through natural language prompting. The authors propose a novel Trait Modulation Keys (TMK) framework that enables simultaneous control of all Big Five traits at varying levels of expression, addressing previous limitations in personality prompting methods. This framework aims to facilitate the exploration of how personality impacts user perceptions in applied contexts, particularly in structured, goal-oriented interactions, thus bridging theoretical insights with practical applications in CA design.

Limitations

The study presents significant insights into the influence of personality on goal-oriented conversational agents (CAs), yet it acknowledges several limitations that warrant further investigation. The sample, consisting of 150 native English-speaking adults from the US, restricts the generalizability of findings across diverse cultural contexts where personality expression may differ. Additionally, the focus on travel planning as a specific task domain may not capture the variability in personality expectations and user perceptions that arise in other contexts, such as social or high-stakes interactions. The 10-minute interaction window, while effective for detecting personality effects, does not account for the evolution of user perceptions over longer or repeated engagements.

Methodologically, the vignette-based experimental design, although rigorous, may not accurately reflect real-world usage patterns where users have genuine stakes in outcomes. The study’s approach to manipulating all five Big Five personality traits simultaneously through uniform profiles, while ecologically valid, limits the ability to isolate individual traits or explore trait interactions. Furthermore, the operationalization of personality levels as low-medium-high may overlook finer-grained effects, and the findings primarily indicate a preference for medium personality expression without addressing responses to heterogeneous profiles. The validation of the TMK framework revealed challenges in controlling medium-targeted traits, with misses occurring within close ranges. Lastly, the study’s reliance on a text-based interface raises questions about the generalizability of personality effects to voice-based interactions, which may yield different perceptions and preferences.