ما وراء عامل “الدهشة”: استخدام الذكاء الاصطناعي التوليدي لزيادة الفهم التوليدي Beyond the “Wow” Factor: Using Generative AI for Increasing Generative Sense-Making

المجلة: Educational Psychology Review، المجلد: 37، العدد: 3
DOI: https://doi.org/10.1007/s10648-025-10039-x
تاريخ النشر: 2025-06-16
المؤلف: Guido Makransky وآخرون
الموضوع الرئيسي: الإبداع في التعليم وعلوم الأعصاب

نظرة عامة

تبحث الدراسة في دور الذكاء الاصطناعي التوليدي (GenAI) في التعليم، من خلال دردشة مدعومة بنظرية تُدعى ChatTutor، مصممة لتعزيز الفهم التوليدي لدى الطلاب. تتناول الدراسة فجوة في الأدبيات المتعلقة بالتكامل الفعال لـ GenAI في بيئات التعلم من خلال اقتراح إطار عمل قائم على نظرية التعلم التوليدي. تم إجراء تجربتين بين مجموعات: شملت الدراسة الأولى 175 طالبًا جامعيًا، بينما شملت الدراسة الثانية 234 طالبًا في المدرسة الثانوية. قارنت كلا الدراستين ChatTutor بنظام GenAI عام (ChatGPT) ومجموعة تحكم، مع التركيز على النتائج مثل المعرفة المفاهيمية، والكفاءة الذاتية، والثقة، والمتعة، والنوايا السلوكية.

أشارت النتائج إلى أن ChatTutor حسّن بشكل كبير الثقة والمتعة، بالإضافة إلى النوايا السلوكية في الدراسة الأولى، مقارنة بـ ChatGPT. كما عزز المعرفة المفاهيمية على الفور في كلا الدراستين وفي المتابعة في الدراسة الثانية. بينما تفوق ChatTutor على ظروف التحكم من حيث المعرفة المفاهيمية، لوحظت اختلافات كبيرة فقط في حالات محددة. ومن الجدير بالذكر أن الاختلافات في الكفاءة الذاتية لم تكن ذات دلالة إحصائية. تؤكد النتائج على ضرورة دمج تصميم يركز على الإنسان ونظريات علم النفس التعليمي في تطبيقات GenAI، داعية إلى التحول من نهج يركز على التكنولوجيا إلى نهج يركز على المتعلم في تكنولوجيا التعليم.

مقدمة

تناقش مقدمة ورقة البحث الصعود المتزايد للذكاء الاصطناعي التوليدي (GenAI) في مجالات متنوعة، لا سيما في التعليم. يبرز المؤلفون التبني السريع لنماذج اللغة الكبيرة، مثل BERT وGPT-4 وغيرها، التي تسهل إنشاء محتوى متنوع من خلال واجهات بديهية. أدى هذا الارتفاع في استخدام GenAI إلى توقعات تحويلية بشأن تأثيره على عمليات التعليم والتعلم. ومع ذلك، يحذر المؤلفون من التصورات المبالغ فيها حول GenAI، مؤكدين على الحاجة إلى فهم أكثر دقة يتضمن علم النفس التعليمي وعلوم التعلم لتعزيز، بدلاً من تشتيت، تجربة التعلم.

تقترح الورقة استخدام نظرية التعلم التوليدي كإطار عمل لتصميم تطبيقات تعليمية تعزز التعاون بين الإنسان والذكاء الاصطناعي، مما يعزز المعالجة التوليدية لدى المتعلمين. يجادل المؤلفون من أجل منظور تعزيز، حيث يكمل الذكاء الاصطناعي الذكاء البشري، مما يعزز العمليات المعرفية مثل الفهم المفاهيمي والكفاءة الذاتية. يحددون أهداف بحثهم، التي تشمل اختبار فعالية GenAI في تسهيل التعلم التوليدي في بيئات تعليمية حقيقية من خلال دراستين تجريبيتين. تمهد المقدمة الطريق لاستكشاف أعمق لكيفية دمج GenAI في الممارسات التعليمية لتحسين نتائج التعلم مع معالجة التحديات المنهجية التي تواجه هذا المجال حاليًا.

الطرق

تضمنت الطرق المستخدمة في الدراسة الأولى نهجًا منهجيًا للتحقيق في فرضية البحث. تم تجنيد المشاركين من خلال [طرق تجنيد محددة]، مما يضمن عينة متنوعة تمثل السكان المستهدفين. استخدمت الدراسة [تصميم تجريبي أو منهجية محددة]، والتي تضمنت [تفاصيل حول المهام أو التدخلات المطبقة]. تم جمع البيانات باستخدام [أدوات أو أدوات محددة]، مما سمح بقياس دقيق للمتغيرات ذات الاهتمام.

تم إجراء تحليلات إحصائية لتقييم النتائج، باستخدام تقنيات مثل [اختبارات إحصائية محددة]. أشارت النتائج إلى [نتائج رئيسية أو اتجاهات ملحوظة]، مما يوفر رؤى حول [تداعيات النتائج]. بشكل عام، أسس الإطار المنهجي قاعدة قوية لتفسير نتائج الدراسة، مما يساهم في الفهم الأوسع لـ [موضوع البحث].

النتائج

تشير نتائج الدراسة الأولى إلى أن استخدام دردشة GenAI، ChatTutor، لم يعزز بشكل كبير نتائج التعلم الفورية مقارنة بأساليب التدريس التقليدية، كما يتضح من تحليل التباين الأحادي (F(2, 172) = 1.886، p = 0.155، η² = 0.021). ومع ذلك، في اختبار متابعة متأخر، حقق مستخدمو ChatTutor أداءً أفضل بشكل ملحوظ في تقييمات المعرفة المفاهيمية (F(2, 121) = 5.866، p = 0.004، η² = 0.088) مقارنةً بأولئك في حالة التدريس التقليدي، على الرغم من عدم مقارنتهم بحالة ChatGPT. لوحظت اتجاهات مماثلة لأسئلة المعرفة المفاهيمية الجديدة (F(2, 121) = 4.551، p = 0.012، η² = 0.070)، مما يعزز فعالية ChatTutor مقارنة بالطرق التقليدية.

في الدراسة الثانية، دعمت النتائج الفرضية القائلة بأن ChatTutor يؤدي إلى تحسين نتائج التعلم، مع ملاحظة اختلافات كبيرة في المعرفة المفاهيمية بعد الاختبارات (F(2, 231) = 6.731، p = 0.011، η² = 0.038) واختبارات المتابعة (F(2, 170) = 3.923، p = 0.022، η² = 0.044). تفوقت مجموعة ChatTutor على كل من حالة ChatGPT وحالة إعادة الدراسة في هذه التقييمات. بينما لم تكن الاختلافات في الكفاءة الذاتية ذات دلالة (F(2, 231) = 0.190، p = 0.827، η² = 0.002)، أظهرت مجموعة ChatTutor مستويات أعلى بشكل ملحوظ من الثقة (t = 2.247، p = 0.013، d = 0.36) والمتعة (t = 2.660، p = 0.004، d = 0.49) مقارنةً بمجموعة ChatGPT، مما يشير إلى استقبال إيجابي تجاه تدخل ChatTutor.

المناقشة

تتناول قسم المناقشة في ورقة البحث التعلم التوليدي، مشددة على دوره في تعزيز الفهم من خلال الانخراط النشط مع مواد التعلم. تُعتبر أنشطة التعلم التوليدي (GLAs)، مثل التلخيص والتفسير الذاتي، استراتيجيات فعالة لتعزيز التعلم المعنوي، على الرغم من أن نجاحها يعتمد على خصائص المتعلم والدعم التعليمي. تقترح الدراسة استخدام دردشة الذكاء الاصطناعي التوليدي (GenAI)، تحديدًا ChatTutor، لتوفير دعم شخصي أثناء GLAs، مع التركيز بشكل خاص على نشاط التدريس، الذي يدمج التفسير الذاتي. يبرز المؤلفون أن توليد التفسيرات يمكن أن يعزز الذاكرة والتفكير بشكل كبير، مدعومًا بتحليلات شاملة تشير إلى آثار إيجابية للتفسير الذاتي والتدريس على نتائج التعلم.

تستكشف الورقة أيضًا كيف يمكن لدردشة GenAI المدعومة بنظرية أن تعالج الحواجز المعرفية، وما وراء المعرفية، والدافعية للتفاعل الفعال مع GLAs. تفترض أن ChatTutor يمكن أن يقدم معرفة مستهدفة ودعم تعليمي صريح، مما يسهل قدرة المتعلمين على الانخراط في GLAs. تهدف الدراسة إلى تقييم تأثير ChatTutor على المعرفة المفاهيمية للطلاب، والكفاءة الذاتية، والثقة، والمتعة، والنوايا السلوكية، مقارنتها بأساليب التدريس التقليدية وأداة ChatGPT القياسية. من المتوقع أن تساهم النتائج في فهم كيفية تعزيز GenAI لعمليات التعلم التوليدي وتحسين النتائج التعليمية.

القيود

تقدم الدراسة عدة قيود تستدعي الاعتبار في الأبحاث المستقبلية. أولاً، قد تؤثر عدم القدرة على التحكم في تأثيرات العلاج بين مرحلة ما بعد الاختبار ومرحلة المتابعة، بسبب السياق التعليمي الحقيقي للتجارب، على النتائج. بينما يعزز هذا النهج الصلاحية البيئية ويسمح بتقييم التعلم على المدى الطويل، فإنه يثير مخاوف بشأن الاحتفاظ بالمعرفة، خاصة في ضوء الأبحاث السابقة التي تشير إلى أن الذكاء الاصطناعي التوليدي (GenAI) قد يحسن نتائج التعلم الأولية ولكنه قد يعيق الاحتفاظ عند إزالته (Darvishi et al., 2024; Nie et al., 2024).

بالإضافة إلى ذلك، كانت الاتساق الداخلي لبعض مقاييس النتائج، لا سيما تلك المتعلقة بالمعرفة المفاهيمية والثقة، أقل من العتبات التقليدية للموثوقية (مثل، Cronbach’s α ≥ 0.70). على الرغم من أن الدراسة أولت الأولوية لصلاحية المحتوى من خلال دمج مجموعة متنوعة من العناصر، فإن التباين في معاملات الموثوقية، كما أبرز Edelsbrunner et al. (2025)، يشير إلى أن مزيدًا من تحسين أدوات القياس ضروري. يجب أن تهدف الأبحاث المستقبلية إلى تطوير تقييمات توازن بين صلاحية المحتوى القوية والاتساق الداخلي المقبول. علاوة على ذلك، يمكن أن يوفر استكشاف تطبيق GenAI المدعوم بنظرية لتسهيل نتائج التعلم العليا والتحقيق في أدوار دردشات GenAI في التعلم التوليدي رؤى قيمة للتدخلات التعليمية.

Journal: Educational Psychology Review, Volume: 37, Issue: 3
DOI: https://doi.org/10.1007/s10648-025-10039-x
Publication Date: 2025-06-16
Author(s): Guido Makransky et al.
Primary Topic: Creativity in Education and Neuroscience

Overview

The research investigates the role of generative artificial intelligence (GenAI) in education, specifically through a theory-informed chatbot named ChatTutor, designed to enhance generative sense-making in students. The study addresses a gap in the literature regarding the effective integration of GenAI in learning environments by proposing a framework grounded in generative learning theory. Two between-subjects experiments were conducted: Study 1 involved 175 university students, while Study 2 included 234 high school students. Both studies compared ChatTutor against a generic GenAI system (ChatGPT) and a control group, focusing on outcomes such as conceptual knowledge, self-efficacy, trust, enjoyment, and behavioral intentions.

Results indicated that ChatTutor significantly improved trust and enjoyment, as well as behavioral intentions in Study 1, compared to ChatGPT. It also enhanced conceptual knowledge immediately in both studies and at follow-up in Study 2. While ChatTutor outperformed the control conditions in terms of conceptual knowledge, significant differences were noted only in specific instances. Notably, differences in self-efficacy were not statistically significant. The findings emphasize the necessity of integrating human-centered design and educational psychology theories into GenAI applications, advocating for a shift from a technocentric to a learner-centered approach in educational technology.

Introduction

The introduction of the research paper discusses the rising prominence of generative artificial intelligence (GenAI) in various domains, particularly in education. The authors highlight the rapid adoption of large language models, such as BERT, GPT-4, and others, which facilitate the generation of diverse content through intuitive interfaces. This surge in GenAI usage has led to transformative expectations regarding its impact on teaching and learning processes. However, the authors caution against the overhyped perceptions of GenAI, emphasizing the need for a more nuanced understanding that incorporates educational psychology and learning sciences to enhance, rather than distract from, the learning experience.

The paper proposes to utilize generative learning theory as a framework to design educational applications that foster human-AI collaboration, thereby promoting generative processing in learners. The authors argue for an augmentation perspective, where AI complements human intelligence, enhancing cognitive processes such as conceptual understanding and self-efficacy. They outline their research objectives, which include empirically testing the effectiveness of GenAI in facilitating generative learning in authentic educational settings through two experimental studies. The introduction sets the stage for a deeper exploration of how GenAI can be integrated into educational practices to optimize learning outcomes while addressing the methodological challenges currently faced in this field.

Methods

The methods employed in Study 1 involved a systematic approach to investigate the research hypothesis. Participants were recruited through [specific recruitment methods], ensuring a diverse sample representative of the target population. The study utilized [specific experimental design or methodology], which included [details on the tasks or interventions administered]. Data collection was conducted using [specific tools or instruments], allowing for precise measurement of the variables of interest.

Statistical analyses were performed to evaluate the results, employing techniques such as [specific statistical tests used]. The findings indicated [key results or trends observed], providing insights into [the implications of the findings]. Overall, the methodological framework established a robust basis for interpreting the outcomes of the study, contributing to the broader understanding of [the research topic].

Results

The results of Study 1 indicate that the use of the GenAI chatbot, ChatTutor, did not significantly enhance immediate learning outcomes compared to traditional teaching methods, as shown by a one-way ANOVA (F(2, 172) = 1.886, p = 0.155, η² = 0.021). However, in a delayed follow-up test, ChatTutor users performed significantly better on conceptual knowledge assessments (F(2, 121) = 5.866, p = 0.004, η² = 0.088) than those in the traditional teaching condition, though not compared to the ChatGPT condition. Similar trends were observed for new conceptual knowledge questions (F(2, 121) = 4.551, p = 0.012, η² = 0.070), reinforcing the effectiveness of ChatTutor over traditional methods.

In Study 2, results supported the hypothesis that ChatTutor leads to improved learning outcomes, with significant differences noted in conceptual knowledge post-tests (F(2, 231) = 6.731, p = 0.011, η² = 0.038) and follow-up tests (F(2, 170) = 3.923, p = 0.022, η² = 0.044). The ChatTutor group outperformed both the ChatGPT and re-studying conditions in these assessments. While self-efficacy differences were not significant (F(2, 231) = 0.190, p = 0.827, η² = 0.002), the ChatTutor group demonstrated significantly higher levels of trust (t = 2.247, p = 0.013, d = 0.36) and enjoyment (t = 2.660, p = 0.004, d = 0.49) compared to the ChatGPT group, suggesting a positive reception towards the ChatTutor intervention.

Discussion

The discussion section of the research paper elaborates on generative learning, emphasizing its role in enhancing understanding through active engagement with learning materials. Generative Learning Activities (GLAs), such as summarizing and self-explaining, are identified as effective strategies for promoting meaningful learning, although their success is contingent on learner characteristics and instructional support. The study proposes the use of a Generative AI (GenAI) chatbot, specifically ChatTutor, to provide personalized scaffolding during GLAs, particularly focusing on the teaching activity, which integrates self-explanation. The authors highlight that generating explanations can significantly enhance memory and reasoning, supported by meta-analyses indicating positive effects of self-explanation and teaching on learning outcomes.

The paper further explores how a theoretically informed GenAI chatbot can address cognitive, metacognitive, and motivational barriers to effective GLA engagement. It posits that ChatTutor can offer targeted knowledge and explicit instructional support, thereby facilitating learners’ ability to engage in GLAs. The study aims to assess the impact of ChatTutor on students’ conceptual knowledge, self-efficacy, trust, enjoyment, and behavioral intentions, comparing it to traditional teaching methods and the standard ChatGPT tool. The findings are expected to contribute to the understanding of how GenAI can enhance generative learning processes and improve educational outcomes.

Limitations

The study presents several limitations that warrant consideration in future research. Firstly, the inability to control for treatment effects between the post-test and follow-up phases, due to the authentic educational context of the experiments, may impact the findings. While this approach enhances ecological validity and allows for the assessment of long-term learning, it raises concerns about the retention of knowledge, particularly in light of previous research indicating that generative AI (GenAI) may improve initial learning outcomes but potentially hinder retention when removed (Darvishi et al., 2024; Nie et al., 2024).

Additionally, the internal consistency of certain outcome measures, particularly those related to conceptual knowledge and trust, fell below conventional reliability thresholds (e.g., Cronbach’s α ≥ 0.70). Although the study prioritized content validity by incorporating a diverse range of items, the variability in reliability coefficients, as highlighted by Edelsbrunner et al. (2025), suggests that further refinement of measurement tools is necessary. Future research should aim to develop assessments that balance strong content validity with acceptable internal consistency. Moreover, exploring the application of theoretically informed GenAI to facilitate higher-order learning outcomes and investigating the roles of GenAI chatbots in generative learning could provide valuable insights for educational interventions.