استخدام المراهقين وفائدتهم المدركة للذكاء الاصطناعي التوليدي في الأعمال المدرسية: استكشاف علاقاتهم بالوظائف التنفيذية والإنجاز الأكاديمي Adolescents’ use and perceived usefulness of generative AI for schoolwork: exploring their relationships with executive functioning and academic achievement

المجلة: Frontiers in Artificial Intelligence، المجلد: 7
DOI: https://doi.org/10.3389/frai.2024.1415782
PMID: https://pubmed.ncbi.nlm.nih.gov/39263526
تاريخ النشر: 2024-08-28
المؤلف: Johan Klarin وآخرون
الموضوع الرئيسي: التدخلات الرقمية في الصحة النفسية

نظرة عامة

تستكشف هذه الدراسة تكرار وفائدة نماذج الذكاء الاصطناعي التوليدية مثل ChatGPT، فيما يتعلق بالوظائف التنفيذية للمراهقين، والتي تشمل المهارات المعرفية الأساسية اللازمة للنجاح الأكاديمي. تم إجراء دراستين مع مراهقين أصغر سناً (N = 385، متوسط العمر 14 سنة) وكبار (N = 359، متوسط العمر 17 سنة). أظهرت النتائج أن 14.8% من المشاركين الأصغر سناً و52.6% من المشاركين الأكبر سناً أفادوا باستخدام الذكاء الاصطناعي التوليدي، مع كون ChatGPT الأداة المفضلة. ومن الجدير بالذكر أن المراهقين الذين يواجهون تحديات أكبر في الوظائف التنفيذية اعتبروا هذه الأدوات أكثر فائدة في إكمال الواجبات المدرسية، على الرغم من عدم وجود علاقة ذات دلالة إحصائية بين الإنجاز الأكاديمي واستخدام الذكاء الاصطناعي أو الفائدة المدركة في الدراسة 1.

تسلط الأبحاث الضوء على الحاجة إلى مزيد من الاستكشاف في العوامل التي تؤثر على استخدام الذكاء الاصطناعي التوليدي بين المراهقين، بما في ذلك الفروق المحتملة بين الجنسين في تفضيلات الأدوات وآثارها على العدالة الأكاديمية. تؤكد الدراسة على أهمية تقييم دمج الذكاء الاصطناعي التوليدي في البيئات التعليمية، خاصة بالنظر إلى المخاطر والفوائد المحتملة للطلاب الذين يعانون من صعوبات في الوظائف التنفيذية. مع استمرار تطور الذكاء الاصطناعي التوليدي، فإن اتخاذ تدابير استباقية وإجراء مزيد من الأبحاث أمر ضروري لضمان استخدامه الفعال والعادل في التعليم، مع معالجة الاعتبارات الأخلاقية المتعلقة بالنزاهة الأكاديمية والتطور المعرفي.

مقدمة

تناقش مقدمة هذه الورقة البحثية ظهور نقاش في السويد حول استخدام ChatGPT، وهو نموذج لغة كبير (LLM) للذكاء الاصطناعي التوليدي، في السياقات التعليمية بعد إصداره في ربيع 2023. يتركز النقاش حول المخاوف بشأن الغش المحتمل مقابل فوائد الذكاء الاصطناعي كمساعد تعليمي، خاصة للطلاب المعرضين لخطر الأداء الضعيف. تشير الأدلة من دراسة في ألمانيا إلى أن ثلثي طلاب الجامعات يستخدمون الذكاء الاصطناعي التوليدي لمهام أكاديمية متنوعة، بينما تكشف دراسة استقصائية في السويد أن 75% من المراهقين والشباب الذين تتراوح أعمارهم بين 15-24 يتفاعلون مع هذه الأدوات لأغراض تعليمية، مثل هيكلة العروض التقديمية والدراسة.

تسلط الورقة الضوء على آثار الذكاء الاصطناعي التوليدي على الوظائف التنفيذية، والتي تشمل العمليات المعرفية الأساسية للتخطيط والتركيز والانتباه. ترتبط العجز في الوظائف التنفيذية بتحقيق أكاديمي أقل ويمكن أن تؤدي إلى تحديات في المجالات الأكاديمية والاجتماعية. يشير المؤلفون إلى وجود فجوة في الأدبيات المتعلقة بالشروط الدقيقة التي تؤثر على استخدام وفعالية الذكاء الاصطناعي التوليدي، خاصة بين المراهقين الأصغر سناً الذين تتراوح أعمارهم بين 12-14. وبالتالي، تهدف الدراسة إلى استكشاف هدفين رئيسيين: (1) تكرار وفائدة أدوات الذكاء الاصطناعي التوليدي في الواجبات المدرسية، بما في ذلك الفروق المحتملة بين الجنسين، و(2) العلاقة بين أنماط الاستخدام هذه والوظائف التنفيذية والأداء الأكاديمي للمراهقين.

الطرق

توضح قسم “المواد والطرق” تصميم التجربة والإجراءات المستخدمة في الدراسة. يتفصل في المواد المحددة المستخدمة، بما في ذلك أي مواد كيميائية، معدات، وعينات بيولوجية، لضمان إمكانية تكرار النتائج. تشمل المنهجية البروتوكولات المتبعة لجمع البيانات، بما في ذلك أي تحليلات إحصائية تم إجراؤها لتفسير النتائج.

بالإضافة إلى ذلك، قد يصف القسم الظروف التجريبية، مثل درجة الحرارة، والمدة، والضوابط المطبقة للتحقق من النتائج. تعتبر وضوح وصرامة الطرق ضرورية لتقييم موثوقية استنتاجات الدراسة ولتمكين الباحثين الآخرين من تكرار التجارب.

النتائج

تشير النتائج من الدراسة 2 إلى نتائج هامة تساهم في الفهم العام لموضوع البحث. تظهر النتائج الرئيسية علاقة واضحة بين المتغيرات قيد التحقيق، مما يدعم الفرضيات الأولية. تكشف التحليلات الإحصائية أن التأثيرات الملحوظة قوية، مع قيم p تشير إلى دلالة قوية.

علاوة على ذلك، تسلط المناقشة الضوء على آثار هذه النتائج ضمن السياق الأوسع للمجال. لا تعزز النتائج الدراسات السابقة فحسب، بل تقترح أيضًا طرقًا جديدة للبحث المستقبلي. بشكل عام، توفر نتائج الدراسة 2 رؤى قيمة تعزز الإطار النظري والتطبيقات العملية للبحث.

المناقشة

تسلط قسم المناقشة في الورقة البحثية الضوء على المشهد المتطور لروبوتات الدردشة الذكاء الاصطناعي التوليدية، خاصة في السياقات التعليمية. تميز بين مطابقة الأنماط وروبوتات الدردشة المعتمدة على التعلم الآلي، مع التأكيد على قدرة الأخيرة على الانخراط في تفاعلات أكثر وعيًا بالسياق. بينما تظهر نماذج اللغة الكبيرة مثل GPT-3 قدرات متقدمة في توليد نصوص شبيهة بالبشر ودعم العمليات المعرفية، فإنها أيضًا تشكل مخاطر، مثل perpetuating biases والمساهمة في مشكلة “الصندوق الأسود”، حيث يكون السبب وراء مخرجات الذكاء الاصطناعي غير واضح. إن التخفيف من هذه المخاطر من خلال تصفية جودة البيانات وضبط المحاذاة أمر ضروري للاستفادة من إمكانيات هذه الأدوات في تحسين التجارب التعليمية.

تستكشف الورقة أيضًا العلاقة بين الوظائف التنفيذية واستخدام الذكاء الاصطناعي التوليدي بين المراهقين. ترتبط الوظائف التنفيذية، التي تشمل العمليات المعرفية الأساسية للنجاح الأكاديمي، بالقدرة على البدء، والتخطيط، وإكمال المهام. تشير النتائج الأولية إلى أن المراهقين الذين يعانون من وظائف تنفيذية أقل قد يستفيدون أكثر من أدوات الذكاء الاصطناعي التوليدي، حيث يمكن أن تساعد هذه الأدوات في تنظيم وإكمال الواجبات المدرسية. ومع ذلك، تثير الدراسة مخاوف بشأن إمكانية أن يؤدي الاعتماد المفرط على الذكاء الاصطناعي التوليدي إلى تقليل القدرات المعرفية. تهدف الأبحاث إلى التحقيق في تكرار وفائدة الذكاء الاصطناعي التوليدي في الواجبات المدرسية، فضلاً عن ارتباطه بالوظائف التنفيذية والإنجاز الأكاديمي، مما يبرز الحاجة إلى مزيد من الاستكشاف في هذا المجال الناشئ.

القيود

تقدم الدراسة نقاط قوة ملحوظة، بما في ذلك استخدام عينتين كبيرتين من المراهقين ومعدل استجابة مرتفع، مما يعزز صلاحيتها الخارجية وتمثيلها. تضيف دمج تقييمات المعلمين للوظائف التنفيذية في الدراسة 2 عمقًا للنتائج، خاصة فيما يتعلق بالعلاقة بين استخدام الذكاء الاصطناعي التوليدي والوظائف التنفيذية. ومع ذلك، يجب الاعتراف بعدة قيود. قد لا تعكس توقيت الاستطلاع في ربيع 2023، بالتزامن مع الإدخال المبكر لروبوتات الدردشة الذكاء الاصطناعي التوليدية، الاتجاهات الحالية للاستخدام بدقة، مما قد يربك النتائج. بالإضافة إلى ذلك، قد يحد العدد المنخفض لمستخدمي الذكاء الاصطناعي للواجبات المدرسية في الدراسة 1 من القوة الإحصائية ويزيد من احتمال حدوث أخطاء من النوع الثاني.

تشمل القيود الأخرى الاعتماد الحصري على مقاييس التقرير الذاتي للوظائف التنفيذية في الدراسة 1، مما يشير إلى الحاجة إلى أبحاث مستقبلية تتضمن تقييمات المعلمين والآباء، بالإضافة إلى التقييمات المعتمدة على الأداء. واجهت الدراسة أيضًا تحديات بسبب غياب مقاييس موثقة لتقييم فائدة أدوات الذكاء الاصطناعي التوليدي، مما يتطلب مزيدًا من التحقق من صحة المقاييس المخصصة المطورة. علاوة على ذلك، تثير جمع البيانات في الدراسة 2 من مدرسة واحدة مخاوف بشأن تحيز الاختيار، على الرغم من أن الاختيار العشوائي للفصول ومعدل استجابة 81% قد يخفف من هذه المشكلة. يجب أن تهدف الأبحاث المستقبلية إلى دراسات متعددة المواقع لتعزيز القابلية للتعميم واستكشاف متغيرات إضافية تتعلق باستخدام الذكاء الاصطناعي، والفائدة المدركة، وتأثيرها على الوظائف التنفيذية والإنجاز الأكاديمي. أخيرًا، تزيد التحليلات الواسعة التي تم إجراؤها من خطر حدوث أخطاء من النوع الأول، مما يتطلب تفسيرًا حذرًا للنتائج ويؤكد على الحاجة إلى مزيد من التحقق من الصحة في هذا المجال البحثي الناشئ.

Journal: Frontiers in Artificial Intelligence, Volume: 7
DOI: https://doi.org/10.3389/frai.2024.1415782
PMID: https://pubmed.ncbi.nlm.nih.gov/39263526
Publication Date: 2024-08-28
Author(s): Johan Klarin et al.
Primary Topic: Digital Mental Health Interventions

Overview

This study investigates the frequency and perceived usefulness of large language model (LLM) generative AI chatbots, such as ChatGPT, in relation to adolescents’ executive functioning (EF), which encompasses critical cognitive skills necessary for academic success. Two studies were conducted with younger (N = 385, mean age 14 years) and older (N = 359, mean age 17 years) adolescents. Findings revealed that 14.8% of younger participants and 52.6% of older participants reported using generative AI, with ChatGPT being the preferred tool. Notably, adolescents experiencing greater EF challenges perceived these AI tools as more beneficial for completing school assignments, although no significant correlation was found between academic achievement and AI usage or perceived usefulness in Study 1.

The research highlights the need for further exploration into the factors influencing generative AI usage among adolescents, including potential gender differences in tool preferences and their implications for academic equity. The study underscores the importance of evaluating the integration of generative AI in educational settings, particularly given the potential risks and benefits for students with EF difficulties. As generative AI continues to evolve, proactive measures and further research are essential to ensure its effective and equitable use in education, while addressing ethical considerations related to academic integrity and cognitive development.

Introduction

The introduction of this research paper discusses the emergence of a debate in Sweden regarding the use of ChatGPT, a Large Language Model (LLM) generative AI chatbot, in educational contexts following its release in spring 2023. The discourse centers on concerns about potential cheating versus the benefits of AI as an educational aid, particularly for students at risk of underperforming. Evidence from a study in Germany indicates that two-thirds of university students utilize generative AI for various academic tasks, while a survey in Sweden reveals that 75% of adolescents and young adults aged 15-24 engage with these tools for educational purposes, such as structuring presentations and studying.

The paper highlights the implications of generative AI on executive functioning (EF), which encompasses cognitive processes essential for planning, concentration, and attention. Deficits in EF are linked to lower academic achievement and can lead to challenges in both academic and social domains. The authors note a gap in the literature regarding the nuanced conditions affecting the use and effectiveness of generative AI, particularly among younger adolescents aged 12-14. Consequently, the study aims to explore two primary objectives: (1) the frequency and perceived usefulness of generative AI tools in schoolwork, including potential gender differences, and (2) the relationship between these usage patterns and adolescents’ EF and academic performance.

Methods

The section on “Materials and Methods” outlines the experimental design and procedures employed in the study. It details the specific materials used, including any reagents, equipment, and biological samples, ensuring reproducibility of the results. The methodology encompasses the protocols followed for data collection, including any statistical analyses performed to interpret the findings.

Additionally, the section may describe the experimental conditions, such as temperature, duration, and controls implemented to validate the results. The clarity and rigor of the methods are crucial for assessing the reliability of the study’s conclusions and for enabling other researchers to replicate the experiments.

Results

The results from Study 2 indicate significant findings that contribute to the overall understanding of the research topic. Key outcomes demonstrate a clear relationship between the variables under investigation, supporting the initial hypotheses. Statistical analyses reveal that the observed effects are robust, with p-values indicating strong significance.

Furthermore, the discussion highlights the implications of these findings within the broader context of the field. The results not only reinforce previous studies but also suggest new avenues for future research. Overall, the outcomes of Study 2 provide valuable insights that enhance the theoretical framework and practical applications of the research.

Discussion

The discussion section of the research paper highlights the evolving landscape of generative AI chatbots, particularly in educational contexts. It differentiates between pattern matching and machine learning-based chatbots, emphasizing the latter’s ability to engage in more context-aware interactions. While large language models (LLMs) like GPT-3 demonstrate advanced capabilities in generating human-like text and supporting cognitive processes, they also pose risks, such as perpetuating biases and contributing to the “black box” problem, where the reasoning behind AI outputs is unclear. Mitigating these risks through data quality filtering and alignment tuning is essential to harness the potential of these tools for enhancing educational experiences.

The paper also explores the relationship between executive functioning (EF) and the use of generative AI among adolescents. EF, which encompasses cognitive processes crucial for academic success, is linked to the ability to initiate, plan, and complete tasks. Preliminary findings suggest that adolescents with lower EF may benefit more from generative AI tools, as these tools can assist in organizing and completing schoolwork. However, the study raises concerns about the potential for generative AI to diminish cognitive abilities if over-relied upon. The research aims to investigate the frequency and perceived usefulness of generative AI in schoolwork, as well as its association with EF and academic achievement, highlighting the need for further exploration in this nascent field.

Limitations

The study presents notable strengths, including the use of two large adolescent samples and a high response rate, which enhance its external validity and representation. The incorporation of teacher ratings of executive function (EF) in Study 2 adds depth to the findings, particularly regarding the relationship between generative AI use and EF. However, several limitations must be acknowledged. The timing of the survey in spring 2023, coinciding with the early introduction of generative AI chatbots, may not accurately reflect current usage trends, potentially confounding the results. Additionally, the low number of AI users for schoolwork in Study 1 could limit the statistical power and increase the likelihood of Type II errors.

Further limitations include the exclusive reliance on self-report measures for EF in Study 1, suggesting a need for future research to incorporate teacher and parent ratings, as well as performance-based assessments. The study also faced challenges due to the absence of validated scales for evaluating the usefulness of generative AI tools, necessitating further validation of the custom scales developed. Moreover, the data collection in Study 2 from a single school raises concerns about selection bias, although the random selection of classes and an 81% response rate may mitigate this issue. Future research should aim for multi-site studies to enhance generalizability and explore additional covariates related to AI use, perceived usefulness, and their impact on EF and academic achievement. Lastly, the extensive analyses conducted increase the risk of Type I errors, warranting cautious interpretation of the findings and emphasizing the need for further validation in this emerging research area.