الذكاء الاصطناعي التوليدي القابل للتفسير (GenXAI): استعراض، تصوّر، وأجندة بحثية Explainable Generative AI (GenXAI): a survey, conceptualization, and research agenda

المجلة: Artificial Intelligence Review، المجلد: 57، العدد: 11
DOI: https://doi.org/10.1007/s10462-024-10916-x
تاريخ النشر: 2024-09-15
المؤلف: Johannes Schneider
الموضوع الرئيسي: الذكاء الاصطناعي القابل للتفسير (XAI)

نظرة عامة

تقدم هذه القسم نظرة عامة على المشهد المتطور للذكاء الاصطناعي التوليدي (GenAI)، مع التركيز على انتقاله من قدرات التعرف إلى قدرات التوليد عبر مهام متنوعة. لقد زاد هذا التحول من أهمية القابلية للتفسير في الذكاء الاصطناعي (XAI)، حيث تتطلب تعقيدات الحلول المولدة أهدافًا ومعايير جديدة لتفسيرات فعالة. يحدد المؤلفون المتطلبات الناشئة لـ XAI، بما في ذلك القابلية للتحقق، والتفاعلية، والأمان، واعتبارات التكلفة، ويقترحون تصنيفًا لتصنيف آليات XAI الحالية ذات الصلة بـ GenAI.

تستعرض الورقة الأدبيات الحالية حول XAI، مع التركيز على الأساليب التي تتراوح من بيانات التدريب إلى التحفيز، وتقدم خلفية تقنية عن GenAI مصممة للجماهير غير التقنية، خاصة في سياق توليد النصوص والصور. بينما لا تستكشف تقييم وتطبيق التفسيرات بعمق، تهدف إلى جذب كل من الخبراء الفنيين والمحترفين من مجالات متنوعة، مثل العلوم الاجتماعية وأنظمة المعلومات. بالإضافة إلى ذلك، يحدد المؤلفون خارطة طريق بحثية تحتوي على أكثر من عشرة اتجاهات محتملة للاستكشاف المستقبلي في مجال XAI وGenAI.

مقدمة

تناقش مقدمة هذه الورقة البحثية التأثير التحويلي للذكاء الاصطناعي التوليدي (GenAI) عبر قطاعات متنوعة، بما في ذلك التعليم، والأعمال، والرعاية الصحية، مع تسليط الضوء على إمكانيته في توليد قيمة اقتصادية كبيرة بينما يشكل أيضًا مخاطر على المجتمع. يمثل GenAI، المدعوم بنماذج أساسية متقدمة، تحولًا من الذكاء الاصطناعي التقليدي الذي يعترف بالبيانات بشكل أساسي إلى واحد يمكنه توليد مخرجات متنوعة مثل النصوص والصور والصوت، جميعها تحت السيطرة من خلال تحفيزات بشرية. تؤكد الورقة على القدرات غير المسبوقة لـ GenAI، بما في ذلك اجتياز امتحانات على مستوى الجامعة والتفوق في المهام الإبداعية، مع الإشارة أيضًا إلى إمكانية الوصول إلى الأنظمة التجارية مثل Chat-GPT وDall-E.

يجادل المؤلفون بضرورة تقنيات الذكاء الاصطناعي القابل للتفسير (XAI) المصممة لـ GenAI، والتي تُسمى GenXAI، لتعزيز الفهم والثقة في مخرجات الذكاء الاصطناعي، خاصة في معالجة قضايا مثل الهلوسة. على الرغم من التقدم في XAI، لا تزال هناك تحديات كبيرة، حيث أظهرت الأساليب الحالية فعالية محدودة في تحسين التفاعلات بين الإنسان والوكيل. تهدف الورقة إلى دفع المجال قدمًا من خلال تحديد التحديات والمتطلبات الرئيسية لـ GenXAI، باستخدام منهجية مراجعة سردية مع نهج تطوير تصنيفي. يعتزم المؤلفون تقديم نظرة شاملة على المتطلبات لـ XAI في سياق GenAI، مما يساهم في فهم أعمق للتفاعل بين النماذج التوليدية والقابلية للتفسير.

الطرق

تناقش قسم الطرق في الورقة البحثية تقنيات قائمة على الاستكشاف لتقييم المعرفة التي تلتقطها نماذج اللغة الكبيرة (LLMs) من خلال استخدام مصنفات، أو مجسات، تحلل تنشيطات النموذج. يمكن أن تحدد هذه المجسات خصائص متنوعة، مثل المعرفة النحوية والدلالية، من خلال تدريب مصنفات على مخرجات نماذج مثل BERT وMiniXX وT5. بالإضافة إلى ذلك، يمكن إنشاء مجموعات بيانات متخصصة لتقييم قدرات محددة، على الرغم من أنه يجب توخي الحذر لتجنب التعلم السريع، حيث تستغل النماذج التكرارات الخاصة بمجموعات البيانات بدلاً من التعلم الحقيقي للخصائص المقصودة. يبرز القسم أيضًا دراسات متنوعة استخدمت طرق الاستكشاف لفهم LLMs، بما في ذلك الاكتساب الزمني للميزات اللغوية وتطور قدرات التلخيص أثناء التدريب.

علاوة على ذلك، يحدد القسم طرق التفسير القائمة على المفاهيم، التي توفر درجات ملاءمة لمفاهيم محددة في المدخلات وتحقق في تنشيطات الخلايا العصبية لتفسير أدوار الخلايا العصبية الفردية. تشمل التطورات الحديثة استخدام نماذج مثل GPT-4 لتوليد تفسيرات لتنشيطات الخلايا العصبية في GPT-2 وتحديد “الخلايا العصبية المعرفية” التي تخزن حقائق محددة. تجمع منهجية البحث بين المراجعات النظامية والسردية لتلخيص الأدبيات الحالية حول الذكاء الاصطناعي القابل للتفسير (XAI) للذكاء الاصطناعي التوليدي (GenAI). يؤكد المؤلفون على حداثة هذا المجال والحاجة إلى تصور شامل للتحديات والمتطلبات في XAI لـ GenAI، مما يؤدي إلى تصنيف تم تطويره من خلال تحليل تكراري للمسوحات السابقة والبحث التجريبي.

المناقشة

تتوسع قسم المناقشة في الورقة البحثية حول بنية ووظائف نماذج الذكاء الاصطناعي التوليدي (GenAI)، مع التركيز على نشرها كتطبيقات مستقلة أو كجزء من أنظمة معقدة تدمج مصادر بيانات متنوعة ونماذج توليدية. يُبرز بشكل خاص أنماط التحويل كمعيار لنماذج اللغة الكبيرة (LLMs)، بينما يتم مناقشة نماذج توليدية أخرى مثل نماذج الانتشار، والمشفرات التلقائية التغيرية (VAEs)، والشبكات التنافسية التوليدية (GANs). تسمح مرونة المحولات لها بالتعامل مع بيانات إدخال متنوعة، على الرغم من أنها تتطلب عادةً المزيد من بيانات التدريب لتحقيق أداء مثالي مقارنة بالنماذج الأبسط. كما يوضح القسم الآليات التشغيلية لنماذج الانتشار، التي تولد مخرجات من خلال إعادة بناء بيانات مشوشة عبر عملية سلسلة ماركوف.

علاوة على ذلك، تؤكد الورقة على أهمية الذكاء الاصطناعي القابل للتفسير (XAI) في GenAI، خاصة مع اكتساب هذه التقنيات اعتمادًا واسع النطاق عبر قطاعات المجتمع المختلفة. يتم التأكيد على الحاجة إلى التحقق من المخرجات، والمساءلة، وفهم تعقيدات أنظمة GenAI، نظرًا لإمكاناتها في إنتاج مخرجات ضارة والتحديات في تقييم أدائها. تحدد المناقشة عدة متطلبات لتفسيرات فعالة في GenAI، بما في ذلك القابلية للتحقق، وتتبع النسب، والتفاعلية، والتفسيرات الديناميكية، واعتبارات التكلفة. هذه العوامل حاسمة لضمان توافق أنظمة GenAI مع احتياجات المستخدم والمعايير الأخلاقية، خاصة مع تحولها إلى جزء لا يتجزأ من التطبيقات ذات المخاطر العالية في التعليم، والصحة النفسية، وما بعدها.

Journal: Artificial Intelligence Review, Volume: 57, Issue: 11
DOI: https://doi.org/10.1007/s10462-024-10916-x
Publication Date: 2024-09-15
Author(s): Johannes Schneider
Primary Topic: Explainable Artificial Intelligence (XAI)

Overview

The section provides an overview of the evolving landscape of Generative AI (GenAI), emphasizing its transition from recognition to generation capabilities across various tasks. This shift has heightened the importance of explainability in AI (XAI), as the complexity of generated solutions necessitates new objectives and criteria for effective explanations. The authors identify emerging requirements for XAI, including verifiability, interactivity, security, and cost considerations, and propose a taxonomy to classify existing XAI mechanisms relevant to GenAI.

The paper surveys the current literature on XAI, focusing on methods ranging from training data to prompting, and offers a technical background on GenAI tailored for non-technical audiences, particularly in the context of text and image generation. While it does not explore the evaluation and application of explanations in depth, it aims to engage both technical experts and professionals from diverse fields, such as social sciences and information systems. Additionally, the authors outline a research roadmap with over ten potential directions for future exploration in the realm of XAI and GenAI.

Introduction

The introduction of this research paper discusses the transformative impact of Generative AI (GenAI) across various sectors, including education, business, and healthcare, highlighting its potential to generate significant economic value while also posing risks to society. GenAI, powered by advanced foundation models, represents a shift from traditional AI that primarily recognizes data to one that can generate diverse outputs such as text, images, and audio, all controlled through human prompts. The paper emphasizes the unprecedented capabilities of GenAI, including passing university-level exams and excelling in creative tasks, while also noting the accessibility of commercial systems like Chat-GPT and Dall-E.

The authors argue for the necessity of Explainable AI (XAI) techniques tailored for GenAI, termed GenXAI, to enhance understanding and trust in AI outputs, particularly in addressing issues like hallucinations. Despite the progress in XAI, significant challenges remain, as existing methods have shown limited effectiveness in improving human-agent interactions. The paper aims to advance the field by identifying key challenges and requirements for GenXAI, employing a narrative review methodology combined with a taxonomy development approach. The authors intend to provide a comprehensive overview of desiderata for XAI in the context of GenAI, contributing to a deeper understanding of the interplay between generative models and explainability.

Methods

The methods section of the research paper discusses probing-based techniques for evaluating the knowledge captured by large language models (LLMs) through the use of classifiers, or probes, that analyze model activations. These probes can identify various properties, such as syntax and semantic knowledge, by training classifiers on the outputs of models like BERT, MiniXX, and T5. Additionally, specialized datasets can be created to assess specific capabilities, although care must be taken to avoid shortcut learning, where models exploit dataset-specific regularities instead of genuinely learning the intended properties. The section also highlights various studies that have employed probing methods to understand LLMs, including the temporal acquisition of linguistic features and the evolution of summarization capabilities during training.

Furthermore, the section outlines concept-based explanation methods, which provide relevance scores for specific concepts in inputs and investigate neuron activations to interpret individual neurons’ roles. Recent advancements include using models like GPT-4 to generate explanations for neuron activations in GPT-2 and identifying “knowledge neurons” that store specific facts. The research methodology combines systematic and narrative reviews to synthesize existing literature on explainable AI (XAI) for generative AI (GenAI). The authors emphasize the novelty of this field and the need for a comprehensive conceptualization of challenges and desiderata in XAI for GenAI, culminating in a taxonomy developed through iterative analysis of prior surveys and empirical research.

Discussion

The discussion section of the research paper elaborates on the architecture and functionality of Generative AI (GenAI) models, emphasizing their deployment as stand-alone applications or as part of complex systems that integrate various data sources and generative models. Notably, transformer architectures are highlighted as the standard for large language models (LLMs), while other generative models like diffusion models, variational autoencoders (VAEs), and generative adversarial networks (GANs) are also discussed. The flexibility of transformers allows them to handle diverse input data, although they typically require more training data to achieve optimal performance compared to simpler models. The section also details the operational mechanisms of diffusion models, which generate outputs by reconstructing noisy data through a Markov chain process.

Furthermore, the paper underscores the importance of explainable AI (XAI) in GenAI, particularly as these technologies gain widespread adoption across various societal sectors. The need for output verification, accountability, and understanding the complexities of GenAI systems is emphasized, given their potential for harmful outputs and the challenges in evaluating their performance. The discussion identifies several desiderata for effective explanations in GenAI, including verifiability, lineage tracking, interactivity, dynamic explanations, and cost considerations. These factors are crucial for ensuring that GenAI systems align with user needs and ethical standards, particularly as they become integral to high-stakes applications in education, mental health, and beyond.