ChatGPT مذهل (في كونه متوسطًا) ChatGPT is incredible (at being average)

المجلة: Ethics and Information Technology، المجلد: 27، العدد: 3
DOI: https://doi.org/10.1007/s10676-025-09845-2
تاريخ النشر: 2025-07-18
المؤلف: Ihor Rudko وآخرون
الموضوع الرئيسي: الأخلاقيات والآثار الاجتماعية للذكاء الاصطناعي

نظرة عامة

تناقش هذه الفقرة المجال غير المستكشف بشكل كافٍ لتجانس المخرجات في سياق الذكاء الاصطناعي (AI) والتحيزات الخوارزمية. وتبرز أنه بينما تلمس الأبحاث حول تحيزات الذكاء الاصطناعي، كما أشار غيتشوا وآخرون ولي وجون (2025)، بشكل غير مباشر تجانس المخرجات، إلا أن الظاهرة نفسها لم تحظ باهتمام أكاديمي كبير. إن مشكلة انهيار النموذج، خاصة عندما يؤدي التدريب على بيانات اصطناعية إلى روبوتات محادثة غير فعالة (شوميلوف وآخرون، 2024)، تعقد فهم تجانس المخرجات. يجادل المؤلف بأن تجانس المخرجات هو محور هذه المناقشات، ومع ذلك فإنه لا يزال مهملًا نسبيًا، ربما بسبب التركيز السائد على تحيزات الذكاء الاصطناعي وانهيار النموذج في شرعنة تكنولوجيا روبوتات المحادثة (غريمليكهويسن وماير، 2022).

مقدمة

تؤكد مقدمة هذه الورقة البحثية على الحاجة الملحة لفهم وتخفيف الآثار السلبية للذكاء الاصطناعي التوليدي، خاصة في سياق روبوتات المحادثة. وتبرز الجسم المتزايد من الأدبيات التي تتناول عيوب هذه الأنظمة، ولا سيما الهلوسة الخوارزمية، التي تشير إلى توليد معلومات خاطئة أو مضللة بواسطة نماذج اللغة. ينتقد المؤلفون الفهم التقليدي للأكاذيب الخوارزمية، مقترحين إعادة تصوّر تُسمى “هراء” تتجاوز قيود الإطار التقليدي. تشير هذه النظرة الجديدة، كما أوضح هيكس وآخرون (2024) وفيشر (2024)، إلى أنه يجب تقييم المخرجات الخوارزمية ليس فقط من حيث دقتها الواقعية ولكن أيضًا من حيث تفاصيل تقديمها.

يجادل المؤلفون بضرورة التمييز بين نوعين من “هراء” الخوارزمية: “ما-هراء”، الذي يتعلق بواقع النص المولد، و”كيف-هراء”، الذي يتعلق بالجوانب الأسلوبية والبنائية للمخرجات. يؤكدون أن كلا النوعين متأصلان في وظيفة نماذج اللغة وليس مجرد عيوب. تفترض الورقة أنه بينما تنتج روبوتات المحادثة نصوصًا قد تبدو متماسكة ومنظمة، إلا أنها تفتقر إلى الفهم الأعمق والنية التي تميز الكتابة البشرية. هذا التمييز حاسم لفهم أكثر شمولاً لآثار المحتوى الذي ينتجه الذكاء الاصطناعي، حيث يتحدى تجسيد روبوتات المحادثة ويبرز تعقيد الحقيقة في اللغة.

مناقشة

في هذا القسم، يقدم المؤلفون إطارًا مفاهيميًا يميز بين نوعين من “هراء” الخوارزمية التي تنتجها نماذج اللغة الكبيرة (LLMs): هراء-ما (what-BS) وهراء-كيف (how-BS). يتعلق الأول بعدم الدقة الواقعية والتناقضات في مخرجات النموذج، وغالبًا ما يُشار إليه بالهلوسات الخوارزمية. يركز الثاني على تجانس المخرجات، حيث ينتج النموذج استجابات تظهر هياكل متوقعة ومبتذلة، بغض النظر عن اختلافات المدخلات. هذا التمييز حاسم لأنه يبرز الطبيعة المزدوجة للمخرجات الخوارزمية، حيث يلعب كلا النوعين من الهراء دورًا، على الرغم من أن لهما آثارًا مختلفة على الموثوقية والأثر الاجتماعي.

يجادل المؤلفون بأنه بينما يكون من الأسهل تحديد هراء-ما، الذي يظهر كأكاذيب صارخة، فإن هراء-كيف أكثر خفاء وغالبًا ما يتم تجاهله، على الرغم من آثاره التراكمية الكبيرة. يؤكدون أن قضايا مثل التحيز الخوارزمي وانهيار النموذج مرتبطة جوهريًا بتجانس المخرجات، الذي لا يمكن تفسيره بشكل كافٍ من خلال استعارة الهراء التقليدية. يدعو المؤلفون إلى فهم أعمق للتفاصيل في مخرجات LLM، مقترحين أن التمييز الواضح بين المحتوى (ما) والشكل (كيف) للغة أمر ضروري لمواجهة التحديات التي تطرحها هذه النماذج في التطبيقات الواقعية. تهدف هذه إعادة التصور إلى تعزيز النقاش حول مخرجات LLM وآثارها على الموثوقية والمساءلة في سياقات مختلفة.

Journal: Ethics and Information Technology, Volume: 27, Issue: 3
DOI: https://doi.org/10.1007/s10676-025-09845-2
Publication Date: 2025-07-18
Author(s): Ihor Rudko et al.
Primary Topic: Ethics and Social Impacts of AI

Overview

The section discusses the underexplored area of output homogenization in the context of artificial intelligence (AI) and algorithmic biases. It highlights that while research on AI biases, as noted by Gichoya et al. and Lee & Jeon (2025), indirectly touches upon output homogenization, the phenomenon itself has not garnered significant scholarly attention. The issue of model collapse, particularly when training on synthetic data leads to ineffective chatbots (Shumailov et al., 2024), complicates the understanding of output homogenization. The author argues that output homogenization is central to these discussions, yet it remains relatively neglected, possibly due to the prevailing focus on AI biases and model collapse in legitimizing chatbot technology (Grimmelikhuijsen & Meijer, 2022).

Introduction

The introduction of this research paper emphasizes the urgent need to comprehend and mitigate the negative impacts of generative AI, particularly in the context of chatbots. It highlights the growing body of literature addressing the flaws of these systems, notably algorithmic hallucination, which refers to the generation of false or misleading information by language models. The authors critique the conventional understanding of algorithmic falsehoods, proposing a re-conceptualization termed “bullshit” that moves beyond the limitations of the traditional framework. This new perspective, as articulated by Hicks et al. (2024) and Fisher (2024), suggests that algorithmic outputs should not only be evaluated for their factual accuracy but also for the nuances of their presentation.

The authors argue for a distinction between two types of algorithmic “bullshit”: “what-BS,” which pertains to the factuality of the generated text, and “how-BS,” which relates to the stylistic and structural aspects of the output. They assert that both types are inherent to the functionality of language models and not merely flaws. The paper posits that while chatbots produce text that may appear coherent and structured, they lack the deeper understanding and intentionality characteristic of human writing. This distinction is crucial for a more comprehensive understanding of the implications of AI-generated content, as it challenges the anthropomorphization of chatbots and underscores the complexity of truth in language.

Discussion

In this section, the authors introduce a conceptual framework distinguishing between two types of algorithmic “bullshit” produced by large language models (LLMs): what-bullshit (what-BS) and how-bullshit (how-BS). The former pertains to factual inaccuracies and inconsistencies in the model’s outputs, often referred to as algorithmic hallucinations. The latter focuses on the homogenization of outputs, where the model generates responses that exhibit predictable and clichéd structures, regardless of input variations. This distinction is crucial as it highlights the dual nature of algorithmic outputs, where both types of bullshit are at play, albeit with different implications for reliability and societal impact.

The authors argue that while what-BS is easier to identify—manifesting as blatant falsehoods—how-BS is more insidious and often overlooked, despite its significant cumulative effects. They emphasize that issues such as algorithmic bias and model collapse are intrinsically linked to output homogenization, which cannot be adequately explained by the conventional bullshit metaphor. The authors advocate for a deeper understanding of the nuances in LLM outputs, suggesting that a clear differentiation between the content (what) and the form (how) of language is essential for addressing the challenges posed by these models in real-world applications. This re-conceptualization aims to enhance the discourse surrounding LLM outputs and their implications for trustworthiness and accountability in various contexts.