انهار نماذج الذكاء الاصطناعي عند تدريبها على بيانات تم إنشاؤها بشكل متكرر AI models collapse when trained on recursively generated data

المجلة: Nature، المجلد: 631، العدد: 8022
DOI: https://doi.org/10.1038/s41586-024-07566-y
PMID: https://pubmed.ncbi.nlm.nih.gov/39048682
تاريخ النشر: 2024-07-24
المؤلف: Ilia Shumailov وآخرون
الموضوع الرئيسي: نمذجة الموضوعات

نظرة عامة

يتم تعريف انهيار النموذج كظاهرة تنكسية تؤثر على الأجيال المتعاقبة من النماذج التوليدية المتعلمة، حيث تلوث البيانات التي تنتجها هذه النماذج مجموعات بيانات التدريب للأجيال اللاحقة. يؤدي هذا التلوث إلى إدراك مشوه للواقع في النماذج، كما هو موضح في الشكل 1a. تحدد الورقة شكلين متميزين من انهيار النموذج: انهيار النموذج المبكر، الذي يتميز بفقدان المعلومات المتعلقة بذيل التوزيع، وانهيار النموذج المتأخر، حيث يتقارب النموذج إلى توزيع لا يشبه الأصل كثيرًا، وغالبًا ما يظهر تباينًا منخفضًا بشكل ملحوظ.

يُنسب حدوث انهيار النموذج إلى ثلاثة مصادر محددة من الأخطاء التي تتراكم على مر الأجيال، مما يؤدي إلى انحرافات عن النموذج الأصلي. المصدر الرئيسي هو خطأ التقريب الإحصائي، الذي ينشأ من قيود أحجام العينات المحدودة ويتناقص مع اقتراب عدد العينات من اللانهاية. هذا الفهم الأساسي لانهيار النموذج ضروري لمعالجة التحديات التي تواجه النمذجة التوليدية وضمان سلامة البيانات عبر الأجيال.

مناقشة

تسلط قسم المناقشة في الورقة الضوء على ظاهرة انهيار النموذج في النماذج التوليدية، لا سيما في سياق نماذج اللغة (LLMs). تحدد ثلاثة مصادر رئيسية من الأخطاء التي تسهم في هذا الانهيار: (1) **خطأ التعبير الوظيفي**، حيث قد تمثل الشبكات العصبية، كموحدات عالمية، التوزيعات بشكل خاطئ، مما يؤدي إلى احتمالات غير صفرية خارج الدعم الأصلي؛ (2) **خطأ التقريب الوظيفي**، الناجم عن قيود في إجراءات التعلم، والتي يمكن أن تفاقم من عدم دقة النموذج؛ و(3) **الأخطاء الإحصائية** الناتجة عن أخذ العينات، والتي يمكن أن تتسبب في فقدان النموذج للمعلومات على مر الأجيال. يوضح المؤلفون أنه مع تدريب النماذج على البيانات التي تم إنشاؤها بواسطة التكرارات السابقة، تميل إلى إنتاج مخرجات تنحرف بشكل متزايد عن التوزيع الأصلي، مما يؤدي في النهاية إلى انهيار إلى دالة دلتا، حيث يتم فقدان التنوع في المخرجات.

تعتبر تداعيات انهيار النموذج كبيرة، لا سيما فيما يتعلق بالحفاظ على الأحداث ذات الاحتمالية المنخفضة التي تعتبر حاسمة للتنبؤات العادلة وفهم الأنظمة المعقدة. يجادل المؤلفون بأن الطبيعة الآلية لـ LLMs تعزز من خطر هجمات التسمم على المدى الطويل، كما هو واضح في جهود التلاعب بالمحتوى التي يقودها البشر في السابق. يؤكدون على ضرورة الحفاظ على الوصول إلى مصادر البيانات الأصلية للتخفيف من آثار انهيار النموذج وضمان سلامة مخرجات LLM. علاوة على ذلك، تثير الورقة مخاوف بشأن أصل البيانات التي تم إنشاؤها بواسطة LLMs، مقترحة أن التنسيق على مستوى المجتمع قد يكون ضروريًا لتتبع المحتوى وإدارته بشكل فعال، وبالتالي الحفاظ على جودة وموثوقية تدريب النماذج المستقبلية.

Journal: Nature, Volume: 631, Issue: 8022
DOI: https://doi.org/10.1038/s41586-024-07566-y
PMID: https://pubmed.ncbi.nlm.nih.gov/39048682
Publication Date: 2024-07-24
Author(s): Ilia Shumailov et al.
Primary Topic: Topic Modeling

Overview

Model collapse is defined as a degenerative phenomenon impacting successive generations of learned generative models, where the data produced by these models contaminates the training datasets of subsequent generations. This contamination leads to a distorted perception of reality in the models, as illustrated in Figure 1a. The paper identifies two distinct forms of model collapse: early model collapse, characterized by a loss of information regarding the tails of the distribution, and late model collapse, where the model converges to a distribution that bears little resemblance to the original, often exhibiting significantly reduced variance.

The occurrence of model collapse is attributed to three specific sources of error that accumulate over generations, resulting in deviations from the original model. The primary source is statistical approximation error, which arises from the limitations of finite sample sizes and diminishes as the number of samples approaches infinity. This foundational understanding of model collapse is crucial for addressing the challenges faced in generative modeling and ensuring the integrity of data across generations.

Discussion

The discussion section of the paper highlights the phenomenon of model collapse in generative models, particularly in the context of language models (LLMs). It identifies three primary sources of error contributing to this collapse: (1) **Functional expressivity error**, where neural networks, as universal approximators, may misrepresent distributions, leading to non-zero likelihoods outside the original support; (2) **Functional approximation error**, stemming from limitations in learning procedures, which can exacerbate model inaccuracies; and (3) **Statistical errors** from sampling, which can cause the model to lose information over generations. The authors illustrate that as models are trained on data generated by previous iterations, they tend to produce outputs that increasingly deviate from the original distribution, ultimately leading to a collapse into a delta function, where diversity in outputs is lost.

The implications of model collapse are significant, particularly regarding the preservation of low-probability events that are crucial for fair predictions and understanding complex systems. The authors argue that the automated nature of LLMs amplifies the risk of long-term poisoning attacks, as seen in previous human-driven content manipulation efforts. They emphasize the necessity of maintaining access to original data sources to mitigate the effects of model collapse and ensure the integrity of LLM outputs. Furthermore, the paper raises concerns about the provenance of data generated by LLMs, suggesting that community-wide coordination may be essential to track and manage the content effectively, thereby preserving the quality and reliability of future model training.

كلمات مفتاحية: حدس، نموذج توليدي