استرجاع معزز بالتوليد الموجه بواسطة رسم المعرفة Knowledge Graph-Guided Retrieval Augmented Generation

المجلة: Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers)
DOI: https://doi.org/10.18653/v1/2025.naacl-long.449
تاريخ النشر: 2025-01-01
المؤلف: Xiangrong Zhu وآخرون
الموضوع الرئيسي: الشبكات العصبية المتقدمة

نظرة عامة

تقدم ورقة البحث إطارًا جديدًا يسمى “استرجاع معزز بالتوليد الموجه بواسطة رسم المعرفة” (KG 2 RAG)، يهدف إلى التخفيف من مشاكل الهلوسة في الردود التي تولدها نماذج اللغة الكبيرة (LLMs). تركز طرق الاسترجاع المعزز التقليدية (RAG) بشكل أساسي على استرجاع أجزاء ذات صلة معزولة بناءً على المعنى، متجاهلة العلاقات بين هذه الأجزاء. بالمقابل، يستفيد KG 2 RAG من الرسوم البيانية للمعرفة (KGs) لتأسيس علاقات على مستوى الحقائق، مما يعزز تنوع وتماسك المعلومات المسترجعة. يتضمن الإطار عملية توسيع أجزاء موجهة بواسطة KG وعملية تنظيم أجزاء قائمة على KG، مما يسهل بشكل جماعي توليد فقرات منظمة جيدًا من الأجزاء المسترجعة.

تظهر التجارب الواسعة على مجموعة بيانات Hot-potQA ونسخها أن KG 2 RAG يتفوق على الأساليب الحالية المعتمدة على RAG من حيث جودة الردود وجودة الاسترجاع. تختتم الورقة بالتأكيد على أهمية توسيع الأجزاء الموجهة بواسطة KG ووحدات تنظيم السياق المعتمدة على KG، التي تعمل معًا لتحسين الفعالية العامة للإطار. تؤكد دراسة الإزالة أيضًا على مساهمات هذه المكونات، مما يؤكد دورها في إنتاج مخرجات متنوعة ومتسقة مناسبة لتوليد ردود عالية الجودة بواسطة LLMs.

مقدمة

تناقش مقدمة ورقة البحث هذه التقدمات والتحديات المرتبطة بنماذج اللغة الكبيرة (LLMs)، وخاصة قابليتها لتوليد الهلوسات – الردود التي قد تحتوي على معلومات قديمة أو غير دقيقة. لمعالجة هذه المشكلة، يقترح المؤلفون إطارًا جديدًا يسمى “استرجاع معزز بالتوليد الموجه بواسطة رسم المعرفة” (KG 2 RAG)، الذي يدمج المعرفة المنظمة من الرسوم البيانية للمعرفة (KGs) في عملية الاسترجاع المعزز بالتوليد. غالبًا ما تؤدي طرق الاسترجاع التقليدية إلى معلومات متجانسة ومتكررة، مما يحد من قدرة LLMs على توليد ردود شاملة. بالمقابل، يعزز KG 2 RAG عملية الاسترجاع من خلال ربط أجزاء الوثائق بالرسوم البيانية ذات الصلة، مما يسمح باستخراج علاقات على مستوى الحقائق ومجموعة أكثر تنوعًا من المعلومات.

يستخدم إطار KG 2 RAG عملية استرجاع من خطوتين: أولاً، يستخدم تقنيات قائمة على المعنى لتحديد الأجزاء الأساسية، ثم يوسع هذا الاختيار من خلال طرق موجهة بواسطة الرسم البياني لتضمين الكيانات والثلاثيات ذات الصلة من KG. لا يزيد هذا النهج من تنوع الأجزاء المسترجعة فحسب، بل ينظمها أيضًا في فقرات متماسكة، مما يعزز المعلوماتية للمدخلات المقدمة إلى LLMs. تظهر النتائج التجريبية على مجموعة بيانات HotpotQA أن KG 2 RAG يتفوق على الأساليب الحالية المعتمدة على RAG من حيث جودة الردود والاسترجاع. كما يقدم المؤلفون دراسة إزالة لتقييم مساهمات المكونات المختلفة داخل الإطار، ويجعلون مجموعة بياناتهم ورمز المصدر متاحين للجمهور لتشجيع المزيد من البحث في هذا المجال.

طرق

توضح قسم المنهجية سير العمل لنظام KG 2 RAG، كما هو موضح في الشكل 2. وهو مقسم إلى ثلاثة أقسام فرعية رئيسية: معالجة الوثائق غير المتصلة (القسم 2.1)، استرجاع الأجزاء المعزز بواسطة KG (القسم 2.2)، وتنظيم السياق القائم على KG (القسم 2.3). يوضح كل قسم فرعي مكونات محددة من سير العمل، موضحًا العمليات المعنية في تحويل الرسوم البيانية للمعرفة (KG) إلى إطار عمل الاسترجاع المعزز بالتوليد (RAG). يهدف هذا النهج المنظم إلى تعزيز كفاءة وفعالية استرجاع المعلومات وتنظيم السياق داخل النظام.

مناقشة

في هذا القسم، يناقش المؤلفون المنهجية والآثار المترتبة على إطارهم المقترح، KG 2 RAG، الذي يعزز الاسترجاع المعزز بالتوليد (RAG) من خلال دمج الرسوم البيانية للمعرفة (KGs). في البداية، يتم تقسيم الوثائق إلى أجزاء، يمكن إثراؤها بسياق إضافي ومعلومات وصفية. يؤكد المؤلفون أن نهجهم في ربط الأجزاء بـ KG لا يعتمد على KGs الموجودة، بل يستخرج الكيانات والعلاقات من الأجزاء لتشكيل الرسوم الفرعية. يتيح ذلك بناء KG يلتقط العلاقات بين الأجزاء، مما يسمح بالمعالجة غير المتصلة والتحديثات التدريجية.

تستخدم الطريقة المقترحة عملية استرجاع من مرحلتين: استرجاع قائم على المعنى يتبعه توسيع موجه بواسطة الرسم البياني. بينما يحدد الاسترجاع القائم على المعنى الأجزاء ذات الصلة بناءً على التشابه مع استعلام المستخدم، فإن التوسيع الموجه بواسطة الرسم البياني يربط هذه الأجزاء من خلال الكيانات المشتركة، مما يعزز التنوع ويقلل من التكرار في المعلومات المسترجعة. يجادل المؤلفون بأن هذه الطريقة تعزز جودة شبكة المعرفة، مما يؤدي إلى تحسين توليد الردود بواسطة نماذج اللغة الكبيرة (LLMs). كما يقدمون وحدة تنظيم السياق المعتمدة على KG التي تقوم بترشيح وترتيب الأجزاء المسترجعة، مما يضمن التماسك والملاءمة لاستعلام المستخدم، وبالتالي تحسين الأداء العام لإطار KG 2 RAG.

القيود

تسلط قسم القيود الضوء على أن إطار العمل المعزز بالتوليد (RAG) يمكن تحسينه من زوايا مختلفة، مثل إعادة كتابة الاستعلام، وتحسين الاسترجاع، والحوار متعدد الأدوار. التركيز الحالي لنهج KG 2 RAG هو تحديدًا على تحسين الاسترجاع، باستخدام الرسوم البيانية للمعرفة (KGs) لتوسيع الاسترجاع وتنظيم السياق. ومع ذلك، فإنه لا يتناول تحسين المكونات الأخرى داخل إطار RAG.

على الرغم من هذه القيود، يشير المؤلفون إلى أن KG 2 RAG يظل متعامدًا ومتوافقًا مع وحدات أخرى، مما يشير إلى إمكانية التكامل. تهدف التطورات المستقبلية إلى تحويل KG 2 RAG إلى أداة قابلة للتوصيل والتشغيل، مما يعزز قابليتها للاستخدام ويسهل تطبيقات البحث الأوسع ضمن المجتمع.

Journal: Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers)
DOI: https://doi.org/10.18653/v1/2025.naacl-long.449
Publication Date: 2025-01-01
Author(s): Xiangrong Zhu et al.
Primary Topic: Advanced Graph Neural Networks

Overview

The research paper introduces a novel framework called Knowledge Graph-Guided Retrieval Augmented Generation (KG 2 RAG), aimed at mitigating hallucination issues in responses generated by large language models (LLMs). Traditional retrieval-augmented generation (RAG) methods primarily focus on semantic-based retrieval of isolated relevant chunks, neglecting the relationships between these chunks. In contrast, KG 2 RAG leverages knowledge graphs (KGs) to establish fact-level relationships, thereby enhancing the diversity and coherence of the retrieved information. The framework incorporates a KG-guided chunk expansion process and a KG-based chunk organization process, which collectively facilitate the generation of well-structured paragraphs from the retrieved chunks.

Extensive experiments on the Hot-potQA dataset and its variants demonstrate that KG 2 RAG outperforms existing RAG-based approaches in terms of both response quality and retrieval quality. The paper concludes by emphasizing the importance of the KG-guided chunk expansion and KG-based context organization modules, which work in tandem to improve the overall effectiveness of the framework. An ablation study further validates the contributions of these components, confirming their role in producing diverse and coherent outputs suitable for high-quality response generation by LLMs.

Introduction

The introduction of this research paper discusses the advancements and challenges associated with large language models (LLMs), particularly their susceptibility to generating hallucinations—responses that may contain outdated or inaccurate information. To address this issue, the authors propose a novel framework called Knowledge Graph-Guided Retrieval Augmented Generation (KG 2 RAG), which integrates structured knowledge from knowledge graphs (KGs) into the retrieval-augmented generation process. Traditional retrieval methods often yield homogeneous and redundant information, limiting the LLMs’ ability to generate comprehensive responses. In contrast, KG 2 RAG enhances the retrieval process by associating document chunks with relevant KGs, allowing for the extraction of fact-level relationships and a more diverse set of information.

The KG 2 RAG framework employs a two-step retrieval process: first, it utilizes semantic-based techniques to identify seed chunks, and then it expands this selection through graph-guided methods to include related entities and triplets from the KG. This approach not only increases the diversity of retrieved chunks but also organizes them into coherent paragraphs, enhancing the informativeness of the input provided to the LLMs. Experimental results on the HotpotQA dataset demonstrate that KG 2 RAG outperforms existing RAG-based methods in both response and retrieval quality. The authors also provide an ablation study to assess the contributions of different components within the framework, and they make their dataset and source code publicly available to encourage further research in this area.

Methods

The methodology section outlines the workflow of the KG 2 RAG system, as depicted in Figure 2. It is structured into three main subsections: document offline processing (Sec. 2.1), KG-enhanced chunk retrieval (Sec. 2.2), and KG-based context organization (Sec. 2.3). Each subsection elaborates on specific components of the workflow, detailing the processes involved in transforming knowledge graphs (KG) into a retrieval-augmented generation (RAG) framework. This structured approach aims to enhance the efficiency and effectiveness of information retrieval and context organization within the system.

Discussion

In this section, the authors discuss the methodology and implications of their proposed framework, KG 2 RAG, which enhances retrieval-augmented generation (RAG) by integrating knowledge graphs (KGs). Initially, documents are segmented into chunks, which can be enriched with additional context and meta-information. The authors emphasize that their approach to chunk-KG association does not rely on existing KGs but instead extracts entities and relations from the chunks to form subgraphs. This enables the construction of a KG that captures relationships among chunks, allowing for offline processing and incremental updates.

The proposed method employs a two-stage retrieval process: semantic-based retrieval followed by graph-guided expansion. While semantic retrieval identifies relevant chunks based on similarity to a user query, the graph-guided expansion connects these chunks through shared entities, promoting diversity and reducing redundancy in the retrieved information. The authors argue that this method enhances the quality of the knowledge network, leading to improved response generation by large language models (LLMs). They also introduce a KG-based context organization module that filters and arranges the retrieved chunks, ensuring coherence and relevance to the user query, thereby optimizing the overall performance of the KG 2 RAG framework.

Limitations

The section on limitations highlights that the Retrieval-augmented generation (RAG) framework can be improved from various angles, such as query rewriting, retrieval optimization, and multi-turn dialogue. The current focus of the KG 2 RAG approach is specifically on retrieval optimization, utilizing knowledge graphs (KGs) for retrieval expansion and context organization. However, it does not address the optimization of other components within the RAG framework.

Despite this limitation, the authors note that KG 2 RAG remains orthogonal and compatible with other modules, suggesting potential for integration. Future developments aim to transform KG 2 RAG into a plug-and-play tool, enhancing its usability and facilitating broader research applications within the community.