استرجاع الرسم البياني المعرفي المعزز بالتوليد للتوصيات المعتمدة على LLM Knowledge Graph Retrieval-Augmented Generation for LLM-based Recommendation

المجلة: Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
DOI: https://doi.org/10.18653/v1/2025.acl-long.1317
تاريخ النشر: 2025-01-01
المؤلف: Shijie Wang وآخرون
الموضوع الرئيسي: الحوسبة المعرفية والشبكات

نظرة عامة

تناقش هذه الفقرة الأهمية المتزايدة لأنظمة التوصية في إدارة overload المعلومات في الخدمات عبر الإنترنت الموجهة للمستخدمين، مع تسليط الضوء بشكل خاص على دور نماذج اللغة الكبيرة (LLMs) في تطوير هذه الأنظمة. على الرغم من إمكانياتها، تواجه أنظمة التوصية المعتمدة على LLM تحديات مثل الهلاوس ونقص المعرفة الحالية والمتخصصة في المجال. للتخفيف من هذه المشكلات، يقدم البحث إطار عمل جديد يسمى K-RagRec، الذي يستخدم تقنيات الجيل المعزز بالاسترجاع (RAG) لدمج معلومات عالية الجودة ومنظمة من الرسوم البيانية للمعرفة (KGs) في عملية التوصية.

يعزز K-RagRec قدرات التوصية لـ LLMs من خلال استخدام شبكة عصبية رسومية (GNN) ونموذج لغة مدرب مسبقًا (PLM) لفهرسة KGs بشكل دلالي، مما يسمح بالاسترجاع الخشن والدقيق. بالإضافة إلى ذلك، يتم تنفيذ سياسة استرجاع انتقائية تعتمد على الشعبية لتحسين كفاءة الاسترجاع بناءً على شعبية العناصر. تصميم الإطار يمكّن من تشفير رسومي أكثر تعبيرًا للرسوم البيانية المعرفية المسترجعة، مما يسمح لـ LLMs باستخدام المعلومات الهيكلية بشكل فعال مع تقليل الحاجة إلى إدخال سياق واسع. تؤكد النتائج التجريبية عبر ثلاثة مجموعات بيانات من العالم الحقيقي فعالية إطار K-RagRec في تحسين نتائج التوصية.

مقدمة

تناقش مقدمة هذه الورقة البحثية الأهمية المتزايدة لأنظمة التوصية عبر مجالات مختلفة، التي تسهلها التقدمات في نماذج اللغة الكبيرة (LLMs). لقد أظهرت هذه النماذج، مثل GPT-4 و LLaMA، قدرات استيعاب لغوي استثنائية، مما يمكنها من التقاط تفضيلات المستخدمين بدقة أكبر. ومع ذلك، تواجه أنظمة التوصية المعتمدة على LLM تحديات، بما في ذلك توليد توصيات خيالية ونقص المعرفة الحالية والمتخصصة في المجال. لمعالجة هذه القضايا، يبرز البحث إمكانيات طرق الجيل المعزز بالاسترجاع (RAG)، التي تستخدم قواعد بيانات خارجية لتعزيز قاعدة المعرفة لـ LLMs، وبالتالي تحسين دقة التوصية.

على الرغم من وعد RAG، غالبًا ما تقدم الطرق التقليدية ضوضاء وتتجاهل العلاقات الهيكلية بين الكيانات، مما يمكن أن يعيق أداء التوصية. يقترح المؤلفون إطار عمل جديد يسمى K-RagRec، الذي يهدف إلى تعزيز التوصيات المعتمدة على LLM من خلال استرجاع المعرفة ذات الصلة من الرسوم البيانية للمعرفة الخاصة بالعناصر (KGs). يتضمن هذا الإطار طريقة فهرسة ذات دقة خشن ودقيق لـ KGs، واستراتيجية استرجاع انتقائية تعتمد على الشعبية لتحسين الكفاءة، وشبكة عصبية رسومية (GNN) لمواءمة المعرفة المسترجعة مع الفضاء الدلالي لـ LLM. يؤكد البحث على أهمية الاستفادة الفعالة من البيانات المنظمة لتحسين قدرات التوصية ويحدد مساهمات الإطار، بما في ذلك التجارب الشاملة على مجموعات بيانات من العالم الحقيقي للتحقق من فعاليتها.

طرق

في هذا القسم، يحدد المؤلفون المنهجية وراء إطار العمل المقترح، K-RagRec، الذي يعزز أداء أنظمة التوصية المعتمدة على نماذج اللغة الكبيرة (LLM) من خلال دمج البيانات المنظمة من الرسوم البيانية للمعرفة. يبرزون قيود الأنظمة الحالية المعتمدة على LLM، التي غالبًا ما تتطلب ضبطًا دقيقًا متكررًا على مجموعات بيانات محددة، مما يؤدي إلى عدم الكفاءة وزيادة التكاليف. يعالج K-RagRec هذه القضايا من خلال استخدام شبكة عصبية رسومية (GNN) للفهرسة التي تسترجع كل من المعرفة الخشنة والدقيقة من الرسوم البيانية للمعرفة، مما يحسن دقة التوصيات مقارنةً بالطرق السابقة لـ GraphRAG التي تستخدم فقط الطبقة الأخيرة من تمثيلات GNN.

يتناول المؤلفون أيضًا تحدي وقت الاسترجاع، الذي يمكن أن يؤثر سلبًا على تجربة المستخدم. يقترحون طريقة جديدة تستفيد من شعبية العناصر، بناءً على توزيع قانون القوة، لتحديد ضرورة الاسترجاع، مما يقلل بشكل كبير من أوقات الاسترجاع مع الحفاظ على دقة التوصية العالية. علاوة على ذلك، يدمج K-RagRec بشكل مبتكر الرسوم البيانية المعرفية المسترجعة في الاستعلام كتحفيز رسومي، بدلاً من نص، مما يقلل من الضوضاء ويعزز قدرات التفكير لـ LLMs من خلال الحفاظ على العلاقات الهيكلية بين الكيانات. لا تعمل هذه الطريقة على تبسيط عملية التوصية فحسب، بل تحسن أيضًا موثوقية النظام بشكل عام.

نقاش

في هذا القسم، يناقش المؤلفون التقدمات والتحديات في تقنيات الجيل المعزز بالاسترجاع (RAG)، خاصة في سياق تعزيز أنظمة التوصية باستخدام الرسوم البيانية للمعرفة (KGs). يبرزون أنه بينما حققت طرق RAG التقليدية، مثل REALM و DPR، تقدمًا في توليد نص متماسك من خلال استرجاع المعلومات ذات الصلة، فإنها غالبًا ما تفشل بسبب الضوضاء والبيانات غير ذات الصلة في عملية الاسترجاع. لقد تحولت الدراسات الحديثة نحو استخدام المعرفة المنظمة من KGs لتحسين دقة وموثوقية المحتوى المولد. يقترح المؤلفون إطار عمل جديد، K-RagRec، الذي يدمج KGs الخارجية لتعزيز نماذج اللغة الكبيرة (LLMs) لتحقيق توصيات أكثر فعالية. يستخدم هذا الإطار عملية متعددة الخطوات تشمل الفهرسة الدلالية للرسوم البيانية المعرفية الفرعية، وسياسة استرجاع انتقائية تعتمد على الشعبية، وآلية إعادة ترتيب لتحسين جودة المعلومات المسترجعة.

تستفيد الطريقة المقترحة من المعرفة الواقعية الغنية المشفرة في KGs، الممثلة كـ triples، لفهم تفضيلات المستخدمين بشكل أفضل بناءً على تفاعلاتهم التاريخية مع العناصر. يوضح المؤلفون بنية K-RagRec، التي تتكون من مكونات للفهرسة والاسترجاع والتوصية، مع التأكيد على أهمية التقاط العلاقات الخشنة والدقيقة داخل KG. كما يقدمون سياسة استرجاع انتقائية تعتمد على الشعبية لتحسين كفاءة الاسترجاع، خاصةً للعناصر ذات البداية الباردة، التي غالبًا ما تكون ممثلة تمثيلًا ناقصًا في تفاعلات المستخدمين. تظهر النتائج التجريبية أن K-RagRec يتفوق بشكل كبير على المعايير الحالية من حيث دقة وكفاءة التوصية، بينما يقلل أيضًا من حدوث الهلاوس في مخرجات LLM. بشكل عام، يبرز هذا العمل إمكانيات دمج KGs مع LLMs لتعزيز أنظمة التوصية، مما يمهد الطريق للبحوث المستقبلية في هذا المجال.

القيود

تحدد الورقة البحثية ثلاث قيود رئيسية في دراستها الرائدة حول الجيل المعزز بالاسترجاع (RAG) للمعايير المعتمدة على نماذج اللغة الكبيرة (LLM). أولاً، كانت التقييمات مقيدة بنماذج تحتوي على 7 مليار (7b) و 8 مليار (8b) من المعلمات بسبب قيود موارد GPU، مما يشير إلى أن الأعمال المستقبلية يجب أن تستكشف نماذج أكبر لتقييم فعالية الإطار وقابليته للتوسع بشكل أفضل.

ثانيًا، استخدمت الدراسة حصريًا Freebase كرسوم بيانية للمعرفة الخارجية (KG)، والتي، على الرغم من كونها شائعة في مهام التوصية، قد لا توفر فهمًا شاملاً لأداء الطريقة. تهدف الأبحاث المستقبلية إلى دمج KGs إضافية مثل YAGO و DBpedia و Wikipedia لتقييم تأثير مصادر المعرفة المختلفة. أخيرًا، تعترف الورقة بتحدي تصميم سياسة استرجاع انتقائية ذكية لأنظمة التوصية المعتمدة على LLM. بينما تستفيد الطريقة الحالية من شعبية العناصر لتعزيز كفاءة الاسترجاع، ستستكشف التحقيقات المستقبلية آليات أكثر تكيفًا، مثل التعلم المعزز، لضبط سياسات الاسترجاع ديناميكيًا استجابةً للاهتمامات المتطورة للمستخدمين.

Journal: Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
DOI: https://doi.org/10.18653/v1/2025.acl-long.1317
Publication Date: 2025-01-01
Author(s): Shijie Wang et al.
Primary Topic: Cognitive Computing and Networks

Overview

The section discusses the growing importance of recommender systems in managing information overload in user-oriented online services, particularly highlighting the role of Large Language Models (LLMs) in advancing these systems. Despite their potential, LLM-based recommender systems encounter challenges such as hallucinations and a lack of current, domain-specific knowledge. To mitigate these issues, the paper introduces a novel framework called K-RagRec, which utilizes Retrieval-Augmented Generation (RAG) techniques to incorporate high-quality, structured information from knowledge graphs (KGs) into the recommendation process.

K-RagRec enhances the recommendation capabilities of LLMs by employing a Graph Neural Network (GNN) and Pre-trained Language Model (PLM) for semantic indexing of KGs, allowing for both coarse and fine-grained retrieval. Additionally, a popularity selective retrieval policy is implemented to optimize retrieval efficiency based on item popularity. The framework’s design enables more expressive graph encoding of the retrieved knowledge sub-graphs, allowing LLMs to effectively utilize structural information while minimizing the need for extensive context input. Experimental results across three real-world datasets validate the effectiveness of the K-RagRec framework in improving recommendation outcomes.

Introduction

The introduction of this research paper discusses the growing significance of recommender systems across various domains, facilitated by advancements in Large Language Models (LLMs). These models, such as GPT-4 and LLaMA, have demonstrated exceptional language understanding capabilities, enabling them to capture user preferences more accurately. However, LLM-based recommender systems face challenges, including the generation of fictional recommendations and a lack of up-to-date, domain-specific knowledge. To address these issues, the paper highlights the potential of Retrieval-Augmented Generation (RAG) methods, which utilize external databases to enhance the knowledge base of LLMs, thereby improving recommendation accuracy.

Despite the promise of RAG, traditional methods often introduce noise and overlook the structural relationships between entities, which can hinder recommendation performance. The authors propose a novel framework called K-RagRec, which aims to enhance LLM-based recommendations by retrieving relevant knowledge from item knowledge graphs (KGs). This framework includes a coarse and fine granularity indexing method for KGs, a popularity selective retrieval strategy to optimize efficiency, and a graph neural network (GNN) for aligning retrieved knowledge with the LLM’s semantic space. The paper emphasizes the importance of effectively leveraging structured data to improve recommendation capabilities and outlines the framework’s contributions, including comprehensive experiments on real-world datasets to validate its effectiveness.

Methods

In this section, the authors outline the methodology behind their proposed framework, K-RagRec, which enhances the performance of large language model (LLM)-based recommender systems by integrating structured data from knowledge graphs. They highlight the limitations of existing LLM-based systems, which often require frequent fine-tuning on specific datasets, leading to inefficiencies and increased costs. K-RagRec addresses these issues by employing an indexing Graph Neural Network (GNN) that retrieves both coarse and fine-grained knowledge from knowledge graphs, thereby improving the precision of recommendations compared to previous GraphRAG approaches that only utilize the last layer of GNN representations.

The authors also tackle the challenge of retrieval time, which can adversely affect user experience. They propose a novel method that leverages the popularity of items, based on a power law distribution, to determine the necessity of retrieval, significantly reducing retrieval times while maintaining high recommendation accuracy. Furthermore, K-RagRec innovatively incorporates retrieved knowledge subgraphs into the query as a graph prompt, rather than as text, which minimizes noise and enhances the reasoning capabilities of LLMs by preserving the structural relationships between entities. This approach not only streamlines the recommendation process but also improves the overall reliability of the system.

Discussion

In this section, the authors discuss the advancements and challenges in Retrieval-Augmented Generation (RAG) technologies, particularly in the context of enhancing recommendation systems using Knowledge Graphs (KGs). They highlight that while traditional RAG methods, such as REALM and DPR, have made strides in generating coherent text by retrieving relevant information, they often fall short due to noise and irrelevant data in the retrieval process. Recent studies have shifted focus towards utilizing structured knowledge from KGs to improve the accuracy and reliability of generated content. The authors propose a novel framework, K-RagRec, which integrates external KGs to augment Large Language Models (LLMs) for more effective recommendations. This framework employs a multi-step process that includes semantic indexing of knowledge sub-graphs, a popularity selective retrieval policy, and a re-ranking mechanism to enhance the quality of retrieved information.

The proposed method leverages the rich factual knowledge encoded in KGs, represented as triples, to better understand user preferences based on their historical interactions with items. The authors detail the architecture of K-RagRec, which consists of components for indexing, retrieval, and recommendation, emphasizing the importance of capturing both coarse and fine-grained relationships within the KG. They also introduce a popularity selective retrieval policy to optimize retrieval efficiency, particularly for cold-start items, which are often underrepresented in user interactions. The experimental results demonstrate that K-RagRec significantly outperforms existing baselines in terms of recommendation accuracy and efficiency, while also reducing the incidence of hallucinations in LLM outputs. Overall, this work underscores the potential of integrating KGs with LLMs to enhance recommendation systems, paving the way for future research in this area.

Limitations

The research paper identifies three primary limitations in its pioneering study on knowledge-graph retrieval-augmented generation (RAG) for large language model (LLM)-based recommendations. Firstly, the evaluation was constrained to 7 billion (7b) and 8 billion (8b) parameter models due to GPU resource limitations, suggesting that future work should explore larger models to better assess the framework’s effectiveness and scalability.

Secondly, the study exclusively utilized Freebase as the external knowledge graph (KG), which, while common in recommendation tasks, may not provide a comprehensive understanding of the method’s performance. Future research aims to incorporate additional KGs such as YAGO, DBpedia, and Wikipedia to evaluate the influence of different knowledge sources. Lastly, the paper acknowledges the challenge of designing an intelligent selective retrieval policy for LLM-based recommender systems. While the current approach leverages item popularity to enhance retrieval efficiency, future investigations will explore more adaptive mechanisms, such as reinforcement learning, to dynamically adjust retrieval policies in response to evolving user interests.