فك تشابك جودة براءات الاختراع: استخدام نموذج لغوي كبير لمراجعة أدبية منهجية Disentangling patent quality: using a large language model for a systematic literature review

المجلة: Scientometrics، المجلد: 130، العدد: 1
DOI: https://doi.org/10.1007/s11192-024-05206-w
تاريخ النشر: 2025-01-01
المؤلف: Valentin J. Schmitt
الموضوع الرئيسي: الملكية الفكرية وبراءات الاختراع

نظرة عامة

يتناول هذا القسم من ورقة البحث التعقيدات المرتبطة بتقييم جودة براءات الاختراع، مع تسليط الضوء على قضايا مثل المصطلحات القابلة للتبادل والمؤشرات المتداخلة. لمواجهة هذه التحديات، يقترح المؤلفون إطار عمل شامل يستند إلى نظرية أصحاب المصلحة، والتي تشمل الأبعاد الاقتصادية والقانونية والتكنولوجية لجودة براءات الاختراع. تم إجراء مراجعة منهجية للأدبيات باستخدام قدرات نموذج اللغة الكبير GPT-4، حيث تم تحليل 5,141 مقالة علمية وتضييقها إلى 762 دراسة ذات صلة. من هذه الدراسات، تم تحديد 985 مؤشرًا متميزًا لتقييم جودة براءات الاختراع وتصنيفها وفقًا للأبعاد المحددة.

تشير النتائج إلى أن الاقتباسات الأمامية، وحجم الأسرة، وعدد المطالبات هي المؤشرات الأكثر اقتباسًا، مع تركيز كبير على الجودة التكنولوجية في حوالي ثلثي الأدبيات. كما تحدد الدراسة تحديات مثل ضعف القابلية للتكرار في البحث بسبب التعريفات غير المتسقة لمؤشرات مثل حجم الأسرة. لمعالجة هذه القضايا، يقترح المؤلفون ثمانية مقترحات بحثية تدعو إلى تقييم نقدي للمؤشرات، واستخدام منهجيات متقدمة، وقياس مقاييس معقدة. بشكل عام، تسهم هذه الدراسة في المجال من خلال تقديم إطار عمل منظم للدراسات المستقبلية حول تقييم جودة براءات الاختراع وتظهر إمكانيات نماذج اللغة الكبيرة في تعزيز مراجعات الأدبيات المنهجية.

مقدمة

تؤكد مقدمة ورقة البحث على الدور الحاسم لتحليل براءات الاختراع في تشكيل استراتيجيات الشركات، لا سيما في القطاعات المدفوعة بالتكنولوجيا. تحدد إطار عمل شامل لاستراتيجية براءات الاختراع يتضمن استراتيجيات ملكية، ودفاعية، واستغلالية، تهدف كل منها إلى حماية المزايا التنافسية، وتسهيل التسويق، وتعظيم العوائد الاقتصادية من خلال مزايا التفاوض. تسلط الورقة الضوء على تعقيد جودة براءات الاختراع، التي تختلف بشكل كبير عبر براءات الاختراع والاستراتيجيات المختلفة، وتلاحظ الاستخدام القابل للتبادل لمصطلحات مثل القيمة، والقوة، والجودة في الأدبيات. تعقد هذه الغموض تقييم براءات الاختراع، حيث قد يكون لدى أصحاب المصلحة المختلفين – المهندسين، ومحامي البراءات، والاقتصاديين – تفسيرات متباينة لما يشكل براءة اختراع عالية الجودة.

لمعالجة هذه التحديات، تقترح الورقة إطار عمل متعدد الأبعاد لتقييم جودة براءات الاختراع، مستندة إلى نظرية أصحاب المصلحة لالتقاط وجهات نظر أصحاب المصلحة المختلفة. تحدد ثلاثة أبعاد لجودة براءات الاختراع: الجودة الاقتصادية، التي تقيم قدرة براءة الاختراع على توليد القيمة؛ الجودة القانونية، التي تقيم قدراتها الحماية؛ والجودة التكنولوجية، التي تأخذ في الاعتبار مساهماتها الابتكارية في المجتمع. تمهد المقدمة الطريق لأسئلة البحث التي تهدف إلى توضيح تطبيق مفاهيم جودة براءات الاختراع عبر أصحاب المصلحة وتحديد المؤشرات المناسبة لتقييم شامل. تنوي الدراسة استخدام مراجعة منهجية للأدبيات (SLR) لاستكشاف هذه الأبعاد وتطوير أجندة بحثية لتحسين منهجيات تقييم جودة براءات الاختراع.

الطرق

تشمل المنهجية المستخدمة في هذا البحث مراجعة منهجية للأدبيات (SLR) تهدف إلى تقييم الجودة متعددة الأبعاد للبراءات. تم تصميم SLR لتوفير نظرة شاملة على الدراسات الموجودة في هذا المجال، مما يسهم في تطوير الأطر النظرية المحيطة بتقييم جودة براءات الاختراع (Paré et al., 2015; Petticrew & Roberts, 2012). الهدف الرئيسي هو تحديد المقالات التي تقيم جودة براءات الاختراع من خلال نهج منظم.

استنادًا إلى المبادئ الأساسية لـ SLR كما هو موضح من قبل Grant وBooth (2009) وXiao وWatson (2019)، تتكون المنهجية من ثلاث خطوات رئيسية: (1) البحث في الأدبيات، (2) تحديد الأدبيات ذات الصلة، و(3) استخراج المعلومات ذات الصلة. تتبع هذه الخطوات الممارسات المقبولة على نطاق واسع داخل المجتمع العلمي (Ananthraman et al., 2023; Girgin Kalıp et al., 2022; Grimaldi & Cricelli, 2020). يتم توضيح تدفق المعلومات التفصيلي خلال هذه الخطوات وفقًا لبيان PRISMA (Liberati et al., 2009)، مع تقديم تفاصيل إضافية في الملحق A.

النتائج

يستعرض قسم النتائج النتائج المستخلصة من تحليل المحتوى للمقالات المتعلقة بجودة البراءات متعددة الأبعاد. يبدأ بتقديم نظرة عامة على المؤشرات الأكثر استخدامًا، والتي يتم تصنيفها بعد ذلك إلى أبعاد متميزة لجودة البراءات. يسمح هذا التصنيف بتصنيف المقالات ذات الصلة وفقًا لهذه الأبعاد.

بالإضافة إلى ذلك، يتضمن القسم تحليلًا شاملاً للأبعاد المحددة، مع التركيز على تطورها بمرور الوقت. يتم تفصيل نتائج التحليل الببليومتري، التي تدعم هذه النتائج بشكل أكبر، في الملحق F. يبرز هذا النهج المنظم أهمية المؤشرات المختلفة في تقييم جودة البراءات واتجاهاتها ضمن الأدبيات.

المناقشة

في قسم المناقشة من ورقة البحث، يحدد المؤلفون منهجية مراجعة منهجية للأدبيات (SLR) لتقييم جودة براءات الاختراع، مع التأكيد على أهمية استخدام قواعد بيانات متعددة لضمان تغطية شاملة. اختاروا Scopus وWeb of Science وIEEE Xplore وGoogle Scholar، حيث تساهم كل منها بقوة فريدة في عملية البحث. استخدم المؤلفون أوامر بحث محددة مصممة وفقًا لهيكل كل قاعدة بيانات، مع دمج الاقتطاع والمرادفات لالتقاط مجموعة واسعة من الأدبيات ذات الصلة. أسفر هذا النهج الدقيق عن مجموعة بيانات أولية تضم 5141 مقالة، والتي تم تنقيحها لاحقًا من خلال معايير الإدراج/الاستبعاد الرسمية والفحص الآلي باستخدام نموذج لغة كبير (LLM)، وهو تحديدًا GPT-4.

أدى عملية الفحص إلى تحديد 1100 مقالة ذات صلة، مع مجموعة بيانات نهائية تضم 762 بعد تقييم يدوي إضافي. استخرج المؤلفون ما مجموعه 985 مؤشرًا مرتبطًا بجودة براءات الاختراع من هذه المقالات، مع تسليط الضوء على المؤشرات الأكثر استخدامًا مثل الاقتباسات الأمامية وحجم الأسرة. كما قاموا بتصنيف هذه المؤشرات إلى ثلاثة أبعاد لجودة براءات الاختراع: الاقتصادية، والقانونية، والتكنولوجية، باستخدام GPT-4 للمساعدة في هذا التصنيف. أشارت النتائج إلى مستوى عالٍ من الاتفاق بين GPT-4 والمقيمين البشريين، لا سيما في البعد الاقتصادي، بينما كشفت أيضًا عن تحديات في تحقيق تصنيف متسق عبر جميع الأبعاد. تؤكد هذه الدراسة على إمكانيات نماذج اللغة الكبيرة في مراجعات الأدبيات مع الاعتراف بضرورة الإشراف البشري لمعالجة التحيزات وضمان التصنيفات الدقيقة.

Journal: Scientometrics, Volume: 130, Issue: 1
DOI: https://doi.org/10.1007/s11192-024-05206-w
Publication Date: 2025-01-01
Author(s): Valentin J. Schmitt
Primary Topic: Intellectual Property and Patents

Overview

This section of the research paper addresses the complexities involved in assessing patent quality, highlighting issues such as interchangeable terminology and overlapping indicators. To tackle these challenges, the authors propose a comprehensive framework grounded in stakeholder theory, which encompasses economic, legal, and technological dimensions of patent quality. A systematic literature review utilizing the capabilities of the large language model GPT-4 was conducted, analyzing 5,141 scientific articles and narrowing down to 762 relevant studies. From these, 985 distinct indicators for patent quality assessment were identified and categorized according to the defined dimensions.

The findings indicate that forward citations, family size, and the number of claims are the most frequently cited indicators, with a significant emphasis on technological quality in approximately two-thirds of the literature. The study also identifies challenges such as poor reproducibility in research due to inconsistent definitions of indicators like family size. To address these issues, the authors propose eight research propositions that advocate for a critical evaluation of indicators, the use of advanced methodologies, and the quantification of complex metrics. Overall, this research contributes to the field by providing a structured framework for future studies on patent quality assessment and demonstrates the potential of large language models in enhancing systematic literature reviews.

Introduction

The introduction of the research paper emphasizes the critical role of patent analysis in shaping corporate strategies, particularly in technology-driven sectors. It outlines a comprehensive patent strategy framework that includes proprietary, defensive, and leveraging strategies, each aimed at safeguarding competitive advantages, facilitating commercialization, and maximizing economic returns through negotiation advantages. The paper highlights the complexity of patent quality, which varies significantly across different patents and strategies, and notes the interchangeable use of terms such as value, strength, and quality in the literature. This ambiguity complicates the assessment of patents, as different stakeholders—engineers, patent attorneys, and economists—may have divergent interpretations of what constitutes a high-quality patent.

To address these challenges, the paper proposes a multidimensional framework for assessing patent quality, drawing on stakeholder theory to capture the varying perspectives of different stakeholders. It identifies three dimensions of patent quality: economic quality, which evaluates a patent’s ability to generate value; legal quality, which assesses its protective capabilities; and technological quality, which considers its innovative contributions to society. The introduction sets the stage for the research questions aimed at clarifying the application of patent quality concepts across stakeholders and identifying suitable indicators for a comprehensive assessment. The study intends to utilize a systematic literature review (SLR) to explore these dimensions and develop a research agenda for improving patent quality assessment methodologies.

Methods

The methodology employed in this research involves a systematic literature review (SLR) aimed at assessing the multidimensional quality of patents. The SLR is designed to provide a comprehensive overview of existing studies in this domain, thereby contributing to the development of theoretical frameworks surrounding patent quality assessment (Paré et al., 2015; Petticrew & Roberts, 2012). The primary objective is to identify articles that evaluate patent quality through a structured approach.

Following the foundational principles of SLR as outlined by Grant and Booth (2009) and Xiao and Watson (2019), the methodology consists of three key steps: (1) searching the literature, (2) identifying relevant literature, and (3) extracting pertinent information. These steps adhere to widely accepted practices within the scientific community (Ananthraman et al., 2023; Girgin Kalıp et al., 2022; Grimaldi & Cricelli, 2020). The detailed flow of information throughout these steps is illustrated in accordance with the PRISMA statement (Liberati et al., 2009), with additional specifics provided in Appendix A.

Results

The Results section outlines the findings from a content analysis of articles related to multidimensional patent quality. It begins by providing an overview of the most frequently utilized indicators, which are subsequently categorized into distinct dimensions of patent quality. This categorization allows for the classification of relevant articles according to these dimensions.

Additionally, the section includes a comprehensive analysis of the identified dimensions, focusing on their evolution over time. The bibliometric analysis results, which further support these findings, are detailed in Appendix F. This structured approach highlights the significance of various indicators in assessing patent quality and their trends within the literature.

Discussion

In the discussion section of the research paper, the authors outline a systematic literature review (SLR) methodology for assessing patent quality, emphasizing the importance of utilizing multiple databases to ensure comprehensive coverage. They selected Scopus, Web of Science, IEEE Xplore, and Google Scholar, each contributing unique strengths to the search process. The authors employed specific search commands tailored to each database’s structure, incorporating truncation and synonyms to capture a wide range of relevant literature. This meticulous approach resulted in an initial dataset of 5141 articles, which was subsequently refined through formal inclusion/exclusion criteria and automated screening using a large language model (LLM), specifically GPT-4.

The screening process led to the identification of 1100 relevant articles, with a final dataset of 762 after further manual assessment. The authors extracted a total of 985 indicators related to patent quality from these articles, highlighting the most frequently used indicators such as forward citations and family size. They also categorized these indicators into three dimensions of patent quality: economic, legal, and technological, using GPT-4 to assist in this classification. The results indicated a high level of agreement between GPT-4 and human raters, particularly in the economic dimension, while also revealing challenges in achieving consistent categorization across all dimensions. This study underscores the potential of LLMs in literature reviews while acknowledging the necessity of human oversight to address biases and ensure accurate classifications.