مسح لإثباتات عدم المعرفة القابلة للتحقق المعتمدة على التعلم الآلي A survey of zero-knowledge proof based verifiable machine learning

المجلة: Artificial Intelligence Review، المجلد: 59، العدد: 7
DOI: https://doi.org/10.1007/s10462-026-11557-y
تاريخ النشر: 2026-04-13
المؤلف: Zhenyun Du وآخرون
الموضوع الرئيسي: الصلابة ضد الهجمات في تعلم الآلة

نظرة عامة

تقدم هذه القسم نظرة عامة على المجال المتزايد لتعلم الآلة بدون معرفة (ZKML)، الذي يستفيد من إثباتات عدم المعرفة (ZKPs) لتعزيز نزاهة وسرية عمليات تعلم الآلة في البيئات السحابية. تتيح إثباتات عدم المعرفة لطرف واحد التحقق من أن عملية حسابية قد تمت بشكل صحيح دون الكشف عن بيانات حساسة أو معلمات نموذج ملكية. تستعرض الدراسة أبحاث ZKML من يونيو 2017 إلى أغسطس 2025، مصنفة الدراسات الحالية إلى ثلاث مهام رئيسية: التدريب القابل للتحقق، الاختبار القابل للتحقق، والاستدلال القابل للتحقق. كما تسلط الضوء على الأنظمة الرئيسية، وتقارن بين خيارات تصميمها، وتحدد التحديات الكبيرة في التنفيذ، مثل محدودية تعبير الدوائر، وارتفاع تكاليف الإثبات، وتعقيدات النشر.

تؤكد الخاتمة على انتقال ZKML من العروض النظرية إلى أنظمة أكثر كفاءة وقابلية للنشر مع الاعتراف بالتحديات المستمرة المتعلقة بالتحجيم، وتغطية النماذج، والخصوصية طوال دورة حياة تعلم الآلة، وأصالة البيانات. تهدف الدراسة إلى أن تكون مرجعًا منظمًا للباحثين والممارسين الذين يركزون على تطوير حلول تعلم آلة موثوقة وتحافظ على الخصوصية.

مقدمة

تناقش مقدمة الورقة التكامل المتزايد للذكاء الاصطناعي (AI) وتعلم الآلة (ML) في مختلف القطاعات، بما في ذلك توليد المحتوى والرعاية الصحية، مدفوعة بالتقدم في هياكل النماذج وبنية التدريب. ومع ذلك، يأتي هذا التقدم مع زيادة في التكاليف الحاسوبية، حيث تتطلب النماذج الأكبر مجموعات بيانات أكثر اتساعًا وأجهزة متخصصة. لقد جعل ظهور تعلم الآلة كخدمة (MLaaS) تعلم الآلة أكثر سهولة، ولكنه يثير مخاوف كبيرة بشأن خصوصية البيانات وحماية الملكية الفكرية. يتردد العملاء في مشاركة البيانات الحساسة مع مقدمي الخدمات الخارجيين بسبب مخاطر الخصوصية، بينما غالبًا ما يتردد مقدمو الخدمات في الكشف عن معلمات النموذج، مما يخلق عجزًا في الثقة في مهام تعلم الآلة المستعان بها.

لمعالجة هذه التحديات، تدعو الورقة إلى تعلم الآلة القابل للتحقق (ZKML)، الذي يهدف إلى ضمان نزاهة الحساب والخصوصية من خلال استخدام إثباتات عدم المعرفة (ZKP). تتيح إثباتات عدم المعرفة لطرف واحد التحقق من صحة بيان دون الكشف عن معلومات إضافية، مما يجعلها مناسبة لتأكيد الادعاءات المتعلقة بمخرجات تعلم الآلة وعمليات التدريب. توضح المقدمة المساهمات الرئيسية للورقة، بما في ذلك تقييم أنظمة ZKP لاستدلال تعلم الآلة، واستطلاع أنظمة ZKML المبكرة، وتشكيل نماذج تعلم الآلة المعتمدة على ZKP اللامركزية، وتقييم أطر ZKP مفتوحة المصدر. تهدف هذه المساهمات إلى تعزيز الثقة والأمان في تطبيقات تعلم الآلة المستعان بها مع الحفاظ على سرية البيانات الحساسة والنماذج الملكية.

نقاش

ت outlines قسم النقاش في ورقة البحث سير العمل ومساهمات تعلم الآلة بدون معرفة (ZKML) في ضمان الاستدلال القابل للتحقق مع الحفاظ على سرية النموذج. في سير العمل المقترح لاستدلال ZKML، يعمل مزود خدمة تعلم الآلة (ML) كالمثبت (P)، حيث يقوم بحساب التوقعات بناءً على نموذج خاص ويولد إثباتًا (π) يشهد على صحة المخرجات. يقدم العميل، كالمتحقق (V)، المدخلات ويتحقق من صحة الإثبات دون الوصول إلى معلمات النموذج. يتم صياغة هذه العملية من خلال العلاقة \( F(x, \hat{y}, \theta) = f_\theta(x) – \hat{y} = 0 \)، مما يضمن أن العميل يمكنه الوثوق بالتوقع بينما يظل نموذج المزود سريًا.

تسلط الورقة أيضًا الضوء على الطبيعة المجزأة للدراسات الحالية حول ZKML، التي غالبًا ما تركز على جوانب ضيقة من المجال، مثل تصميم الدوائر أو التطبيقات المحددة. لمعالجة هذه الفجوة، أجرى المؤلفون استطلاعًا منهجيًا لدراسات ZKML من يونيو 2017 إلى أغسطس 2025، حيث قاموا بتحليل 27 عملًا تمثيليًا. تشمل مساهماتهم نظرة شاملة على مشهد ZKML، ورؤى حول التطور الفني وتحديات التنفيذ، وجسرًا من البحث إلى التطبيقات العملية، وتحديد المشكلات المفتوحة واتجاهات البحث المستقبلية. تهدف الدراسة إلى تقديم فهم شامل لـ ZKML، يشمل الفئات الخوارزمية، والتقدم الفني، واستراتيجيات النشر التجارية.

Journal: Artificial Intelligence Review, Volume: 59, Issue: 7
DOI: https://doi.org/10.1007/s10462-026-11557-y
Publication Date: 2026-04-13
Author(s): Zhenyun Du et al.
Primary Topic: Adversarial Robustness in Machine Learning

Overview

This section provides an overview of the growing field of zero-knowledge machine learning (ZKML), which leverages zero-knowledge proofs (ZKPs) to enhance the integrity and confidentiality of machine learning processes in cloud-based environments. ZKPs enable one party to verify that a computation was performed correctly without disclosing sensitive data or proprietary model parameters. The survey reviews ZKML research from June 2017 to August 2025, categorizing existing studies into three primary tasks: verifiable training, verifiable testing, and verifiable inference. It also highlights key systems, compares their design choices, and identifies significant implementation challenges, such as limited circuit expressiveness, high proving costs, and deployment complexities.

The conclusion emphasizes the transition of ZKML from theoretical demonstrations to more efficient and deployable systems while acknowledging ongoing challenges related to scalability, model coverage, privacy throughout the machine learning lifecycle, and data authenticity. The survey aims to serve as a structured reference for researchers and practitioners focused on developing trustworthy and privacy-preserving machine learning solutions.

Introduction

The introduction of the paper discusses the growing integration of artificial intelligence (AI) and machine learning (ML) in various sectors, including content generation and healthcare, driven by advancements in model architectures and training infrastructure. However, this progress comes with increased computational costs, as larger models necessitate more extensive datasets and specialized hardware. The emergence of Machine Learning as a Service (MLaaS) has made ML more accessible but raises significant concerns regarding data privacy and the protection of intellectual property. Clients are hesitant to share sensitive data with external providers due to privacy risks, while providers are often reluctant to disclose model parameters, creating a trust deficit in outsourced ML tasks.

To address these challenges, the paper advocates for verifiable machine learning (ZKML), which aims to ensure computational integrity and privacy through the use of zero-knowledge proofs (ZKP). ZKPs enable one party to verify the truth of a statement without revealing additional information, making them suitable for confirming claims related to ML outputs and training processes. The introduction outlines the main contributions of the paper, including benchmarking ZKP systems for ML inference, surveying early ZKML systems, formalizing decentralized ZKP-based ML models, and assessing open-source ZKP frameworks. These contributions aim to enhance trust and security in outsourced ML applications while maintaining the confidentiality of sensitive data and proprietary models.

Discussion

The discussion section of the research paper outlines the workflow and contributions of Zero-Knowledge Machine Learning (ZKML) in ensuring verifiable inference while maintaining model confidentiality. In the proposed ZKML inference workflow, the machine learning (ML) service provider acts as the prover (P), computing predictions based on a private model and generating a proof (π) that attests to the correctness of the output. The client, as the verifier (V), provides the input and checks the validity of the proof without accessing the model parameters. This process is formalized by the relation \( F(x, \hat{y}, \theta) = f_\theta(x) – \hat{y} = 0 \), ensuring that the client can trust the prediction while the provider’s model remains confidential.

The paper also highlights the fragmented nature of existing surveys on ZKML, which often focus on narrow aspects of the field, such as circuit design or specific applications. To address this gap, the authors conducted a systematic survey of ZKML studies from June 2017 to August 2025, analyzing 27 representative works. Their contributions include a comprehensive overview of the ZKML landscape, insights into technical evolution and implementation challenges, a bridge from research to practical applications, and identification of open problems and future research directions. The survey aims to provide a holistic understanding of ZKML, encompassing algorithmic categories, technical advancements, and commercial deployment strategies.