إطار التعلم الفيدرالي المخصص القائم على البلوكشين لأنظمة توصية الأدوية المقاومة لتسميم النموذج Blockchain-based personalized federated learning framework for drug recommendation systems resilient to model poisoning

المجلة: Neural Computing and Applications، المجلد: 38، العدد: 1
DOI: https://doi.org/10.1007/s00521-025-11828-9
تاريخ النشر: 2026-01-01
المؤلف: Sina Apak وآخرون
الموضوع الرئيسي: تعلم الآلة في الرعاية الصحية

نظرة عامة

تقدم ورقة البحث إطار عمل للتعلم الفيدرالي المخصص (PFL) قائم على تقنية البلوكشين يهدف إلى تعزيز الأمان والخصوصية والكفاءة في بيئات الرعاية الصحية اللامركزية. من خلال السماح لعدة كيانات صحية بتدريب نموذج عالمي بشكل تعاوني دون مشاركة بيانات المرضى الخام، يعالج الإطار الثغرات الموجودة في التعلم الفيدرالي، مثل تسميم النموذج وحقن البيانات. تشمل المكونات الرئيسية دمج تحمل الأخطاء البيزنطية العملية (PBFT) لتجميع النموذج بشكل آمن، وتصنيف الشذوذ باستخدام معيار L2 للدفاع ضد الهجمات، ونموذج هجين محسن باستخدام بحث بنية الشبكات العصبية (NAS) يجمع بين الشبكات العصبية طويلة الأمد (LSTM) ووحدات التكرار المغلقة (GRU) مع آلية الانتباه. تم تصميم هذه البنية للتعامل بكفاءة مع بيانات الرعاية الصحية غير المستقلة وغير الموزعة (non-IID) مع تقليل عدد المعلمات القابلة للتدريب بشكل كبير، مما يجعلها مناسبة للأجهزة ذات الموارد المحدودة.

تم تقييم فعالية الإطار المقترح من خلال مهام توصية الأدوية باستخدام ثلاثة مجموعات بيانات طبية حقيقية: Symptom2Disease وUCL Drug وDermo Questions، محققة درجات F1 تبلغ 0.97 و0.70 و0.83 على التوالي. لم يظهر الإطار أداءً تنافسياً مقارنة بالطرق التقليدية فحسب، بل خفف أيضًا من التهديدات العدائية بشكل فعال، حيث نجح في تحديد 100% من هجمات تسميم النموذج المحاكية. تم تأكيد الحفاظ على الخصوصية من خلال تحليل المعلومات المتبادلة (MI)، الذي أشار إلى تسرب معلومات ضئيل أثناء التدريب الفيدرالي. بشكل عام، تؤكد الدراسة قدرة الإطار على تقديم توصيات آمنة ومخصصة وتحافظ على الخصوصية في أنظمة الرعاية الصحية الذكية.

مقدمة

تسلط المقدمة الضوء على تقارب تقنية البلوكشين والتعلم الفيدرالي (FL) والرعاية الصحية، مع التأكيد على أهميتها في تعزيز تطبيقات التعلم الآلي التي تحافظ على الخصوصية ومرونة في المجالات الحساسة. تقدم الدراسة إطار عمل للتعلم الفيدرالي المخصص (PFL) يتضمن طبقات تكيف خاصة بالعميل لإدارة البيانات غير المستقلة وغير الموزعة (non-IID) بشكل فعال مع الاستفادة من مزايا نموذج فيدرالي عالمي.

في هذا الإطار، يتم مشاركة طبقات النموذج الأساسية فقط بين جميع العملاء، مما يسمح بتعديلات مخصصة على مستوى العميل. يهدف هذا النهج المبتكر إلى تقديم حل قابل للتوسع يحقق توازنًا مثاليًا بين الأمان والخصوصية والأداء، وبالتالي معالجة التحديات الحرجة في مجال أنظمة توصية الأدوية.

طرق

توضح قسم المنهجية تقييم إطار عمل التعلم الفيدرالي المخصص (PFL) القائم على البلوكشين المصمم لأنظمة توصية الأدوية. كانت التجارب تهدف إلى تقييم أداء الإطار عبر عدة أبعاد، بما في ذلك الدقة، والحفاظ على الخصوصية، والقدرة على مواجهة الهجمات العدائية، والكفاءة الحسابية. تم تقييم الإطار المقترح مقابل التعلم الفيدرالي التقليدي (FL) ونماذج التعلم العميق المستقلة باستخدام مجموعات بيانات طبية متنوعة. تشير النتائج الرئيسية إلى أن نموذج LSTM-GRU المدمج المحسن باستخدام NAS، المدعوم بآليات الانتباه، يلتقط بشكل فعال التبعيات التسلسلية المعقدة، مما يحسن دقة وموثوقية التوصيات.

لضمان التحقق الآمن واللامركزي للنموذج، يدمج الإطار طبقة بلوكشين خاصة مرخصة، مما يسهل الثقة الموزعة. تتحقق عقد الموثق من تحديثات النموذج المقدمة من العملاء من خلال طرق تشفير وإحصائية، مما يمنع المساهمات المسمومة من التأثير على النموذج العالمي مع الحفاظ على سجل يمكن تدقيقه لتطور النموذج. تم استخدام خوارزمية توافق تحمل الأخطاء البيزنطية العملية (PBFT) لنهائيتها الحتمية، مما يضمن أن تحديثات النموذج يتم إنهاؤها فقط عندما يتفق عدد كافٍ من عقد الموثق. تتضمن العملية إرسال العملاء لمعلمات النموذج المدربة محليًا، والتي يتم التحقق منها وتقييمها بحثًا عن الشذوذ قبل تقديمها للتوافق. يتم تسجيل التحديثات الناجحة بشكل غير قابل للتغيير على البلوكشين، الذي يخزن بيانات وصفية أساسية، مما يضمن مقاومة التلاعب وقابلية التحقق. تدعم هذه البنية التعلم الفيدرالي الآمن والقابل للتفسير والمرن، خاصة في السياقات العدائية، وتضع الأساس لتقييمات الأداء اللاحقة.

نتائج

تظهر نتائج الدراسة حول التعلم الفيدرالي المخصص فعالية الإطار عبر تكوينات العملاء المختلفة، مع التركيز على مقاييس مثل الدقة، ووقت التدريب لكل جولة، والعبء الاتصالي. تضمنت التجارب زيادة عدد العملاء من 3 إلى 10، مما كشف أن دقة النموذج تحسنت عمومًا مع زيادة عدد العملاء، حيث بلغت ذروتها عند 82.00% مع 10 عملاء. ومع ذلك، كانت هذه الزيادة في الدقة مصحوبة بزيادة في التكاليف الحسابية، حيث ارتفع وقت التدريب لكل جولة من 8 ثوانٍ لثلاثة عملاء إلى 26 ثانية لعشرة عملاء، وارتفع العبء الاتصالي من 15 ميغابايت إلى 50 ميغابايت، على التوالي.

لمعالجة التحديات التي تطرحها توزيعات البيانات غير المستقلة وغير الموزعة، قدمت الدراسة تقنية ضبط دقيقة تسمح لكل عميل بتكييف النموذج المدرب عالميًا مع مجموعة بياناته المحلية. لم يحسن هذا النهج الشخصي أداء التصنيف للعملاء ذوي الخصائص البيانية المتنوعة فحسب، بل حافظ أيضًا على توازن بين كفاءة الاتصال وقابلية التوسع الحسابية. تشير النتائج إلى أنه بينما يمكن أن يؤدي زيادة عدد العملاء إلى تحسين دقة النموذج، فإن دمج ضبط النموذج المحلي أمر حاسم لتحسين الأداء في سيناريوهات التعلم الفيدرالي. كما أوضح تحليل صندوق الرسم توزيع الدقة عبر العملاء، مما يبرز التباينات في الأداء ويقترح أن التكوينات مع 4 و7 عملاء حققت دقة متوسطة أعلى وثباتًا.

نقاش

يسلط قسم النقاش في ورقة البحث الضوء على التقدم الكبير في أنظمة توصية الأدوية، مع التأكيد على دمج تقنيات التعلم الآلي والتعلم العميق. تشمل المنهجيات البارزة تحليل المشاعر لمراجعات الأدوية لتحسين التوصيات بناءً على ملاحظات المرضى، ونماذج هجينة تجمع بين الأساليب التقليدية والحديثة لتعزيز التخصيص. أظهرت الابتكارات الحديثة، مثل بنى المحولات وأطر التعلم العميق، وعدًا في تحسين دقة التوصيات، خاصة في السيناريوهات السريرية المعقدة مثل مرض باركنسون. ومع ذلك، تتطلب النماذج الحالية مزيدًا من الاستكشاف لتحسين تضمينات السياق والفعالية العامة.

تناقش الورقة أيضًا التحديات الحرجة في الذكاء الاصطناعي في الرعاية الصحية، خاصة فيما يتعلق بالهجمات العدائية وتسميم النموذج، التي تهدد موثوقية أنظمة الذكاء الاصطناعي. تقترح استراتيجيات مثل التدريب العدائي والتجميع الآمن للتخفيف من هذه المخاطر، خاصة في بيئات التعلم الفيدرالي حيث تزيد البيانات اللامركزية من الضعف. يتم تقديم دمج تقنية البلوكشين كحل قوي لتعزيز أمان ونزاهة أنظمة التعلم الفيدرالي، مما يضمن أن تحديثات النموذج محصنة ضد التلاعب وموثوقة. يهدف الإطار المقترح إلى سد الفجوات البحثية الحالية من خلال تقديم نهج تعلم فيدرالي مخصص معزز بالبلوكشين يتضمن كشف الشذوذ ونموذج LSTM-GRU المحسن باستخدام NAS، مما يضمن توصيات دوائية فعالة وآمنة مصممة لتلبية احتياجات المرضى الفردية مع الحفاظ على أداء عالٍ في البيئات ذات الموارد المحدودة.

Journal: Neural Computing and Applications, Volume: 38, Issue: 1
DOI: https://doi.org/10.1007/s00521-025-11828-9
Publication Date: 2026-01-01
Author(s): Sina Apak et al.
Primary Topic: Machine Learning in Healthcare

Overview

The research paper introduces a blockchain-based Personalized Federated Learning (PFL) framework aimed at enhancing security, privacy, and efficiency in decentralized healthcare environments. By allowing multiple healthcare entities to collaboratively train a global model without sharing raw patient data, the framework addresses vulnerabilities inherent in federated learning, such as model poisoning and data injection. Key components include the integration of Practical Byzantine Fault Tolerance (PBFT) for secure model aggregation, L2-norm anomaly filtering for adversarial defense, and a Neural Architecture Search (NAS)-optimized hybrid model combining Long Short-Term Memory (LSTM) and Gated Recurrent Unit (GRU) networks with an attention mechanism. This architecture is designed to efficiently handle non-IID healthcare data while significantly reducing the number of trainable parameters, making it suitable for resource-constrained devices.

The effectiveness of the proposed framework was evaluated through drug recommendation tasks using three real-world medical datasets: Symptom2Disease, UCL Drug, and Dermo Questions, achieving F1-scores of 0.97, 0.70, and 0.83, respectively. The framework not only demonstrated competitive performance compared to conventional methods but also effectively mitigated adversarial threats, successfully identifying 100% of simulated model poisoning attacks. Privacy preservation was confirmed through Mutual Information (MI) analysis, which indicated minimal information leakage during federated training. Overall, the study validates the framework’s capability to deliver secure, personalized, and privacy-preserving recommendations in intelligent healthcare systems.

Introduction

The introduction highlights the convergence of blockchain technology, federated learning (FL), and healthcare, emphasizing its significance in enhancing privacy-preserving and resilient machine learning applications in sensitive areas. The study introduces a personalized federated learning (PFL) framework that incorporates client-specific adaptation layers to effectively manage non-independent and identically distributed (non-IID) data while leveraging the advantages of a global federated model.

In this framework, only the base model layers are shared among all clients, allowing for tailored adaptations at the client level. This innovative approach aims to provide a scalable solution that strikes an optimal balance between security, privacy, and performance, thereby addressing critical challenges in the realm of medical recommendation systems.

Methods

The methodology section details the evaluation of a blockchain-based personalized federated learning (PFL) framework designed for drug recommendation systems. The experiments aimed to assess the framework’s performance across several dimensions, including accuracy, privacy preservation, robustness against adversarial attacks, and computational efficiency. The proposed framework was benchmarked against traditional federated learning (FL) and standalone deep learning models using diverse medical datasets. Key findings indicate that the NAS-optimized fusion LSTM-GRU model, enhanced with attention mechanisms, effectively captures complex sequential dependencies, thereby improving recommendation accuracy and reliability.

To ensure secure and decentralized model validation, the framework integrates a private permission blockchain layer, which facilitates distributed trust. Validator nodes verify client-submitted model updates through cryptographic and statistical methods, preventing poisoned contributions from affecting the global model while maintaining an auditable record of model evolution. The Practical Byzantine Fault Tolerance (PBFT) consensus algorithm was employed for its deterministic finality, ensuring that model updates are finalized only when a sufficient number of validator nodes agree. The process involves clients transmitting locally trained model parameters, which are then authenticated and assessed for anomalies before being submitted to consensus. Successful updates are recorded immutably on the blockchain, which stores essential metadata, ensuring tamper resistance and verifiability. This architecture supports secure, explainable, and resilient federated learning, particularly in adversarial contexts, and sets the stage for subsequent performance evaluations.

Results

The results of the study on personalized federated learning demonstrate the framework’s effectiveness across different client configurations, focusing on metrics such as accuracy, training time per round, and communication overhead. The experiments involved increasing the number of clients from 3 to 10, revealing that model accuracy generally improved with more clients, peaking at 82.00% with 10 clients. However, this enhancement in accuracy was accompanied by increased computational costs, with training time per round rising from 8 seconds for 3 clients to 26 seconds for 10 clients, and communication overhead escalating from 15 MB to 50 MB, respectively.

To address the challenges posed by non-IID data distributions, the study introduced a fine-tuning technique that allows each client to adapt the globally trained model to its local dataset. This personalization approach not only improved classification performance for clients with diverse data characteristics but also maintained a balance between communication efficiency and computational scalability. The findings indicate that while increasing client numbers can enhance model accuracy, the integration of local model fine-tuning is crucial for optimizing performance in federated learning scenarios. The box plot analysis further illustrated the accuracy distribution across clients, highlighting variations in performance and suggesting that configurations with 4 and 7 clients yielded higher median accuracy and consistency.

Discussion

The discussion section of the research paper highlights significant advancements in drug recommendation systems, emphasizing the integration of machine learning and deep learning techniques. Notable methodologies include sentiment analysis of drug reviews to refine recommendations based on patient feedback, and hybrid models that combine traditional and modern approaches to enhance personalization. Recent innovations, such as transformer architectures and deep learning frameworks, have shown promise in improving recommendation accuracy, particularly in complex clinical scenarios like Parkinson’s disease. However, existing models require further exploration to optimize contextual embeddings and overall effectiveness.

The paper also addresses critical challenges in healthcare AI, particularly concerning adversarial attacks and model poisoning, which threaten the reliability of AI systems. Strategies such as adversarial training and secure aggregation are proposed to mitigate these risks, especially in federated learning environments where decentralized data increases vulnerability. The integration of blockchain technology is presented as a robust solution to enhance the security and integrity of federated learning systems, ensuring that model updates are tamper-proof and trustworthy. The proposed framework aims to bridge existing research gaps by offering a blockchain-enhanced personalized federated learning approach that incorporates anomaly detection and a Neural Architecture Search (NAS)-optimized LSTM-GRU model, ensuring efficient and secure drug recommendations tailored to individual patient needs while maintaining high performance in resource-constrained settings.