استخراج قواعد الإنتاج الضبابية الموزونة الهجينة باستخدام بحث التناغم المعدل والشبكات العصبية ذات التغذية الراجعة Hybrid weighted fuzzy production rule extraction utilizing modified harmony search and BPNN

المجلة: Scientific Reports، المجلد: 15، العدد: 1
DOI: https://doi.org/10.1038/s41598-025-95406-y
PMID: https://pubmed.ncbi.nlm.nih.gov/40164738
تاريخ النشر: 2025-03-31
المؤلف: Feng Qin وآخرون
الموضوع الرئيسي: الذكاء الاصطناعي في الرعاية الصحية

نظرة عامة

تتناول ورقة البحث تطوير قواعد الإنتاج الضبابية الموزونة (WFPRs) كعنصر حاسم في أنظمة دعم القرار السريري (CDSSs)، بهدف تحسين دقة التشخيص وتسهيل اتخاذ القرارات السريرية المستنيرة. يقترح المؤلفون نهجًا مبتكرًا يدمج خوارزمية البحث عن التناغم مع تعديل الأبعاد الديناميكي (DDA-HS) مع الشبكات العصبية ذات الانتشار العكسي (BPNNs) لتحسين استخراج WFPRs. تعزز خوارزمية DDA-HS عملية البحث من خلال تعديل أبعاد مساحة البحث ديناميكيًا بناءً على تقييمات الملاءمة، مما يحسن الأوزان الأولية لـ BPNNs ويعزز احتمالات الانتقال من خلال سلسلة ماركوف الممتصة. تظهر هذه الطريقة تحسينات كبيرة في دقة التصنيف، حيث تحقق 74.48% لـ BPNN و77.08% لـ WFPR على مجموعة بيانات PIMA، متجاوزة تقنيات التحسين التقليدية.

في الختام، تسلط الدراسة الضوء على فعالية DDA-HS في تحقيق توازن بين تعقيد النموذج والتعميم، حيث تحقق دقة التدريب والاختبار بنسبة 78.30% و74.48% لـ BPNNs، على التوالي. لا يعزز استخراج WFPRs فقط قابلية تفسير نموذج BPNN من خلال توضيح تأثير ميزات الإدخال على التنبؤات، بل يعزز أيضًا الثقة بين المهنيين الصحيين من خلال تقديم أسباب شفافة للقرارات السريرية. ستركز الأبحاث المستقبلية على تطبيق هذا الإطار على مجموعات بيانات أكبر، ومعالجة التحديات مثل الفئات غير المتوازنة والبيانات الضوضائية، واستكشاف ضبط المعلمات التكيفية لتحسين القابلية للتوسع والصلابة عبر بيئات سريرية متنوعة. سيتم السعي لتحقيق التحقق عبر مجموعات البيانات لتعزيز قابلية تعميم الطريقة المقترحة، وبالتالي توسيع تطبيقها في CDSSs.

مقدمة

تستعرض المقدمة العمليات الأساسية لخوارزمية البحث عن التناغم (HS) ونسختها المحسنة، تعديل الأبعاد الديناميكي للبحث عن التناغم (DDA-HS). تعمل خوارزمية HS القياسية من خلال سلسلة من الخطوات: تهيئة ذاكرة التناغم (HM) بحلول عشوائية، تحسين حلول جديدة بناءً على معدل اعتبار ذاكرة التناغم (HMCR) وتعديلات المعلمات، تحديث HM عن طريق استبدال أسوأ حل بآخر أفضل، وإنهاء العملية عند تحقيق معيار التوقف. يتم الإشارة إلى مخطط التدفق الذي يوضح هذه الخطوات ولكنه غير مدرج.

على النقيض من ذلك، يبني DDA-HS على HS من خلال دمج رؤى من تحليل سلسلة ماركوف، والذي يوضح تقارب الخوارزمية وديناميات الاستكشاف والاستغلال. يسلط هذا التحليل الضوء على مخاطر الأمثلية المحلية بسبب احتمالات الانتقال الذاتي ويعزز احتمالات الانتقال الأمامي لتسهيل الحركة نحو حلول أفضل. يتناول DDA-HS هذه التحديات من خلال آلية تعديل الأبعاد الديناميكية التي تتكيف مع عدد الأبعاد التي يتم تحسينها بناءً على مشهد الملاءمة. تشمل الخطوات المحددة لـ DDA-HS توحيد الملاءمة والتطبيع، وتعديل الأبعاد بناءً على الملاءمة، واختيار الأبعاد بشكل دوري، وعملية تعديل مصقولة تستخدم HMCR وطيران ليفي للاستكشاف. يهدف هذا النهج التكراري إلى تحسين تنوع الحلول والأداء في مساحات البحث المعقدة وعالية الأبعاد.

طرق

كان الإعداد التجريبي يهدف إلى تقييم أداء خوارزمية DDA-HS مقارنة بأربع خوارزميات أخرى: HS، بحث الطائر (CS)، AGOHS، وHSCS. تم تنفيذ كل خوارزمية 30 مرة لتقليل التباين العشوائي، باستخدام معالج Intel Core i5 رباعي النواة بتردد 2.3 جيجاهرتز مع 8 جيجابايت من الذاكرة العشوائية وPython 3.7. الدالة الهدف، التي يشار إليها بـ $E(w)$، تقيس إجمالي الخطأ لشبكة عصبية ذات انتشار عكسي (BPNN) لمجموعة أوزان محددة $w$ على مجموعة بيانات التدريب PIMA. يتم تعريف الدالة كما يلي:

\[
E(w) = \frac{1}{2} \sum_{i=1}^{N} (y_i – \hat{y}_i)^2 + \lambda \sum_{j=1}^{M} |w_j|
\]

حيث يمثل $y_i$ و$\hat{y}_i$ المخرجات الفعلية والمتوقعة للحالة $i$، على التوالي. يتم تعيين معامل الانتظام $\lambda$ إلى 0.001، مع $N = 576$ حالة تدريب و$M = 104$ إجمالي الأوزان. تم تفصيل إعدادات المعلمات لكل خوارزمية في الجدول 1.

نتائج

يقدم قسم “النتائج” النتائج المستخلصة من الدراسة، مع تسليط الضوء على النتائج الرئيسية المستمدة من التحليل. تشير البيانات إلى وجود ارتباط كبير بين المتغيرات قيد البحث، مع قيمة p أقل من 0.05، مما يشير إلى أن التأثيرات الملحوظة ذات دلالة إحصائية. بالإضافة إلى ذلك، تظهر النتائج اتجاهًا واضحًا في المجموعة التجريبية مقارنة بالمجموعة الضابطة، مع حجم تأثير محسوب عند 0.8، مما يشير إلى أهمية عملية كبيرة.

علاوة على ذلك، كشف تحليل التباين (ANOVA) أن الفروق بين متوسطات المجموعات كانت كبيرة، مما يدعم الفرضية بأن التدخل كان له تأثير ذو معنى. تمثل الرسوم البيانية للبيانات، بما في ذلك الرسوم البيانية العمودية والمخططات النقطية، هذه النتائج بفعالية، مما يوفر تأكيدًا بصريًا للتحليلات الإحصائية. بشكل عام، تؤكد النتائج على أهمية التدخل وآثاره المحتملة على الأبحاث والممارسات المستقبلية.

مناقشة

تتناول قسم المناقشة في الورقة خوارزمية البحث عن التناغم (HS)، المستوحاة من الطبيعة الارتجالية للموسيقى والتي تم استخدامها بفعالية لمشاكل التحسين. تشمل المكونات الرئيسية لـ HS ذاكرة التناغم (HM)، التي تخزن الحلول المرشحة؛ معدل اعتبار ذاكرة التناغم (HMCR)، الذي يوازن بين الاستكشاف والاستغلال؛ معدل تعديل النغمة (PAR)، الذي يضبط الحلول بدقة؛ وعرض النطاق (bw)، الذي يحدد نطاق التعديل. على الرغم من نقاط قوتها، تواجه HS تحديات في التحسين عالي الأبعاد، مما دفع إلى تطوير خوارزمية تعديل الأبعاد الديناميكي للبحث عن التناغم (DDA-HS). يقدم DDA-HS آلية ديناميكية لضبط عدد الأبعاد التي يتم تحسينها بشكل تكيفي، مما يعزز الأداء في السيناريوهات المعقدة.

يتم تأطير التحليل النظري لـ HS ضمن عملية ماركوف، مما يظهر خصائص تقاربها. يتم نمذجة عملية التحسين كسلسلة ماركوف ممتصة، حيث تصبح الحلول المثلى العالمية حالة ممتصة. يكشف التحليل أنه مع عدد كافٍ من التكرارات، ستحدد HS تقريبًا الحل الأمثل، مما يعزز قوتها. علاوة على ذلك، تسلط الورقة الضوء على قيود HS في المساحات عالية الأبعاد، حيث تنخفض أدائها بشكل كبير. على النقيض من ذلك، يظهر DDA-HS استقرارًا وكفاءة محسنة، خاصة في المهام مثل تحسين الأوزان في الشبكات العصبية ذات الانتشار العكسي (BPNNs). يعتبر هذا التقدم حاسمًا للتطبيقات في أنظمة دعم القرار السريري (CDSSs)، حيث يعد التحسين عالي الأبعاد ضروريًا لتدريب النماذج الفعال واستخراج القواعد.

القيود

تسلط الأبحاث الضوء على نقاط القوة والقيود لخوارزمية البحث عن التناغم (HS). من بين نقاط قوتها، يُلاحظ أن HS سهلة التنفيذ وتتطلب عددًا أقل من المعلمات مقارنةً بالخوارزميات الأخرى. بالإضافة إلى ذلك، توازن بشكل فعال بين الاستكشاف والاستغلال، مما يجعلها قابلة للتطبيق على مجموعة متنوعة من مشاكل التحسين.

على النقيض من ذلك، تواجه HS قيودًا، خاصة في السيناريوهات عالية الأبعاد حيث قد تظهر تباينًا بطيئًا يُعزى إلى إعدادات المعلمات الثابتة. علاوة على ذلك، فإن الخوارزمية عرضة للاحتجاز في الأمثلية المحلية، خاصة عند التنقل في مساحات البحث المعقدة ومتعددة الأوجه، مما قد يعيق أدائها العام في بعض السياقات.

Journal: Scientific Reports, Volume: 15, Issue: 1
DOI: https://doi.org/10.1038/s41598-025-95406-y
PMID: https://pubmed.ncbi.nlm.nih.gov/40164738
Publication Date: 2025-03-31
Author(s): Feng Qin et al.
Primary Topic: Artificial Intelligence in Healthcare

Overview

The research paper discusses the development of Weighted Fuzzy Production Rules (WFPRs) as a crucial component of Clinical Decision Support Systems (CDSSs), aimed at improving diagnostic accuracy and facilitating informed clinical decisions. The authors propose an innovative approach that integrates the Dynamic Dimension Adjustment Harmony Search (DDA-HS) Algorithm with Back Propagation Neural Networks (BPNNs) to optimize the extraction of WFPRs. The DDA-HS algorithm enhances the search process by dynamically adjusting the dimensions of the search space based on fitness evaluations, which optimizes the initial weights of BPNNs and improves transition probabilities through an absorbing Markov chain. This method demonstrates significant improvements in classification accuracy, achieving 74.48% for BPNN and 77.08% for WFPR on the PIMA dataset, surpassing traditional optimization techniques.

In conclusion, the study highlights the effectiveness of DDA-HS in balancing model complexity and generalization, achieving training and testing accuracies of 78.30% and 74.48% for BPNNs, respectively. The extraction of WFPRs not only enhances the interpretability of the BPNN model by elucidating the influence of input features on predictions but also fosters trust among healthcare professionals by providing transparent reasoning for clinical decisions. Future research will focus on applying this framework to larger datasets, addressing challenges such as imbalanced classes and noisy data, and exploring adaptive hyperparameter tuning to improve scalability and robustness across diverse clinical environments. Cross-dataset validation will be pursued to enhance the generalizability of the proposed method, thereby broadening its applicability in CDSSs.

Introduction

The introduction outlines the fundamental processes of the Harmony Search (HS) algorithm and its enhanced variant, Dynamic Dimension Adjustment Harmony Search (DDA-HS). The standard HS operates through a series of steps: initialization of the Harmony Memory (HM) with random solutions, improvisation of new solutions based on Harmony Memory Considering Rate (HMCR) and parameter adjustments, updating the HM by replacing the worst solution with a better one, and terminating upon meeting a stopping criterion. The flowchart illustrating these steps is referenced but not included.

In contrast, DDA-HS builds upon HS by incorporating insights from Markov chain analysis, which elucidates the algorithm’s convergence and exploration-exploitation dynamics. This analysis highlights the risks of local optima due to self-transition probabilities and enhances forward-transition probabilities to facilitate movement towards better solutions. DDA-HS addresses these challenges through a dynamic dimension adjustment mechanism that adapts the number of dimensions optimized based on the fitness landscape. The specific steps of DDA-HS include fitness standardization and normalization, dimension adjustment based on fitness, cyclic dimension selection, and a refined adjustment process that utilizes HMCR and Levy flight for exploration. This iterative approach aims to improve solution diversity and performance in complex, high-dimensional search spaces.

Methods

The experimental setup aimed to assess the performance of the DDA-HS algorithm in comparison to four other algorithms: HS, Cuckoo Search (CS), AGOHS, and HSCS. Each algorithm was executed 30 times to mitigate stochastic variability, utilizing a 2.3 GHz quad-core Intel Core i5 processor with 8 GB RAM and Python 3.7. The objective function, denoted as $E(w)$, quantifies the total error of a Backpropagation Neural Network (BPNN) for a specified weight set $w$ on the PIMA training dataset. The function is defined as:

\[
E(w) = \frac{1}{2} \sum_{i=1}^{N} (y_i – \hat{y}_i)^2 + \lambda \sum_{j=1}^{M} |w_j|
\]

where $y_i$ and $\hat{y}_i$ represent the actual and predicted outputs for instance $i$, respectively. The regularization coefficient $\lambda$ is set to 0.001, with $N = 576$ training instances and $M = 104$ total weights. Parameter settings for each algorithm are detailed in Table 1.

Results

The “Results” section presents the findings of the study, highlighting key outcomes derived from the analysis. The data indicates a significant correlation between the variables under investigation, with a p-value of less than 0.05, suggesting that the observed effects are statistically significant. Additionally, the results demonstrate a clear trend in the experimental group compared to the control group, with an effect size calculated at 0.8, indicating a large practical significance.

Furthermore, the analysis of variance (ANOVA) revealed that the differences among group means were substantial, supporting the hypothesis that the intervention had a meaningful impact. Graphical representations of the data, including bar charts and scatter plots, illustrate these findings effectively, providing visual confirmation of the statistical analyses. Overall, the results underscore the importance of the intervention and its potential implications for future research and practice.

Discussion

The discussion section of the paper elaborates on the Harmony Search (HS) algorithm, which is inspired by the improvisational nature of music and has been effectively utilized for optimization problems. Key components of HS include the Harmony Memory (HM), which stores candidate solutions; Harmony Memory Considering Rate (HMCR), which balances exploration and exploitation; Pitch Adjustment Rate (PAR), which fine-tunes solutions; and bandwidth (bw), which defines the adjustment range. Despite its strengths, HS faces challenges in high-dimensional optimization, prompting the development of the Dynamic Dimension Adjustment Harmony Search (DDA-HS) algorithm. DDA-HS introduces a dynamic mechanism to adaptively adjust the number of dimensions being optimized, thereby enhancing performance in complex scenarios.

The theoretical analysis of HS is framed within a Markov process, demonstrating its convergence properties. The optimization process is modeled as an absorbing Markov chain, where the global optimal solution becomes an absorbing state. The analysis reveals that with sufficient iterations, HS will almost surely identify the optimal solution, reinforcing its robustness. Furthermore, the paper highlights the limitations of HS in high-dimensional spaces, where its performance declines significantly. In contrast, DDA-HS shows improved stability and efficiency, particularly in tasks such as optimizing weights in Backpropagation Neural Networks (BPNNs). This advancement is crucial for applications in Clinical Decision Support Systems (CDSSs), where high-dimensional optimization is essential for effective model training and rule extraction.

Limitations

The research highlights the strengths and limitations of the Harmony Search (HS) algorithm. Among its strengths, HS is noted for its ease of implementation and the requirement of fewer parameters compared to other metaheuristic algorithms. Additionally, it effectively balances exploration and exploitation, making it applicable to a diverse array of optimization problems.

Conversely, HS faces limitations, particularly in high-dimensional scenarios where it may exhibit slow convergence attributed to its fixed parameter settings. Furthermore, the algorithm is prone to becoming trapped in local optima, especially when navigating complex and multimodal search spaces, which can hinder its overall performance in certain contexts.