تشخيص الزرق باستخدام OCT ومراحل الزرق باستخدام التعلم الآلي القابل للتفسير OCT-based diagnosis of glaucoma and glaucoma stages using explainable machine learning

المجلة: Scientific Reports، المجلد: 15، العدد: 1
DOI: https://doi.org/10.1038/s41598-025-87219-w
PMID: https://pubmed.ncbi.nlm.nih.gov/39875492
تاريخ النشر: 2025-01-28
المؤلف: Md. Mahmudul Hasan وآخرون
الموضوع الرئيسي: الجلوكوما واضطرابات الشبكية

نظرة عامة

تتناول البحث التحدي الصحي المتزايد الذي تسببه الزرق، مع تسليط الضوء على قيود طرق التشخيص الآلي الحالية التي تعتمد على نماذج التعلم العميق غير الشفافة. لتعزيز القابلية للتفسير والثقة في تشخيص الزرق، طورت الدراسة أداة ذكاء اصطناعي قابلة للتفسير (XAI) تستخدم صور التصوير المقطعي البصري (OCT). تتكون مجموعة البيانات من 334 عينًا طبيعية و268 عينًا زرقية، مصنفة إلى مراحل مبكرة ومتوسطة ومتقدمة. استخدمت الدراسة نظرية معالجة الإشارات وقامت بتقييم دقة تفسير النموذج بدقة من خلال SHapley Additive exPlanations (SHAP) وتحليل الاعتماد الجزئي (PDA).

أظهرت أداة XAI الناتجة أداءً مثيرًا للإعجاب، حيث حققت درجات منطقة تحت المنحنى (AUC) تبلغ 0.96 و0.98 و1.00 لتفريق الزرق المبكر والمتوسط والمتقدم، على التوالي. ومن الجدير بالذكر أن نماذج التعلم الآلي تفوقت على الأطباء بنسبة 10.4% إلى 11.2% في الدقة لتشخيص الزرق في مراحله المبكرة وعامة. تقدم الأداة البرمجية سهلة الاستخدام التي تم تطويرها في هذه الدراسة رؤى شفافة وقابلة للتفسير، مما يجعلها موردًا قيمًا لممارسي رعاية العيون لتعزيز اتخاذ القرارات السريرية.

مقدمة

في هذا القسم، يناقش المؤلفون استخراج ميزات المجال المكاني والترددي من صور التصوير المقطعي البصري (OCT)، مع التركيز بشكل خاص على طبقات الشبكية مثل طبقة ألياف الأعصاب الشبكية (RNFL) وطبقة خلايا العقدة-الطبقة الداخلية (GC-IPL) وسماكة البقعة. تعتبر هذه الميزات في المجال المكاني حاسمة لتفريق بين العيون الصحية، والمشتبه في إصابتها بالزرق، والعيون الزرقية، مع درجات متفاوتة من الترابط بين المعلمات. باستخدام برنامج CIRRUS HD-OCT، استخرجت الدراسة بيانات عددية تتعلق بتحليل رأس العصب البصري (ONH) وتحليل RNFL وتحليل سماكة البقعة، وحساب ميزات إحصائية مثل المتوسط، والوسيط، والانحراف المعياري، والانحراف، والكراتوز، بالإضافة إلى مقاييس قائمة على نظرية المعلومات مثل إنتروبيا شانون ومعلومات فيشر.

بالإضافة إلى ذلك، قام المؤلفون بتحويل بيانات المجال المكاني إلى المجال الترددي باستخدام تحويل فورييه السريع (FFT)، مما سمح باستخراج ميزات المجال الترددي، بما في ذلك كثافة الطيف الطاقي (PSD) وإنتروبيا الطيف. كشفت التحليلات عن نمط ‘قمة مزدوجة’ مميز في بيانات سماكة RNFL، يمثل المناطق العلوية والسفلية. تؤكد الدراسة على أهمية كل من ميزات المجال المكاني والترددي في فهم العلاقة الهيكلية-الوظيفية في الزرق، كما هو موضح من خلال أشكال وجداول تكميلية متنوعة.

طرق

يستعرض قسم “المواد والطرق” تصميم التجربة والإجراءات المستخدمة في الدراسة. يوضح المواد المحددة المستخدمة، بما في ذلك أي مواد كيميائية، ومعدات، وعينات بيولوجية، لضمان إمكانية تكرار البحث. تشمل المنهجية التقنيات المستخدمة لجمع البيانات وتحليلها، مع تحديد أي طرق إحصائية تم تطبيقها لتفسير النتائج.

بالإضافة إلى ذلك، قد يصف القسم إعداد التجربة، بما في ذلك ظروف التحكم وأي متغيرات تم التلاعب بها خلال الدراسة. يسمح هذا النهج الشامل بفهم واضح لكيفية اشتقاق النتائج ويدعم صحة الاستنتاجات المستخلصة من البحث.

نتائج

يقدم قسم النتائج النتائج المتعلقة بتطبيق تقنيات التعلم الآلي القابلة للتفسير. تؤكد الدراسة على أهمية القابلية للتفسير في نماذج التعلم الآلي، مع عرض طرق متنوعة تم استخدامها لتوضيح قرارات النموذج. تشير النتائج الرئيسية إلى أن دمج أساليب الذكاء الاصطناعي القابلة للتفسير يعزز بشكل كبير ثقة المستخدم وفهمه لتوقعات النموذج، خاصة في مجموعات البيانات المعقدة.

علاوة على ذلك، تكشف التحليلات أن بعض تقنيات القابلية للتفسير، مثل SHAP (SHapley Additive exPlanations) وLIME (تفسيرات نموذجية محلية قابلة للتفسير)، تبرز بفعالية مساهمات الميزات الفردية في الناتج العام للنموذج. تؤكد هذه النتائج على إمكانيات التعلم الآلي القابل للتفسير لتحسين عمليات اتخاذ القرار عبر تطبيقات متنوعة، مما يعزز في النهاية الشفافية الأكبر في الأنظمة الآلية.

مناقشة

يستعرض قسم المناقشة في ورقة البحث المنهجية والنتائج لدراسة مقطعية أجريت في مركز صحة العين (CFEH) في سيدني، أستراليا، مع التركيز على تشخيص الزرق باستخدام تقنيات التعلم الآلي (ML). التزمت الدراسة بالإرشادات الأخلاقية وشملت جمع البيانات من المرضى الذين تم تشخيصهم بمراحل مختلفة من الزرق بين عامي 2015 و2021. تم تصنيف المرضى بناءً على الانحراف المتوسط (MD) لحقول الرؤية إلى مجموعات صحية، وزرق مبكر، وزرق متوسط، وزرق متقدم. استخدمت الدراسة بروتوكولات شاملة لفحص الزرق، وتم استخدام تقنيات استيفاء البيانات للتعامل مع العيوب في البيانات، مما يضمن سلامة التحليل.

تم إجراء اختيار الميزات باستخدام طرق تصفية أحادية المتغير، وتحديدًا اختبار ANOVA F وارتباط بيرسون، لاستبعاد الميزات غير ذات الصلة وتعزيز دقة النموذج. تم استخدام ثلاثة نماذج تعلم تحت إشراف—K-Nearest Neighbour (KNN)، وآلات الدعم الناقل (SVM)، وغابات عشوائية (RF)—للتصنيف، حيث أظهرت RF أداءً متفوقًا في تمييز مراحل الزرق، محققة منطقة تحت المنحنى (AUC) تبلغ 0.96 للزرق المتقدم. كما نفذت الدراسة SHapley Additive exPlanations (SHAP) لتفسير النموذج، مما يسمح للأطباء بفهم مساهمات الميزات المختلفة في التوقعات. بشكل عام، تشير النتائج إلى أن نماذج التعلم الآلي، وخاصة RF، يمكن أن تصنف مراحل الزرق بفعالية، مما يوفر أداة تشخيصية قيمة للأطباء.

Journal: Scientific Reports, Volume: 15, Issue: 1
DOI: https://doi.org/10.1038/s41598-025-87219-w
PMID: https://pubmed.ncbi.nlm.nih.gov/39875492
Publication Date: 2025-01-28
Author(s): Md. Mahmudul Hasan et al.
Primary Topic: Glaucoma and retinal disorders

Overview

The research addresses the increasing health challenge posed by glaucoma, highlighting the limitations of current automated diagnostic methods that rely on opaque deep learning models. To enhance explainability and trustworthiness in glaucoma diagnosis, the study developed an explainable artificial intelligence (XAI) tool utilizing optical coherence tomography (OCT) images. The dataset comprised 334 normal and 268 glaucomatous eyes, categorized into early, moderate, and advanced stages. The study employed signal processing theory and rigorously evaluated model interpretability through SHapley Additive exPlanations (SHAP) and partial dependency analysis (PDA).

The resulting XAI tool demonstrated impressive performance, achieving area under the curve (AUC) scores of 0.96, 0.98, and 1.00 for differentiating early, moderate, and advanced glaucoma, respectively. Notably, the machine learning models outperformed clinicians by 10.4% to 11.2% in accuracy for early-stage and overall glaucoma diagnosis. The user-friendly software tool developed in this study offers transparent and interpretable insights, positioning it as a valuable resource for eye care practitioners to enhance clinical decision-making.

Introduction

In this section, the authors discuss the extraction of spatial and frequency domain features from optical coherence tomography (OCT) images, specifically focusing on retinal layers such as the retinal nerve fiber layer (RNFL), ganglion cell-inner plexiform layer (GC-IPL), and macular thickness. These spatial domain features are critical for differentiating between healthy eyes, glaucoma suspects, and glaucomatous eyes, with varying degrees of inter-parameter correlation. Using CIRRUS HD-OCT software, the study extracted numerical data related to optic nerve head (ONH) analysis, RNFL analysis, and macular thickness analysis, computing statistical features such as mean, median, standard deviation, skewness, and kurtosis, as well as information theory-based metrics like Shannon entropy and Fisher information.

Additionally, the authors transformed the spatial domain data into the frequency domain using fast Fourier transform (FFT), allowing for the extraction of frequency-domain features, including power spectral density (PSD) and spectral entropy. The analysis revealed a characteristic ‘double hump pattern’ in the RNFL thickness data, representing the superior and inferior regions. The study emphasizes the importance of both spatial and frequency domain features in understanding the structural-functional relationship in glaucoma, as illustrated through various supplementary figures and tables.

Methods

The “Materials and Methods” section outlines the experimental design and procedures employed in the study. It details the specific materials used, including any reagents, equipment, and biological samples, ensuring reproducibility of the research. The methodology encompasses the techniques for data collection and analysis, specifying any statistical methods applied to interpret the results.

Additionally, the section may describe the experimental setup, including control conditions and any variables manipulated during the study. This comprehensive approach allows for a clear understanding of how the findings were derived and supports the validity of the conclusions drawn from the research.

Results

The results section presents findings related to the application of explainable machine learning (ML) techniques. The study emphasizes the importance of interpretability in ML models, showcasing various methods employed to elucidate model decisions. Key results indicate that the integration of explainable AI approaches significantly enhances user trust and understanding of model predictions, particularly in complex datasets.

Furthermore, the analysis reveals that certain explainability techniques, such as SHAP (SHapley Additive exPlanations) and LIME (Local Interpretable Model-agnostic Explanations), effectively highlight the contributions of individual features to the overall model output. These findings underscore the potential of explainable ML to improve decision-making processes across various applications, ultimately fostering greater transparency in automated systems.

Discussion

The discussion section of the research paper outlines the methodology and findings of a cross-sectional study conducted at the Centre for Eye Health (CFEH) in Sydney, Australia, focusing on glaucoma diagnosis using machine learning (ML) techniques. The study adhered to ethical guidelines and involved data collection from patients diagnosed with varying stages of glaucoma between 2015 and 2021. Patients were categorized based on the mean deviation (MD) of visual fields into healthy, early, moderate, and advanced glaucoma groups. The study utilized comprehensive glaucoma examination protocols, and data imputation techniques were employed to handle artefacts in the data, ensuring the integrity of the analysis.

Feature selection was performed using univariate filter methods, specifically ANOVA F Test and Pearson’s correlation, to eliminate irrelevant features and enhance model accuracy. Three supervised learning models—K-Nearest Neighbour (KNN), Support Vector Machines (SVM), and Random Forests (RF)—were employed for classification, with RF demonstrating superior performance in distinguishing between glaucoma stages, achieving an area under the curve (AUC) of 0.96 for advanced glaucoma. The study also implemented SHapley Additive exPlanations (SHAP) for model interpretability, allowing clinicians to understand the contributions of various features to the predictions. Overall, the findings suggest that ML models, particularly RF, can effectively classify glaucoma stages, providing a valuable diagnostic tool for clinicians.