تعزيز اكتشاف العلامات العينية: تقسيم استراتيجي قائم على الذكاء الاصطناعي لتحسين الدقة وحماية الخصوصية Enhancing ocular sign detection: AI-based strategic segmentation for improved accuracy and privacy protection

المجلة: npj Digital Medicine، المجلد: 9، العدد: 1
DOI: https://doi.org/10.1038/s41746-025-02310-w
PMID: https://pubmed.ncbi.nlm.nih.gov/41495351
تاريخ النشر: 2026-01-06
المؤلف: Chaoyu Lei وآخرون
الموضوع الرئيسي: تصوير الشبكية والتحليل

نظرة عامة

في هذه الدراسة، نتناول الحاجة الملحة للكشف الدقيق عن العلامات العينية للتشخيص المبكر لأمراض العين مع ضمان خصوصية المريض. غالبًا ما تتضمن طرق الذكاء الاصطناعي الحالية التي تستخدم صور الوجه أو الصور العينية الخارجية معلومات زائدة يمكن أن تؤثر على الأداء والخصوصية. لمواجهة ذلك، أجرينا تحليلًا استعاديًا متعدد الجنسيات شمل 2,360 عينًا من 1,180 صورة نصف وجه لمرضى داء الغدة الدرقية العينية عبر مجموعات عرقية متنوعة ومستشفيات متعددة. قدمنا شبكة ضغط وتحفيز كثيفة (DSE-Net) لتجزئة الميزات العينية الرئيسية، محققين درجات Dice مثيرة للإعجاب بلغت 84.7%، 84.8%، 92.7%، و95.1%، متفوقين بذلك على سبعة نماذج تجزئة أخرى.

علاوة على ذلك، طورنا SegmenView، الذي يدمج هياكل متنوعة بما في ذلك LeNet وAlexNet وResNet50 وVGGNet16 للكشف عن علامات عينية محددة مثل وذمة الجفن والتهاب الملتحمة. أظهر SegmenView درجات منطقة تحت المنحنى (AUC) الداخلية بلغت 71.09%، 80.81%، 90.07%، و82.86%، مع AUCs الخارجية تتراوح من 55.58% إلى 84.29% عبر مجموعتي اختبار، متجاوزًا النماذج التقليدية لنصف الوجه والمناطق المحيطة بالعين. سلطت مقارناتنا مع أربع طرق للحفاظ على الخصوصية الضوء على قدرة SegmenView الفائقة في الحفاظ على التوازن بين الخصوصية ودقة التشخيص. بالإضافة إلى ذلك، عزز استخدام رسم خرائط تنشيط الفئة المعتمد على التدرج (Grad-CAM) من قابلية تفسير نموذجنا، مما يبرز إمكانيته للكشف الفعال عن العلامات العينية مع مراعاة الخصوصية.

مقدمة

تسلط المقدمة الضوء على العبء الكبير الذي تفرضه الأمراض العينية على الرعاية الصحية العالمية، حيث يعاني حوالي 0.5% من السكان من العمى وأكثر من 200 مليون فرد يعانون من ضعف بصري معتدل إلى شديد. يمكن أن تعكس التدخلات المبكرة أكثر من مليار حالة من فقدان الرؤية، مما يبرز ضرورة التشخيص الدقيق وفي الوقت المناسب. العديد من الحالات العينية، مثل تلك التي تؤثر على الجفن، الملتحمة، الكارنكول، وكرات العين، تظهر علامات مبكرة غالبًا ما تكون خفية وتعتمد على الحكم السريري الذاتي، مما يعقد جهود الكشف.

لمعالجة هذه التحديات، تقترح الدراسة استخدام الذكاء الاصطناعي (AI) لتعزيز اتساق التشخيص وتقليل الاعتماد على خبرة الأطباء. ومع ذلك، فإن طرق الذكاء الاصطناعي الحالية للكشف عن العلامات العينية غالبًا ما تستخدم صور الوجه، مما يثير مخاوف الخصوصية بموجب لوائح مثل اللائحة العامة لحماية البيانات في الاتحاد الأوروبي (GDPR). تقدم الدراسة SegmenView، وهو نهج تجزئة جديد يستخدم شبكة ضغط وتحفيز كثيفة (DSE-Net) لعزل المناطق العينية الرئيسية للكشف عن العلامات المرتبطة بداء الغدة الدرقية العينية (TED)، وهي حالة تؤثر على 25-50% من مرضى داء غريفز. تم تحسين أداء النموذج باستخدام هياكل متنوعة من الشبكات العصبية التلافيفية (CNN) وتم التحقق منها عبر بيانات مرضى متنوعة، مما يضمن القوة والعمومية. بالإضافة إلى ذلك، تتضمن الدراسة تقنيات للحفاظ على الخصوصية وتعزز قابلية تفسير النموذج من خلال التصورات باستخدام رسم خرائط تنشيط الفئة المعتمد على التدرج (Grad-CAM).

طرق

في هذا القسم، يوضح المؤلفون الطرق المستخدمة لتقييم أداء تقنيات الحفاظ على الخصوصية المختلفة مقارنةً بـ SegmenView. نفذت الدراسة أربع طرق تشفير كلاسيكية: خلط البكسلات، الخرائط اللوجستية الفوضوية، تشفير XOR، والتشفير المتجانس. لتعزيز مجموعة البيانات، تم تطبيق تقنيات زيادة البيانات مثل التدوير العشوائي، والانعكاس الأفقي، وتعديلات السطوع. بعد ذلك، تم استخدام تحويل الموجات لاستخراج ميزات القوام، وتم تحويل صور المرضى إلى فضاء اللون HSV لتتوافق مع الإدراك البصري البشري. تم حساب وتطبيع المدرجات لكل قناة، مما أسفر عن متجه ميزات مشترك تم تشفيره باستخدام الطرق المذكورة أعلاه.

أشارت نتائج الكشف إلى أن أداء هذه الطرق للحفاظ على الخصوصية كان عمومًا أدنى من SegmenView، حيث لم تتجاوز قيم منطقة تحت المنحنى (AUC) 0.65. على وجه التحديد، أسفر خلط البكسلات عن AUCs بلغت 42.78، 43.81، 46.89، و62.89 لحالات مختلفة، بينما أنتجت الخرائط اللوجستية الفوضوية AUCs بلغت 60.81، 51.88، 57.91، و52.73. أسفر تشفير XOR عن AUCs بلغت 51.48، 50.47، 46.59، و52.58، وأظهر التشفير المتجانس AUCs بلغت 50.67، 54.81، 49.27، و53.24. تم تقديم نتائج مفصلة لمجموعات اختبار خارجية في الجدول التكميلي 2.

نتائج

في قسم النتائج، تقدم الدراسة قدرات الكشف عن SegmenView لمختلف الحالات العينية، بما في ذلك وذمة الجفن، التهاب الملتحمة، وذمة الكارنكول أو الطية، وزيادة بروز العين. تشير النتائج إلى أن SegmenView يظهر دقة عالية في تحديد هذه الحالات، على الرغم من عدم تفصيل مقاييس محددة مثل الحساسية، النوعية، أو منطقة تحت المنحنى (AUC) في النص المقدم. تشير تداعيات هذه النتائج إلى أن SegmenView يمكن أن يكون أداة قيمة في الإعدادات السريرية لتقييم صحة العين. قد تكون هناك حاجة إلى مزيد من التحليل الكمي والمقارنات مع طرق التشخيص الحالية لتحديد فعاليته بالكامل.

مناقشة

في هذه الدراسة، طور المؤلفون وصادقوا على SegmenView، وهو نموذج مصمم للكشف عن العلامات العينية لداء الغدة الدرقية العينية (TED). تتكون مجموعة البيانات من 1,160 صورة مريض من خمسة مستشفيات، مع اختلافات كبيرة في معدلات حدوث العلامات العينية عبر مجموعات التدريب والاختبار. أظهر نموذج التجزئة، DSE-Net، أداءً متفوقًا مع أقل عدد من المعلمات (11.02M) بين ثمانية شبكات تم تقييمها، محققًا درجات عالية من معامل Dice (على سبيل المثال، 92.7% للكارنكول الدمعي) وقابلية عامة قوية عبر مجموعات عرقية متنوعة وأجهزة تصوير. تعزز دقة النموذج في عزل مناطق الآفة من دقة الكشف مع تقليل مخاطر الخصوصية من خلال التركيز فقط على المناطق ذات الصلة سريريًا.

تفوق SegmenView على النماذج التقليدية لنصف الوجه والمناطق المحيطة بالعين في الكشف عن العلامات العينية، كما يتضح من قيم منطقة تحت المنحنى (AUC) الأعلى عبر مجموعات بيانات متعددة. أكدت تصورات Grad-CAM أن SegmenView يركز بفعالية على الهياكل التشريحية ذات الصلة، على عكس النماذج الأوسع التي تضمنت مناطق غير ذات صلة، مما قد يخفف من حساسية الكشف. أكد المؤلفون على إمكانيات النموذج للتكامل في سير العمل السريري، لا سيما في الرعاية الأولية والطب عن بُعد، حيث يمكن أن يسهل الكشف المبكر عن التغيرات العينية الخفية. ومع ذلك، اعترفت الدراسة بالقيود، بما في ذلك التركيبة السكانية الصينية بشكل أساسي والتركيز على أربعة علامات عينية فقط، مما يشير إلى الحاجة إلى مزيد من التحقق والتوسع في قدرات النموذج في الأبحاث المستقبلية.

Journal: npj Digital Medicine, Volume: 9, Issue: 1
DOI: https://doi.org/10.1038/s41746-025-02310-w
PMID: https://pubmed.ncbi.nlm.nih.gov/41495351
Publication Date: 2026-01-06
Author(s): Chaoyu Lei et al.
Primary Topic: Retinal Imaging and Analysis

Overview

In this study, we address the critical need for accurate detection of ocular signs for early diagnosis of eye diseases while ensuring patient privacy. Current AI methods utilizing facial or external ocular images often include extraneous information that can compromise both performance and privacy. To tackle this, we conducted a multinational retrospective analysis involving 2,360 eyes from 1,180 half-face images of thyroid eye disease patients across diverse racial groups and multiple hospitals. We introduced a Dense Squeeze-and-Excitation Network (DSE-Net) for segmenting key ocular features, achieving impressive Dice scores of 84.7%, 84.8%, 92.7%, and 95.1%, thereby outperforming seven other segmentation models.

Furthermore, we developed SegmenView, which integrates various architectures including LeNet, AlexNet, ResNet50, and VGGNet16 to detect specific ocular signs such as eyelid edema and conjunctival erythema. SegmenView demonstrated internal area under the curve (AUC) scores of 71.09%, 80.81%, 90.07%, and 82.86%, with external AUCs ranging from 55.58% to 84.29% across two test datasets, surpassing traditional half-face and periocular models. Our comparisons with four privacy-preserving methods highlighted SegmenView’s superior capability to maintain a balance between privacy and diagnostic accuracy. Additionally, the use of Gradient-weighted Class Activation Mapping (Grad-CAM) enhanced the interpretability of our model, underscoring its potential for effective, privacy-conscious ocular sign detection.

Introduction

The introduction highlights the significant global healthcare burden posed by ophthalmic diseases, with approximately 0.5% of the population experiencing blindness and over 200 million individuals suffering from moderate to severe visual impairment. Early intervention can reverse more than one billion cases of vision loss, underscoring the necessity for timely and accurate diagnosis. Many ophthalmic conditions, such as those affecting the eyelid, conjunctiva, lacrimal caruncle, and eyeball, present early signs that are often subtle and reliant on subjective clinical judgment, complicating detection efforts.

To address these challenges, the study proposes the use of artificial intelligence (AI) to enhance diagnostic consistency and reduce dependence on physician experience. However, existing AI methods for ocular sign detection frequently utilize facial images, raising privacy concerns under regulations like the European Union’s General Data Protection Regulation (GDPR). The study introduces SegmenView, a novel segmentation approach utilizing a Dense Squeeze-and-Excitation Network (DSE-Net) to isolate key ocular regions for detecting signs associated with Thyroid Eye Disease (TED), a condition affecting 25-50% of Graves’ disease patients. The model’s performance was optimized using various Convolutional Neural Network (CNN) architectures and validated across diverse patient data, ensuring robustness and generalizability. Additionally, the study incorporates privacy-preserving techniques and enhances model interpretability through visualizations using Gradient-weighted Class Activation Mapping (Grad-CAM).

Methods

In this section, the authors detail the methods employed to evaluate the performance of various privacy-preserving techniques compared to SegmenView. The study implemented four classical encryption approaches: pixel scrambling, chaotic logistic maps, XOR encryption, and homomorphic encryption. To enhance the dataset, data augmentation techniques such as random rotations, horizontal flipping, and brightness adjustments were applied. Subsequently, a wavelet transform was utilized to extract texture features, and patient images were converted to the HSV color space to align with human visual perception. Histograms for each channel were computed and normalized, resulting in a joint feature vector that was encrypted using the aforementioned methods.

The detection results indicated that the performance of these privacy-preserving methods was generally inferior to SegmenView, with Area Under the Curve (AUC) values not exceeding 0.65. Specifically, pixel scrambling yielded AUCs of 42.78, 43.81, 46.89, and 62.89 for various conditions, while chaotic logistic maps produced AUCs of 60.81, 51.88, 57.91, and 52.73. XOR encryption resulted in AUCs of 51.48, 50.47, 46.59, and 52.58, and homomorphic encryption showed AUCs of 50.67, 54.81, 49.27, and 53.24. Detailed results for external test datasets are provided in Supplementary Table 2.

Results

In the results section, the study presents the detection capabilities of SegmenView for various ocular conditions, including eyelid edema, conjunctival erythema, caruncle or plica edema, and exophthalmos. The findings indicate that SegmenView demonstrates high accuracy in identifying these conditions, although specific metrics such as sensitivity, specificity, or area under the curve (AUC) are not detailed in the provided text. The implications of these results suggest that SegmenView could be a valuable tool in clinical settings for the assessment of ocular health. Further quantitative analysis and comparisons with existing diagnostic methods may be warranted to fully establish its efficacy.

Discussion

In this study, the authors developed and validated SegmenView, a model designed for detecting ocular signs of Thyroid Eye Disease (TED). The dataset comprised 1,160 patient images from five hospitals, with significant variations in ocular sign incidence rates across training and testing datasets. The segmentation model, DSE-Net, demonstrated superior performance with the lowest parameter count (11.02M) among eight evaluated networks, achieving high Dice Coefficient scores (e.g., 92.7% for lacrimal caruncle) and robust generalizability across diverse racial groups and imaging devices. The model’s precision in isolating lesion areas enhances detection accuracy while minimizing privacy risks by focusing solely on clinically relevant regions.

SegmenView outperformed traditional half-face and periocular models in detecting ocular signs, as evidenced by higher Area Under Curve (AUC) values across multiple datasets. The Grad-CAM visualizations confirmed that SegmenView effectively concentrated on pertinent anatomical structures, unlike broader models that included irrelevant regions, which could dilute detection sensitivity. The authors emphasized the model’s potential for integration into clinical workflows, particularly in primary care and telemedicine, where it could facilitate early detection of subtle ocular changes. However, the study acknowledged limitations, including a predominantly Chinese population and a focus on only four ocular signs, suggesting the need for further validation and expansion of the model’s capabilities in future research.