معالجة التصنيف غير المتسق باستخدام مطابقة الصور المتقاطعة لتقسيم الصور الطبية المعتمدة على الخربشة Addressing Inconsistent Labeling With Cross Image Matching for Scribble-Based Medical Image Segmentation

المجلة: IEEE Transactions on Image Processing، المجلد: 34
DOI: https://doi.org/10.1109/tip.2025.3530787
PMID: https://pubmed.ncbi.nlm.nih.gov/40031274
تاريخ النشر: 2025-01-01
المؤلف: Jingkun Chen وآخرون
الموضوع الرئيسي: تقنيات تقسيم الصور الطبية

نظرة عامة

في السنوات الأخيرة، اكتسب التعلم شبه المراقب زخمًا في تقسيم الصور الطبية، لا سيما من خلال استخدام تعليقات الخربشة لتقليل تكاليف التعليق. ومع ذلك، فإن التحديات التي تطرحها الطبيعة غير المكتملة والذاتية وغير القياسية لتسمية الخربشة تؤدي إلى تناقضات تعيق عملية التعلم وت degrade أداء التقسيم. لمواجهة هذه المشكلة، يقترح المؤلفون تطوير مجموعة مرجعية مصممة لمطابقة الميزات على مستوى البكسل. تم بناء هذه المجموعة من رموز محددة للفئة وميزات على مستوى البكسل مستمدة من صور متنوعة، وتعمل كمستودع يعرض أنماط بكسل وفئات مختلفة.

تسهل استراتيجية مطابقة الميزات على مستوى البكسل المقارنة الفعالة للبكسلات غير المعلّمة، مما يوفر إرشادات أساسية في السيناريوهات التي تتسم بالخربشات غير المتسقة وغير المكتملة. من خلال دمج تقنيات التنعيم والانحدار، تتماشى هذه الطريقة ميزات البكسل عبر صور مختلفة، مما يعزز قدرة الشبكة على تعلم أنماط متسقة من المجموعة المرجعية. تشير النتائج التجريبية على ثلاثة مجموعات بيانات متاحة للجمهور إلى أن هذه الطريقة تتفوق على التقنيات الحديثة من حيث دقة التقسيم والاستقرار. جعل المؤلفون رمزهم متاحًا للجمهور على https://github.com/jingkunchen/scribble-medical-segmentation.

طرق

في هذا القسم، يوضح المؤلفون إعدادهم التجريبي لتقييم طريقة تقسيم تطبق على الصور القلبية، مع التركيز على التحديات التي تطرحها تعقيد الصور وتنوع تعليقات الخربشة. استخدموا ثلاث مجموعات بيانات قلبية متميزة: ACDC و MS-CMRSeg Scribbles و MS-CMRSeg Challenge، كل منها يقدم خصائص فريدة وأنماط تعليق. تتضمن مجموعة بيانات ACDC، وهي معيار لتقسيم القلب، 200 مسح MRI ثلاثي الأبعاد من 100 مريض، مع التركيز على البطين الأيسر (LV) وعضلة القلب (myo) والبطين الأيمن (RV) مع تعليقات خربشة. تتكون مجموعة بيانات MS-CMRSeg Scribbles من 25 صورة ثلاثية الأبعاد عالية الدقة تم التعليق عليها إلى أربع فئات، بينما تحتوي مجموعة بيانات MS-CMRSeg Challenge على 90 صورة من وضعيتين، تم التعليق عليها لثلاث فئات.

لتعديل المعلمات الفائقة، اتبع المؤلفون تكوينًا من أعمال سابقة، حيث تم تعيين معامل وزن الخسارة $\alpha$ إلى 0.5. قاموا بتقييم طريقتهم مقابل مجموعة متنوعة من الأساليب المعروفة وأجروا تحليلًا مقارنًا مع طريقة MPLS تحت ظروف زيادة التحيز في تعليقات الخربشة. استخدمت التقييمات طريقة التحقق المتقاطع بخمسة أضعاف مع تقسيم 80% للتدريب و20% للاختبار عبر مجموعات البيانات، باستخدام معامل تشابه Dice (DSC) والنسبة المئوية 95 من مسافة هاوسدورف (95HD) كمقاييس للأداء. تم تقييم الأهمية الإحصائية للنتائج باستخدام اختبارات ويلكوكسون لرتبة المجموع لتحديد قيم p للاختلافات الملحوظة بين الطرق.

نقاش

في مناقشة ورقة البحث، يبرز المؤلفون إمكانيات تقسيم الصور المعتمد على الخربشة لتقليل عبء التعليق مع الحفاظ على دقة التقسيم. ومع ذلك، فإنهم يعترفون بأن النماذج المدربة باستخدام تعليقات الخربشة قد تؤدي أداءً أقل مقارنة بتلك التي تستخدم تعليقات كثيفة. لمعالجة ذلك، تم اقتراح طرق متنوعة، بما في ذلك الأساليب المعتمدة على الرسوم البيانية مثل تقسيم المشي العشوائي، وتقنيات متقدمة مثل تنظيم الحد الأدنى من الانتروبيا، وإطارات تنظيم CRF، وإطارات واعية بالشك. تهدف هذه الابتكارات إلى تعزيز أداء نماذج التقسيم في السيناريوهات التي تتوفر فيها تعليقات متناثرة فقط، لا سيما في مجالات معقدة مثل التصوير الطبي.

كما يبرز القسم التحديات المرتبطة بمطابقة تشابه الميزات في سياق تقسيم الصور الطبية المعتمد على الخربشة. غالبًا ما تعتمد تقنيات مطابقة الميزات على مستوى البكسل التقليدية على معلومات بصرية شاملة، والتي لا تتوفر مع تعليقات الخربشة المتناثرة. تعقد هذه القيود، جنبًا إلى جنب مع التعقيدات الفريدة للصور الطبية، مثل الأشكال والملمس غير المنتظم، فعالية هذه الأساليب. يجادل المؤلفون بأن الأساليب الحالية لمطابقة الميزات لا تعالج بشكل كافٍ القيود الجوهرية لتعليقات الخربشة، مما يمكن أن يؤدي إلى تمثيل ميزات غير مثالي ويعيق الأداء العام للتقسيم.

Journal: IEEE Transactions on Image Processing, Volume: 34
DOI: https://doi.org/10.1109/tip.2025.3530787
PMID: https://pubmed.ncbi.nlm.nih.gov/40031274
Publication Date: 2025-01-01
Author(s): Jingkun Chen et al.
Primary Topic: Medical Image Segmentation Techniques

Overview

In recent years, weakly-supervised learning has gained traction in medical image segmentation, particularly through the use of scribble annotations to lower annotation costs. However, the challenges posed by the incomplete, subjective, and non-standardized nature of scribble labeling lead to inconsistencies that hinder the learning process and degrade segmentation performance. To tackle this issue, the authors propose the development of a reference set designed for pixel-level feature matching. This set is constructed from class-specific tokens and pixel-level features derived from diverse images, serving as a repository that showcases various pixel styles and classes.

The proposed pixel-level feature matching strategy facilitates effective comparison of unlabeled pixels, providing essential guidance in scenarios marked by inconsistent and incomplete scribbles. By integrating smoothing and regression techniques, the approach aligns pixel-level features across different images, enhancing the network’s ability to learn consistent patterns from the reference set. Experimental results on three publicly available datasets indicate that this method outperforms state-of-the-art techniques in terms of segmentation accuracy and stability. The authors have made their code publicly available at https://github.com/jingkunchen/scribble-medical-segmentation.

Methods

In this section, the authors detail their experimental setup for evaluating a segmentation method applied to cardiac images, emphasizing the challenges posed by the complexity of the images and variability in scribble annotations. They utilized three distinct cardiac datasets: ACDC, MS-CMRSeg Scribbles, and MS-CMRSeg Challenge, each offering unique characteristics and annotation styles. The ACDC dataset, a benchmark for cardiac segmentation, includes 200 3D MRI scans from 100 patients, focusing on the left ventricle (LV), myocardium (myo), and right ventricle (RV) with scribble annotations. The MS-CMRSeg Scribbles dataset comprises 25 high-resolution 3D images annotated into four classes, while the MS-CMRSeg Challenge dataset features 90 images from two modalities, annotated for three classes.

For hyperparameter tuning, the authors followed a configuration from previous work, setting the loss weight parameter $\alpha$ to 0.5. They benchmarked their method against various established approaches and conducted a comparative analysis with the MPLS method under conditions of increased bias in scribble annotations. The evaluation employed five-fold cross-validation with an 80% training and 20% testing split across datasets, utilizing the Dice Similarity Coefficient (DSC) and the 95th percentile of Hausdorff Distance (95HD) as performance metrics. Statistical significance of the results was assessed using Wilcoxon rank-sum tests to determine p-values for the observed differences between methods.

Discussion

In the discussion of the research paper, the authors highlight the potential of scribble-based segmentation for reducing annotation workload while maintaining segmentation accuracy. They acknowledge, however, that models trained with scribble annotations may underperform compared to those using dense annotations. To address this, various methods have been proposed, including graphical-based approaches like random walk segmentation, and advanced techniques such as minimum entropy regularization, CRF regularized frameworks, and uncertainty-aware frameworks. These innovations aim to enhance the performance of segmentation models in scenarios where only sparse annotations are available, particularly in complex fields like medical imaging.

The section also emphasizes the challenges associated with feature similarity matching in the context of scribble-based medical image segmentation. Traditional pixel-level feature matching techniques often rely on comprehensive visual information, which is not available with sparse scribble annotations. This limitation, combined with the unique complexities of medical images, such as irregular shapes and textures, complicates the effectiveness of these methods. The authors argue that existing feature-matching approaches do not adequately address the inherent limitations of scribble annotations, which can lead to suboptimal feature representation and hinder the overall segmentation performance.