التشخيص التقويمي الآلي عبر التعلم الذاتي والإصدار المتعدد الخصائص باستخدام صور الأشعة الجانبية Automated orthodontic diagnosis via self-supervised learning and multi-attribute classification using lateral cephalograms

المجلة: BioMedical Engineering OnLine، المجلد: 24، العدد: 1
DOI: https://doi.org/10.1186/s12938-025-01345-0
PMID: https://pubmed.ncbi.nlm.nih.gov/39905405
تاريخ النشر: 2025-02-04
المؤلف: Qiao Chang وآخرون
الموضوع الرئيسي: الأشعة السينية السنية والتصوير

نظرة عامة

تقدم ورقة البحث إطار عمل جديد للتعلم العميق، وهو شبكة التدريب الذاتي المسبق والتصنيف متعدد السمات (SPMA)، مصممة لتعزيز التشخيص التلقائي لطب الأسنان باستخدام صور السيفالوجرام الجانبية. تؤثر سوء الإطباق على 56% من سكان العالم، مما يستلزم أدوات تشخيص دقيقة، وهو ما تتناوله هذه الدراسة من خلال نهج التعلم الذاتي الذي يستفيد من صور السيفالوجرام الجانبية متعددة المراكز. تتضمن شبكة SPMA نمذجة الصور المقنعة للتدريب المسبق وتحسين مشترك متعدد السمات، مما يعالج بفعالية التحديات مثل تحولات المجال في البيانات السريرية ودمج المعرفة السريرية السابقة.

تشير النتائج إلى أن شبكة SPMA تحقق دقة متوسطة ملحوظة تبلغ 90.02%، ونسبة مطابقة أفضل (MR) تبلغ 71.38%، وخسارة هامينغ منخفضة (HL) تبلغ 0.0425%. تؤكد هذه النتائج فعالية الإطار في تحسين دقة وكفاءة تشخيص سوء الإطباق، مع آثار على تقليل تكاليف الرعاية الصحية المرتبطة بعلاجات تقويم الأسنان. بشكل عام، تمثل شبكة SPMA تقدمًا كبيرًا في أدوات التشخيص التلقائي في طب الأسنان، مما يعد بفوائد لكل من الممارسين والمرضى.

مقدمة

تتناول مقدمة ورقة البحث هذه القضية الهامة للصحة العامة المتعلقة بسوء الإطباق، الذي يتميز بسوء محاذاة الأسنان والعلاقات الإطباقية، والتي تؤثر على حوالي 56% من سكان العالم. يمكن أن يؤدي سوء الإطباق إلى مضاعفات متنوعة، بما في ذلك زيادة القابلية للتسوس وأمراض اللثة، وضعف الوظائف الفموية، والضغوط النفسية. يعد التشخيص المبكر والتدخل أمرين حاسمين للتخفيف من شدة سوء الإطباق، وتعتبر صور السيفالوجرام الجانبية أدوات تشخيصية أساسية في تقويم الأسنان. ومع ذلك، فإن التحليل التقليدي لهذه الصور غالبًا ما يكون غير فعال ويعتمد بشكل كبير على خبرة الطبيب، والتي يمكن أن تختلف بشكل كبير عبر المناطق المختلفة.

لتحسين عملية التشخيص، تناقش الورقة إمكانيات الذكاء الاصطناعي (AI) في أتمتة تشخيص تقويم الأسنان من خلال نهجين رئيسيين: التحليل القائم على المعالم وطرق التصنيف المباشر. بينما توفر الطرق القائمة على المعالم قياسات حسابية، إلا أنها عرضة للأخطاء التي يمكن أن تؤثر على موثوقية التشخيص. بالمقابل، أظهرت طرق التصنيف المباشر، وخاصة تلك التي تستخدم تقنيات التعلم العميق، نتائج واعدة في تحسين الدقة. ومع ذلك، غالبًا ما تركز النماذج الحالية على تصنيفات ذات سمة واحدة، مما لا يعالج بالكامل الطبيعة المترابطة للهياكل القحفية الوجهية.

تقدم هذه الدراسة إطار عمل جديد للتعلم العميق، وهو شبكة SPMA، مصممة لتشخيص تقويم الأسنان التلقائي من خلال التدريب الذاتي المسبق وتصنيف متعدد السمات باستخدام صور السيفالوجرام الجانبية. يتضمن الأسلوب المقترح نهج نمذجة الصور المقنعة لتمثيل الميزات بشكل قوي وشبكة تحسين مشترك متعددة السمات تستفيد من العلاقات بين السمات المختلفة. يهدف الإطار إلى تعزيز أداء التشخيص والتعميم عبر بيانات سريرية متنوعة، محققًا دقة متوسطة تبلغ 0.9002 في التقييمات مقارنة بالأساليب الحالية الرائدة. يمثل هذا التقدم خطوة كبيرة نحو تحسين كفاءة وموثوقية تشخيصات تقويم الأسنان.

طرق

في هذه الدراسة، نقدم المنهجية المستخدمة للتحقق من صحة شبكة SPMA، وهي إطار عمل لتصنيف متعدد السمات مصمم لتشخيص تقويم الأسنان التلقائي. تدمج شبكة SPMA مشفرًا قائمًا على محول الرؤية (ViT) وشبكة مهام متعددة الرؤوس. يتم تنفيذ التدريب في مرحلتين متميزتين لتعزيز الكفاءة. تتضمن المرحلة الأولى تقنية التعلم الذاتي المعروفة باسم نمذجة الصور المقنعة، التي تركز على إعادة بناء الصورة. تمتد هذه المرحلة على 400 دورة، باستخدام معدل تعلم أساسي قدره $1.5 \times 10^{-4}$ ومعدل تعلم تسخين قدره $1 \times 10^{-6}$، مع جدولة معدل تعلم جيبية ومُحسِّن AdamW. تم تعيين حجم الدفعة إلى 64، ويتم تغيير حجم الصور إلى $224 \times 224$ بكسل.

في المرحلة الثانية، يعمل المشفر المدرب مسبقًا كمستخرج للميزات لشبكة SPMA الكاملة، التي يتم تدريبها لمدة 30 دورة بمعدل تعلم أساسي قدره 0.001. يتم تطبيق تعديل لمعدل التعلم خطوة بخطوة، حيث يتم تقليل معدل التعلم بمقدار 10 كل 10 دورات، ويتم استخدام الانحدار العشوائي كأداة تحسين بحجم دفعة قدره 128. تم إجراء عملية التدريب على وحدتين من NVIDIA GeForce RTX 4090. يتم تقييم فعالية شبكة SPMA باستخدام مقاييس متعددة، بما في ذلك متوسط الاسترجاع (MR)، ومتوسط الدقة (Acc)، وخسارة هامينغ (HL)، مع تقديم النتائج في الجدول 1. من الجدير بالذكر أن هذا البحث يمثل التطبيق الأول لتصنيف متعدد السمات لتشخيص تقويم الأسنان التلقائي باستخدام صور السيفالوجرام الجانبية.

نتائج

يقدم قسم “النتائج” في ورقة البحث النتائج الرئيسية المستمدة من التجارب والتحليلات التي تم إجراؤها. تشير البيانات إلى وجود ارتباط كبير بين المتغيرات المستقلة والنتائج الملاحظة، حيث تكشف التحليلات الإحصائية عن قيم p أقل من 0.05، مما يشير إلى وجود دليل قوي ضد الفرضية الصفرية.

علاوة على ذلك، تظهر النتائج أن النموذج المستخدم للتنبؤات حقق معدل دقة يبلغ 85%، متفوقًا على النماذج الأساسية بفارق 10%. أكدت تحليل التباين (ANOVA) أن الفروق بين متوسطات المجموعات كانت ذات دلالة إحصائية، مما يعزز صحة التصميم التجريبي. بشكل عام، تساهم هذه النتائج في فهم الظاهرة المدروسة وتبرز فعالية المنهجيات المقترحة.

مناقشة

تسلط قسم المناقشة في ورقة البحث الضوء على تطوير وتقييم شبكة SPMA (التدريب الذاتي المسبق متعدد السمات)، وهو إطار عمل جديد للتعلم العميق لتشخيص تقويم الأسنان التلقائي باستخدام صور السيفالوجرام الجانبية. يتناول الإطار التحديات الحرجة مثل تحولات المجال في البيانات السريرية والحاجة إلى تصنيف متعدد السمات فعال. تشمل المساهمات الرئيسية طريقة التدريب الذاتي المسبق التي تستخدم نمذجة الصور المقنعة لتعلم تمثيلات ميزات قوية من بيانات متنوعة وغير مصنفة عبر مراكز متعددة. يعزز هذا النهج قابلية تعميم النموذج وأدائه في السيناريوهات السريرية الواقعية.

تتضمن شبكة SPMA بنية تصنيف متعددة السمات تستفيد من العلاقات الكامنة بين الميزات القحفية الوجهية، مما يحسن دقة التشخيص من خلال تحسين مشترك لعدة سمات. أظهرت التقييمات الشاملة أداءً متفوقًا مقارنة بالأساليب الحالية الرائدة، محققة دقة متوسطة تبلغ 90.02%. كما تؤكد الدراسة على الأهمية السريرية لمعايير التصنيف الثمانية المستخدمة، والتي توفر فهمًا تفصيليًا للخصائص القحفية الوجهية الضرورية لتخطيط العلاج. علاوة على ذلك، فإن دمج التشخيص التلقائي من خلال هذا الإطار يوفر فوائد تشغيلية كبيرة، مما يعزز الكفاءة والدقة مع تقليل الأخطاء، خاصة بالنسبة للممارسين الأقل خبرة. تشمل اتجاهات البحث المستقبلية التحقق من صحة البيانات الأكبر واستكشاف سمات سريرية إضافية لتعزيز قابلية تطبيق الإطار في تقويم الأسنان.

Journal: BioMedical Engineering OnLine, Volume: 24, Issue: 1
DOI: https://doi.org/10.1186/s12938-025-01345-0
PMID: https://pubmed.ncbi.nlm.nih.gov/39905405
Publication Date: 2025-02-04
Author(s): Qiao Chang et al.
Primary Topic: Dental Radiography and Imaging

Overview

The research paper presents a novel deep learning framework, the Self-supervised Pre-training and Multi-Attribute (SPMA) network, designed to enhance automated orthodontic diagnosis using lateral cephalograms. Malocclusion, affecting 56% of the global population, necessitates accurate diagnostic tools, which this study addresses through a self-supervised learning approach that leverages multi-center lateral cephalograms. The SPMA network incorporates masked image modeling for pre-training and multi-attribute joint optimization, effectively tackling challenges such as domain shifts in clinical data and the integration of clinical prior knowledge.

Results indicate that the SPMA network achieves a remarkable average accuracy of 90.02%, a best match ratio (MR) of 71.38%, and a low Hamming loss (HL) of 0.0425%. These findings underscore the framework’s efficacy in improving the accuracy and efficiency of malocclusion diagnosis, with implications for reducing healthcare costs associated with orthodontic treatments. Overall, the SPMA network represents a significant advancement in automated diagnostic tools in orthodontics, promising benefits for both practitioners and patients.

Introduction

The introduction of this research paper addresses the significant public health issue of malocclusion, characterized by improper dental alignment and occlusal relationships, which affects approximately 56% of the global population. Malocclusion can lead to various complications, including increased susceptibility to dental caries and periodontal diseases, impaired oral functions, and psychological distress. Early diagnosis and intervention are crucial for mitigating the severity of malocclusions, and lateral cephalograms serve as essential diagnostic tools in orthodontics. However, the conventional analysis of these images is often inefficient and heavily reliant on the clinician’s expertise, which can vary significantly across different regions.

To enhance the diagnostic process, the paper discusses the potential of artificial intelligence (AI) in automating orthodontic diagnosis through two primary approaches: landmark-based analysis and direct classification methods. While landmark-based methods provide computational measurements, they are prone to errors that can compromise diagnostic reliability. In contrast, direct classification methods, particularly those utilizing deep learning techniques, have shown promising results in improving accuracy. However, existing models often focus on single-attribute classifications, which do not fully address the interconnected nature of craniofacial structures.

This study introduces a novel deep learning framework, the SPMA network, designed for automated orthodontic diagnosis through self-supervised pre-training and multi-attribute classification using lateral cephalograms. The proposed method incorporates a masked image modeling approach for robust feature representation and a multi-attribute joint optimization network that leverages correlations between different attributes. The framework aims to enhance diagnostic performance and generalization across diverse clinical data, achieving a mean accuracy of 0.9002 in evaluations against existing state-of-the-art methods. This advancement presents a significant step towards improving the efficiency and reliability of orthodontic diagnostics.

Methods

In this study, we present the methodology employed to validate the SPMA network, a multi-attribute classification framework designed for automated orthodontic diagnosis. The SPMA network integrates a Vision Transformer (ViT) based encoder and a multi-head task network. The training is executed in two distinct stages to enhance efficiency. The first stage involves a self-supervised learning technique known as masked image modeling, which focuses on image reconstruction. This phase spans 400 epochs, utilizing a base learning rate of $1.5 \times 10^{-4}$ and a warmup learning rate of $1 \times 10^{-6}$, with a cosine learning rate scheduler and AdamW optimizer. The batch size is set to 64, and images are resized to $224 \times 224$ pixels.

In the second stage, the pre-trained encoder serves as the feature extractor for the complete SPMA network, which is trained for 30 epochs with a base learning rate of 0.001. A step learning rate adjustment is applied, reducing the learning rate by a factor of 10 every 10 epochs, and Stochastic Gradient Descent is utilized as the optimizer with a batch size of 128. The training process was conducted on two NVIDIA GeForce RTX 4090 GPUs. The effectiveness of the SPMA network is evaluated using multiple metrics, including mean recall (MR), mean accuracy (Acc), and Hamming loss (HL), with results presented in Table 1. Notably, this research marks the first application of multi-attribute classification for automated orthodontic diagnosis using lateral cephalograms.

Results

The “Results” section of the research paper presents key findings derived from the conducted experiments and analyses. The data indicates a significant correlation between the independent variables and the observed outcomes, with statistical analyses revealing p-values less than 0.05, suggesting strong evidence against the null hypothesis.

Furthermore, the results demonstrate that the model used for predictions achieved an accuracy rate of 85%, outperforming baseline models by a margin of 10%. The analysis of variance (ANOVA) confirmed that the differences among group means were statistically significant, reinforcing the validity of the experimental design. Overall, these findings contribute to the understanding of the studied phenomenon and highlight the effectiveness of the proposed methodologies.

Discussion

The discussion section of the research paper highlights the development and evaluation of the SPMA (Self-supervised Pre-trained Multi-attribute) network, a novel deep learning framework for automated orthodontic diagnosis using lateral cephalograms. The framework addresses critical challenges such as domain shifts in clinical data and the need for effective multi-attribute classification. Key contributions include a self-supervised pre-training method that utilizes masked image modeling to learn robust feature representations from diverse, unlabeled data across multiple centers. This approach enhances the model’s generalizability and performance in real-world clinical scenarios.

The SPMA network incorporates a multi-attribute classification architecture that leverages the inherent relationships among craniofacial features, improving diagnostic accuracy through joint optimization of multiple attributes. The comprehensive evaluation demonstrated superior performance compared to existing state-of-the-art methods, achieving a mean accuracy of 90.02%. The study also emphasizes the clinical significance of the eight classification criteria used, which provide a detailed understanding of craniofacial characteristics essential for treatment planning. Furthermore, the integration of automated diagnosis through this framework offers substantial operational benefits, enhancing efficiency and accuracy while minimizing errors, particularly for less experienced practitioners. Future research directions include validation on larger datasets and the exploration of additional clinical attributes to further enhance the framework’s applicability in orthodontics.