DentoMorph-LDMs: نماذج انتشار تعتمد على نسيج اللثة المتصل بـ 8 نقاط جديدة وفقدان الأسنان اللبنية لزيادة صور الأسنان DentoMorph-LDMs: diffusion models based on novel adaptive 8-connected gum tissue and deciduous teeth loss for dental image augmentation

المجلة: Scientific Reports، المجلد: 15، العدد: 1
DOI: https://doi.org/10.1038/s41598-025-11955-2
PMID: https://pubmed.ncbi.nlm.nih.gov/40715323
تاريخ النشر: 2025-07-26
المؤلف: Hanaa Salem وآخرون
الموضوع الرئيسي: الأشعة السينية السنية والتصوير

نظرة عامة

تقدم هذه البحث DentoMorph-LDMs، إطار عمل مبتكر يعزز إعادة بناء الصور السنية للأطفال واكتشاف الأمراض من خلال دمج الخوارزميات المستوحاة من البيولوجيا في نماذج الانتشار الكامنة (LDMs). يقدم الدراسة وظيفتين جديدتين للخسارة: تعويض بكسل متكيف مع اللثة (GAPI)، الذي يحاكي السلوك التكيفي لأنسجة اللثة، وإعادة بناء قائمة على الانتقال المؤقت (DTBR)، الذي يأخذ في الاعتبار مراحل نمو الأسنان الأولية لدى الأطفال الذين تتراوح أعمارهم بين 2-12 عامًا. تحسن هذه الخوارزميات بشكل كبير من الحفاظ على السلامة التشريحية أثناء إعادة بناء الصورة، محققة مقاييس أداء متفوقة، بما في ذلك درجة بداية قدرها 9.87 وتحسين متوسط الدقة بنسبة 9.1% عبر خمسة نماذج اكتشاف، مما يمكّن من اكتشاف أكثر دقة للتسوس في مراحله المبكرة وغيرها من الشذوذات السنية.

يظهر الإطار عمومية قوية عبر مجموعات بيانات متعددة وفئات عمرية، مع فعالية ملحوظة خلال فترات الأسنان المختلطة. تشير التحقق الطولي إلى فوائد سريرية مستدامة، بما في ذلك تحسين الحساسية في اكتشاف التسوس المبكر وتوفير اقتصادي لكل مريض. ومع ذلك، تعترف الدراسة بالقيود مثل الاعتماد على مجموعة بيانات تدريب واحدة والعبء الحسابي الذي قد يعيق الاعتماد في البيئات ذات الموارد المحدودة. تشمل اتجاهات البحث المستقبلية توسيع الإطار ليشمل التصوير ثلاثي الأبعاد، ودمج عمليات بيولوجية إضافية، واستكشاف تطبيقات أوسع في التصوير الطبي للأطفال. يؤسس هذا العمل نموذجًا جديدًا لتعزيز الصور الطبية يدمج الذكاء الحسابي مع الرؤى البيولوجية، مما يمهد الطريق للتقدم بين التخصصات في الرعاية الصحية للأطفال.

مقدمة

تسلط مقدمة ورقة البحث الضوء على الاهتمام المتزايد في وظائف المسافة الموقعة العصبية (SDFs) كطريقة جديدة للتمثيل البصري ثلاثي الأبعاد، والتي تقدم مزايا على الشبكات التقليدية من الفوكسل والشبكات المثلثية من خلال توفير وظيفة مستمرة لتمثيل الهندسة ثلاثية الأبعاد. تسهل هذه الطريقة التخزين المدمج للبيانات الهندسية التفصيلية. تناقش الورقة تطبيق النماذج التوليدية العميقة، وخاصة نماذج الانتشار الكامنة (LDMs)، في تعزيز الصور الطبية والسنية، مشيرة إلى تفوقها على الشبكات التنافسية التوليدية التقليدية (GANs) في إنتاج صور عالية الدقة مع تماسك هيكلي أفضل. ومع ذلك، فإن التطبيقات الحالية لنماذج LDM في التصوير السني قد أغفلت إلى حد كبير دمج المعرفة التشريحية الخاصة بالمجال.

لمعالجة هذه الفجوة، يقترح المؤلفون وظائف خسارة مستوحاة من البيولوجيا مصممة خصيصًا لتشريح الأسنان للأطفال، من خلال طريقتين مبتكرتين: تعويض بكسل متكيف مع اللثة (GAPI) وإعادة بناء قائمة على الانتقال المؤقت (DTBR). يركز GAPI على استعادة البكسلات المفقودة عند واجهة اللثة والأسنان من خلال تحليل السياق التشريحي المحلي، بينما يقدر DTBR مرحلة نمو المريض لضمان توافق الهياكل المعوضة مع مراحل نمو الأسنان المتوقعة. يمثل هذا البحث جهدًا رائدًا لدمج بيولوجيا الأسنان للأطفال في إطار توليدي قائم على الانتشار لتعزيز الصور وتعويضها، بهدف تحسين دقة وواقعية التصوير السني.

الطرق

تركز المنهجية المقدمة في هذا البحث على الاستفادة من المبادئ البيولوجية من سلوك الأنسجة الفموية لتطوير خوارزميات معالجة الصور المتقدمة لتصوير الأسنان، مستهدفة تطبيقات الأطفال بشكل خاص. يتم تقديم خوارزميتين مبتكرتين، تعويض بكسل متكيف مع اللثة (GAPI) وإعادة بناء قائمة على الانتقال المؤقت (DTBR). يتم دمج هذه الخوارزميات في نماذج الانتشار الكامنة (LDM) لتعزيز إعادة بناء وتعزيز الصور السنية عبر فئات عمرية مختلفة. تحاكي خوارزمية GAPI الدعم الهيكلي التكيفي لأنسجة اللثة، مع إعطاء الأولوية للمسارات التشريحية الهامة لتعويض البكسلات مع الوعي بالسياق، بينما تدمج خوارزمية DTBR أنماط النمو للأسنان الأولية، مستخدمة التعرف على الأنماط لإعادة بناء الصور التي تعكس المظهر المتوقع للأسنان الأولية في مراحل مختلفة.

تم هيكلة المنهجية في ثلاث مراحل: (1) معالجة وتحليل الصور لتحديد المناطق التي تحتاج إلى إعادة بناء، (2) إعادة بناء البكسلات المستوحاة من البيولوجيا باستخدام GAPI وDTBR لاستعادة البكسلات المفقودة أو التالفة، و(3) دمج هذه الخوارزميات مع نماذج الانتشار لتوليد صور سنية ذات مصداقية تشريحية. توجه إدماج وظائف الخسارة المستوحاة من البيولوجيا في إطار LDM عملية إعادة البناء، مما يضمن أن الصور الناتجة تحافظ على الاستمرارية التشريحية. للتحقق من فعالية هذا النهج، يتم استخدام مجموعة البيانات المحسنة التي تم إنتاجها بواسطة التقنيات المستوحاة من البيولوجيا لتدريب وتقييم نماذج اكتشاف الأمراض المختلفة، مما يظهر إمكانيات هذه الخوارزميات في تحسين تحليل الصور السنية.

النتائج

يقدم قسم النتائج في الورقة تقييمًا شاملاً لنموذج الانتشار المعزز المقترح، مقارنًا أدائه ضد نماذج موجودة مختلفة باستخدام مقاييس رئيسية: درجة البداية (IS)، مسافة بداية فريشيت (FID)، مقياس مؤشر التشابه الهيكلي (SSIM)، ونسبة الإشارة إلى الضوضاء القصوى (PSNR). يحقق النموذج المقترح IS قدره 9.87، وأدنى FID قدره 4.21، وSSIM مرتفع قدره 0.952، وPSNR قدره 34.76، مما يشير إلى جودة صورة وواقعية متفوقة مقارنة بالمنافسين مثل DiffusionCLIP وControlNet-Aug. تؤكد التحليلات الإحصائية، بما في ذلك اختبارات t لعينة واحدة واختبارات t لعينة مستقلة، على أهمية هذه التحسينات، مع قيم p أقل من 0.001 عبر جميع المقاييس، مما يعزز فعالية النموذج في توليد صور سنية عالية الجودة.

بالإضافة إلى ذلك، تظهر نتائج اكتشاف الأمراض الفموية تحسينات كبيرة عبر نماذج التصنيف المختلفة بعد تطبيق تقنيات تعزيز البيانات. بشكل ملحوظ، أظهر نموذج Faster R-CNN أعلى تحسين مطلق في متوسط الدقة (mAP) عند +0.076 (تحسين نسبي بنسبة 10.2%)، بينما أظهر YOLOv8 زيادة ملحوظة في الاسترجاع بمقدار +0.065 (تحسين نسبي بنسبة 8.4%). تؤكد هذه النتائج على الدور الحاسم لتعزيز البيانات في تحسين قدرات الاكتشاف، مما يعزز أداء العديد من الخوارزميات في سياق اكتشاف الأمراض الفموية. بشكل عام، تسلط النتائج الضوء على تفوق النموذج المقترح في توليد الصور وتأثيره الإيجابي على مهام التصنيف.

المناقشة

تسلط قسم المناقشة في الورقة الضوء على التقدم الكبير في تطبيقات التعلم العميق ضمن رعاية الصحة الفموية، خاصة من خلال استخدام الشبكات العصبية التلافيفية (CNNs) والشبكات التنافسية التوليدية (GANs). كانت هذه التقنيات أساسية في تحسين تحليل الصور السنية، مما يمكّن من التعرف الدقيق وتصنيف الأمراض السنية مع تبسيط سير العمل السريري. على الرغم من القدرات الواعدة لخوارزميات التعلم العميق، لا تزال التحديات قائمة، مثل أحجام مجموعات البيانات المحدودة التي تؤدي إلى الإفراط في التكيف وصعوبات في دمج هذه النماذج في الممارسة السريرية بسبب مقاييس الأداء غير الموثوقة. يؤكد المؤلفون على ضرورة إجراء مراجعات شاملة للأدبيات الحالية لتحديد المنهجيات الفعالة ومعالجة الحواجز التي تعيق اعتماد التعلم العميق في التصوير السني.

علاوة على ذلك، تقدم الورقة إطارًا مستوحى من البيولوجيا لتعزيز الصور السنية باستخدام نماذج الانتشار الكامنة (LDMs). تستفيد هذه الطريقة من الخصائص التشريحية للهياكل السنية لتحسين جودة الصورة ودقتها في مهام إعادة البناء. تم تصميم إطار LDM لمعالجة الصور السنية بكفاءة من خلال ترميزها في مساحة كامنة، حيث تضيف عملية الانتشار وتزيل الضوضاء لاحقًا، مسترشدة بوظائف خسارة مستوحاة من البيولوجيا جديدة. يهدف هذا الأسلوب إلى إنتاج صور عالية الدقة تحافظ على الاستمرارية التشريحية، خاصة في الحالات السنية للأطفال حيث تكون التغيرات النمائية حرجة. يعزز دمج GAPI (تعويض بكسل متكيف مع اللثة) عملية إعادة البناء من خلال إعطاء الأولوية للأهمية الهيكلية في جيران البكسل، مما يحسن الجودة العامة للتصوير السني.

القيود

تعترف البحث بعدة قيود قد تؤثر على قابلية تطبيق وأداء الإطار المقترح. أولاً، كانت عملية التدريب تعتمد بشكل أساسي على مجموعة بيانات شاملة واحدة، وعلى الرغم من أن التحقق عبر مجموعات البيانات يشير إلى عمومية جيدة، إلا أن التحقق الإضافي من خلال دراسات مستقبلية متعددة المراكز أوسع ضروري لتأسيس قابلية تطبيق الإطار بشكل عالمي. بالإضافة إلى ذلك، فإن اعتماد الإطار على معلومات دقيقة عن العمر وتصنيف مراحل النمو يطرح تحديات؛ قد تؤدي الأخطاء في بيانات عمر المريض أو الأنماط التنموية غير النمطية إلى تقويض الأداء، مما يتطلب تقييمًا سريريًا دقيقًا والتحقق اليدوي المحتمل.

علاوة على ذلك، قد تعيق المتطلبات الحسابية للإطار، على الرغم من أنها قابلة للإدارة للتطبيقات السريرية، اعتماده في البيئات ذات الموارد المحدودة أو برامج الفحص ذات الحجم الكبير حيث تكون المعالجة السريعة ضرورية. أخيرًا، قد يحد التركيز الحالي على طرق التصوير ثنائية الأبعاد من قابلية تطبيق الإطار على تقنيات التصوير ثلاثية الأبعاد الناشئة، مثل التصوير المقطعي المحوسب باستخدام شعاع مخروط (CBCT) وأجهزة المسح داخل الفم، التي أصبحت شائعة بشكل متزايد في ممارسة طب الأسنان للأطفال.

Journal: Scientific Reports, Volume: 15, Issue: 1
DOI: https://doi.org/10.1038/s41598-025-11955-2
PMID: https://pubmed.ncbi.nlm.nih.gov/40715323
Publication Date: 2025-07-26
Author(s): Hanaa Salem et al.
Primary Topic: Dental Radiography and Imaging

Overview

This research presents DentoMorph-LDMs, an innovative framework that enhances pediatric dental image reconstruction and disease detection by integrating biologically-inspired algorithms into Latent Diffusion Models (LDMs). The study introduces two novel loss functions: Gum-Adaptive Pixel Imputation (GAPI), which simulates the adaptive behavior of gum tissue, and Deciduous Transition-Based Reconstruction (DTBR), which accounts for the developmental stages of primary teeth in children aged 2-12 years. These algorithms significantly improve the preservation of anatomical integrity during image reconstruction, achieving superior performance metrics, including an Inception Score of 9.87 and a mean Average Precision improvement of 9.1% across five detection models, thereby enabling more accurate detection of early-stage caries and other dental anomalies.

The framework demonstrates robust generalization across multiple datasets and age groups, with notable effectiveness during mixed dentition periods. Longitudinal validation indicates sustained clinical benefits, including improved sensitivity in early caries detection and economic savings per patient. However, the study acknowledges limitations such as reliance on a single training dataset and the computational overhead that may hinder adoption in resource-limited settings. Future research directions include expanding the framework to three-dimensional imaging, integrating additional biological processes, and exploring broader applications in pediatric medical imaging. This work establishes a new paradigm for medical image enhancement that merges computational intelligence with biological insights, paving the way for interdisciplinary advancements in pediatric healthcare.

Introduction

The introduction of the research paper highlights the growing interest in neural signed distance functions (SDFs) as a novel 3D visual representation method, which offers advantages over traditional voxel grids and triangle meshes by providing a continuous function for 3D geometry representation. This approach facilitates compact storage of detailed geometrical data. The paper discusses the application of deep generative models, particularly Latent Diffusion Models (LDMs), in medical and dental image augmentation, noting their superiority over traditional Generative Adversarial Networks (GANs) in generating high-resolution images with better structural coherence. However, existing LDM applications in dental imaging have largely overlooked the incorporation of domain-specific anatomical knowledge.

To address this gap, the authors propose biologically inspired loss functions tailored to pediatric dental anatomy, specifically through two innovative methods: the Gum-Adaptive Pixel Imputation (GAPI) and the Deciduous Transition-Based Reconstruction (DTBR). GAPI focuses on restoring missing pixels at the gum-tooth interface by analyzing local anatomical context, while DTBR estimates the developmental stage of the patient to ensure that imputed structures align with expected tooth growth stages. This research represents a pioneering effort to integrate pediatric dental biology into a diffusion-based generative framework for image augmentation and imputation, aiming to enhance the accuracy and realism of dental imaging.

Methods

The methodology presented in this research focuses on leveraging biological principles from oral tissue behavior to develop advanced image processing algorithms for dental imagery, specifically targeting pediatric applications. Two novel algorithms, Gum-Adaptive Pixel Imputation (GAPI) and Deciduous Transition-Based Reconstruction (DTBR), are introduced. These algorithms are integrated into Latent Diffusion Models (LDM) to enhance the reconstruction and augmentation of dental images across different age groups. The GAPI algorithm mimics the adaptive structural support of gum tissue, prioritizing significant anatomical pathways for context-aware pixel imputation, while the DTBR algorithm incorporates the developmental patterns of deciduous teeth, utilizing pattern recognition to reconstruct images that reflect the expected appearance of primary dentition at various stages.

The methodology is structured in three phases: (1) image preprocessing and analysis to identify areas needing reconstruction, (2) bio-inspired pixel reconstruction using GAPI and DTBR to restore missing or corrupted pixels, and (3) integration of these algorithms with diffusion models to generate anatomically plausible dental images. The incorporation of biologically inspired loss functions into the LDM framework guides the reconstruction process, ensuring that the generated images maintain anatomical continuity. To validate the effectiveness of this approach, the enhanced dataset produced by the bio-inspired techniques is employed to train and evaluate various disease detection models, demonstrating the potential of these algorithms in improving dental imagery analysis.

Results

The results section of the paper presents a comprehensive evaluation of the proposed augmentation diffusion model, comparing its performance against various existing models using key metrics: Inception Score (IS), Fréchet Inception Distance (FID), Structural Similarity Index Measure (SSIM), and Peak Signal-to-Noise Ratio (PSNR). The proposed model achieves an IS of 9.87, the lowest FID of 4.21, a high SSIM of 0.952, and a PSNR of 34.76, indicating superior image quality and realism compared to competitors such as DiffusionCLIP and ControlNet-Aug. Statistical analyses, including one-sample and independent two-sample t-tests, confirm the significance of these improvements, with p-values less than 0.001 across all metrics, reinforcing the model’s effectiveness in generating high-quality dental images.

Additionally, the results of oral disease detection demonstrate significant enhancements across various classification models following the application of data augmentation techniques. Notably, Faster R-CNN exhibited the highest absolute improvement in mean Average Precision (mAP) at +0.076 (10.2% relative improvement), while YOLOv8 showed a notable increase in Recall of +0.065 (8.4% relative improvement). These findings underscore the critical role of data augmentation in optimizing detection capabilities, enhancing the performance of multiple algorithms in the context of oral disease detection. Overall, the results highlight the proposed model’s superiority in image generation and its positive impact on classification tasks.

Discussion

The discussion section of the paper highlights the significant advancements in deep learning applications within oral health care, particularly through the use of convolutional neural networks (CNNs) and generative adversarial networks (GANs). These technologies have been instrumental in enhancing dental image analysis, enabling precise identification and classification of dental pathologies while streamlining clinical workflows. Despite the promising capabilities of deep learning algorithms, challenges persist, such as limited dataset sizes leading to overfitting and difficulties in integrating these models into clinical practice due to unverified performance metrics. The authors emphasize the necessity for comprehensive reviews of existing literature to identify effective methodologies and address the barriers hindering the adoption of deep learning in dental imaging.

Furthermore, the paper introduces a bio-inspired framework for dental image enhancement utilizing Latent Diffusion Models (LDMs). This approach leverages the anatomical characteristics of dental structures to improve image quality and accuracy in reconstruction tasks. The LDM framework is designed to process dental images efficiently by encoding them into a latent space, where a diffusion process adds and subsequently removes noise, guided by novel bio-inspired loss functions. This method aims to produce high-fidelity images that maintain anatomical continuity, particularly in pediatric cases where developmental variations are critical. The integration of GAPI (Gum-Adaptive Pixel Imputation) further enhances the reconstruction process by prioritizing structural significance in pixel neighborhoods, thereby improving the overall quality of dental imaging.

Limitations

The research acknowledges several limitations that may impact the applicability and performance of the proposed framework. Firstly, the training was predominantly based on a single comprehensive dataset, and while cross-dataset validation indicates good generalization, further validation through broader multi-center prospective studies is necessary to establish the framework’s universal applicability. Additionally, the framework’s reliance on precise age information and developmental stage classification poses challenges; inaccuracies in patient age data or atypical developmental patterns could compromise performance, necessitating careful clinical assessment and potential manual verification.

Moreover, the computational demands of the framework, although manageable for clinical applications, may hinder its adoption in resource-constrained settings or high-volume screening programs where rapid processing is essential. Lastly, the current emphasis on 2D imaging modalities may restrict the framework’s applicability to emerging 3D imaging technologies, such as cone-beam computed tomography (CBCT) and intraoral scanners, which are becoming increasingly prevalent in pediatric dental practice.