تطوير وتقييم نظام قائم على التعلم العميق لتقدير العمر السني باستخدام طريقة ديميرجيان على الأشعة السينية البانورامية Development and evaluation of a deep learning-based system for dental age estimation using the demirjian method on panoramic radiographs

المجلة: BMC Oral Health، المجلد: 25، العدد: 1
DOI: https://doi.org/10.1186/s12903-025-06420-5
PMID: https://pubmed.ncbi.nlm.nih.gov/40670936
تاريخ النشر: 2025-07-16
المؤلف: Yunus Balel وآخرون
الموضوع الرئيسي: دراسات الأنثروبولوجيا الجنائية والبيوآركيولوجيا

نظرة عامة

تقدم هذه الورقة البحثية تطوير وتقييم نموذج قائم على التعلم العميق لتقدير العمر السني تلقائيًا باستخدام طريقة ديميرجيان على الأشعة السينية البانورامية. استخدمت الدراسة مجموعة بيانات تتكون من 4,800 صورة بانورامية لتدريب نموذج YOLOv11 لتصنيف مراحل نمو الأسنان، وتقييم أدائه من خلال مقاييس مثل الدقة، والاسترجاع، ودرجة F1، ومتوسط الدقة العامة (mAP). تم استخدام مجموعة بيانات منفصلة تتكون من 650 فردًا لمقارنة العمر الزمني مع التقييمات اليدوية وتلك المدعومة بالذكاء الاصطناعي، مما كشف أن كلا الطريقتين قد بالغتا في تقدير العمر الزمني بشكل كبير (p < 0.001) دون وجود فرق كبير بينهما (p = 0.433). أظهر النموذج أعلى أداء له في مجموعة 2nd Molar-H، محققًا دقة قدرها 0.99 واسترجاعًا قدره 1.0، بينما لوحظ أدنى أداء في مجموعة 1st Molar-B. تشير النتائج إلى أن دقة النظام المدعوم بالذكاء الاصطناعي قابلة للمقارنة مع طريقة ديميرجيان اليدوية، حيث تميزت بشكل خاص في المراحل المتأخرة من نمو الأسنان. أظهر تحليل الانحدار علاقة ضعيفة ولكنها ذات دلالة إحصائية بين العمر وخطأ التقدير، وكانت أكثر وضوحًا في النموذج المدعوم بالذكاء الاصطناعي (R² = 0.042). تعزز البرنامج النصي بلغة بايثون والواجهة الرسومية التطبيق العملي للنموذج في البيئات السريرية والجنائية. ومع ذلك، يوصي المؤلفون بمزيد من التحقق مع مجموعات سكانية متنوعة واستكشاف هياكل نماذج بديلة لتحسين الدقة والموثوقية. بشكل عام، تمثل هذه الدراسة تقدمًا كبيرًا في أتمتة عمليات تقدير العمر في العلوم الجنائية وطب الأسنان، مع إمكانية تقديم نتائج أسرع ومعيارية وموضوعية، في انتظار التحقق على نطاق واسع قبل الدمج السريري.

مقدمة

تسلط مقدمة هذه الورقة البحثية الضوء على الدور الحاسم لتقدير العمر في مجالات مختلفة، وخاصة في العلوم الجنائية وطب الأسنان، حيث يعد أمرًا أساسيًا للتحقق من الهوية، والإجراءات القانونية، والتطبيقات الطبية. لقد أثبت تحليل نمو الأسنان، وخاصة من خلال طرق مثل طريقة ديميرجيان، فعاليته في تقدير العمر الزمني من خلال فحص مراحل النمو الشعاعي للأسنان. ومع ذلك، يمكن أن تتأثر دقة هذه الطرق التقليدية بالتنوعات العرقية والجغرافية في نمو الأسنان.

لقد قدمت التطورات الحديثة في الذكاء الاصطناعي (AI) والتعلم العميق طرقًا مبتكرة لتقدير العمر، وخاصة من خلال استخدام الشبكات العصبية التلافيفية (CNNs) التي تحلل الأشعة السينية البانورامية للأسنان. تقدم هذه الدراسة نظامًا آليًا جديدًا قائمًا على التعلم العميق لا يقدر العمر باستخدام طريقة ديميرجيان فحسب، بل يتميز أيضًا بخط أنابيب شامل مع واجهة بديهية للمستخدمين غير التقنيين. يهدف النظام إلى تعزيز الكفاءة والدقة في تقدير العمر مع تقديم أدلة تجريبية على موثوقيته مقارنة بالطرق اليدوية. بشكل عام، تسهم هذه البحث في المجال من خلال تقديم نموذج عالي الأداء ي prioritizes الوصول للمحترفين في العلوم الجنائية وأطباء الأسنان.

طرق

تحدد قسم “المواد والطرق” في الورقة البحثية التصميم التجريبي والإجراءات المستخدمة للتحقيق في فرضية البحث. يوضح المواد المحددة المستخدمة، بما في ذلك أي مواد كيميائية، ومعدات، وعينات بيولوجية، مما يضمن إمكانية تكرار الدراسة. تشمل المنهجية إعداد التجربة، وتقنيات جمع البيانات، والأساليب التحليلية، والتي قد تشمل التحليلات الإحصائية أو النماذج الحاسوبية.

بالإضافة إلى ذلك، قد يصف القسم أي ضوابط أو متغيرات تم التلاعب بها خلال التجارب، فضلاً عن المعايير لاختيار العينات أو الموضوعات. تعتبر هذه النظرة الشاملة ضرورية لفهم صلاحية وموثوقية النتائج المقدمة في الدراسة.

نتائج

حللت الدراسة مجموعة بيانات تتكون من 4,800 مريض بمتوسط عمر 10.64 سنوات لتقييم أداء نموذج في تصنيف النضج السني باستخدام طريقة ديميرجيان. أشارت مقاييس الأداء إلى أن فئة 1st Molar-B أظهرت أدنى أداء (الدقة: 0.471، الاسترجاع: 0.833، mAP50-95: 0.436، درجة F1: 0.601)، بينما حققت فئة 2nd Molar-H أعلى أداء (الدقة: 0.99، الاسترجاع: 1.0، mAP50-95: 0.954، درجة F1: 0.995). أظهر مصفوفة الارتباك المعادلة صفريًا دقة تصنيف قوية بشكل عام، حيث كانت معظم التوقعات متوافقة بشكل صحيح على طول القطر، على الرغم من ملاحظات بسيطة عن التصنيفات الخاطئة في المراحل المبكرة من النمو بسبب تداخل الميزات.

تم استخدام مجموعة بيانات مستقلة تتكون من 650 فردًا لتقييم الأداء اليدوي، مما كشف عن اختلافات ذات دلالة إحصائية بين العمر الزمني وكلا من تقديرات ديميرجيان اليدوية والمدعومة بالذكاء الاصطناعي (p < 0.001). بالغت كلا الطريقتين في تقدير العمر مقارنة بالعمر الزمني، مع اختلافات متوسطة قدرها -0.766 سنة للطريقة اليدوية و-0.707 سنة للطريقة المدعومة بالذكاء الاصطناعي. ومع ذلك، لم يتم العثور على فرق كبير بين الطريقتين (p = 0.433). أظهرت تحليلات الانحدار الخطي أن العمر الزمني كان له تأثير محدود على دقة كلا الطريقتين، حيث أظهرت الطريقة المدعومة بالذكاء الاصطناعي ارتباطًا أقوى قليلاً (R² = 0.042) مقارنة بالطريقة اليدوية (R² = 0.012). اقترحت معاملات الانحدار السلبية أن الدقة تحسنت مع انخفاض العمر الزمني، خاصة في النموذج المدعوم بالذكاء الاصطناعي، الذي كان له علاقة ذات دلالة إحصائية أكبر (p < 0.001).

مناقشة

في هذه الدراسة، تم تطوير نموذج تعلم عميق يستخدم بنية YOLOv11 لتقدير العمر تلقائيًا بناءً على طريقة ديميرجيان باستخدام الأشعة السينية البانورامية للأسنان. أظهر النموذج دقة عالية، خاصة في الأسنان التي اكتمل فيها نمو الجذر، وأدى أداءً مشابهًا للطرق اليدوية التقليدية. كشفت التحليلات الإحصائية أن دقة النموذج تحسنت مع زيادة العمر الزمني، مما يبرز التحديات التي تطرحها المراحل المبكرة من النمو حيث تظهر الأسنان ميزات متداخلة مع الأسنان الأولية. قدمت الدراسة أيضًا برنامج نصي سهل الاستخدام بلغة بايثون وواجهة، مما يمكّن المستخدمين الذين ليس لديهم خبرة في البرمجة من إجراء تقديرات العمر بكفاءة.

على الرغم من النتائج الواعدة، تعترف الدراسة بعدة قيود، بما في ذلك خصوصية مجموعة البيانات الجغرافية، والتي قد تؤثر على إمكانية تعميمها على مجموعات سكانية متنوعة، والاختلافات المحتملة في جودة الأشعة السينية بسبب أجهزة التصوير المختلفة. بالإضافة إلى ذلك، قد يؤدي الاعتماد على الحكم البشري في تصنيف مراحل نمو الأسنان إلى إدخال تحيز. يقترح المؤلفون أن التحقق الإضافي عبر مجموعات بيانات متعددة المراكز ومتنوعة عرقيًا ضروري لتعزيز قابلية تطبيق النموذج. بشكل عام، تمثل هذه البحث تقدمًا كبيرًا في أتمتة تقدير العمر في السياقات الجنائية وطب الأسنان، مع إمكانية توحيد وتسريع العملية مع تقليل الاعتماد على التقييمات الخبيرة.

Journal: BMC Oral Health, Volume: 25, Issue: 1
DOI: https://doi.org/10.1186/s12903-025-06420-5
PMID: https://pubmed.ncbi.nlm.nih.gov/40670936
Publication Date: 2025-07-16
Author(s): Yunus Balel et al.
Primary Topic: Forensic Anthropology and Bioarchaeology Studies

Overview

This research paper presents the development and evaluation of a deep learning-based model for automatic dental age estimation using the Demirjian method on panoramic radiographs. The study utilized a dataset of 4,800 panoramic radiographs to train a YOLOv11 model for tooth development staging, assessing its performance through metrics such as precision, recall, F1 score, and mean Average Precision (mAP). A separate dataset of 650 individuals was employed to compare chronological age with manual and AI-assisted Demirjian assessments, revealing that both methods significantly overestimated chronological age (p < 0.001) without a significant difference between them (p = 0.433). The model demonstrated its highest performance in the 2nd Molar-H group, achieving a precision of 0.99 and a recall of 1.0, while the lowest performance was noted in the 1st Molar-B group. The findings indicate that the AI-assisted system's accuracy is comparable to that of the manual Demirjian method, particularly excelling in later stages of tooth development. Regression analysis showed a weak but statistically significant relationship between age and prediction error, more pronounced in the AI-assisted model (R² = 0.042). The developed Python script and graphical interface enhance the model's practical application in clinical and forensic settings. However, the authors recommend further validation with diverse populations and exploration of alternative model architectures to improve accuracy and robustness. Overall, this study marks a significant advancement in automating age estimation processes in forensic science and dentistry, with the potential to provide faster, standardized, and objective outcomes, pending large-scale validation before clinical integration.

Introduction

The introduction of this research paper highlights the critical role of age estimation in various fields, particularly in forensic sciences and dentistry, where it is essential for identity verification, legal proceedings, and medical applications. The analysis of dental development, especially through methods like the Demirjian method, has proven effective for estimating chronological age by examining the radiographic developmental stages of teeth. However, the accuracy of these traditional methods can be influenced by ethnic and geographical variations in dental development.

Recent advancements in artificial intelligence (AI) and deep learning have introduced innovative approaches to age estimation, particularly through the use of Convolutional Neural Networks (CNNs) that analyze dental panoramic radiographs. This study presents a novel deep learning-based automated system that not only estimates age using the Demirjian method but also features an end-to-end pipeline with an intuitive interface for non-technical users. The system aims to enhance efficiency and accuracy in age estimation while providing empirical evidence of its reliability compared to manual methods. Overall, this research contributes to the field by offering a high-performing model that prioritizes accessibility for forensic professionals and dentists.

Methods

The “Materials and Methods” section of the research paper outlines the experimental design and procedures employed to investigate the research hypothesis. It details the specific materials used, including any reagents, equipment, and biological samples, ensuring reproducibility of the study. The methodology encompasses the experimental setup, data collection techniques, and analytical approaches, which may include statistical analyses or computational models.

Additionally, the section may describe any controls or variables that were manipulated during the experiments, as well as the criteria for selecting samples or subjects. This comprehensive overview is crucial for understanding the validity and reliability of the findings presented in the study.

Results

The study analyzed a dataset of 4,800 patients with a mean age of 10.64 years to evaluate the performance of a model in classifying dental maturity using the Demirjian method. Performance metrics indicated that the 1st Molar-B class exhibited the lowest performance (Precision: 0.471, Recall: 0.833, mAP50-95: 0.436, F1 Score: 0.601), while the 2nd Molar-H class achieved the highest performance (Precision: 0.99, Recall: 1.0, mAP50-95: 0.954, F1 Score: 0.995). A row-normalized confusion matrix demonstrated strong overall classification accuracy, with most predictions correctly aligned along the diagonal, although minor misclassifications were noted in early developmental stages due to overlapping features.

An independent dataset of 650 individuals was used for manual performance evaluation, revealing statistically significant differences between chronological age and both manual and AI-assisted Demirjian estimates (p < 0.001). Both methods overestimated age compared to chronological age, with mean differences of -0.766 years for the manual method and -0.707 years for the AI-assisted method. However, no significant difference was found between the two methods (p = 0.433). Linear regression analyses indicated that chronological age had a limited effect on the accuracy of both methods, with the AI-assisted method showing a slightly stronger correlation (R² = 0.042) compared to the manual method (R² = 0.012). Negative regression coefficients suggested that accuracy improved as chronological age decreased, particularly in the AI-assisted model, which had a more statistically significant relationship (p < 0.001).

Discussion

In this study, a deep learning model utilizing the YOLOv11 architecture was developed for automatic age estimation based on the Demirjian method using dental panoramic radiographs. The model demonstrated high accuracy, particularly in teeth with completed root development, and performed comparably to traditional manual methods. Statistical analyses revealed that the model’s accuracy improved with increasing chronological age, highlighting the challenges posed by early developmental stages where teeth exhibit overlapping features with primary teeth. The study also introduced a user-friendly Python script and interface, enabling users without programming expertise to conduct age estimations efficiently.

Despite its promising results, the study acknowledges several limitations, including the dataset’s geographic specificity, which may affect generalizability to diverse populations, and potential variability in radiograph quality due to different imaging devices. Additionally, the reliance on human judgment for tooth development staging may introduce bias. The authors suggest that further validation across multi-center and ethnically diverse datasets is necessary to enhance the model’s applicability. Overall, this research represents a significant advancement in automating age estimation in forensic and dental contexts, with the potential to standardize and expedite the process while reducing dependency on expert evaluations.