التعرف على الأسنان ثلاثي الأبعاد لطب الأسنان الشرعي باستخدام التعلم العميق 3D tooth identification for forensic dentistry using deep learning

المجلة: BMC Oral Health، المجلد: 25، العدد: 1
DOI: https://doi.org/10.1186/s12903-025-06017-y
PMID: https://pubmed.ncbi.nlm.nih.gov/40301795
تاريخ النشر: 2025-04-29
المؤلف: Hamza Mouncif وآخرون
الموضوع الرئيسي: الأشعة السينية السنية والتصوير

نظرة عامة

تصنيف هياكل الأسنان داخل الفم أمر ضروري في التحليل السني وطب الأسنان الشرعي، ومع ذلك فإن طرق التصوير التقليدية ثنائية الأبعاد غالبًا ما تفتقر إلى الدقة بسبب الطبيعة ثلاثية الأبعاد المعقدة لتشريح الأسنان. تقدم هذه الدراسة نهجًا جديدًا يستخرج الميزات الرئيسية من نماذج الأسنان ثلاثية الأبعاد ويحولها إلى صور ثنائية الأبعاد للتحليل التفصيلي. من خلال استخدام بنية الشبكة العصبية المتكررة (RNN)، يحدد الأسلوب أنماطًا معقدة ضرورية للتصنيف الدقيق، بينما يعزز أيضًا كفاءة المعالجة ويقلل من الحاجة إلى التحليل اليدوي. تعالج هذه الاستراتيجية المبتكرة عدم انتظام بيانات ثلاثية الأبعاد وتضع معيارًا جديدًا في تحديد الأسنان.

تظهر الطريقة المقترحة للتعلم العميق تحسينات كبيرة في دقة التصنيف عبر فئات متعددة (16، 8، و4 فئات) من خلال تحويل الميزات من الوجوه مثلثية الشكل للأجسام ثلاثية الأبعاد إلى صور رمادية ثنائية الأبعاد ومعاملة كل صف كسلسلة لإدخال الشبكة العصبية المتكررة. على الرغم من تفوقها على الطرق الحالية، تعترف الدراسة بالقيود، بما في ذلك مجموعة بيانات تحتوي على 448 كائن سن قد تحد من إمكانية التعميم وتزيد من خطر الإفراط في التكيف. ستهدف الأبحاث المستقبلية إلى توسيع مجموعة البيانات، وتعزيز دقة النموذج من خلال هياكل أكثر تعقيدًا، والتحقق من النهج في مجموعات سكانية متنوعة وإعدادات سريرية، مما يدمجها في سير العمل الشرعي لتحسين التحليل السني الآلي.

مقدمة

تؤكد مقدمة ورقة البحث على الدور الحاسم للتعرف على الأسنان البشرية الفردية في التعليم السني وتطبيقاتها في طب الأسنان الشرعي وعلم الآثار. تبرز مرونة الأسنان كوسيلة موثوقة للتعرف في السيناريوهات الكارثية، مشيرة إلى حالات مثل تسونامي تايلاند عام 2004، حيث كانت الأدلة السنية محورية في تحديد 79% من الضحايا. تشير الورقة إلى أنه على الرغم من أن الكثير من الأبحاث قد ركزت على الأسنان الفردية لتحديد النسب والجنس والعمر، فإن ظهور التقنيات الرقمية والتعلم العميق قد حول تشخيص الأسنان، خاصة في تصنيف الأسنان من الصور.

يستعرض المؤلفون منهجيات متنوعة، بما في ذلك الشبكات العصبية التلافيفية (CNNs) وطرق النمذجة ثلاثية الأبعاد، التي تم استخدامها لتصنيف الأسنان. يشيرون إلى قيود الطرق الحالية، وخاصة تلك التي تعتمد فقط على الصور ثنائية الأبعاد، والتي تكافح مع التعقيد المكاني والتغيرات في ظروف العرض. بالمقابل، يهدف النهج الهجين المقترح إلى تحويل شبكات الأسنان ثلاثية الأبعاد إلى تمثيلات ثنائية الأبعاد لمعالجتها بواسطة الشبكات العصبية المتكررة (RNNs)، مما يعزز كل من دقة التصنيف وكفاءة الحوسبة. تمهد المقدمة الطريق لاستكشاف مفصل للنهج المقترح، مجموعة البيانات، النتائج التجريبية، والنقاشات التي تلي ذلك في الورقة.

النتائج

تشير نتائج النهج المقترح لتصنيف الأسنان ثلاثية الأبعاد إلى تحسين كبير في الدقة مقارنة بالطرق الحالية الرائدة. حقق الأسلوب دقة قدرها 98.01% على مجموعة بيانات من 16 فئة، و98.88% على مجموعة بيانات من 8 فئات، و99.75% على مجموعة بيانات من 4 فئات. يُعزى هذا الأداء إلى التحويل الفعال للميزات ثلاثية الأبعاد إلى تنسيق صورة ثنائية الأبعاد، والذي، عند معالجته من خلال الشبكات المتصلة بالكامل، يلتقط العلاقات المكانية المعقدة والتغيرات الدقيقة بين أنواع الأسنان المختلفة.

تم تقديم تقييم مفصل لأداء التصنيف في الجداول 2 و3، والتي تتضمن مقاييس الدقة والاسترجاع ودرجة F1. بينما تحقق البنية المقترحة باستمرار درجات عالية عبر جميع إصدارات مجموعة البيانات، تم ملاحظة بعض التحديات مع تصنيف الأضراس الثانية والثالثة بسبب تشابهها الشكلي مع الأسنان المجاورة وتمثيلها المحدود في مجموعة البيانات. ومع ذلك، أدى دمج الأسنان في فئات أوسع في مجموعات البيانات من 8 فئات و4 فئات إلى تحسين اتساق التصنيف، مما يظهر قوة ومرونة الأسلوب عبر مستويات تصنيف الأسنان المختلفة. بشكل عام، تؤكد النتائج فعالية النهج المقترح في التمييز بين أنواع الأسنان المختلفة على الرغم من التشابهات الهيكلية المتأصلة.

نقاش

في هذه الدراسة، طورنا نهجًا جديدًا للتعلم العميق لتصنيف الأسنان ثلاثية الأبعاد من خلال الاستفادة من الميزات الهندسية لتمثيلات الشبكة مثلثية الشكل. يستخدم أسلوبنا الشبكات العصبية المتكررة، وبشكل خاص وحدات البوابة المتكررة (GRUs)، لالتقاط العلاقات المكانية المعقدة المتأصلة في الهياكل السنية ثلاثية الأبعاد بشكل فعال. تتضمن عملية استخراج الميزات تحويل بيانات الشبكة مثلثية الشكل إلى تنسيق صورة رمادية ثنائية الأبعاد، مما يسمح لـ GRUs بمعالجة الطبيعة التسلسلية للبيانات مع الحفاظ على السياق الهندسي. أظهرت تجاربنا أن GRUs تفوقت بشكل كبير على الشبكات العصبية التلافيفية التقليدية (CNNs) وهياكل متكررة أخرى، محققة دقة تصنيف قدرها 98.01%، 98.88%، و99.75% عبر مجموعات البيانات التي تحتوي على 16، 8، و4 فئات سنية، على التوالي.

كشفت دراسة الإزالة أن دمج إحداثيات الرؤوس مع مراكز الوجوه مثلثية الشكل قدمت التمثيل الأكثر شمولاً لهندسة الأسنان، مما عزز أداء التصنيف. بينما كانت الشبكات العصبية التلافيفية تكافح مع الاعتماديات التسلسلية، تمكنت GRUs من إدارة هذه التعقيدات بشكل فعال، مما أدى إلى تعميم أفضل عبر مهام التصنيف المختلفة. الآثار السريرية لنتائجنا كبيرة، خاصة في طب الأسنان الشرعي، حيث يمكن أن يقوم أسلوبنا بأتمتة تحديد السجلات السنية في سيناريوهات الكوارث، مما يسرع عملية التعرف. ستركز الأعمال المستقبلية على توسيع مجموعة البيانات والتحقق من أداء النموذج في إعدادات سريرية متنوعة لتعزيز قابليته للتطبيق في التحقيقات الشرعية في العالم الحقيقي.

Journal: BMC Oral Health, Volume: 25, Issue: 1
DOI: https://doi.org/10.1186/s12903-025-06017-y
PMID: https://pubmed.ncbi.nlm.nih.gov/40301795
Publication Date: 2025-04-29
Author(s): Hamza Mouncif et al.
Primary Topic: Dental Radiography and Imaging

Overview

The classification of intraoral teeth structures is essential in dental analysis and forensic dentistry, yet traditional 2D imaging methods often lack accuracy due to the complex 3D nature of dental anatomy. This study introduces a novel approach that extracts key features from 3D tooth models and converts them into 2D images for detailed analysis. By employing a recurrent neural network (RNN) architecture, the method effectively identifies complex patterns necessary for accurate classification, while also enhancing processing efficiency and reducing the need for manual analysis. This innovative strategy addresses the irregularities of 3D data and sets a new standard in dental identification.

The proposed deep learning method demonstrates significant improvements in classification accuracy across multiple categories (16, 8, and 4 categories) by transforming features from the triangular faces of 3D objects into 2D grayscale images and treating each row as a sequence for RNN input. Despite outperforming existing methods, the study acknowledges limitations, including a dataset of 448 tooth objects that may restrict generalizability and increase the risk of overfitting. Future research will aim to expand the dataset, enhance model accuracy through more complex architectures, and validate the approach in diverse populations and clinical settings, ultimately integrating it into forensic workflows to improve automated dental analysis.

Introduction

The introduction of the research paper emphasizes the critical role of individual human tooth recognition in dental education and its applications in forensic odontology and archaeology. It highlights the resilience of teeth as a reliable means of identification in catastrophic scenarios, citing instances such as the 2004 Thailand tsunami, where dental evidence was pivotal in identifying 79% of victims. The paper notes that while much research has focused on single teeth for determining ancestry, sex, and age, the advent of digital technologies and deep learning has transformed dental diagnostics, particularly in classifying teeth from images.

The authors review various methodologies, including convolutional neural networks (CNNs) and 3D modeling approaches, that have been employed for tooth classification. They point out the limitations of existing methods, particularly those relying solely on 2D images, which struggle with spatial complexity and variations in viewing conditions. In contrast, the proposed hybrid approach aims to convert 3D tooth meshes into 2D representations for processing by recurrent neural networks (RNNs), enhancing both classification accuracy and computational efficiency. The introduction sets the stage for the detailed exploration of the proposed method, dataset, experimental results, and discussions that follow in the paper.

Results

The results of the proposed 3D teeth classification approach indicate a significant improvement in accuracy compared to existing state-of-the-art methods. The method achieved an accuracy of 98.01% on a 16-class dataset, 98.88% on an 8-class dataset, and an impressive 99.75% on a 4-class dataset. This performance is attributed to the effective transformation of 3D features into a 2D image format, which, when processed through RGU and fully connected networks, captures complex spatial relationships and subtle variations among different tooth types.

A detailed evaluation of the classification performance is provided in Tables 2 and 3, which include precision, recall, and F1-score metrics. While the proposed architecture consistently achieves high scores across all dataset versions, some challenges were noted with the classification of 2nd and 3rd molars due to their morphological similarities with adjacent teeth and limited representation in the dataset. However, consolidating teeth into broader categories in the 8-class and 4-class datasets improved classification consistency, demonstrating the method’s robustness and adaptability across varying levels of tooth categorization. Overall, the findings underscore the effectiveness of the proposed approach in distinguishing between different tooth types despite inherent structural similarities.

Discussion

In this study, we developed a novel deep learning approach for classifying 3D teeth by leveraging the geometric features of triangular mesh representations. Our method utilizes recurrent neural networks, specifically Gated Recurrent Units (GRUs), to effectively capture the intricate spatial relationships inherent in 3D dental structures. The feature extraction process involves converting the triangular mesh data into a 2D grayscale image format, allowing the GRUs to process the sequential nature of the data while maintaining the geometric context. Our experiments demonstrated that GRUs significantly outperformed traditional convolutional neural networks (CNNs) and other recurrent architectures, achieving classification accuracies of 98.01%, 98.88%, and 99.75% across datasets with 16, 8, and 4 tooth categories, respectively.

The ablation study revealed that combining vertex coordinates with triangular face centers provided the most comprehensive representation of the tooth geometry, enhancing classification performance. While CNNs struggled with sequential dependencies, GRUs effectively managed these complexities, leading to superior generalization across various classification tasks. The clinical implications of our findings are substantial, particularly for forensic dentistry, where our method can automate the identification of dental records in disaster scenarios, thereby expediting the identification process. Future work will focus on expanding the dataset and validating the model’s performance in diverse clinical settings to further enhance its applicability in real-world forensic investigations.