التقسيم التلقائي الكامل بالذكاء الاصطناعي للأنسجة المتعلقة بجراحة الفم استنادًا إلى صور الأشعة المقطعية المخروطية Fully automatic AI segmentation of oral surgery-related tissues based on cone beam computed tomography images

المجلة: International Journal of Oral Science، المجلد: 16، العدد: 1
DOI: https://doi.org/10.1038/s41368-024-00294-z
PMID: https://pubmed.ncbi.nlm.nih.gov/38719817
تاريخ النشر: 2024-05-08
المؤلف: Yu Liu وآخرون
الموضوع الرئيسي: الأشعة السينية السنية والتصوير

نظرة عامة

تقدم ورقة البحث نظام تقسيم الأنسجة الآلي بالكامل الذي يهدف إلى تحسين جراحة زراعة الأسنان من خلال زيادة الدقة والكفاءة في معالجة صور التصوير المقطعي المحوسب باستخدام شعاع مخروط (CBCT). تستخدم الطريقة المقترحة تقنية معالجة الصور التكيفية بناءً على توزيعات البيانات، مما يسمح بتقسيم فعال للهياكل التشريحية الحيوية، بما في ذلك العظم السنخي، والأسنان، والجيوب الأنفية العلوية. يحقق النظام درجات Dice متوسطة مثيرة للإعجاب تبلغ 96.5%، 95.4%، 93.6%، و94.8% لتقسيم الأسنان، والعظم السنخي، والجيوب الأنفية العلوية، وقناة الفك السفلي، على التوالي، مما يظهر أداءً متفوقًا مقارنة بالطرق الحالية.

في الختام، تقدم الدراسة نهج تقسيم شامل قائم على التعلم العميق يبسط عملية التخطيط الجراحي من خلال القضاء على الحاجة إلى التحديد اليدوي وتقسيم متعدد الخطوات. تضمن قدرة الطريقة على التكيف مع التغيرات في صور CBCT من مصنّعين مختلفين أداءً متسقًا عبر مجموعات بيانات متنوعة. ميزة ملحوظة هي القدرة على تحديد الأسنان المفقودة تلقائيًا باستخدام نظام الترقيم الثنائي FDI، مما يعزز شمولية التقييمات السنية. بينما النتائج واعدة، يعترف المؤلفون بالحاجة إلى مزيد من التحقق عبر مجموعات سكانية وسيناريوهات سريرية متنوعة لمعالجة القيود وتحسين المتانة. يجب أن تركز الأبحاث المستقبلية على توسيع مجموعة البيانات وتنقيح الطريقة للتعامل مع التغيرات الشديدة في جودة الصورة.

مقدمة

تسلط مقدمة ورقة البحث هذه الضوء على الدور التحويلي للتصوير المقطعي المحوسب باستخدام شعاع مخروط (CBCT) في الممارسات السنية الحديثة، مع التأكيد على قدراته في التصوير ثلاثي الأبعاد عالي الدقة التي تعزز التخطيط والتنفيذ الجراحي. تساعد تقنية CBCT في وضع زراعة الأسنان بدقة، مما يقلل من المخاطر على الهياكل التشريحية المحيطة، وتسهّل التقييمات التقويمية والعمليات الجراحية للفك والوجه. ومع ذلك، لا يزال تقسيم الهياكل السنية تلقائيًا من صور CBCT يمثل تحديًا كبيرًا بسبب تعقيد تشريح الأسنان وقيود الطرق الحالية، التي تعتمد غالبًا على الميزات اليدوية وتعيقها الكفاءة والدقة المنخفضة.

يحدد المؤلفون أربعة عيوب رئيسية في تقنيات التقسيم الحالية: (1) تصنيف غير كافٍ للأسنان وفقًا لنظام الترقيم الثنائي FDI، مما يؤدي إلى صعوبات في اكتشاف الأسنان المفقودة؛ (2) تباين في توزيع التدرجات الرمادية والتباين عبر أجهزة CBCT المختلفة، مما يعقد المعالجة المسبقة؛ (3) تعقيد مفرط في الطرق الحالية، التي تتطلب خطوات متعددة ويمكن أن تقدم أخطاء؛ و(4) نقص في حلول التقسيم الآلي بالكامل من البداية إلى النهاية التي تشمل جميع الهياكل الفموية ذات الصلة. لمعالجة هذه القضايا، تقترح الورقة نظام تقسيم آلي بالكامل قائم على التعلم العميق يستخدم توزيع البيانات في الرسم البياني لمعالجة الصور ويستخدم الشبكات العصبية المتقدمة لتقسيم دقيق للأسنان، والعظام الفكية، وقناة الفك السفلي. تشير النتائج الأولية إلى أن هذا النهج يعزز بشكل كبير دقة وكفاءة التقسيم مقارنة بالطرق الحالية.

طرق

في هذا القسم، يصف المؤلفون طريقة معالجة مسبقة جديدة لصور التصوير المقطعي المحوسب باستخدام شعاع مخروط (CBCT)، مع معالجة التحديات الفريدة التي تطرحها مقياس قيمة كثافة التدرجات الرمادية والتمثيل المحدود للهياكل السنية داخل الصور. تهدف المعالجة المسبقة إلى تعزيز كفاءة ودقة التدريب للشبكات العصبية من خلال التركيز على الخصائص المحددة لصور CBCT، التي تظهر عادةً توزيعًا منحرفًا لقيم التدرجات الرمادية بسبب هيمنة الأنسجة الرخوة والوجود المحدود للفوكيلات السنية. يستخدم المؤلفون تحليل الرسم البياني لتحديد وإزالة البيانات غير ذات الصلة، مستهدفين بشكل خاص قيم التدرجات الرمادية التي تقل عن تلك الخاصة بالأنسجة العظمية، ويطبقون تقنيات معالجة المجال الترددي لتحسين توزيع التدرجات الرمادية من أجل نتائج تقسيم محسنة.

بالنسبة لمهام التقسيم، يستخدم المؤلفون بنية شبكة هجينة تجمع بين 3D-UNet لتقسيم الهيكل العظمي وأنبوب الأعصاب مع Swin-UNETR لتقسيم حالات الأسنان. يستفيد Swin-UNETR من نقاط القوة في Swin-Transformer لالتقاط الاعتمادات بعيدة المدى والميزات المحلية، مما يعالج قيود الشبكات العصبية التلافيفية التقليدية (CNNs) في تقسيم الهياكل السنية المتقاربة. تتكون البنية من تصميم مشفر-مفكك يدمج عمليات استخراج الميزات واستعادتها، مما يعزز قدرة النموذج على إنتاج نتائج تقسيم دقيقة. تدعم المنهجية التفصيلية وبنية الشبكة التحليلات الإحصائية والتمثيلات البصرية لعمليات المعالجة المسبقة والتقسيم، مما يوضح فعالية النهج المقترح في تحسين دقة تقسيم الأنسجة السنية في صور CBCT.

نتائج

يقدم قسم النتائج تقييمًا شاملاً للطريقة المقترحة لتقسيم صور التصوير المقطعي المحوسب باستخدام شعاع مخروط (CBCT). تستخدم الطريقة عملية متعددة المراحل تشمل معالجة الصور، وتقسيم الهيكل العظمي باستخدام شبكة تعلم عميق، ودمج نتائج تقسيم متنوعة (الأسنان، العظم العلوي، العظم السفلي، والجيوب الأنفية العلوية) لإنتاج نتائج تقسيم نهائية. حقق النهج المقترح مقاييس مثيرة للإعجاب على مجموعة بيانات التحقق، بما في ذلك درجة Dice متوسطة تبلغ 96.5% لتقسيم الأسنان ودرجات عالية عبر مهام تقسيم أخرى: 92.4% للعظم العلوي، 98.3% للعظم السفلي، 93.6% للجيوب الأنفية العلوية، و94.8% لتقسيم أنبوب الأعصاب السفلي.

أظهرت التحليلات المقارنة مع الطرق الحالية القائمة على التعلم العميق أن الطريقة المقترحة تفوقت على الآخرين، محققة دقة متفوقة عبر جميع المقاييس التي تم تقييمها. أكدت دراسات الإلغاء فعالية تقنية المعالجة المسبقة التكيفية، التي حسنت بشكل كبير أداء التقسيم مقارنة بالطرق العامة. كما نجحت الطريقة في تحديد مواقع الأسنان النسبية، مما يسمح بتسمية متسقة وفقًا لنظام الترقيم الثنائي FDI. أظهرت التحقق على مجموعات بيانات CBCT متنوعة عمومية قوية، مع تسجيل أفضل أداء على مجموعة بيانات LargeV (درجة Dice تبلغ 96.9%). علاوة على ذلك، أظهرت الطريقة الآلية للتقسيم كفاءة أكبر بمئات المرات مقارنة بالتحديد اليدوي التقليدي مع الحفاظ على دقة مماثلة، مما يبرز إمكاناتها للتطبيق السريري.

مناقشة

في هذه الدراسة، نقدم طريقة جديدة قائمة على التعلم العميق لتقسيم الهياكل السنية في صور التصوير المقطعي المحوسب باستخدام شعاع مخروط (CBCT)، تستهدف بشكل خاص الأسنان، والعظام العلوية والسفلية، والجيوب الأنفية العلوية، وقناة العصب السفلي. تتميز طريقتنا بعملية تقسيم من البداية إلى النهاية التي تبسط التخطيط الجراحي لزراعة الأسنان، مما يلغي الحاجة إلى التحديد اليدوي وخطوات التقسيم المتعددة. تشير النتائج التجريبية إلى أن طريقتنا تعزز بشكل كبير الكفاءة والدقة، محققة درجة Dice متوسطة تبلغ 95.1%، والتي تتجاوز التقنيات الحالية بنسبة 3.6% في مهام تقسيم حالات الأسنان. بالإضافة إلى ذلك، تتكيف خوارزمية التكيف لدينا بشكل فعال مع التغيرات في صور CBCT من مصنّعين مختلفين، مما يظهر أداءً قويًا عبر ظروف التصوير المتنوعة.

تتمثل إحدى الابتكارات الرئيسية لطريقتنا في قدرتها على تصنيف الأسنان باستخدام نظام الترقيم الثنائي FDI، مما يسهل التعرف التلقائي على الأسنان المفقودة – وهو جانب أساسي من تخطيط العلاج السني غالبًا ما يتم تجاهله من قبل الطرق الحالية. بينما نتائجنا واعدة، نعترف بالقيود، بما في ذلك إجراء التقييم على مجموعة بيانات محددة لا تشمل العيوب الشديدة أو ضباب الحركة. يجب أن تركز الأبحاث المستقبلية على التحقق من طريقتنا عبر مجموعات سكانية وسيناريوهات سريرية أوسع، بالإضافة إلى تعزيز أدائها في وجود تغييرات شديدة في جودة الصورة. بشكل عام، تقدم دراستنا تقدمًا كبيرًا في تقسيم الهياكل السنية الآلي، مع إمكانية تحسين التخطيط قبل الجراحة ونتائج الجراحة في إجراءات زراعة الأسنان.

Journal: International Journal of Oral Science, Volume: 16, Issue: 1
DOI: https://doi.org/10.1038/s41368-024-00294-z
PMID: https://pubmed.ncbi.nlm.nih.gov/38719817
Publication Date: 2024-05-08
Author(s): Yu Liu et al.
Primary Topic: Dental Radiography and Imaging

Overview

The research paper presents a fully automated tissue segmentation system aimed at enhancing dental implant surgery through improved accuracy and efficiency in processing cone beam computed tomography (CBCT) images. The proposed method utilizes an adaptive image preprocessing technique based on data distribution histograms, which allows for effective segmentation of critical anatomical structures, including alveolar bone, teeth, and the maxillary sinus. The system achieves impressive average Dice scores of 96.5%, 95.4%, 93.6%, and 94.8% for tooth, alveolar bone, maxillary sinus, and mandibular canal segmentation, respectively, demonstrating superior performance compared to existing methods.

In conclusion, the study introduces a comprehensive deep learning-based segmentation approach that simplifies the surgical planning process by eliminating the need for manual outlining and multi-step segmentation. The method’s adaptability to variations in CBCT images from different manufacturers ensures consistent performance across diverse datasets. A notable feature is the capability to automatically identify missing teeth using the FDI Two-Digit Notation, enhancing the comprehensiveness of dental assessments. While the results are promising, the authors acknowledge the need for further validation across varied populations and clinical scenarios to address limitations and improve robustness. Future research should focus on expanding the dataset and refining the method to handle extreme variations in image quality.

Introduction

The introduction of this research paper highlights the transformative role of cone beam computed tomography (CBCT) in modern dental practices, emphasizing its high-resolution 3D imaging capabilities that enhance surgical planning and execution. CBCT technology aids in accurately positioning dental implants, thereby minimizing risks to surrounding anatomical structures, and facilitates orthodontic evaluations and maxillofacial surgeries. However, the automatic segmentation of dental structures from CBCT images remains a significant challenge due to the complexity of dental anatomy and limitations of existing methods, which often rely on manual features and are hindered by low efficiency and accuracy.

The authors identify four primary shortcomings in current segmentation techniques: (1) inadequate classification of teeth according to the FDI Two-Digit Notation, leading to difficulties in detecting missing teeth; (2) variability in grayscale distribution and contrast across different CBCT devices, complicating preprocessing; (3) excessive complexity in existing methods, which require multiple steps and can introduce errors; and (4) a lack of fully automated, end-to-end segmentation solutions that encompass all relevant oral structures. To address these issues, the paper proposes a novel deep learning-based fully automated segmentation system that employs a data distribution histogram for image preprocessing and utilizes advanced neural networks for accurate segmentation of teeth, mandibular bones, and the mandibular canal. Preliminary results indicate that this approach significantly enhances segmentation accuracy and efficiency compared to existing methods.

Methods

In this section, the authors describe a novel preprocessing method for Cone Beam Computed Tomography (CBCT) images, addressing the unique challenges posed by their grayscale density value scale and the limited representation of dental structures within the images. The preprocessing aims to enhance the training efficiency and accuracy of neural networks by focusing on the specific characteristics of CBCT images, which typically exhibit a skewed distribution of grayscale values due to the predominance of soft tissue and the minimal presence of dental voxels. The authors utilize histogram analysis to identify and remove irrelevant data, specifically targeting grayscale values below that of bone tissue, and apply frequency domain processing techniques to refine the grayscale distribution for improved segmentation outcomes.

For the segmentation tasks, the authors employ a hybrid network architecture combining the 3D-UNet for skeletal and neural tube segmentation with the Swin-UNETR for tooth instance segmentation. The Swin-UNETR leverages the strengths of the Swin-Transformer to capture long-range dependencies and local features, addressing the limitations of traditional convolutional neural networks (CNNs) in segmenting closely spaced dental structures. The architecture consists of an encoder-decoder design that integrates feature extraction and restoration processes, enhancing the model’s ability to produce precise segmentation results. The detailed methodology and network structure are supported by statistical analyses and visual representations of the preprocessing and segmentation processes, demonstrating the efficacy of the proposed approach in improving the accuracy of dental tissue segmentation in CBCT images.

Results

The results section presents a comprehensive evaluation of the proposed method for segmenting oral Cone Beam Computed Tomography (CBCT) images. The method employs a multi-stage process that includes image preprocessing, skeletal segmentation using a deep learning network, and the integration of various segmentation results (teeth, maxillary bone, mandibular bone, and maxillary sinus) to produce final segmentation outcomes. The proposed approach achieved impressive metrics on the validation dataset, including an average Dice score of 96.5% for tooth segmentation and high scores across other segmentation tasks: 92.4% for maxillary, 98.3% for mandibular, 93.6% for maxillary sinus, and 94.8% for mandibular neural tube segmentation.

Comparative analysis with existing deep learning methods demonstrated that the proposed method outperformed others, achieving superior accuracy across all evaluated metrics. Ablation studies confirmed the effectiveness of the adaptive preprocessing technique, which significantly enhanced segmentation performance compared to generic methods. The method also successfully identified relative tooth positions, allowing for consistent labeling according to the FDI Two-Digit Notation. Validation on diverse CBCT datasets indicated robust generalizability, with the best performance recorded on the LargeV dataset (Dice score of 96.9%). Furthermore, the proposed automated segmentation method exhibited hundreds of times greater efficiency than traditional manual outlining while maintaining comparable accuracy, underscoring its potential for clinical application.

Discussion

In this study, we introduce a novel deep learning-based method for the segmentation of dental structures in oral cone beam computed tomography (CBCT) images, specifically targeting teeth, maxillary and mandibular bones, maxillary sinus, and mandibular nerve canal. Our approach features an end-to-end segmentation process that streamlines surgical planning for dental implants, eliminating the need for manual annotations and multiple segmentation steps. Experimental results indicate that our method significantly enhances efficiency and accuracy, achieving an average Dice score of 95.1%, which surpasses existing techniques by 3.6% in tooth instance segmentation tasks. Additionally, our adaptive algorithm effectively accommodates variations in CBCT images from different manufacturers, demonstrating robust performance across diverse imaging conditions.

A key innovation of our method is its ability to classify teeth using the FDI Two-Digit Notation, facilitating the automatic identification of missing teeth—an essential aspect of dental treatment planning often overlooked by current methods. While our findings are promising, we acknowledge limitations, including the evaluation being conducted on a specific dataset that does not encompass severe artifacts or motion blur. Future research should focus on validating our method across broader populations and clinical scenarios, as well as enhancing its performance in the presence of extreme variations in image quality. Overall, our study presents a significant advancement in the automated segmentation of dental structures, with the potential to improve preoperative planning and surgical outcomes in dental implant procedures.