تشخيص سرطان الفم القائم على الذكاء الاصطناعي باستخدام الشبكات العصبية التلافيفية العميقة Xception وMobileNet-v2 Artificial intelligence—based diagnosis of oral leukoplakia using deep convolutional neural networks Xception and MobileNet-v2

المجلة: Frontiers in Oral Health، المجلد: 6
DOI: https://doi.org/10.3389/froh.2025.1414524
PMID: https://pubmed.ncbi.nlm.nih.gov/40191066
تاريخ النشر: 2025-03-21
المؤلف: Elakya Ramesh وآخرون
الموضوع الرئيسي: أمراض الفم وعلاجها

نظرة عامة

تدرس الدراسة فعالية الشبكات العصبية التلافيفية (CNNs)، وبالتحديد Xception وMobileNet-v2، في تشخيص اللويحات البيضاء الفموية (OL) والتفريق بينها وبين الآفات البيضاء الأخرى في تجويف الفم. تم استخدام مجموعة بيانات تتكون من 659 صورة سريرية، بما في ذلك 202 من OL و457 من آفات أخرى مثل التقرن الاحتكاكي وسرطان الفم. تم تقسيم مجموعة البيانات إلى عينات تدريب، والتحقق، والاختبار، مع تطبيق تحسين البيانات لتعزيز مجموعة التدريب. تم استخدام مقاييس الأداء بما في ذلك الدقة، والدقة الإيجابية، والاسترجاع، ودرجة F1 لتقييم النماذج.

أشارت النتائج إلى أن كلا النموذجين من CNN نجحا في تشخيص OL وآفات أخرى، حيث تفوق MobileNet-v2 على Xception من حيث درجة F1 والدقة العامة، محققًا نطاق دقة يتراوح بين 89%-92%. تستنتج الدراسة أنه بينما تظهر CNNs إمكانات كبيرة في تشخيص الاضطرابات الفموية المحتملة الخبيثة، فإن المزيد من البحث ضروري للتحقق من صحة هذه النماذج في بيئات سريرية متنوعة واستكشاف تكاملها في الممارسة الروتينية. يجب أن تأخذ التحقيقات المستقبلية أيضًا في الاعتبار الأساليب متعددة الوسائط التي تجمع بين التصوير والبيانات السريرية والنسجية لتعزيز التشخيص المدعوم بالذكاء الاصطناعي.

مقدمة

تناقش مقدمة ورقة البحث الاضطرابات الفموية المحتملة الخبيثة (OPMDs)، وهي آفات في الغشاء المخاطي الفموي تحمل خطرًا مرتفعًا للتحول الخبيث. تشمل OPMDs الرئيسية المحددة اللويحات البيضاء، واللويحات الحمراء، واللويحات الفموية، من بين أمور أخرى. يتم تقديم تعريف مصقول للويحات البيضاء، مع التأكيد على خصائصها كآفة بيضاء بشكل أساسي، لا رجعة فيها، تشكل خطرًا أكبر للإصابة بالسرطان، وغالبًا ما ترتبط باستخدام التبغ والكحول. يتم تسليط الضوء على تصنيف اللويحات البيضاء إلى أنواع متجانسة وغير متجانسة، مع أوصاف محددة لمظاهرها السريرية.

تؤكد الورقة على أهمية التقييم النسجي لتقييم خطر التحول الخبيث في OPMDs، على الرغم من توفر تقنيات تشخيصية مساعدة مثل الخزعات السائلة وأنظمة التصوير الضوئي. الكشف المبكر أمر حاسم لتحسين معدلات البقاء وجودة الحياة، حيث أن سرطان الفم المتقدم يقلل بشكل كبير من التوقعات. تستكشف المقدمة أيضًا إمكانيات الذكاء الاصطناعي (AI) وتعلم الآلة (ML) في تعزيز دقة التشخيص وتقليل التكاليف في إدارة OPMDs. يقترح المؤلفون منهجية ذكاء اصطناعي لتحليل اللويحات البيضاء من خلال الصور الرقمية، بهدف تحسين التمييز بين الأنواع السريرية وتسهيل التشخيص في الوقت المناسب.

طرق البحث

تمت الموافقة على البحث من الناحية الأخلاقية من قبل مجلس مراجعة الأخلاقيات في كلية طب الأسنان SRM، رامابورام، تحت رقم المرجع SRMU/M&HS/SRMDC/2023/PG/014. تضمن هذه الموافقة أن الدراسة تلتزم بالمعايير الأخلاقية في البحث الذي يشمل البشر، على الرغم من عدم تقديم تفاصيل محددة بشأن المواد والأساليب المستخدمة في الدراسة في النص المستخرج. ستكون المعلومات الإضافية ضرورية لفهم كامل للمنهجيات المستخدمة في البحث.

النتائج

يقدم قسم النتائج تحليلًا مقارنًا لأداء نموذجين من الشبكات العصبية التلافيفية (CNN)، MobileNetV2 وXception، في تصنيف الآفات الفموية إلى ثلاث فئات: اللويحات البيضاء المتجانسة (OL)، اللويحات البيضاء غير المتجانسة، وآفات بيضاء أخرى. توضح الأشكال 2 و3 قدرات التعرف للنماذج، بينما تلخص الجدول 1 مقاييس أداء التشخيص الخاصة بها، بما في ذلك الدقة، والدقة الإيجابية، والاسترجاع، ودرجة F1.

أظهر MobileNetV2 حساسية متفوقة، خاصة بالنسبة لـ OL غير المتجانسة (92%) وآفات بيضاء أخرى (94%)، مقارنةً بـ Xception، الذي حقق 85% حساسية لـ OL غير المتجانسة ولكنه قدم أداءً جيدًا بنسبة 89% لـ OL المتجانسة و91% للآفات البيضاء الأخرى. أظهرت كلا النموذجين خصوصية عالية، حيث حقق Xception 96% لـ OL المتجانسة و94% لـ OL غير المتجانسة، بينما وصل MobileNetV2 إلى 96% لـ OL المتجانسة و93% لـ OL غير المتجانسة، وكان لديه أعلى خصوصية (95%) للآفات البيضاء الأخرى. بشكل عام، يضع الأداء المتوازن لـ MobileNetV2 مع حساسية وخصوصية أعلى كونه النموذج الأكثر فعالية في تشخيص اللويحات البيضاء الفموية والتفريق بينها وبين الآفات البيضاء الأخرى، كما هو موضح في الجدول 2.

المناقشة

تدرس الدراسة تطبيق الشبكات العصبية التلافيفية (CNNs)، وبالتحديد Xception وMobileNetV2، لتشخيص اللويحات البيضاء الفموية وآفات بيضاء أخرى باستخدام الصور السريرية. تم تجميع مجموعة بيانات تتكون من 659 صورة، مع تمثيل 202 صورة للويحات البيضاء الفموية و457 صورة توضح آفات بيضاء أخرى. تم تقسيم مجموعة البيانات إلى مجموعات تدريب، والتحقق، والاختبار، مع استخدام تقنيات تحسين البيانات لتعزيز تنوع مجموعة بيانات التدريب. تم تدريب النماذج لتصنيف الصور إلى ثلاث فئات: اللويحات البيضاء المتجانسة، اللويحات البيضاء غير المتجانسة، وآفات بيضاء أخرى. تفوق نموذج MobileNetV2 على Xception، محققًا دقة إجمالية تبلغ 92% ودرجة F1 تبلغ 0.92، مما يدل على قدرته التشخيصية المتفوقة.

تؤكد النتائج على إمكانيات أدوات التشخيص المدفوعة بالذكاء الاصطناعي في تحسين اتخاذ القرارات السريرية للاضطرابات الفموية المحتملة الخبيثة. تسلط الدراسة الضوء على أهمية الكشف المبكر عن اللويحات البيضاء الفموية، التي تحمل خطرًا كبيرًا للتحول الخبيث، وتقترح أن نماذج CNN يمكن أن تكون إضافات قيمة لطرق التشخيص التقليدية. ومع ذلك، تم الاعتراف بالقيود مثل مجموعة البيانات الصغيرة والتحيزات المحتملة بسبب أخذ العينات المريحة. يجب أن تركز الأبحاث المستقبلية على التحقق من صحة هذه النماذج عبر مجموعات سكانية متنوعة ودمج الأساليب متعددة الوسائط لتعزيز دقة التشخيص وقابلية التطبيق السريري.

Journal: Frontiers in Oral Health, Volume: 6
DOI: https://doi.org/10.3389/froh.2025.1414524
PMID: https://pubmed.ncbi.nlm.nih.gov/40191066
Publication Date: 2025-03-21
Author(s): Elakya Ramesh et al.
Primary Topic: Oral Health Pathology and Treatment

Overview

The study investigates the efficacy of convolutional neural networks (CNNs), specifically Xception and MobileNet-v2, in diagnosing oral leukoplakia (OL) and differentiating it from other white lesions in the oral cavity. A dataset comprising 659 clinical photographs, including 202 of OL and 457 of other lesions such as frictional keratosis and oral carcinoma, was utilized. The dataset was divided into training, validation, and test samples, with data augmentation applied to enhance the training set. Performance metrics including accuracy, precision, recall, and F1-score were employed to evaluate the models.

Results indicated that both CNN models successfully diagnosed OL and other lesions, with MobileNet-v2 outperforming Xception in terms of F1-score and overall accuracy, achieving an accuracy range of 89%-92%. The study concludes that while CNNs show significant potential for diagnosing oral potentially malignant disorders, further research is necessary to validate these models in varied clinical settings and to explore their integration into routine practice. Future investigations should also consider multimodal approaches that combine imaging with clinical and histopathological data to enhance AI-assisted diagnostics.

Introduction

The introduction of the research paper discusses Oral Potentially Malignant Disorders (OPMDs), which are lesions of the oral mucosa with an elevated risk of malignant transformation. Key OPMDs identified include leukoplakia, erythroplakia, and oral lichen planus, among others. A refined definition of leukoplakia is provided, emphasizing its characteristics as a predominantly white, irreversible lesion that poses a higher cancer risk, often linked to tobacco and alcohol use. The classification of leukoplakia into homogeneous and non-homogeneous subtypes is highlighted, with specific descriptions of their clinical appearances.

The paper underscores the importance of histopathological evaluation for assessing the risk of malignant transformation in OPMDs, despite the availability of adjunct diagnostic technologies such as liquid biopsies and optical imaging systems. Early detection is crucial for improving survival rates and quality of life, as advanced oral cancer significantly reduces prognosis. The introduction also explores the potential of artificial intelligence (AI) and machine learning (ML) in enhancing diagnostic accuracy and reducing costs in the management of OPMDs. The authors propose an AI methodology for analyzing leukoplakia through digital images, aiming to improve the differentiation of clinical types and facilitate timely diagnosis.

Methods

The research conducted was ethically approved by the Ethics Review Board of SRM Dental College, Ramapuram, under the reference number SRMU/M&HS/SRMDC/2023/PG/014. This approval ensures that the study adheres to ethical standards in research involving human subjects, although specific details regarding the materials and methods employed in the study are not provided in the extracted text. Further information would be necessary to fully understand the methodologies utilized in the research.

Results

The results section presents a comparative analysis of the performance of two convolutional neural network (CNN) models, MobileNetV2 and Xception, in classifying oral lesions into three categories: homogenous leukoplakia (OL), non-homogenous OL, and other white lesions. Figures 2 and 3 illustrate the models’ identification capabilities, while Table 1 summarizes their diagnostic performance metrics, including accuracy, precision, recall, and F1-score.

MobileNetV2 demonstrated superior sensitivity, particularly for non-homogeneous OL (92%) and other white lesions (94%), compared to Xception, which achieved 85% sensitivity for non-homogeneous OL but performed well with 89% for homogenous OL and 91% for other white lesions. Both models exhibited high specificity, with Xception achieving 96% for homogenous OL and 94% for non-homogeneous OL, while MobileNetV2 reached 96% for homogenous OL and 93% for non-homogeneous OL, and had the highest specificity (95%) for other white lesions. Overall, MobileNetV2’s balanced performance with higher sensitivity and specificity positions it as the more effective model for diagnosing oral leukoplakia and differentiating it from other white lesions, as detailed in Table 2.

Discussion

The study investigates the application of convolutional neural networks (CNNs), specifically Xception and MobileNetV2, for the diagnosis of oral leukoplakia and other white lesions using clinical photographs. A dataset comprising 659 images was curated, with 202 images representing oral leukoplakia and 457 images depicting other white lesions. The dataset was divided into training, validation, and testing sets, with data augmentation techniques employed to enhance the training dataset’s variability. The models were trained to classify images into three categories: homogeneous leukoplakia, non-homogeneous leukoplakia, and other white lesions. The MobileNetV2 model outperformed Xception, achieving an overall accuracy of 92% and an F1-score of 0.92, indicating its superior diagnostic capability.

The findings underscore the potential of AI-driven diagnostic tools in improving clinical decision-making for oral potentially malignant disorders. The study highlights the importance of early detection of oral leukoplakia, which has a significant risk of malignant transformation, and suggests that CNN models could serve as valuable adjuncts to traditional diagnostic methods. However, limitations such as the small dataset and potential biases due to convenience sampling were acknowledged. Future research should focus on validating these models across diverse populations and integrating multimodal approaches to enhance diagnostic accuracy and clinical applicability.